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Resonant Frequencies of an Electromagnetic Cavity in an 
Accelerated System of Reference* 


C. V. HEER 
Department of Physics, Ohio State University, Columbus, Ohio 
(Received 27 December 1963) 


The resonant frequencies of an electromagnetic cavity in a nonpermanent gravitational field such as that 
provided by rotation is considered. The constitutive equations for accelerated macroscopic matter are 


developed and it is shown that the electromagnetic-field tensors are related by H21=K,Fz,and, Kal” =F, 
etc. For the special case of angular rotation about the z axis the constitutive equations are given in terms 
of noncovariant field vectors B, H, D, E. Using these constitutive equations Maxwell’s equations are solved, 
and it is shown that the degeneracy between the axially degenerate modes, i.e., clockwise and counter- 
clockwise traveling waves, is removed by the rotation. In vacuum due to the similarity between energy flux 
and momentum a shift in frequency of Q-J/h or a splitting between traveling wave modes of 2Q-J/h is 
predicted. This led to a suggestion of the “Coriolis-Zeeman” effect for photons. Using either the above 
development or the energy density it is shown that the effect depends on the moment of the energy flux and 
is only proportional to the angular momentum. In the geometrical-optics region an index of refraction valid 
for accelerated macroscopic matter is developed and is applied to the square Fabry-Perot cavity rate 
gyroscope to yield the same result as either of the above methods. An experiment to show the validity of 
Minkowski's decomposition of the energy-momentum tensor is discussed. 


INTRODUCTION 


HE effect of acceleration on the resonant fre- 
quencies of an electromagnetic resonant structure 

or cavity is considered in this paper. Interference of 
light rays in a rotating system was suggested by Michel- 
son as early as 1904 on the basis of the ether. The 
observations of subsequent workers? confirmed the 
measurement of rotation by the interference technique 
and Michelson and Gale succeeded in measuring the 
angular rotation of the earth.? With the advent of the 
general theory of relativity i£ was shown that problems 
related to light transmission in accelerated frames of 
reference were more properly treated within the frame- 
-Work of general relativity. Adopting this viewpoint the 
author suggested that in rotating resonant-electro- 
Magnetic structures the degeneracy of the traveling 
Wave modes about the axis of rotation would be removed 


*This work was supported by a grant from the National 
Science Foundation, maleate in part by the enone Aero- 
Nautics and Space Administration through Grant NsG-552. 

, À- A. Michelson, Phil. Mag. 8, 716 (190)... Har- 

26. Sagnac, Compt. Rend. 157, 708 and 1410 (19 nE s 
s dissertation, Jena, 1911 (unpublished); and B. Pogany, 

nn: Physik 85, 244 (1928). E 

WW NER ed c Gale, J. Astrophys. 61, 1401 (1925). 


by rotation and a beat frequency between the two modes 
of 2m? would be observed. This work was extended to 
optical masers? and confirmed by the experiments of 
Macek and Davis,* and Cheo and Heer.’ It is the pur- 
pose of this paper to extend this earlier work. 

In order to pursue the subject in a systematic manner 
the general principle of relativity that all systems of 
reference are equivalent with respect to the fundamental 
laws of physics is invoked. This requirement that the 
laws of nature be expressible in the form of equations 
which are form invariant or covariant is made possible 
by incorporating the metric gag into the physical laws. 
The gag are defined by the quadratic form : 


d= g,adx*daf. (1) 


For a system which is rotating with angular velocity ^ 


Q about the z axis and described by coordinates x!— x, 
w=y, r—z, x'—c, the symmetric tensor gig has 


^ 


* OxV. Heer, Bull. Am. Phys. Soc. 6, 58 (1961). 

5 C. V. Heer, Proceedings of the Third International Conference 
on Quantum Electronics (Dunod Cie, Paris,-and Columbia Uni- 
versity Press, New York, 1963); A. H. Rosenthal, J. Opt. Soc. 
Am. 52, 1143 (1962). E 

$ W. Macek and D. Davis, Appl. Phys. Letters 2, 67 (1963). 

1 P. K. Cheo and C. V. Heer (to be published). 
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the following nonzero covariant and contravariant 
components. 

£u g^'— (—Qy/c) ; 
877 = (1-0y?/2) ; 


Ézz7 Syy=822=1; 
£y g!! — (Ox/c); 


EQ) g=; (2) 
g= (Oxy/c) ; Su=—~[1-O (+ y")/e?]; 
gt=—1; detg=g=—1. 


Of the simple systems, ie., constant linear velocity, 
linear acceleration, and angular rotation, only angular 
rotation does not permit the introduction of an in- 
Stantaneous rest frame for the entire system and the 
metric cannot be placed in diagonal form. This aspect 
of the metric is considered in some detail by Méller.® 
It is also for this reason that effects of the order of v/c 
are observed in the rotating system. Thus, the experi- 
ment of Michelson can be explained by regarding the 
trajectory of a light ray as given by ds?=0. 


Maxwell's Equations 


Following the procedure suggested in the Introduc- 
tion, Maxwell's equations are written in a covariant 
form in an instantaneous rest frame, and by the rules 
of tensor algebra, remain valid in an arbitrary frame. 
For macroscopic matter Maxwell’s equations in covari- 
ant form are 


OF ap OFg, OF xa 


Oxy  Qx* dxf 


b 


and A 
— 01/2. — gy Fap] — ja 
(—g) m g) 2 Hee |= ja. (3) 


In the subsequent discussion gag remains dimensionless, 
Fag and H® have the dimensions of (energy)!?, and 
the m.k.s. system of units is used. Although Fas and 
H*? can be written in terms of a polar vector and an 
axial vector'in a covariant manner, à noncovariant 
notation? is found more convenient and consistent with 


conventional techniques. Nonconvariant vectors are 
defined as 


eo? E,— Fa; ui 3? B,—38,,F,,; noi I {= 47; 4 
ej 12] ,— Ht»; Bo ^H ,— 30, Hv:; ( ) 
where Baya is the three-dimensional Levi-Cevita symbol. 
Maxwell's equations take on their conventional ap- 
e pearance in this notation, 
curlE4-9B49/—0, divB=0, (5) 
ES curlH— 9D/3t—J, divD=p, 


erations cur] and div are defined only for 
ering of the differentiation. 


ej Mp jt; 
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Instantaneous Rest Frame 


Using the rules of tensor algebra for two Systems of 
coordinates which are related by 


dX? c A? da? ? dx? = A “sd X? D) (6) 
AtA tr — 5 y, A%yA% = 5%, 


the covariant and contravariant components of a tensor 
transform as 


Top=A*A%pTav, and T= A«AB, TO, (7) 


Following the procedure discussed by Möller? an in- 
stantaneous rest frame is regarded as a system of 
inertia for which Gyx=Gyy=Gzz=1 and Grr=—1. 
If («,y,2,ct) and (X,Y,Z,cT) are regarded as instan- 
taneously at rest with respect to each other, then 
Atp=A*,=0. (8) 


Since gag is a tensor, the components of the metric Lap 
are related to those in the instantaneous rest frame by 


gap = A9 AY Gay 8 (9) 
For subsequent reference, 
£u— —ÀAT,AT,, and £u^ —AT,AT,. (10) 


Also in the remainder of this paper capital Greek or 
Latin indices refer to the instantaneous rest frame, 
small indices refer to the frame under consideration, and 
repeated Greek letters are summed over the four 
indices. Repeated Latin letters are summed over only 
the space indices. 


CONSTITUTIVE EQUATIONS 


For vacuum H= F*f, and the relationship between 
B, H, E and D follow immediately from the transfor- 
mation [b= ge"7PYF,.. Macroscopic matter requires 
further elaboration. A procedure similar to that used by 
Minkowski for introducing the constitutive equations 
in special relativity is used to find the constitutive equa- 
tions in a nonpermanent gravitational field. For isotropic 
dielectric and paramagnetic substances the constitutive 
equations in an instantaneous rest frame have their 
usual form 


Dx=K.eEx, Br=KmuHx; Jx-eEx (11) 


in noncovariant notation. K, is the dielectric constant, 
Km the magnetic constant, and c the electrical conduc- 
tivity. In the instantaneous rest frame there is no dis- 
tinction between covariant and contravariant com 
ponents and these constitutive equations may be written 
as 


Hxr=K Fxr; Fyz=K,Hyz; 
jx-—(uye)eFxp; jT= epr: (12) 


In any other frame of reference the compon a 
follow from the rules given by Eqs. (7) and (8). Thu 


*o [D 3 ) 
Fa—AX.ATQ RS (sum X only), (13a 
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and similar expressions follow for H,,. A simple relation- 
ship between H.: and Fz follows, i.e., 


Hu=A* AT Hxr=KA*,A' Fyr=KFa. (13b) 


Following a similar procedure for F7” and H zv, the first 
two constitutive equations may be written in the simple 
form as 


Hu=K Fa and K,Hev— pes, (14a) 


Noncovariant notation may be introduced by relating 
the covariant and contravariant components; 


K,H*v— Fzt—grogWbE, g, and KeF =H a7 Seeks. 

For the special example of rotation along the z axis, 
the introduction of the metric given by Eqs. (2) and 
the noncovariant field vectors yields the following 


constitutive equations. With Q=Qa, these expressions 
reduce to 


esee 


Qar 
-Kaut (| ; 


se 
ABA 


For vacuum, K,—1— K,, these equations may be re- 
arranged to read 
B=poH-+ (uo/ e) ^[ (Qar/c) X D ], (140) 
= €oE— (eno)! ^ (Qxr/c)XB]. 

Rotation has the effect of making even the vacuum 
appear anisotropic. This effect occurs in first order of 
Q and corresponds to the type term introduced by 
Michelson for the description in terms of the “aether.” 

The source equation transforms as a four vector, and 
by the rules of tensor algebra 

j;—ÀX.xd Tur ) and j=Al jr E (15) 
For macroscopic matter the constitutive equations in 
the instantaneous rest system are given by Eqs. (12). 
Substituting for jx end jı yields 
Ja= (~ gu) M (uy e)! PoF c gi (— 8) je (16a) 
Again using the metric to determine the components of 
j*, the current and charge in the rotating system may 
be written as 


"ie tete d 
x[1- eet Veh, 


OF (p) se n-ee cover 


[7 


(14b) 


(16b) 


Matter with a finite conductivity in the presence of 
an oscillating electric field will appear to have an in- 
duced oscillating current and charge. It is often desire- 
able to separate the property of matter from the vacuum 
field. In the notation used in the-paper this is expressed 
by 

Hc8 = Fob Me 
and noncovariant vectors M and P for the magnetiza- 
tion and polarization are defined as 


uj^M.—15,,M*", and eP, =M. (17) 


With these definitions, Maxwell's equations may be 
written in terms of B and E if J is replaced by 


J — J--curl M 4-0P/9t. (18) 


RESONANT FREQUENCY OF A ROTATING CAVITY 


The approximate frequency of oscillation of a reso- 
nant electromagnetic cavity or structure is found by a 
method used by Slater? for microwave cavities. His 
method is modified to include traveling waves and it is 
necessary to use complex wave functions. A set of 
orthonormal functions E, and H, which are solutions of 
the vector Helmholtz equation and which satisfy the 
boundary conditions nX E, —0 and n-H,=0 on a per- 
fectly conducting surface S and nX H,—0 and n-E,=0 
on an insulating surface S’ are introduced. These vectors 
are related by —ikaHa=curlE, and Z&E,-curlH,. 
k.F.—grady, for the scalar potential. Orthogonality 
is chosen such that /dvE,-E,*=6(a=5), etc., and an 
arbitrary vector may be expanded in terms of this 
basic set, i.e., 


E(r/(—X, E, f dE- E* +3 Fy f dvE-Fy*, (19) 
a B 


where the volume integrals are the time-dependent 
coefficients. Expanding Maxwell’s equations in this 
manner yields 


d 
-ika | E-E [en 
t 


= l dA(nXE)-H.*, (20a) 
S 


d 
tite [ a IE ^ 
t 
= Í dvJ- E,*— J dA(nxH)-E,*, (20b) 
Sn 
Í doD- gradyo*+ f dogpa*=07 (200) 
This expansion in normal modes im plies that the element 


J. C. Slater, Microwave Electronics (D. Van Nostrand Com- 
pany, Inc., Princeton, New Jersey, 1950). 
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Fic. 1. Four mirrors M form the square Fabry-Perot cavity 
with diameter D. Optical media of length d and index of refraction 
(K.K,,)! shifts the splitting in resonant frequency of the axially 
degenerate modes which is caused by the angular rotation Q from 
the vacuum value given by Eq. (25) to that given by Eq. (26). 
Beats are observed with the photodetector R. 


of volume is dxdydz and the surface elements dxdy, etc. 
"This is not true in the rotating system and a correction 
of the 9 (0?) is implied in the wave functions. The above 
equations are correct to O(0?) and should yield the reso- 
nant frequency to ©(Q?)., Omitting the source terms on 
the right and using the constitutive equations to elimi- 
nate B and D, the remaining equations may be solved 
to yield the resonant frequency. If va? is the resonant 
frequency, and E, and H, the normal modes for 2=0, 
the resonant frequency correct to 9(Q) becomes 


v= nel 1-- (K Kne) e 


| ders E^] , (21) 


where as emphasized earlier Q is along the z axis. If two 
modes a and a’ are axially degenerate for 2=0, then 
the frequency splitting between the two modes is given 
by 4 


Av/va= (K Kme) POQ 
j dofxe(BoXHA+E,* XH.) +00). (22) 


The cylindrical microwave cavity with rotation Q 
s along the z or symmetgy axis provides an excellent 
example. dn the T'E,,, mode the zero-order wave 
functions are 9f the form 


: O E, Adde X grade" Z, (20) ]) sinmpz/Zo. (23) 
i \ direct evaluation of the integral for vacuum yields 
Tora Ao 2 Ay— 2m , (24) 
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and the rotation removes the twofold de 
the axial mode. 

A system of interest as a sensitive rate £yroscopes- 
is the square Fabry-Perot cavity with confocal or flat 


generacy of 


mirrors. If E, is an almost plane wave with a finite 


cross section, and if the perpendicular axis of the cavity 
makes angle y with the z axis of rotation, integration 
over the beam volume yields a frequency splitting of 


Av/ v, — (QD/c) cosy (25) 


for vacuum, D is the diameter of the square. Other 
configurations may be treated in a similar manner. If for 
convenience y=0 and one arm of the Fabry-Perot con- 
tains matter as shown in Fig. 1 with an index of refrac- 
tion (K,K,,)!? and of length d, then 


Av/va= (QD/c){1— (d/4D)1— (K,K,)3?]) (26) 


is the required correction for the matter. 


Plane Waves’ 


In the limit of short wavelengths or of the validity of 
geometrical optics the invariance of the phase d e — &,dx? 
and the invariance of the scalar k,k* can be used to 
determine the index of refraction as measured by an 
observer in a nonpermanent gravitational field. Fer- 
mat's principle can be applied to determine the path. 
Using the transformation between k, and kr, the scalar 
invariant becomes 


kak? = kak*= (w/c) (KKK,—1)(—gu4)3, (27) 


and is of course the usual null vector for vacuum. If 
do is a line element along the path of the ray and n° 
is the contravariant component of the ray direction, 
then 

n^ — da*/do = k"! (— gu/ KK)? (28) 


follows from Eq. (27) and the various relationships for 
the gag."' Further manipulation yields for the phase 


dg=kedx*=wel{ [(— KK m/ gu)? 
—gun*/gu]de—cdi). (29) 


If the quantity inside the square brackets is regarded as 
the index of refraction u, 


u= (— K, Ks gu) ?— gain®/ gu ) 


then the path follows the variation 
6 Jl udo -—0. (31) 


Equation (31) is quite general and is Fermat's principle 
applicable for macroscopic matter and nonpermanent 
gravitational fields. If the principle of equivalence 15 
invoked and for the g,,—0, the bending of a light ray 
in a macroscopic medium and in a permanent gravita- 
"A See D. Landau and E. Lifshitz, The Classical Theory of Fields 


(Addison-Wesley Publishing Company, Inc., Reading, Massa- 
chusetts, 1951), p. 280, for a similar development for vacuum. 


(30) 
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tional field described by gy: is readily obtained. (— gi, 
-14-2x/e where x is the gravitational potential.) It 
is interesting to note that the bending by a gravita- 
tional field is not separable from that bending by matter. 
For the special example a system rotating about the z 
axis and described by the metric given by Eq. (2, u 
becomes 


u= (KeK n) +H (QXr)-f--0(9), 


where f; is a unit normal in the direction of the ray. 
Macroscopic matter does not effect the term depending 
on ga and the term of O(Q) in the index of refraction 
depends on the vacuum properties only. If this index 
of refraction is used for the square Fabry-Perot cavity, 
or cavities in which geometrical optics may be used 
with some validity, the shift in resonant frequency is 
the same as indicated by Eqs. (25) and (26). 


(32) 


CORIOLIS-ZEEMAN EFFECT FOR PHOTONS 


The removal of the twofold degeneracy of the axial 
mode of the cylindrical cavity discussed earlier yields 
an angular frequency of 


Gmap— Omnp-EmQ (m=0, 1,2, ---). (33) 


If the photon concept is used and J is the orbital angu- 
lar momentum of the photon, the above equation may 
be written as 

(34) 


and the correction is proportional to Q-J. For vacuum, 
Eq. (22) is consistent with the concept of angular 
momentum. In fact, Eq. (25) is readily obtained by 
noting that 


Tio — Ti? 4- J Qo, 


Ay/v—2Q-J/h»— (QD/c) cosy, (35) 


where J;— (/m/c)(D/2) cosy is the projection of the 
orbital angular momentum J along the z axis. The ob- 
server in the rotating system would find the resonant 
frequency dependent on the angular momentum of the 
photon and would regard the level splitting as a 
“Coriolis-Zeeman” effect for the photon. This concept 
is valid only for vacuum. The right side of Eq. (22) is 
the expression for the moment of energy flux rather than 
angular momentum. This distinction between energy 
flux and momentum is apparent only in the presence of 
matter or in 0 (Q?). 
If the energy-momentum tensor is defined as 


Supe deti 30,2 Fp He, (36) 


whose covariant divergence yields the four-force 


12 Although an experiment with matter waves does not seem 
Possible ai this timer is interesting to note that using the Rae 
ormalism with the Klein-Gordon equation a phase difference be- 
tween the c.w. and c.c.w. paths is obtained in which the Compro 
Wavelength \=//me replaces the optical yardas n T P dum 
Shift is 8 D*0/Ac in an experiment similar to that use He en 
Son (Ref, 2) -Tnterference depends on the Compton rather 
the de Broglie wavelength (Ref. 4). 
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Ja= Fa5j*, the various components in the noncovariant 
notation are 


Sé=—c(EXH),, S/=c(DxB)., 
$,— —4(E-D--B-H). 


In the instantaneous rest frame 577 is the energy den- 
sity, S7* the energy flux, and Sy? the momentum, and 
these quantities have the same interpretation in other 
systems of coordinates. The energy-momentum tensor 
in the instantaneous rest frame and the frame of obser- 
vation are related by 


(37) 


Sot =A AtSd, (38) 
and the energy density may be written as 
$772 SE (gex/ gu) Sr. (39) 


For the stationary system which is being considered the 
Zap do not depend on the local time variable ¢ and for 
short periods of time in which losses may be ignored, 
solutions of Maxwell's equations which are independent 
of the time variable may be obtained. The observer in 
the moving system interprets the integral of S;! over 
the volume of the cavity or radiation field at time £ 
asa constant of motion for the system. If a single photon 
is present the photon frequency is measured as 


pate E Site | ats, (ge2/get)S:7]. (40) 


Considered as a traveling wave the integral over S77 
is independent of the sense of rotation and the beat 
frequency caused by the interference of clockwise (c.w.) + 
and counter-clockwise (c.c.w.) photons at a detector 
attached to the moving system is 


Av= uei [etel 
= rita. i d4rx(EXH)]--o(Q). (41) 


This is the same as the earlier expression given by Eq. 
(22) when E and H are replaced by zero-order ortho- 
normal E, and H, wave functions. The dependence on 
the projection of the moment of the energy flux rather 
than the angular momentum is immediately apparent. 


MINKOWSKPS ENERGY-MOMENTUM TENSOR 


The lack of symmetry in the Minkowski energy- 
momentum tensor between the, energy flux S7* and the 
momentum Sz” in the presence of macroscopic matter 
has been discussed frequently in the literature.” 

The energy-momentum tensor is symmetric in an 
inertia frame in the absence of matter and becomes 


13 See Ref. 8, p. 204 for a discussion and references to related 
discussions. 
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nonsymmetric in the presence of matter or nonperma- 
nent gravitational fields. A first-order nonsymmetry 
appears in the presence of rotation. Minkowski’s theory 
requires that transparent matter not exchange energy 
with the electromagnetic field. It is shown in the earlier 
developments in this paper that the frequency shift in 
first order of Q depends on the moment of the energy 
flux and also that the change in index of refraction in 
the geometrical optic region is independent of the pres- 
ence of matter to O(@). These same results follow with 
the Minkowski energy-momentum tensor given by 
Eq. (36). Thus an observation of the beat frequency in 
accord with Eq. (26) in the presence of matter would 
support the Minkowski formulation. This effect should 
be observable in the square Fabry-Perot cavity with 
flat mirrors for which the mode of operation is simple. 
Since the presence of the maser material allows the 
cavity to oscillate at both frequencies and the beat 
frequency is sensitive to rotation only, the shift in the 
beat frequency is measureable. Combining Eqs. (25) 
and (26) for va coinciding with a maser line, the frac- 
tional shift is 


(Avy/Av) -1— (d/A4D)1— (K,K,)-1*?], — (42) 


where Avy, is the beat frequency with matter and Avo 
with vacuum. If a large portion of the maser is filled 
with matter an effect as large as 30% can be expected. 
Unfortunately terms of 0(Q?) cannot be tested by pres- 
ent experimental apparatus. Applying the principle 
of equivalence it is the terms of 9(Q?) which are equi- 
valent to the effects of a permanent gravitational field. 
Although not emphasized in the development, the equa- 
tions developed in this paper may be applied in a perma- 
nent gravitational field by invoking the principle of 
equivalence. à 


EXCITATION OF CAVITY MODES 


Using the form invariance of the fundamental laws of 
physics and the concept of the instantaneous rest frame, 
the constitutive equations for macroscopic matter in 
an accelerated system have been developed and used 
with Maxwell’s equations to discuss the resonant fre- 
quencies of electromagnetic cavities. The deformation 
of matter by the acceleration forces has been neglected. 
Since the deformation of matter by the acceleration 
would change the shape of the cavity and the physical 
constants of enclosed matter, the small shifts suggested 
would be difficult to observe. For axially degenerate 


‘modes the deformations affect both c.w. and c.c.w. 


traveling waves in the same manner and the splitting 
between modes given by'Eq. (22) is almost independent 
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of the deformations. The splitting between mod 

: ; 7 : es 
given by Eq. (22) is valid over the entire region of the 
electromagnetic spectrum for which cavity modes mas 
be defined. Thus, in the microwave region a resoa 
size cavity would have a value of m~10, while in the 
optical region a value of m> 10* is possible. For a closed 
microwave cavity the normal modes may be determined 
in à standard manner and wall losses introduced b 
discussing the decay of stored energy in terms of the Q. 
Since the beat frequency would be of the order of Cycles 
per second for a rotation rate of 1 rad per sec, a Q of the 
order of 10? would be required to resolve the splitting 
between the c.w. and c.c.w. modes. Such a Q is almost 
possible with superconducting cavity walls. In the opti- 
cal region the Fabry-Perot cavity shown in Fig. 1 may 
be discussed in a similar manner. A system with dimen- 
sions of the order of 1 m has the Q limited to 108 by 
diffraction and reflection losses and a beat note of the 
order of kilocycles is not resolved. 

In order to enhance the apparent Q of the system a 
maser material in the appropriate frequency range may 
be used.577 The volume source or sink term on the right- 
hand side of Eq. (20b) may be due to either the polari- 
zation, magnetization, or conductivity as shown by Eq. 
(18). If the maser media has a narrow linewidth de- 
scribed by e"(v) or x"(v)/* either property may be 
discussed in terms of a generalized negative resistance 
c (v) which is proportional to it. In this linear approxima- 
tion the response of the c.w. mode depends on the value 
of c(v) at the frequency of the c.w. mode. A similar 
argument follows for the c.c.w. mode, and both modes 
will be excited for a maser linewidth greater than the 
splitting. Oscillation will occur for both modes in the 
linear approximation. If the polarization depends on the 
electric field in a nonlinear manner, the question of 
coupling between these closely spaced modes needs 
further consideration.!5 It may be possible to study these 
nonlinearities by using the square Fabry-Perot cavity 
as a precision spectrometer system. 
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The electronic thermal conductivity of superconductors is discussed without recourse to the quasi- 
particle approximation. The Kubo formula for the thermal conductivity is used as the starting point. This 
is first examined in the Hartree-Fock approximation in the Nambu space. It is shown that in the Eliashberg 
approximation of neglecting the momentum dependence of the electronic self-energy, the calculation of the 
thermal conductivity is reduced to a quadrature, involving however the complex energy gap and renormaliza- 
tion functions which are solutions of the Eliashberg gap equations at finite temperatures. These equations 
are given in an Appendix. The problem is also considered in the ladder approximation in the Nambu space, 
anda generalized Boltzmann equation is derived which includes corrections to the Hartree-Fock approxima- 
tion corresponding to the replacement of the scattering by the transport cross section. It is shown that the 
standard Boltzmann equation for superconducting quasiparticles is obtained in the weak-coupling limit. 
No numerical calculations are performed in this paper, but a clear scheme for such calculations is outlined. 
Reasons for believing that such calculations will explain the anomalous drop in the electronic thermal con- 
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ductivity of superconducting lead are given. 


I. INTRODUCTION 


| essaie on the strong elemental super- 
conductors lead and mercury show that the elec- 
tronic thermal conductivity of these materials differs 
markedly from that of typical weak superconductors 
like tin or indium. For lead and mercury! the ratio, 
K./Kn, of the thermal conductivity in the supercon- 
ducting and normal states when plotted against the 
reduced temperature T/T. shows a steep positive slope 
of about 5 near T=T.. For tin? and indium,’ on the 
other hand, the experiments yield a smaller slope of 
about 1.6. 

A theory of the electronic thermal conductivity of 
superconductors based on the quasiparticle approxima- 
tion and the Boltzmann equation approach of Bardeen, 
Rickayzen, and one of the present authors! has been 
carried through previously.5 The results obtained are in 
substantial agreement with the data on tin and indium. 
The lack of agreement between the theory given in I 
and the data on lead is not surprising since, as shown in 


* Alfred P. Sloan Foundation Fellow. ‘ i 
+ Permanent address: Institut für Angewandte Physik, Uni- 
versitát Hamburg, Hamburg, Germany. E 
1 T. K. Hulm, Proc. Roy. Soc. (London) A204, 98 (1956); J. H. 
P. Watson and G. M. Graham, Can. J. Phys. 41, 1738 (1963). 
2A. M. Guenault, Proc. Roy. Soc. (London) A262, 420 (1961). 
3 A. M. Toxen and R. E. Jones, Phys. Rev. 120, 1167 (1960). 
‘J. Bardeen, G. Rickayzen, and L. Tewordt, Phys. Rev. 113, 
982 (1959). 129, 657 ( 
5L. hys. Rev. 129, h d t 
as I. Mud n this subject is contained in L. emot 2i 
128, 12 (1962), hereafter referred to as II; L. P. Eus an 
P. Martin, ibid. 124, 678 (1961); B. T. Geilikman, V. Z. em 
Zh. Elsperim. i Teor. Fiz. 4l, 1142 (1961) [English transl.: 
Soviet Phys JETP 14, 816 (1962) ]. 


1963), hereafter referred to 


the work of Schrieffer, Scalapino, and Wilkins,? the 
strong electron-phonon coupling causes the quasi- 
particle picture to be quite meaningless over much of 
the energy spectrum. 

It therefore seems reasonable to study the thermal 
conductivity of lead on the basis of the strong-coupling 
theory used by Schrieffer e£ al. in explaining tunneling 
characteristics. This is the aim of the present paper. We 
use the method of thermodynamic Green's functions’ 
and thereby avoid the quasiparticle approximation. We 
expect that in the limit of almost stable excitations our 
theory will reduce to the theory developed in I, and this 
expectation is realized. In this paper we carry the theory 
as far as we are able to without using numerical methods. 
We expect to use the formulas obtained here as the 
basis for further calculation. 

Our starting point is the Kubo formula in which the 
thermal conductivity is expressed in terms of the corre- 
lation function of two heat current operators.’ For the 
superconducting state it is convenient to use the two 
component space introduced by Nambu.? In Sec. II the 
problem is treated in the Hartree-Fock approximation 
which means that the correlation function is factored 
into two Nambu matrix Green's functions. The mo- 
mentum integration in the expression for the thermal 


6 J. R. Schrieffer, D. J. Scalapino, and J. W. Wilkins, Phys. Rev. 
Letters 10, 336 (1963). 3 

7 Several references on the Green's function method are now 
available. A reference in which the treatment is similar in style 
and notation to that of the present paper is V. Ambegaokar, 1962 
Brandeis Lectures (W. A. Benjamin, Inc., New York, 1963), Vol. 2. 
Other references are given here. 


8 We use the form given by J. S. Langer, Phys. Rev. 128, 110 o> e 


(1962). See also Ref. 
? Y. Nambu, Phys. Rev. 117, 648 (1960). 
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conductivity can be carried out if one makes the weak- 
momentum approximation introduced by Eliashberg,'? 
that is, if one neglects the p dependence of the re- 
normalization function Z,(w) and the gap function 
A,(). In contrast to the result of the quasiparticle 
approach, the thermal conductivity is expressed as an 
integral over all excitation energies rather than over all 
momenta. The integrand contains the complex functions 
Z(w) and A(@w). These functions have to be determined 
from the generalization to finite temperatures of 
Eliashberg’s gap equations. The generalized gap equa- 
tions are given in Appendix A. In the limit of zero tem- 
perature these equations have been solved for lead by 
Schrieffer et al. However, for the problem of the thermal 
conductivity one needs the solutions near Te. 

In Sec. III the problem is treated in the more general 
ladder approximation in the Nambu space. This is the 
approximation that must be used with the super- 
conducting approximation to the single-particle self- 
energy operator in order to preserve conservation laws! 
or, equivalently, Ward identities. To facilitate the treat- 
ment of this approximation we use a method introduced 
recently by Langer, Maradudin, and Wallis? in discuss- 
ing optical absorption by anharmonic crystals. The 
integral equation for the vertex part is simplified by 
using a double spectral representation (derived in 
Appendix B) for the vertex function. The resulting 
integral equation for one of the spectral functions corre- 
sponds to a generalized. Boltzmann equation in which 
this function essentially plays the role of the non- 
equilibrium part of the distribution function. The 
integral equation simplifies if one makes an Ansatz for 
the vector part of the spectral function which corre- 
sponds to Bloch’s Ansatz in the elementary theory, and 
further, if one again uses the weak-momentum approxi- 
mation. In this way one obtains an integral equation in 
the energy variable alone which represents the generali- 
zation of Bloch’s equation for thermal conductivity to 
a strongly coupled superconductor. In the limit of weak 
electron-phonon interaction the equation reduces to the 
Boltzmann equation used in I. 


II. HARTREE-FOCK APPROXIMATION 


If we neglect the entropy flow and measure energies 
with respect to the chemical potential, the Kubo 
formule for the thermal conductivity K may be written 
as follows (we use units in which Z— 1): 


, 2 9 f 
K=— Im Í tale Í d*xid*xo(u(x1,0) -u(xs,/9)). (2.1) 
3VT  J— 4 

10 G. M. Eliashberg, Zh: Eksperim. i Teor. Fiz. 38, 960 (1960) 
[English transl. Soviet Phys.—JETP 9, 1385 (1959) ]. 

* 1G, Baym and L. P. Kadanoff, Phys. Rev. 124, 287 (1961). 

2 J. S. Langer, A. ‘A. Maradudin, and R. F. Wallis, Proceedings 
International Conference on Lattice Dynamics, 1963 (to be 
ed). E 
L. P. Kadanoff and P. C. Martin, Ann. Phys. (N. V.) 24, 
63), for a discussion of this point. 
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Above V is the volume of the system, T is the tempera. 
ture, and u is the heat-current density operator at the 
space-time point 1— (x;,/;). The brackets denote an 
average in the grand canonical ensemble. For the inter- 
acting electron-phonon system u has the form u=y 
-Fup,, where 2 


1 ô ð 
ETEA NET +v) £s) v 
5 =e I aly! 1 Js) D) | va (2.2) 


is the heat current for the electronic system. Tt includes 
that part of the electron-phonon interaction energy 
which is due to electronic charge fluctuations. [The 
operators V,!(1' and wv,(1) create and annihilate 
electrons with spin ø at 1’ and 1, respectively. ] 

The detailed form of the phonon part of the heat- 
current density operator! is not needed for the purposes 
of this paper. It suffices to note that the correlation 
function of uy, with itself gives rise through Eq. (2.1) 
to a thermal conductivity which may be interpreted as 
the thermal conductivity of the lattice as limited by 
electron scattering. In the normal state this contribu- 
tion is proportional to 7? and is too small to be observed 
in a pure system. In the superconducting state the 
lattice thermal conductivity is increased relative to that 
in the normal state because the energy gap in the 
electronic spectrum leads to an increase in the relaxation 
time for phonons. However, because of the smallness of 
the lattice term, the electronic contribution (though 
rapidly falling) will dominate the thermal conductivity 
just below T, which is the region of interest to us. The 
cross terms in the expression (2.1) between u., and ups 
describe “phonon drag" effects which are totally 
negligible for heat conduction because the electronic 
heat current is odd with respect to reflections in the 
Fermi surface. In what follows, therefore, we take (2.2) 
as the expression for the heat current operator and drop 
the subscript “el.” In the language of conventional 
transport theory we are assuming that the lattice 1s 
always in equilibrium. The contribution to the thermal 
conductivity of the processes by which the lattice does 
in fact achieve equilibrium are, as we have seen, small in 
the region of interest to us. 


Let us introduce the correlation function 
.P(1,2)7 (T[u(1):u(2) 


where T is Wick's time ordering operator. In the stand- 
ard way we make Fourier transformations 1n the 
spatial and imaginary time difference coordinates, 50 


(2.3) 


14 In a continuum Debye approximation this has the form give? 
by K. Baumann, Ann. Phys. (N. Y.) 23, 221 (1963). T 

!5 Because the phonon lifetimes contain the small quantity 
(m/M)Y2 where M is the ionic mass, the quasiparticle oppio 
tion is valid for the phonons. {See Ref. 10 and A. B. Migdal, set 
Eksperim. i Teor. Fiz. 34, 1438 (1958) [English transl.: Soy 
Phys.—JETP 14, 816 (1962)].] By using the methods of Te 
present paper one can recover a Boltzmann equation for | 
thermal conduction by phonons as limited by electron scattering. 
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that 


12 ae j 
P( D =f Ony g = Pm) 


Xexp{ilq: (i—x5) —vs(5—15)]). (2.4) 
Here v,-—2-mi/8, m runs over all integers, and 
B= (keT) with kp Boltzmann's constant. In terms of 
P(g,vs) Eq. (2.1) becomes 


1d 
K= 7r gy RE Ora) loue) DT (2.5) 


q=0 


It is convenient to use Nambu’s two-component field 
operators defined by 


y: (1) 


«o- e PO Gu). 


In terms of these and the Pauli spin matrices r; the 
correlation function may be written as follows 


1 ð ð 
( M SERA ) 
4m? NOE Oly’ 


ð ð 9 
1 vé e) eaea" 
Ol» 


Ol» 


XCDOV Qi 2 wit) Don (2-6) 


2! =2+ 


In writing the correlation function in the form (2.6) 
we have neglected certain discontinuity terms that arise 
from the time derivatives acting on the time ordering 
operator. These terms make no contribution to the 
thermal conductivity. 

In this section we shall derive the expression for K in 
the Hartree-Fock approximation. This is defined by 


(TE: (Dpi V0) Sa1229; Q1), (2.7) 
where Ga is Nambu's matrix Green’s function 


Gi(12)= —i(TLWi(1)yrt (2) p. (2.8) 


Then we obtain from Eqs. (2.6), (2.7), and (2.4) 


as 4 : : 
P (gvm) ef als Y, eh [oi(pt+-a)+ Get») p T 
4m? J (2r) Bu 


XTr(ri8(9-9, +r Ym)T3G(P,50)} » (2.9) 


where t;— (2/-+1)ri/8 and } takes on integral values. 
he Hartree-Fock approximation for the heat-current 
Correlation function is represented by the diagram in 
Fig. 1. 
We how insert the spectral representation for $(,57), 


Fic. 1. Diagrammatic representation of the Hartree-Fock 
approximation for the heat-current correlation function of super- 
conductors. The solid lines on the left of the equality sign represent 
Nambu matrix propagators G. The equivalent diagrammatic repre- 
sentation in terms of Gorkov’s normal and anomalous Green's 
functions is shown on the right. In the present formalism the terms 
on the right arise from taking the trace Tr[/729739]. 


namely, 
* do alpu) 


S5) (2.10) 


—»2- (i—o 
into (2.9) and carry out the sum over (1. Introducing the 
result into (2.5) one finds 
1 d pdes k 
K= — fo? sech? (48w) 
24m?khpT? J (2r)* 


XTrira(pw)rsa(p,w)} . 


(2.11) 


The Nambu matrix propagator $(p,¢;) with ¢; continued 
to the complete complex z plane has the form 


ZZ »(Z)--€prad- bp (z)T1 
PZ? (z)—ey —ġp (z) 


where ep= (f?/2:)—y. The renormalization function 
Z5 (z) and the gap function p(z), both complex, have to 
be determined from the Nambu-Eliashberg self-con- 
sistency condition. We now make the weak-momentum 
approximation introduced by Eliashberg, that is we 
neglect the p dependence of Z,(z) and ¢,(z). Then the 
integration over p, or ej, in Eq. (2.11) can be done. The 
spectral function a(f,») is obtained from (2.12) from 
the relation 


S(5,2)— (2.12) 


a(p,o)=2 Img (p, «—1i0*) 
= (1/1) (p, o— 10") — G (p, «-T-10*) ]. 


The integrals over e that are then needed are 


(2.13) 


n- f de Tr( (5 o—i0)79 (5, w—i0*)} 


=0, . (2.14) 


and 


taf de Tr( 38 (2, w—10*) 735 (6, w+10*)} 
ot |A@)] 
x( 


le a(o] (245) 
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Fic. 2. Integral equation for the vertex function in the ladder 
approximation, represented in terms of diagrams. G and D denote 
particle and phonon propagators, respectively. 


The integrations are straightforward and are sketeched 
in Appendix C. The complex functions Z (w) and A(o), 
the latter being the complex energy gap parameter, are 
related to the functions occurring in (2.12) by 


Zw) — Zi (9)--iZ«(9) — Z (i 9—i0*); 


A (e) — Alw) FiA (9) 
—$(z—o—10*)/Z(z— »—10*). 


(2.16) 


The thermal conductivity K is now given by 
n c2 w? sech?(4Bw) 
i m i doc 
SmkgT?Jo — |Im(Z(e)[o?— A?) ]'} | 


(: x) . (Qu) 


Above n is the density of electrons. Near T=0 the 
factor in the large brackets becomes zero for w<Ao, 
where Ao— A1(Ao) is the gap edge, since A» vanishes for 
w<Ao. At finite temperatures the existence of thermally 
excited phonons may cause A» to have a small nonzero 
value for o «Ae. 

If for those frequencies w that contribute pre- 
dominantly to the integral in (2.17) the inequality 
2A5A1«o?— A+A? is satisfied, the following approxi- 
mate formula for K obtains 


S n j w? (w? — A+ AQ?) 1/2 
Rda 


|Z2(w?— As?2- A9?) — ZAA] 
o— A — A»? 
sec) (1 ) . (9.8) 


|o?— A+ A2] 


To first order in A» and Zz the expression in the de- 
nominator of the integrand in (2.18) reduces to WZI, 
where T is one-half the decay rate for a quasiparticle as 


- calculated in II. Equation (2.18) is thus somewhat more 


reneral than fhe quasiparticle approximation. In the 
of lead it would appear that the condition A241 
W — AHA? is reasonably well satisfied for fre- 


Jue: icies below 2 or 357, the latter being roughly the 
6 
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frequency which contributes maximally to K, but in 
numerical calculation it is probably no harder to use di 
exact form (2.17). 


III. GENERALIZED BOLTZMANN EQUATION 


We now wish to obtain a Boltzmann-like equation for 
the correlation function P(1,2). It is convenient first to 
rewrite the expression for this quantity as follows: 


il y) ð 
P(1,2) ( Vid-—wv ) 
2m\ Oty Oly’ 


-Tr{rsx(1,2,1’)} | l-1*,; (3.1) 
where the three-point function x(1,2,1’) is defined by 
Kj (T: Au (2) (1^) ). (3.2) 


[In writing (3.1) we have again neglected, for the same 
reason as given below Eq. (2.6), terms arising from the 
nonzero commutator of the time derivative and time- 
ordering operations.] Because of the homogeneity of 
space and time x depends only on two space-time coordi- 
nate differences, so that x= x (1-2,2-1^). Since, further- 
more, this quantity has periodicity properties along the 
imaginary time axis, one may make Fourier integral and 
series expansions in both space-time arguments. We shall 
denote the Fourier transform by x(p--q, £id-va; p, £1). 
Inserting the Fourier expansion for x into (3.1) one 
obtains the following expression for P(g,vm): 


14 
Pg Ym) = —— — 2 
2m ti 
-Tr{rax (pq, titm; P, 6}. (3.3) 


In Appendix B we have derived the following double 
spectral representation for x (see also Ref. 12) 


x(p+q, Fit rn; p; £2 
iE Ii —- (p-+q, 01; P, w2) 
—J-— (2r) L (Gt—e)(v»—ov») . 
(2) : 
te (p+a, 01; D; we) | f (3.4) 
(Fi - vs — 01) (Ym— 92) 


The explicit forms of the spectral functions f? and {2 
are given in Appendix B. We insert Eq. (3.4) into Eq: 
(3.3), perform the sum over {;, and introduce the result 
for P(q,ym) into (2.5). The result for K in terms of £0 
and f? is then found to be 


dp 
eto i TP lerton) Htoo] 
(2)? 


== 


O = A L) 
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Here we have used the abbreviation f,? (w,v)=£ (p,w,p,v), i=1, 2. 
: To proceed further we must determine the spectral functions f? and f in some approximation. We consider the 
integral equation for z in the ladder approximation which is represented by the diagrams in Fig. 2 and is given by 


, 1 1 
x (p, ıt Vm; P, £2 = Eo (2¢,+ Um)G(P, Gris »)n8(5:)4- PLS, C+ Vgy)T34- T8 (b, 1] 
m 


1 dk 
ae »» — —G(2, Cr vs)rax(p—k, sr Fum Hs; p—k, Fi un) 735 (P,5)D (Kun) . (3.6) 
B Hn (27)? 
Here D(k,u,) is the phonon propagator (us — 2«ni/B with n an integer) which has the spectral representation 


* dy d(k,v) - 
Da - | — 5 (3.7) 
—» 2T Un—v 


Introducing this spectral representation as well as those for 9 and x into (3.6) and doing the sum over z we find 


dwidw — fy? (0,02) fy (01,02) 
i Ce AF mee] 
dodo»  a(p,oi)raa (p,os) 
(2r)? (Civ — o1) (1-092) 


a p [| = a(p,w1)T3 Pn d*k ie Erorien 
2m 2r N'r»&4—o1 fior m (2r) Be "a 


a(p,v1)r3 [fox (unu) — fy P (unua) Taa (D,vz) 
aee | 
Cr Y2—»i 


p 
E Hen) i 
2m 


(3.8) 


Sibi 3 (rt ym Hiv- (Ym — uz) (£1— v2) 


The Fermi and Planck functions are abbreviated by f and JV, respectively. From Eq. (3.8) we can derive two 
coupled equations for f? and f? by calculating first the discontinuity across the real ¢; axis at ¢;=w for fixed 
imaginary v, and then the discontinuity across the real vm axis at v4 — v, or by carrying out these operations in the 
opposite order. The two equations are 


f, O (w,v) = — (p/2m) Qo-1-»)a(5, o+ v)rsa(p,w)+ (20) | ttr (ms) +N 00) MCE) 


Xa(p,v1) Taf fO (41,u2)2 Im[ (e— va—41—:0*)-1 (w— va—10*)-1 ] 
X2 Im[ (e4-»—»1—:0*)7 (v—us—320*)! -- f? i? (1,42) 276 (6 — v2) 


X2 Im[ (v—42—10*) (o--» — vı ið) (w+ v— vs—u1—10*)-']r22(p,»:), (3.9) 
and A 


iP (w,v) +4, otr, v) = (on) d'hiver [Coi -N CNPC alr 


X {fp (11,»)2 Im[ (e4-»— ri 10*) (@— »a— i10) 1 (9— v2 —10*) J +f (11,0) 
X2 Im[ (e4-v— vs—11— 10) (od-»—v1—10*) 1 (0—»$—10*)-1 ralh, v). (3.10) > 


3 
W ond and third terms on the right of Eqs. (3.6) and (3.8), which are discontinuity terms of the 
S A E de omitted, do not in fact contribute to the double spectral functions of physical interest. Let 


us now consider Eq. (3.10) in the limit » > 0. We make the Ansatz > 
£,9 (40) = (p/m) (epo), i12, (3.11) 


tion for gY-+g. On the right-hand side of this equation there occurs the angular 
eglects terms of order kaT/p and kaT/cpr (where cisa = ^ 
be transformed into integrations over k, or the phonon . 


and obtain from (3.10) an equa l 
factor (p—k)-p/#*, which may be replaced by 1 if one n 
typical velocity of sound). The d'k integration may now 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar : 


i ee 


€ 


ex 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A810 V. AMBEGAOKAR 


ANED We, WAR; WV ORIDE 


frequencies cx (k), where \ designates the phonon branch, and over ej. Since the latter integral runs over practi- 


cally all ej-x, we introduce the functions 


e 


g)= ] deg C? (ew) 


(3.12) 


and integrate both sides of the equation for g% (&o)-- g?? (ec) over all e. The result can be written in the form 


gO (w) +g (9) = (2r) derer P» [oro XUI Oel 


X rag (ur) - 9 (m) ]rsa(e,v2)2 Im[ (e— vi— 10*)7 (e — va—ui—10*) 1 (0—v3—10*)71]. (3.13) 


Above we have followed Schrieffer et al. in characterizing 
the phonons by frequency distributions /,(w,) and 
spectral functions d(w),v) which are of the form 


dy (wy, v) = 2ray? (wy) [5(v—wr) —8(v- F3) ], (3.14) 


where a, measures the strength of the electron-phonon 
coupling. All of the constants arising from transforming 
the variables of integration are contained in a. 

The functions g(w) are matrices in the Nambu 
space. We make the decomposition 


gO (o) = gol (9)1H- 0 (w)r 
+ go (w)rotgs(w)ra. — (3.15) 


On the left side of (3.13) we are interested only in the 
73 component because only this enters the integral for 
the thermal conductivity. One finds that the terms 
containing go“ on the right of (3.13) do not enter the 
ra component of the equation. The terms proportional 
to go and gi“? make no contribution to the rz compo- 
nent when integrated over e since the integrands are odd 
in e We are then left with a homogeneous integral 
equation for gs“) (w)-+g3%)(w). We may now explicitly 
do the e integral and perform the sum in the Nambu 
space. When this is done one finds that the right-hand 
side is zero because there occurs an integral we have met 
with in Sec. II, namely, 


n= [ de Tr[_73G(€, o—:0*)758 (e, o—20*) |=0. (3.16) 


(See Appendix C for the evaluation.) 
We have thus shown that 


1 - 
2p f dep: Tr{ ralf (o0) +f ® (@,0) ]) 
2u 


= gD (w)+g32(w)=0. (3.17) 


G 
fp a) (@,0) Sy (p y) T; (p,o) T34 (p 9) 
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Hereafter we shall write 


g” (v) - g” @) =). (3.18) 


One can also show, using time reversal invariance and 
(3.17), that in our approximation we may take g(w) to 
be real (Appendix B). 

Using (3.17) in the general expression for the thermal 
conductivity (3.5) we see that the second term in this 
last equation does not contribute. The third term on the 
right-hand side of (3.5) also does not contribute. To see 
this we take the derivative of both sides of Eq. (3.10) 
with respect to v and then set » —0. We now remove the 
vector dependence by an Ansatz similar to (3.11) and 
integrate the resulting equation for scalar quantities 
over all e. The procedure is identical to that described 
above Eq. (3.13). We then take the 73 projection and 
find that the right-hand side is zero either because of 
oddness in e or because of (3.16) and (3.17). In other 
words, in our approximation we have 


ð 
i dep Telr C oH (estes 2m =0. 
(3.19) 


We therefore see that the thermal conductivity (3.5) is 
entirely determined by the 7s component of p-f, (w,0) 
or equivalently, through Eqs. (3.11), (3.12), (3.15), and 
(3.18), by the scalar function g(w). To determine this 
function let us consider Eq. (3.9) in the limit y=0. In 
the curly bracket on the right of (3.9) there are eight 
terms which arise from explicitly taking the imaginary 
parts. It is convenient to increase this number from 
eight to ten by adding and subtracting the term 
Arf, x O (144,42) (w— v3— u1) ô (o — v2) d(w— v1)8 (us) inthe 
curly bracket. The resulting equation for f, (w,0) may 
be written in the form 


| +n) | d'y f(v—w) +N ELS (5, 0—0 rf (o—», 0) rig (b; o+i0) 


+6(p, ot1i0+)raf iO (e — v, 0) r3 (p, w—i0)] i Hip (0,0) « (3.20) 


an 
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(The occurrence of the +70*’s in the arguments of the Green's functions in the curly bracket will be important 


in what follows.) In (3.20) T (pw) describes the corrected vertex associated with the ladder approximation and is 


given by 


dk 
roe) | v F 


duıdusdvz 
(27r)? 


{C/(— m) +N Gs) Jd (k v) fo 0 (uyu) + fpr” tur tea) ] 


XRe[(e—v—gi—i0*)71 (— p — i0]. (3.21) 


The function f,’(w,0) consists of four interference terms. The explicit form is 


dk dud 
£/(»0)-2 eun 
(20)*J (2x)? 

atte ak 

Xrs[Img(2, o—i09]-2 f EY 


dysdv; 


(27)? 


; 1 
Lf(—mi) +N (v3) Jd (k,v)P—[ReS{p, w—i0*) Jral fp" (2,0) + f (1,0) ] 


wW— V3— Hi 


1 
Doaa) +N (oo) P(—) 


H2 


X(LImg(?, w— 10*) Jrafp-x™ (w— vs, uz) rd Reg(, o—i0*)] 


+[ReS(p, w—i0*) ]rafp r” (w— vs, H2)TaLImG(p, w—i0+)]}. 


We shall return in a moment to f’. However, let us first 
consider the vertex corrections (3.21). In general, we 
may define the Fourier coefficients of a vertex operator 
by the relation 


x (p, Fid- vs; P, £2 
= (p/2m)[S(p, Gar Ym) T3+73G (6,61) ] 


—G(5, Ctra) Pp (Er vu, £095.60. (3.23) 


The function I'(p,w) is then related to the analytic 
continuation of the Fourier series coefficients by 


T (2,9) 7 Po (or, v)|,—o; (3.24) 
where 
T (o+ V, v) =3{T (firm, t) | fr=0 10" 
Tr. Vm, £2] haw tiot} < (3.25) 


ym =v -r10* 


It is straightforward to verify that T (p,w) as defined by 
(3.23)-(3.25) is in fact given by (3.21) in the ladder 
approximation of (3.6). i 

There exists a Ward identity, reflecting the conserva- 
tion of energy density, for the vertex function intro- 
duced above. This Ward identity has been discussed 
with great clarity by Langer; and it is therefore not 
necessary to give the derivation here. The result in 
terms of Fourier series coefficients is'? 


Talent) = —nvsg (50 
a a E Jh ((/2m)—9)nrs— X52 ]- (3.26) 


dditional terms due 

18 In Langer's result (Ref. 8) there occur acan ders. 
to the aoi in space of the Coulomb interachonbe cone 
uch terms.do not occur for the electron-phonon au fentonithithe 
the direct Coulomb interaction it is in any case consis 


(3.22) 


We notice that in the Eliashberg approximation of 
neglecting the momentum dependence of the self-energy 
operator the vertex correction is zero. We may therefore 
make the replacement 

T (pw) — (p/m)wr3. (3.27) 
Ina more exact treatment one would have to include the 
effective mass and wave function renormalization 
corrections implied by (3.26). 

We now return to Eq. (3.20), remove the vector 
dependence, and integrate both sides of the resulting 
equation over all e. The 7; component of this equation is 
then found to be 


TM 
oe (n 
|ImZ (w2— A2)12] 


1 dv 
NE dex F (wo) d lonny) 


20 J 2m x 
XLfG—e)4-N (v)]g@—v)} +8’), (3.28) 


where g’(w) comes from the interference terms f^ given 
in (3.22). Whereas in the other terms of (3.20) the 7; 
component on the left is only coupled to the 7; compo- 
nent on the right, g’ turns out to depend only on ge. The , 


Eliashberg approximation to neglect such terms. It should also be 
remarked that since the Ward identity (3.26) folldws from the 
strict conservation of energy density it requires tfe inclusion in the 
vertex function of the heat current ups introduced above Eq. (2.2) 
and then rapidly discarded. This means that in using (3.26) we are 
including a part of the "phonon-drag" thermal conduction. Since 
we have already argued that processes of this type 
effect on the thermal conductivity, their inclusion here is of no 
consequence. 


" CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar ( A 
^ 


have a negligible _ 


EJ 


Cd 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A812 
explicit form we find is 
ima IMA f 
|ImZ (w — A?) ||? A?| 
dvdyi 


——— =i Nv 
x| gy n 


g'(w)= 


Filwr) dh lwy 
XE Ja ER po aH] 


(w—y—p1)p 


dvdys 
——[ f(»—o)2-N (v 
+f aseo 


Fy (wx), (ev) 
do onal 
xz | Gy, 


X [ge (e—», we) — g2 (o—», u2) |p. (3.29) 


The coupling between the 73 and the 7» components, 
though physically interesting, appears to be of negligible 
importance for our problem. In the first place, an 
examination of Eq. (3.13) shows that the 7» component 
on the left is coupled only to the 7» component on the 
right. In other words, go“ (»)-- g;9 (w) satisfies a linear 
homogeneous integral equation. Therefore this quantity 
can at most be nonzero for certain values of the electron- 
phonon coupling constant, an unphysical possibility 
e which it seems safe to ignore. In comparing the remain- 
ing term in Eq. (3.29) with the homogeneous term in 
(3.28) we encounter the ratio [ImA(w)/w]. From the 
numerical results of Schrieffer et al. we see that for the 
frequencies of interest, i.e., o « 1077, this factor is less 
than 0.1 at T=0 and consequently exceedingly small 
just below T= T.. Thus, barring the occurrence of other 
large factors, the second term in (3.29) is also negligible. 
An order of magnitude comparison of the integrands of 
this term and the homogeneous term in (3.28) yields the 
ratio T (»—»)/ (e—»), where T (w—») is a typical width 
- at frequency (w—y). Although this factor is not small 
over the entire region of interest, it is never large. It 
therefore seems safe to conclude that the terms in g' are 
of no significance. These terms evidently describe weak- 
ation processes peculiar to the superconducting 
hich involve the motion of condensed pairs 
to the single-particle excitations." We plan to 
these processes further. 
he coupling beteen different components in 
b» space came solely from g' in (3.28), none 
, whenthis term is omitted. On introducing the 
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eminiscent of the collective excitation 
se of the superconductor to finite 
ectromagnetic disturbances. 
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spectral function (3.14) we then obtain 
Q2— |^ | 2 

ey 
le*—A*| 


x | 1— (2af —)E jl dine (END 


cum ERO D 


XE lono) lN ())g (o —o») 
+a) Nodeto]. (3.30) 


It follows from (3.30) that g(w) is odd in w, which one 
also expects on general grounds. Further one sees that 
near T=0 g(w)=0 for o« Ac. From (3.5) and our sub- 
sequent analysis we see that the thermal conductivity 
K is determined once g(w) is known by the quadrature 


c REL SE E ea 
=— w sech?(sBw)g(w). (3.31 
= T i E 


The Hartree-Fock result (2.17) is recovered if we 
approximate g(w) by the inhomogeneous term of 
(3.30). In the small width limit |Im[Z (o?— A2)/?]|-t — 
(w?— A32)2/oT'. If one introduces instead of g(w) a new 
function G(w) defined according to 


N 
=) (), (8.32) 


A 
w — Ar 


glw)=—r sena GA) | 
one obtains an integral equation for G(w) which is 
identical to the integral equation obtained previously 
from the Boltzmann equation. [Essentially Eq. (2.18) 
of I if the differences in phonon spectra used are taken 
into account. ] 

If the width is not small, (3.30) is the most con- 
venient form for numerical calculations. The complex 
functions A (w) and Z (w) have to be determined from the 
set of integral equations given in Appendix A. 


IV. CONCLUSIONS 


In spite of some anfractuous analysis, we have been 
led to a relatively straightforward scheme for calculating 
the electronic thermal conductivity of strongly coupled 
superconductors. The Eliashberg approximation of weak 
momentum dependence is essential for much of the final 
simplicity. We have found in this approximation that 
the thermal conductivity is determined by an integral 
of the expression w sech?(48w)g(w) over all excitation 
energies w. The function g(w) plays the role of the 
deviation from an equlibrium distribution and satisfies 
the generalized Boltzmann equation (3.30). 

If one neglects the difference between the transport 
and the scattering cross sections, or equivalently keeps 
only the inhomogeneous term of the integral equation 
(3.30), one recovers the result of the Hartree-Fock 
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approximation of Sec. II. In this case, for frequencies 
not too far above the gap edge g(w) becomes approxi- 
mately equal to — 2r (w?— A?) ?/T (w), where T (w) plays 
the role of a width parameter and is in the weak-coupling 
limit identical with the reciprocal of twice the quasi- 
particle lifetime as calculated in IT. The expression for 
Tlw) is P@)=|Z.@—A?)—Z,A,A.|o-1. The renor- 
malization function Z(w) and the gap function A(w) 
have to be determined from the Eliashberg gap equa- 
tions given in Appendix A. 

The gap equations have been solved for lead by 
Schrieffer e al. in the limit of zero temperature, but not 
for temperatures near T, which are of interest here. We 
can therefore only speculate about the manner in which 
the great reduction in the thermal conductivity of super- 
conducting lead might be explained by our theory. The 
calculations in I have shown that in the small width 
limit the decrease of K,/K, with decreasing temperature 
is mainly due to the increase of A, in the factor 
(w*— A52)? in the integrand of K., while Tr (as a 
function of Bw) is relatively insensitive to changes in 
temperature near T.. Let us assume that this is also true 
for lead. Then it is important to note that near T, 
energies up to about wm= 10&5T' contribute to the heat 
current. This means that the first peak in the curve for 
Ai(w) calculated by Schrieffer eż al. which occurs at 
w'=4.4X10 eVz10&sT. occurs roughly at wm. If 
the shape of the A; (w) curve were preserved at tempera- 
tures near T, or, more precisely, if the ratio A1(w)/Ao 
were nearly independent of T, then the effective gap Ai 
for the excitations predominantly carrying the heat 
current would clearly be much greater than the gap edge 
Ao. This would lead to a substantial reduction of K,/K, 
compared to the value obtained in the small width limit. 

To go beyond the Hartree-Fock approximation one 
must numerically solve the integral equation (3.30). 
From the experience of the calculations reported in I, 
it is not expected that this more exact treatment will 
drastically affect the thermal conductivity. Neverthe- 
less, without a solution of the integral equation it seems 
difficult to estimate the errors introduced by the 
Hartree-Fock approximation. It is worth noting that 
the most complicated part of the integral equation 
(3.30), which depends on the solution of the gap equa- 
tions, is the Hartree-Fock value of g(w) which occurs as 
a factor in both the homogeneous and inhomogeneous 
terms. It should also be noted that the same electron- 
phonon coupling strengths, y, and phonon frequency 
distributions F,(w,) occur in the integral equation for 
g(w) and those for A(») and Z(#). es. 

In the small width limit, i.e., a — 0, we have indi- 
cated how (3.30) may be transformed into the D. 
Boltzmann equation for the superconducting state. 
byproduct of the analysis given in the present paper has 

+ uei g f this equation, as 
thus been the justification of the use o q uem 
in I, for example. In our extreme Voce E 
frequency limit the only validity criterion 10r i do 
of the Boltzmann equation is that well defined quasi 
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particles exist in the thermally excited frequency range 
below roughly 10ksT. This criterion also applies to the 
normal state, and when.it fails one must resort to the 
more general approach described in this paper. 
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APPENDIX A 


In the Nambu representation the superconductive 
approximation to the electronic self-energy operator has 
the Fourier coefficients 


oe | a! 
2(p,61)=—- 
bki p Qxy * 


XG(o—-4, 61— Ym) raD (q,¥m) - (A1) 


Here 
Gt (5,2) EMG (5,2) (A2) 
and the Nambu Ansatz for Z(5,z) is 


Z (5,2) — zt (z)H- Xp(2)a4- p (2)71- (A3) 


Following Eliashberg we neglect x the even in frequency 
component of the diagonal (in Nambu space) part of Z 
as contributing only a shift in the chemical potential. 
Then S^! has the form given in Eq. (2.12), namely 


G71(5,2) =22 (2)—ers— (2) 71, (A4) 


where Z(z)=1—¢(z). The functions ¢ and Z have the 
properties 
Z(z)-Z*(; Z(—z)=Z(2), 


o(2*)=9*(@); e(—2-6Q). 


Z(z) is even because z¢ (z) is the odd component of the 
diagonal part of Z. The function (z) may be taken to 
be even in z for s state pairing because of spin-rotation 
invariance, time-reversal symmetry, and spatial-in- 
version symmetry.!® Using (A5) it is easily seen that the 
spectral function a(p,w) defined by Eq. (2.13) has the 
properties 


(A5) 


Tr[a(5,9)]- Tr[ e$, —e)1, 
Tr[ 7:2 (,9)] mem Tr[ r:a (b, — w)] r2 


Now we insert the spectral representations for G and D 


^ 


18 Questions of symmetry are very carefully considered in 
P. Nozieres, Theory of Interacting Fermion Systems (Wa A. 
Benjamin, Inc., New York, 1963). 


(A6) 
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[ Eqs. (2.10) and (3.7) ] into (A1) and perform the sum 
over vm. We then make the weak-momentum approxi- 
mation for ¢ and Z and perform the integral over d?g in 
the manner discussed in Sec. IIT. The integral of a(eo) 
over all e which is needed here is given by®” 


(oZ(w)+4(o)r 


] dea\e,w) = — 2r RA ) | , (A7) 
—o eio 


— A1 (oj) ]/2] ^x 


AND L. TEWORDT 


where e; is the root of &+¢?(w)— (w—i0+)272 (w)=0 
which lies in the upper half-plane. [For real argument 
w, $ (v) and Z (w) are given by (2.16). ] After performing 
the steps outlined above, we decompose the integral 
equation for Z(z) in the Nambu space and equate 
coefficients of 1 and 73 on the right and left sides. The 
symmetries (A6) may be used to simplify the equations, 
The final equations are 


1 2 A (c3 
Aor | dei ne — e | iot on 


X {LV (x) + f(—e1) ]E (14-9 4-2) 1 9- (914-9 — 2) ] 
— EN (e) - f (1) JL (—91i--94-2) + (wtw), 


i-ze- | dos Rel 


[93 — A? (w) 7?) a 


(A8) 


e Iz [soe 


{LV @)+f(—o1) ]L (14-9 4-2)71— (w14-e — 2) ] 
+L (e) + f (3) ]E(C— 12-3 4-2) 31— (— 914-9 — 2)71]) . 


The physical values of A and Z are obtained by setting 
z=w—10* in (A8). The sign of the square root in (A8) is 
to be determined by Im{Z(w?— A?)!?) <0. 


APPENDIX B 


The spectral representation (3.4) for the object 
xis (l1,l0,t3) = T W(t) u (to); (ts)) follows from its perio- 
dicity properties for imaginary times. (We suppress the 
spatial arguments for brevity.) Consider the region of 
time Reż, t2, 45—0; 0>Imh, tz 43> —B=ir, and let T 
order the operators from right to left in order of 
increasing negative. imaginary time.’ Then measuring 
energies with respect to the chemical potential, and 
using the cyclic invariance of the trace operation con- 
tained in the ground canonical ensemble average, one 
finds the periodicity properties 


Xij (0,t2,t3) = — Xij (T tzt), 
Xilin) = +X (t1,7,¢3) , (B1) 
i5 (515,0) = — Xi (tta T). 

As a result we may make the Fourier series expansion 


^ 1 
% +5 (t1,lo,t3) zm IA z Xij (nm,D)e 3ntig-irmtag-ituts , (B2) 


© T? lmn 


where (n= (2n4-1)m/7, v» — 2mm/7, ti— (212- 1)r/7. Be- 
cause x (I1,5,5) = x (I— t», £j—15) we expect x (nml) to be 
zero unless €,--v5,4-61— 0. Inverting the Fourier series 
we have E 


E Kiy(n,m,l) = / diy jr día f dlsX;(t1,t2,t3) 
2 Fe LE Q b, 9 0 x 


giintigirnt2git its : (B3) 
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To do the integral we break up the domain of integration 
into six regions corresponding to the six time-ordered 
constituents of x. In each of these regions the time 
dependence of the Heisenberg operators can be dis- 
played and the integrals trivially performed. One verifies 
that (nml) is indeed zero unless the condition written 
above (B3) is satisfied. It is convenient to take as 
independent variables (; and vm and to introduce 
Xx (Civ, t)mvTx(Id-m, —m, —1— 1) so that the Fourier 
expansion reads 


1 
Ki(I— te, 15—19) 2 — 22 Xu(Fr ym, £2 
T? im 
K e-i(Sttym) lto) eitilt), (B4) 


After explicitly carrying out the time integrals in (B3) 
in the manner indicated above, one finds that the 
Fourier coefficient x(fi--v5, ¢1) can be written in the 
form (3.4). The spectral functions f(? and f? obtained 
explicitly in this way are sums of products of three 
matrix elements, and two delta functions involving the 
frequencies o; and ws. After some rearrangement one 
finds that the spectral functions may be written com- 
pactly in the form 


fao f zl dí; 


X ((u(—2), ¥:(0)], V; (—1)) 
x eieitigio:ta | (B5) 
and 


£;0 Gud | a dis(([u(— to), V1 (0) ]; Wi(ts)}) 


X eiwitigivnts > (B6) 


am 
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Above, the curly bracket indicates the anticommutator 
and the square bracket the commutator. 

We now wish to substantiate the remarks made after 
Eq. (3.18) about the reality of g(w). If one uses time 
reversal invariance on the closed forms (B5) and (B6) 
one finds the relation 


fy (1,02) = — (ro) inkue® (—601, w2)*(72)1;. (BT) 


Further from explicit complex conjugation of (B5) and 
(B6) one finds 
fy (ww) *= —15/0 (v1, —x). (B8) 
From (B7) and (B8) we can derive the exact identity 
Tr[ £9 (1,02)73]= — Tr. f (w, —e2)ra]*. (B9) 


In the ladder approximation we have proved that 
| dep-Tr{ ral (,0) +£@(,0)]}=0. (B10) 


It follows from (B9) and (B10) that in our approxima- 
tion 


n dep-Tr[ raf (,0)] 
F 


= | [acp mte (B11) 


Using the definition of g(w) contained in Eqs. (3.18), 
(3.15), and (3.11) we see that (B11) implies that this 
function is real. 


APPENDIX C 


Here we sketch the evaluation of the integrals 71 and 
I» given in Eqs. (2.14) and (2.15). Although the evalua- 
ton is straightforward it seems worth presenting, 
particularly because (2.14) plays an important role in 
Sec. III. 
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Consider first J;. We have using (2.12) 
n= f de Tr[ 72G fc, w—10*) 7G (e, w—10*) | 
p o9? Z?--e— g? 5 
zl de =, (C1) 
-o (e~e)? (ea)? 
where e= +e; are the roots of 
eZ—ge—g-0. (C2) 


We choose" 4-e; to lie in the upper half-plane. Then by 
contour integration 


wL — e? 
n= -—— 


4e E 


(C3) 


where the last step follows from (C2). The integral 75, 
on the other hand, is given by 


n= f de 'Tr[738 (e, »—20*)739(e, w +70) | 
Azete lol 


=2 | ee 
- (e-e)(erea)e-a*)(ta*) 


Since e is in the upper half-plane, so is —&*. By a 
straightforward contour integration, 


eZ lolita o^ Z|*— [o] *4- et? 
Ide] 35 ] 


(C4) 


1 


8ie Ime; Ree; 8i Ree; Ime;-e* 


(C5) 


T ae 
Ime; i 


Using (C2) and the relation A(w)=¢(w)/Z (w), (C5) 
may be put in the form given in (2.15). 


1,12 
| €1i 
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Dissipation in Quantum Mechanics. Two-Level System. II 


I. R. SENITZKY 
U. S. Army Electronics Research and Development Laboratories, Fort Monmouth, New Jersey 
(Received 20 December 1963) 


The theory of a previous article, dealing with two types of two-level systems coupled to a loss mechanism 
(LM), is extended. The first extension consists of the consideration of the most general type of two-level 
system (TLS), in which the dipole moment is expanded in terms of the three Pauli spin matrices and unit 
matrix, the expansion coefficients being vectors (dipole vectors). The second extension consists of the addi- | 
tion to the thermal-reservoir type of LM of a large number of systems identical to the TLS under considera- 
tion. The TLS is described in terms of the time development of the Pauli matrices and differential equations 
are obtained for their expectation value in the presence of arbitrary driving fields. The Bloch equations 
fora magnetic dipole of spin 3 are exhibited as a special case of these equations, corresponding to a particular 
combination of the dipole vectors. All other combinations describe electric dipole systems. Equations for two 
simple special cases of such systems are presented, one treated in the previous article and the other having 
permanent dipole moment. The frequency of oscillation of a freely decaying TLS is derived and shown to be 
shifted by an amount that depends on the relationship between the dipole vectors. It is pointed out that 
the commonly held belief that any TLS can be represented as a magnetic dipole of spin 4 is only approxi- 
mately correct in the presence of dissipation. The conditions under which the differential equations for the 
expectation values of the dynamical variables of the TLS can be converted into differential equations for 


macroscopic variables are discussed. 


INTRODUCTION 


N a previous article, it was pointed out that there are 
many important physical problems which may be 
considered, with some simplification, as that of a two- 
level system (TLS) coupled to a loss mechanism (LM), 
and a study was made of this problem. The LM was a 
thermal reservoir type of system, and two types of TLS 
were considered, the magnetic dipole type and the 
electric dipole type without permanent dipole moment 
that couples to its environment through one dynamical 
variable only. In the present article the analysis of I will 
be extended in two aspects. One extension is the con- 
sideration of the most general type of two-level dipole 
system, of which the two types considered in I are 
special cases.? The results will produce, on the one hand, 
a unified method of treating all two-level systems, and 
on the other hand, an explicit and systematic exhibition 
of differences among two-level systems. The other 
extension is the consideration of a more complicated 
LM, one which consists not only of a thermal reservoir 
but also—in addition to the reservoir—of a large 
number of systems identical to the TLS under considera- 
1 tion, loosely coupled to it, and surrounded by the same 
environment. (In the language of magnetic resonance, 
coupling of the “spin-spin” type as well as the “spin- 
lattice" type will be considered.) Part I and part II 
< contain the first and second extensions, respectively. 
“ Some consequences of the results and several special 
cases are discussed in part TII. 


* 


i 


STR: Senitzky, Phys. Rev. 131, 2827 (1963); hereafter referred 
ric dipole TLS is far richer in possibilities than the 


ole TLS, since the latterlis restricted by the special 
angular momentum. All cases other than the two 
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The defining property of a dipole system may be given 
by the form of the interaction energy of the system with 
an electric or magnetic field, 


HI pM doi. (1) 


respectively, where d, the dipole moment, is an operator 
referring only to the system under consideration. Since 
our system is a TLS, the components of d, that is dz, dy, 
and d, are 2X2 Hermitian matrices, and for the most 
general TLS, these are arbitrary. Now, any 2X2 
Hermitian matrix may be written as a linear super- 
position of the Pauli spin matrices together with the 
unit matrix. We may therefore write 


4 
d=p > asa, (2) 


a=] 


where c is the unit matrix, 01, c2, and e; are the three 
Pauli spin matrices, u is a quantity having the dimension 
of dipole moment, and the a's are four real 3-dimensional 
vectors (determined by 12 real numbers, as many as 
determine three Hermitian 2X2 matrices) which deter- 
mine completely the dipole moment operator of the 
TLS. The three Pauli spin matrices obey the well-known 
properties 


oP=1, (c7)-0, [mn]-2i., ©) 


where i j, { ) is the anticommutator bracket, and k, 
l, m stand for the cyclic permutation of 1, 2, 3. The ais 
may be normalized by the requirement that 


d?(=d-d)=np2, (4) 
[the right side of Eq. (4) is to be considered multiplied 


A816 


fen a Pears 


i 
4 
| 
z 

a 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


DISSIPATION IN QUANTUM MECHANICS. TWO-LEVEL SYSTEM A817 


by the unit matrix] which gives 


4 
È ag-—n, (5) 


a=! 


where ” is a number of the order of unity chosen for 
convenience in the particular case under consideration. 
The representation of the o’s is such that the energy of 
the TLS is given by 27:95. For an electric dipole system, 
nothing more can be said about the a’s without addi- 
tional information about the system. For a magnetic 
dipole TLS, however, the proportionality between 
magnetic moment and angular momentum requires 
that a,—0, and that a;, as, and a; be perpendicular to 
each other and equal in magnitude. 

The TLS is subject to fields produced by the LM and 
to externally imposed fields, Exar and Eext, respectively, 
for the electric system, and Hrm and Hex for magnetic 
system. We introduce the notation 


F=—(2/h)pErm, í-— (2/%)uEext (6a) 
for the electric dipole case, and 
F=—(2/h)pHim, f-— (2/h)uHext (6b) 


for the magnetic dipole case. The Hamiltonian may 
now be written, for either case, as 


H=Hiythhowostsn Dia Tata Y, (7) 
where 


F=F+E. (7a) 


Comparison of Eq. (7) with Eqs. (I.60) and (1.70) 
shows that the electric dipole case in I corresponds to 
@a=5e1 and the magnetic case corresponds to ai=@, 
az= fj, a5—2, a,—0, 4, fj, € being unit Cartesian coordi- 
nate vectors. 

A TLS can have only three linearly independent 
dynamical variables, since each variable corresponds to 
a 2x2 Hermitian matrix and, as mentioned previously, 
there are only three such linearly independent matrices 
possible, aside from the physically uninteresting unit 
matrix. In the present analysis, (as in I), the TLS will 
be described by the three Pauli spin matrices. Once we 
have these, we can obtain the dipole moment, the 
energy, and any other variable. The LM will be de- 
scribed (likewise as in I) by Hım and F. 

The expression for the Hamiltonian, Eq. (7), may 


be rewritten as 


4 3 
H- Hyu-F Moss M DL 0daniyn; (8) 


a=] m-—l 


where the vectors have been resolved along Cartesian 
axes labeled 1, 2, and 3. As in I, we consider Fy, Fs, and 
F; to act independently of one another; we assume that 
they refer to three independent but identical LM's. 
Correspondingly, we write 


^7 Hiu- Hit Het Hs. (9) 


(This separation of Hrm was not indicated explicitly in 
I, but is implicit in the argument used there.) The 
equations of motion for the TLS and LM, obtained 
from Eqs. (8) and (9), are 


= —wor H} m (aont m3— G3mtym2) , (10a) 


6277 901-- 2 m (U3miym1— Cmts m3) ; (10b) 


= Dm (a1ni$m92— Cambs m1) J (10c) 
P= — GIA Fm Hn], (10d) 
H n= — (0/2)*2a Camal Hm Emn]. (10e) 


The last two equations may be combined to give 


EnO — G/A)EF nEn (OJH 2HE dan J di, 


0 
XCF nl) CE nll), Hs) ]r«()], (11) 
which may be rewritten as an integral equation 


t ty 
F4 (0) = EnO O+0/2h)> tan | zl dl; U m(t— 11) 
a 0 0 


XLF m(t), Fs), Ho 5) oallt) Us 1 0—15), (12) 


where 


U (7) =exp[ (2/7) H4, (0)7 ] , (12a) 
and where Fm® (f) is defined by 
Fn (f) - F,(0) , (12b) 


P,0()— —G/8)EFS(0),8,(0)]. — (12e) 


The important properties of F (z) needed for present 
purposes, and derived in I, are the following: 


(Fn ())=0, (13) 
(Fm (t1)F x (2) 


= ôn (2/7) i * dello") cose! (ht) 
—i£(w') sino'(4—1:) ], (14) 


where 
E(w!) = 3ahZ 1 B(e")[1— exp — ho / kT)], (14a) 
nlo) - 3 hZ-3B (o) L1d- exp — Ie / ET) ], (14b) 


2- [| iD zn, < tel 


B= | EQ HCE), 
XP(E+ ho’, E) exp(—E/RT), (144) 


p(E) being the density of energy states of the LM, 
F(E; Ex) being the average over smail ranges of E; and 


Er of |F ® |*, and T being the LM temperature. Both — 


n(w’) and £(w’) are assumed to become vanishingly small 
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as w’ approaches zero, and approximately constant in 
the neighborhood of w’=o, the values in this neighbor- 
hood being denoted by n ande (without argument) 
respectively. The expression (F (1;)F (t2)) will occur, 
in the present analysis, as a factor in an integrand, the 
integration being over 4, or i», or both. Consider the 
expression 


diio(h)(F ()F (0) (15) 


for >w. Substituting from Eq. (14) into (15), and 
interchanging the order of integration, one sees that if 
e(l) is approximately an oscillating function with 
angular frequency o", the main contribution to the 
integration comes from the neighborhood oc". The 
same argument applies if the integration in (15) is over 
to [with e(I;) instead of (A) ]. We may therefore write 
for use in subsequent integration 


(FO (A)F(t)) 


- I (162) 


l— le 


z= a|» (w"’) 5 (t1— 12) at, (e^) 


and 
({F (th) ,F (£)))e4n(9")8(5—1:)). — (16b) 


The values of o" to be encountered in the present 
article will be only w and 0. For the latter value, it is 
clear that (F (/;)F € (t2))=0. 

Equations (12) may be written as 


Fa) PO (04-75 (Dd Vo OY (0) ? (17) 
where 


1 t ty 
Fent()=— | as diU ,,(1— 11) 
2h Jo 3 OF . 


X [PF m(t1), EF (te); H mlt) Joa (2) ] 
XUmi(t—h). (17a) 


By use of approximations based on the fact that the LM 
is affected only slightly by the TLS, it is shown in I that 


Fin (t) = — £s), (182) 
Fm (i) — £n), (18b) 


where £ is a (c-number) function of ¢ which is zero at 

t=0 and approaches the constant £ in a time large com- 

pared tu w but short compared to the time during 

which secular changes (in the TLS) take place. It is also 
“shown in I [Eq. (1.76) ] that 


1 Fay (20, (192) 
and by ax identical argument? it can be shown that 
Fi, (0) «0. (19b) 


quation (192) follows from the fact that a(t) is a slowly vary- 
unction [while o1(#) and o2(¢) are approximately oscillatory 
ctions with (angular) frequency w]. Equation (19b) follows 
‘the fact that c4 is the unit matrix and does not vary at all. 
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Substituting from Eqs. (18) and (19) into (17), we 
obtain 4 
Fy, (0) = EnO (t) E ais Eos (L) E d» E01 (£) D (20) 


This expression for F,,(/) may now be substituted into 
Eqs. (10a)—(10c), the equations of motion for the TLS 
Before we do that, however, it is convenient to rewrite 
the products of m and c, in these equations as sym- 
metrized products. (Note that [F,(¢),oa(t) ]=0.) Then 
by substituting from Eq. (20) and utilizing the proper. 
ties of the o’s given in Eq. (3), we get 


6,7 —oo1- 2; Gam (8 0,73) 
m 

—las»s (yn 0 02) + 15035) , (21a) 

03— 901-2, (Sas (8. 9,01) 
m 

—lai(Sn 9,03) Famam), (21b) 

$3 2. [ 3015(5» 0,02) — 24 2m{ Fm 71} 
m 


xus (dimt Gon?) E] E (216) 
where 
s mE, O- fn. 


Equations (21) contain as unknowns, dynamical vari- 
ables referring to the TLS only. F is a “prescribed” 
field, and so is, of course, f. F® is the quantum- 
mechanical version of a stochastic force. With £ replaced 
by £, and with f—0, Eqs. (21) are of the type referred 
to in I as the Langevin equations for a TLS. 

It should be noted that, while F,,(¢) and ca(t) com- 
mute, Fm® (t) and ca(t) do not. The commutation rela- 
tions between the latter two operators may be obtained 
from Eq. (20), utilizing the fact that [Fin(t),oa(t) ]=0; 
the result is 


[cy F,, 9 ]— 2ia15 £05 ; (22a) 
Lon Fn |= 2ias,s £03, (22b) 
[es Em w] = Qué (@imo1+ 025,02) o (22c) 


The problem may now be considered as formally 
defined by Eqs. (21)—Eqs. (22) being part of the pre- 
scription for F—with initial values for the o’s given 
by (their usual description when the Schrédinger picture 


Ee o 4 D —3 
OE ap nou ): 


e 


From Eqs. (21) and (22) it can be shown, in a manner 
similar to that employed in I, that the solutions of 
Eqs. (22) subject to the initial conditions (23) are true 
spin operators, satisfying Eqs. (3). These solutions, 9 
course, are operators both with respect to the TLS an 
LM. It is easy to see that this satisfactory state of 


(23) 


0 8 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


DISSIPATION IN QUANTUM 
affairs is due to the fact that £ vanishes at /—0, since 
F (0) and e4(0) are taken to be the uncoupled operators, 
and the coupling between TLS and LM is assumed to 
begin at /—0. Having shown the consistency of the for- 
malism, we now replace £ by £, as approximation for 
compulational purposes. 

A solution of Eqs. (21) that is an operator in both 
TLS and LM spaces has not been found (even when all 
a’s but a; vanish, as in the electric dipole case in I). In 
order to make progress with Eqs. (21), we take expecta- 
tion values in LM space. This does not simplify the 
situation automatically since expectation values of 
products, ({F,,,c¢2}), occur, and they are certainly not 
equal to the product of the expectation values. The 
evaluation of expectation values of these products will 
involve a significant approximation. 

We consider § (that is, both F(? and f) to be a 
small quantity (compared to w) of first order. Equations 
(21) can be rewritten as integral equations for the o’s in 
which the kernels are of first order. Defining the symbols 
€ (/), Be(t), &3(t) by rewriting Eqs. (21) as 


o1(t) = —wo2(t) +41 (2) , (24a) 
é()= oni ()+62(), (24b) 
o3(t)=@(t) , (24c) 
we obtain, as equivalent integral equations, 
t 
net | di, cose (t— £)di(4) 
0 
— | di sine(/—A)P»(&), (25a) 
0 


t 
sem | dli cosc(1— 1) s(t) 
0 
t 
T i dt; sinw(t—ty)®1(), (25b) 
0 
t 
scat | dt,®3(t1) , 
0 


where the o Os are the solution for the free TLS satisfy- 
ing the initial conditions. A substitution is now made 
from Eqs. (25) into all products {Em0 Sa} occurring in 
Eqs. (21). Thus 


(CF, (t1),o1(t1)} )uat 
=), iE dty([ain sine(1—11) 
n Jo 


ar, coso (ih) {Fn (), (85509 Crs) 
— as UP, (D, {Fn (£,02 (5233 cose (I— 11) 


— as (F9 (D, (85 9 (ta), 2x} 
pd Xsinw(i—h))im, (26) 


MECHANICS. 


TWO-LEVEL SYSTEM A819 
where use has been made of the fact that (Fn ca ras 
=(F, ow, and of Eq. (13).4 The significant approxi- 
mation consists of ignoring the noncommutativity of 
Ca and F,, and replacing the product of the LM 
variables by its expectation value, both operations to 
be performed only in terms of higher order than the first. 
(See I for a discussion of this approximation.) Utilizing 
Eq. (13) and the Kronecker delta of Eq. (14), we obtain, 
with the above approximation, 


(Fm 1} um 
"i di { Fo (t), Fm (t1)}) 
0 


x {o3(t:) [dim sinw(t{—t,)-+- dom cosw(t—t;) ] 
(27) 


The considerations related to Eqs. (15) and (16) may 
now be applied. o5(/41) is a slowly varying function of /1, 
so that Eq. (16b) with w’=w must be used for the 
integration associated with first square bracket in Eq. 
(27). The second square bracket is shown in I [Eq. 
(1.81) ; note that cı and oz approximately oscillate with 
frequency w] to be itself a slowly varying function of /1, 
so that Eq. (16b) with w’”=O must be used for the 
integration associated with the second square bracket, 
and the contribution of this integration vanishes. The 
result is 


—d3ml_o1(t1) sinw({—1,)-+-ao(t;) coso(1— 5)h F 


((F,, 9, 01) ) ri = 2damn(o2) rA - (28a) 


In an entirely similar manner, one obtains 


((F,, 9 os) im = — 2d1mn(os) uM , (28b) 


and 
((F,, 9,03) um = 2n (a15 (02). — a2 (01) 1M) . (28c) 
Substituting from Eqs. (28) into Eqs. (21), we have 


a= (—w-aj- aon) o2— aneit ar an (05 — 00) 


--as* fo3— az- fo, (29a) 


G2= (w+ a1- a2)01— a1 noz 22* am (03— 00) 


4-as*fo1— ar foz, (29b) 

= a a n 
where the notation 

gom: —E/n (29d) 


is used, and where LM expectation value brackets are 
omitted. (Since these are operator equations in TLS . 
space, oo is to be understood as being multiplied by the 
unit matrix.) It is easily seer» that the results of I for 
the magnetic and electric dipole cases treated there are 
obtained by setting @in=dim in the former case and 
0i, 8580»1 in the latter. Equations (29) describe the 
* Where brackets are used to indicate expectation value with 
respect to the LM, the subscript “LM” will be added if the en- 


closed expression refers to both the TLS and LM, while no®sub- 
script will be used if the enclosed expression refers only tothe LM. - 
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behavior of the most general TLS coupled to a thermal- 
reservoir type LM. 

We can take expectation values with respect to the 
'TLS in Eqs. (29). This leaves the equations formally 
unaltered, and the o’s now stand for their respective 
expectation values in both LM and TLS spaces. Hence- 
forth, Eqs. (29) will be understood to refer to these 
expectation values. Discussion of the physical signifi- 
cance of these equations will be postponed until the 
second generalization has been carried out. 


II. 


The second generalization refers directly to the LM 
rather than to the TLS. Instead of assuming the LM to 
be only a thermal reservoir of given temperature, we 
consider the LM to consist of two parts: The first part 
(to be referred to as LM3) is the same thermal reservoir 
considered heretofore; the second part (to be referred to 
as LM») consists of a large number of two-level systems 
identical to the TLS under consideration, situated in a 
similar environment, and loosely and randomly coupled 
to our system and to each other. The individual two- 
level systems of LM; are thus coupled (in addition to 
the coupling to each other and to the TLS under con- 
sideration) to a thermal reservoir and to external forces. 
The thermal reservoir may be LM; itself, but since the 
possibility of the transmission of effects through LM; 

will be excluded, it is simpler to consider the systems of 
LM? coupled to a thermal reservoir that is identical to 
LM; but independent. In the notation to be used, LM» 
consists of the mutually coupled two-level systems only, 
the thermal reservoir and external forces being described 

| separately. 

The Hamiltonian for our problem is obtained by 
adding several terms to the expression in Eq. (7), the 

| result being 


| A= Fhoost+zh Da [UC (FO+F- f) 
-FHiwidHiwstHO, (30) 


where Erw; and Hw are the energy operators of LM; 
| and LM,, respectively, # and F? are the coordinates 
1 through which LM; and LM,, respectively, couple to 
[ the TLS, and H®) contains the coupling term of LM; 

to any other systems, the energy operators for those 
* systems, and the coupling term of LM: with prescribed 
external forces; that is, H® refers to everything that 
; couples to LM» except the TLS under consideration. 
— < We pass now from the Heisenberg picture, which has 
< been used in the preceding discussion, to the interaction 

c . picture. All the operators are transformed into primed 
E epe in the manner® 


i Re ee (31) 


e noted £hat the interaction picture is obtained here 
th the Heisenberg piene. This accounts for the 

"in the right side of Eq. (31a), the opposite of 
1 starting with the Schródinger picture. 
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where U is a unitary operator satisfying the differentia] 
equation and initial condition 


ihU (t) - U ()H (0), (31a) 
U(0)=1. (31b) 


All the primed operators now satisfy the equation of 
motion 7 
ihÁ' — [A', H' — H O" H-ih (04'/010); (32) 


in other words, the effective Hamiltonian that enters 
into the equations of motion for the primed operators no 
longer contains any reference to the influence of external 
systems or forces on LM;. Formally, LM; and LM, 
appear equivalently in these equations. In this trans- 
formation, the effect of the external influences on LM, 
has been removed from the operators and transferred to 
the state vector, which becomes 


v'()- U(w. (33) 


It should be remembered that Y is independent of the 
time and would be the initial state if the Schródinger 
picture were used; Eq. (33) may be written as 


W' (i) =U ()v'(0). (34) 


Now, if there were no coupling between the TLS and 
LM,, Eq. (34) would yield the entire time development 
of LM;, and leave all the other systems to which refer- 
ence is made in the Hamiltonian of Eq. (30) unaffected, 
that is in their Heisenberg (initial) state. The presence 
of the 120,a,- F ? coupling term affects U only indirectly 
through the time development of H(t) [note that 
H O (0) does not contain this coupling term] and does 
not produce lowest order effects. The difference between 
V (t) and the corresponding state in which there is no 
coupling between the TLS and LM; is due only to the 
effect on LM; of the TLS, and is slight. We approximate 
by ignoring this effect in W'(/) and consider V'(/) to 
denote the state in which the TLS and LM; are un- 
affected while LM; has developed under the influence of 
those systems (and external forces) explicitly entering 
into H®, (This approximation is similar in spirit to 
approximations previously made for LM;, where LM 
variables in interaction terms were replaced by their un- 
coupled values. In the present instance, LM; may be 
uncoupled only from the TLS but not from the effect 
of H®. This is the essential difference between LM: 
and LM».) 

Analytically, the approximation just performed may 
be described as follows: Set 


V —VrsV rM LM2; 


(this implies that the coupling is turned on at 1-0), 
which is the same as 


Y (0) — Verr (0) Jr (0) Yume’ (0) - (35b) 


Consider H  (/) to be obtained from H® (0) (by means 
of the Heisenberg equations of motion) by neglecting 


(35a) 


= 


ct 


mn m Cho ct ct 09. UU rm* (b 


a M to 


saat 7 — 2-2 


b 


DISSIPA 
L2 1 ^ 

vihen H(t) and, therefore- 
a)—U (f) contain only LM, 


the F term in Eq. ( 
as is evident from Eq. 
variables. Thus, 


W’'(t)=rrs' (0) zi (0)U (Dcus (0) 
—wVrrs' (0)Vrwi (0) rus (2). (36) 


Since LM; and LM; are large and incompletely de- 
scribed systems, we will describe their initial states, as 
has been done previously for LM;, by ensemble aver- 
ages. For this purpose, it is more convenient to use 
density matrices instead of state vectors. We therefore 
write, in place of Eq. (36), 


P' (Ù= pris’ (0) oxi (0) oe! (Ù, (37) 


where P is the density matrix for the combination of 
systems, and the p's are individual density matrices. 
Both yw» (f) and prm? (t) describe the development of 
LM: in the absence of coupling to the TLS under 
consideration. 

We come now to the essential aspect, and significant 
approximation, of the analysis of LM». LM» is obviously 
not a thermal reservoir, since the individual two-level 
systems of which it is composed can interchange an 
amount of energy comparable to their own energy with 
the thermal reservoir and external forces to which they 
are coupled. LM? cannot, therefore, be described by a 
fixed temperature. We assume, however, that the 
randomness associated with LM» (due, mainly, to the 
random coupling among the individual two-level sys- 
tems) is sufficiently great so that at any given time LM; 
may be described by a temperature, this temperature 
being, in general, a function of the time. According to 
this assumption, the density matrix for LM; is given by 


pum? (1) — Z2 expL— Hyws/ KT (0) ], (38) 
where 


Zo(t)=Trace exp — H vw/ KT (1) ]. (38a) 


T (i) is left unspecified for the time being, but we assume 
that it changes slowly compared to exp (iut). 

Now, LM», like LM, is a large system with many 
energy levels closely spaced. It may therefore be treated, 
in the analysis of its interaction with the TLS, in the 
same manner as LM,, but with cognizance of the time 
dependence of the temperature. (It is to be noted that 
the temperature enters into the analysis only through 
the density matrix.) Our problem thus reduces itself to 
the analysis of the interaction of a TLS with external 
fields and with two LM's, one of which has a time- 
dependent temperature. 

Before this oben is considered, it is useful to look 
at a simpler situation, one in which there are two LM s 
with different, but fixed, temperatures. Very little 
analysis is required to notice that the change required 


i ture 
*Si is composed of two-level systems, its tempera 
may pee ege well as positive values, depending on the 
average energy of the two-level systems. 
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in the results obtained for a single LM is the replace- 
ment of n by 7-+n®, and the replacement of & by 
£O-F£O, where the superscripts (1) and (2) refer to 
LM; and LM,, respectively, and £(? and 7 are defined 
for each LM in the same manner as £ and n, respectively, 
by Eqs. (14a) and (14b). From Eq. (29d) we have 


«9 — 9/0; (39) 
thus Eqs. (29a)-(29c) remain valid provided we set 


—40-4-40, 40 
075 0-Fn ss a) 


i (59 oaa SS Pagi; 
We 134. Alos 4-6 


We return now to situation in which T@ is time- 
dependent. This time dependence affects several calcula- 
tions that lead to Eqs. (29). First, there is the computa- 
tion of expectation values of LM, variables. The ex- 
pectation value of an operator A'(/) referring to a single 
time (and uncoupled from the TLS) poses no problem 
since it is given by Trace pray’ (f) A' (D). The expectation 
value of an operator of the type {A1'(t1),A2'(/2)}, 
referring to two different times, requires, in general, 
another method of evaluation. In the present analysis, 
however, this type of expectation value is needed only 
in cases where the result is negligible unless ¢; is very 
close to £». [See, for instance, Eqs. (14) and (16). ] Bear- 
ing in mind that T® (/) is a slowly varying function, we 
can use previous results for this type of expectation 
value, with £9? and »®—which are functions of 7'0— 
evaluated at either 41, t2, or an intermediate value. Then, 
there is the question of time integration where £? or 
1? is part of the integrand. [See, for instance, Eq. (27). ] 
Here too, no formal modification of the final result is 
necessary, for in this integration the main contribution 
comes only from a very small neighborhood of a definite 
time, so that £O or 7 may be evaluated at that time 
and taken outside the integral. It is seen, therefore, 
that Eqs. (29a)-(29c) and (40) remain formally un- 
changed, but with 7® and es? being considered now 
functions of ¢ (the same argument as that of the a’s 
and f). 

Our final task is the evaluation.of T®, or, more 
directly, of oo) and n®, through which the behavior of 
the TLS depends on 7?. From Eqs. (39), (14a), and 
(14b), we have 


and 


—hofkT®)—1 
t (M) a 
exp(—fw/kT@)+1 : 


oo” 


This is the expectation value of the energy, in units of 


g average 

[iat con- " 

idé ntical = Y 
edm a 


energy (at time /) of the t 
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similar environment, this energy is also equal to cs(/)." 
Thus, we have the important result 


ey =o). (42) 
As far as 7 is concerned, no such simple consideration 
applies. The best that can be done within the scope of 
the present analysis is to observe that €? is not as 
strongly dependent on T® as cy? (or ®)—as can be 
seen from Eqs. (14)—and to approximate it by an 
average value 

From, Eqs.‘ (29a)-(29c), (40), and (42) we finally 
obtain °° : 

[M gk 

g,— (~oa 33)0s— naz sına" as(o3— 00) 


i: -Fas-fos—as:fos, (43a) 
G2= (w-I-5a1* a2)01— 7a o+ naz: as (03— 00) 
-F-as:fo4— ai: 105, (43b) 
637 — (aj?4-a5?)n(ea— oo) + a1: fr» — az for, (43c) 
where 
=n Hn, =n, cmo. (43d) 


In summary, these equations describe the behavior (in 
terms of expectation values) of a general TLS subject 
to external fields and coupled both to a thermal reservoir 
and a large number of systems identical to itself. The 
dipole moment is given in terms of the o’s by Eq. (2), 
and the energy is 37wo3. Two relaxation constants, 7 
and 7 (with 7 being a better “constant” than 7, which 
may be regarded as depending to some extent on the 
energy of the TLS) and an equilibrium energy oo enter 
into these equations. The equilibrium energy is that 
associated with the temperature Tı of the thermal 
reservoir, and is obtained by replacing superscript (2) 
with superscript (1) in Eq. (41).5 


II. 


A few special types of TLS are of immediate interest. 
As mentioned previously, the magnetic dipole type has 
31—1, a2= fj, a5—2, a4—0. Equations (43) become 


D= —003— Nrt Jz0z— [204 (44a) 
Cy=Wo2—Toyt feor— 202 , (44b) 
(44c) 


(EE —2m(o3—230)-- f22— fy92- 


1 Strictly speaking, there is an approximation involved in this 
statement, since LM expectation values are averages both with 
a respect to a thermal (canonical) ensemble—implicit in the density 
matrix employed—and a quantum-mechanical ensemble, while 
ui» expectation values are everages only with respect to the 
atter. 
3 Equations (43), involving two dissipation parameters, 7 and n, 
are more general than the LM considered in the above discussion. 


More general loss mechanisms may be obtained by replacing LM, 

— —. with several thermal reservoirs at different temperatures, some of 
these temperatures being, possibly, prescribed slowly varying func- 

~ tions of the time. Equations of the form of Eq. (40) can then be 


used 40 obtain new values for the two dissipation parameters with 


. Eqs. (43) remaining formally unchanged. 
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These are essentially the Bloch equations? with (251 
being the longitudinal relaxation time T', and 77 bein, 
the transverse relaxation time 75. If the spin-lattice 
coupling is regarded as coupling between the TLS anq 
a thermal reservoir, and spin-spin coupling as couplin, 
among identical two-level systems, then Eqs. (44) and 
(43d) show the contribution of each type of coupling to 
the relaxation process. It should be noticed that it is not 
in general, correct to refer to T» as the spin-spin relaxa- 
tion time (as is often done) unless n® is negligible com- 
pared to n®. If n® is zero, Eqs. (44) reduce to the 
results of I [ Eqs. (1.84) ]. 

As far as the electric dipole TLS is concerned, there 
are many possible types, depending on the choice of the 
a's. a4 does not enter into the equations of motion and, 
therefore, does not affect the behavior of the TLS. a4and 
a3 determine the “permanent” dipole moment, or the 
nonoscillating part of the (expectation value of the) 
dipole moment of the free TLS, for only e; can have a 
nonoscillating value different from zero when f=7 
=n=0 in Eqs. (43). Since a; plays no dynamic role, we 
will ignore it henceforth, and refer to the constant part 
of o3 (times uas) as the permanent dipole moment. A 
case in which the dipole moment has a particularly 
simple appearance is that in which a;=a3=0, so that 
d=ajo;. This is the electric dipole case treated in I. 
Taking a5?—1, a;-f- f, Eqs. (43) become 


01— —002, (45a) 
63—001— 102— fos, (45b) 
da= —7(o3—00) + for, (45c) 


which, for 7? 20, reduce to Eqs. (1.67). A permanent 
dipole moment may be added to this case, while still 
maintaining a relatively simple form of the equations, 
by considering as—0, a3=ea:. (Here, the permanent 
dipole moment has the same spatial direction as the 
oscillating dipole moment.) The result is 


017 —0023— E03, (46a) 
02— 901— oF efo1— fos, (46b) 
63=—7(o3—00)+ foz, (46c) 


where Eq. (46a) has been simplified in appearance by 
substitution from Eq. (46c). 

It is not the purpose of the present article to present 
equations for large number of special cases, although 
there are many others than the above three particularly 
simple ones that are of interest. Neither is it the purpose 
of the present article to discuss the solution of the above 
equations for given driving fields, which will be done In 
a forthcoming article. We will, however, discuss the 
solution of the equations in the absence of a driving field; 
that is, the free decay of a TLS from given initial 
conditions. For the sake of simplicity, we consider only 


°F. Bloch, Phys. Rev. 70, 460 (1946); R. K. Wanggness and 
F. Bloch, ibid. 89, 728 (1953). 
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the case in which a; is either perpendicular to both a; 
and a», or is equal to zero. From Eqs. (43) we have, 
under the above conditions, 


o= (—e-F 541: a;)0»— 5a5/o1 , (47a) 
637 (v-1-741* a2)01 — 241702 , (47b) 
G3= — (a! -a)n(ss— o9). (47c) 


The last equation may be solved immediately, the result 
being 


o3=o0+[o3(0)—o0] expL— (ai?2-as?)gt]. (48) 


Equations (47a) and (47b) yield the same equation for 
both cı and oe: 


G1,2+7 (a1 + a2") 61,2 
+ GET Caa? (21:22)? }}or,2=0. 


The solution of Eq. (49) is 
e1— exp[ —7(ai*4-az)t][ai cosQi-+B, sin], (50a) 
o2=exp[ —7(ai4-az)£ [as cosQi+ Bs singi], — (50b) 


where A; and B; are constants determined by the initial 
conditions, and 


V=% {1— G/e?)L (ai: as)?-r1(ag—a2)]]. (0c) 


We see that c; approaches exponentially its equilibrium 
value, and that c; and e» undergo exponentially damped 
oscillation. The frequency of oscillation € is of interest. 
For a; and as equal in magnitude and perpendicular to 
each other, 9 is equal to w, the frequency of the free 
TLS; otherwise © is less than e. Equation (50c) puts 
into perspective the results of I, where it was found that 
the frequency of the undriven damped oscillator was 
equal to that of the free oscillator for the magnetic 
dipole case but not for the electric dipole case. It is seen, 
now, that the magnetic dipole TLS has just that com- 
bination of a’s which yields 2=a, but the electric dipole 
TLS treated in I has a2=0, a;?— 1, giving 
go 1-167/8)], 

in accordance with the results of I.” 

One encounters occasionally the statement that any 


TLS is equivalent to a system of spin 5 in an appropriate 
magnetic field." While this statement is true in the 


(49) 


(51) 


0 A arison might be made between Eas. (47a) and (47b) on 
the SEd and ARE for the coordinate and momentum of 
a harmonic oscillator with dissipation on the other. If both coordi- 
nate and momentum couple to the loss mechanism, then the 
equations of motion for the harmonic oscillator variables (in 
appropriate units) are > 

p=—ag—mp, d-ep-—n4- 
i d to 

i if a; and as are orthogonal, c» and a; correspond t 
s s es momentum, respectively, of the harmonic 
oscillator. It can also be seen readily that the oscillatory Heuer 
of the freely decaying harmonic oscillator is equal to oe y E 
71735. Comparison with the dissipation terms 1n Eas. ( Run 
(47b) shows that this corresponds to our requirement (in addition 
to th hogonality) that a;^—02. : 

2 N ; pube gen d Y. R. Shen, Phys. Rev. 133, A37 eee 
A. Abragam, The Principles of Nuclear Magnetism (Oxfor 
University Press, New York, 1961), P- 36. 
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absence of dissipation, it is only an approximation when 
the TLS is coupled to an LM, as the above frequency 
consideration reveals, and as can be seen from the 
dependence of Eqs. (43) on the type of TLS considered. 
It was shown previously [Eqs. (44)] that the Bloch 
equations are a special case of Eqs. (43). The Bloch 
equations were developed originally for macroscopic 
matter. They also apply to the expectation values for a 
microscopic system, and, in this sense, correspond to a 
special case of Eqs. (43). One might ask how the present 
theory applies to macroscopic matter, or to a large 
number of systems identical to—but possibly, with 
different orientation than—the TLS under considera- 
tion. Can one, as in the case of the Bloch equations, 
regard a linear combination of the (four) o’s as the 
component of dipole moment along a given direction for 
macroscopic matter containing JV systems similar to our 
TLS? The macroscopic dipole moment D is given by 


N N 4 
D-Y, d?2Y, Y, aoa”, 


j-1 


(32) 


j=] a=l 


where the superscript j refers to the jth TLS. The above 
question is equivalent to asking if ¢, is independent 
(except for a multiplicative constant) of 7. It is clear 
that if the systems are all oriented in the same manner, 
then a, 0 = aa, c, =a, 


D=Nd, (53) 


and the answer to the above question is, obviously, in 
the affirmative. If the microscopic systems are not all 
oriented similarly, then the coefficients a;?-f in Eq. 
(43a) are different for differently oriented systems. If, 
because of symmetry conditions, the solutions of Eqs. 
(43) are essentially unaffected by a particular distribu- 
tion of orientations, then the answer is still in the 
affirmative, but if no such symmetry exists, then Eqs. 
(43) cannot be regarded as equations for macroscopic 
polarization. The macroscopic result must be obtained 
through Eq. (52), that is, by solving first the microscopic 
problem, and then superposing the solutions. 

The essential aspects of the orientation of a magnetic 
dipole TLS are determined by the tield that is respon- 
sible for the level separation. If this field is the same for 
all the two-level systems, Eqs. (43)—which reduce in 
this instance to the Bloch equations—may be regarded 
as equations for macroscopic polarization. Such is the 


case for macroscopic matter in an external (laboratory) r 


field. If, however, the field responsible for level separa- 
tion is an internal field and has different orientation for 
different dipoles, the above equations may be applied 
only on a microscopic scale. è 
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The Heitler-London method based on nonorthogonal atomic orbitals is applied to arrays of an infinitely 
large number of atoms. The electronic energy is given by a quotient with strongly divergent numerator 
and denominator, which until now has defied correct computation. By noticing some resemblance of this 
problem to the linked-cluster expansion in many-body problems, we have now developed a new method to 
compute the Heitler-London energy. Here, the numerator and denominator are divided simultaneously 
by a common factor, which leads to a set of recurrence relations between the normalization matrices F. 
'The matrices F are essential parts of the quotient to be calculated. When an overlap integral (E |) is repre- 
sented by a line starting from /; and ending at k, the calculation of F using the recurrence relations is carried 
out systematically by drawing diagrams consisting of connected loops. Since our present aim in applying the 
Heitler-London method is to compute spin-wave spectra, the calculation is carried out in the complete space 
of spin waves and the energy expression is given by a Hermitian matrix. This introduces additional matrices 
A. The computation of A is also carried out by the diagram technique, since A can be expanded into an 
asymptotic series using F. Finally the energy matrix is written as a sum over connected diagrams, in ac- 
cordance with the speculation obtained from the linked-cluster expansion. An error introduced by truncating 
the series of diagrams is also calculated. The present method not only ensures that the energy density in the 
Heitler-London method is finite, but also provides an accurate and practical way to compute the Heitler- 
London energy, which has never been accomplished previously. To calculate the ground-state energy the 
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Cluster Expansion in the Heitler-London Approach to Many-Electron Problems* 


technique is simplified since the computation of A can be completely eliminated. 


I. INTRODUCTION 


OR many years, the concept of exchange coupling 
has played an important role in theories of chemical 
bond and magnetism. This concept was first introduced 
in the Heitler-London theory! of molecular binding and 
applied by Heisenberg? to ferromagnetism. Although 
the Heisenberg theory has been formulated in a very 
attractive form by Van Vleck? and Mgller* and much 
theoretical work in magnetism has used this formalism, 
the method has been criticized in many respects. 

In this paper, we will limit our discussion to one of 
the difficulties inherited in the Heitler-London approach 
to magnetic problems; that is, the nonorthogonality 
difficulty in calculating energy spectra. The other 
aspects of the difficulties will be found in, for example, 
review articles by Herring. 

The nonorthogonality difficulty can be stated as 
follows. If we assume that there are no spin-orbit terms 
in the Hamiltonian, the calculation of the Heitler- 
London energy of the form 


E= fees J J W*vdz 


is reduced to that of the Heisenberg exchange Hamil- 
tonian - : 
, SC —2 X J58:8;, (1.2) 
uL: i>j 


r 


(1.1) 


007757 se = a ze E 


provided the wave function Y of this system is con- 
structed from orthogonal atomic orbitals. 


_____*Based oa work performed under the auspices of the U. S. 
— Atomic Energy Commission. 

i 1 W. Heitler and F. London, Z. Physik 44, 455 (1927). 
Heisenberg, Z. Physik 49, 619 (1928). 

H. Van Vleck," Theory of Electric and Magnetic Suscepti- 
A 'endon Press, Oxford, 1932). 

jller, Z. Physik 82, 559 (1933). 
erring, Rev. Mod. Phys. 34, 631 (1962); also J. Appl. 
1, 3S (1960). 
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The orthogonality of atomic orbitals makes the ex- 
change integrals J;; thus obtained all positive. Since a 
positive exchange integral in the Heisenberg theory 
corresponds to ferromagnetism, this approach leads to 
the obviously absurd conclusion that all substances are 
ferromagnetic. 

To utilize the Heitler-London method to full ad- 
vantage, nonorthogonal atomic orbitals should be used 
as the basis. However, the calculation of energy spectra 
becomes extremely difficult since both numerator and 
denominator of the energy expression (1.1) diverge 
when the number JV of electrons involved increases. It 
is not at all obvious if it is permissible to discard, from 
the energy expression, terms of higher order in overlaps. 
Such terms may appear N times, JV? times, or even more. 
Slater? and Inglis? pointed out this difficulty, which led 
to grave doubts as to the validity of the Heisenberg 
exchange Hamiltonian??? (1.2). , 

Previously,” we have discussed this difficulty. (This 
paper will be referred to as paper I.) The remaining 
problems will also be examined in detail and will be 
published elsewhere." The conclusions will be summa- 
rized as follows: The exact calculation of the Heitler- 
London energy can be converted to the eigenvalue 
problem of a spin Hamiltonian of the form 


Rspin= 2, P(— 1)7J pP", (1.3) 


where P is a permutation operator and P" the corre 
sponding permutation of spin variables. The exchange 
energy Jp for a single interchange P of electrons 15 
very nearly equal to the value one would compute by 
neglecting terms of higher order in overlaps. The con- 

6 J. C. Slater, Phys. Rev. 35, 509 (1930). 

7D. R. Inglis, Phys. Rev. 46, 135 (1934). 

2 J. C. Slater, Rev. Mod. Phys. 25, 199 (1953). 

* G. Heber, Fortschr. Physik 1, 707 (1954). 


1 T. Arai, Phys. Rev. 126, 471 (1962). 
u T. Arai and C. Herring (to be published). 


| 
| 


0 


Un CÓ) c[ ct (0 8&9 c& 0 cn H 


| IA A A o I p 0— ^ e [0D Mo (m [l 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


HEITLER-LONDON APPROACH 


tribution from the higher order terms in Hamiltonian 
Dr’ (— 1)FJ pP”, where the summation +p’ excludes 
all single interchanges, is small. 

This gives a rigorous justification for using the 
Heisenberg exchange Hamiltonian in the calculation of 
thermodynamical functions.? However, we have not 
succeeded in evaluating the value of Jp when P is not 
a single interchange. Although we have bounded the 
total energy E as well as the exchange energy J p for a 
single interchange by applying algebraic identities that 
exist between J p's, the values of the bounds are merely 
first order in the approximation. The method cannot be 
extended immediately to narrow the bounds beyond 
the single interchange approximation. This implies that 
the accuracy of the Heitler-London method so far is 
limited up to that of the Heisenberg model. 

If we would like to use the Heitler-London method 
as a rigorous mathematical tool to calculate many- 
electron systems, we have to provide an expansion 
technique with which the energy expression (1.1) as 
well as the expectation value of any observable quan- 
tity can be calculated up to a desired accuracy. This is 
the purpose of the present paper. 

'To achieve this goal, we have to carry out more de- 
tailed calculations of both the numerator and the de- 
nominator of (1.1). It is, of course, impossible to 
proceed with the calculation directly. However, we find 
that our problem has some resemblance to the linked 
cluster expansion for Green functions. In either case, 
the quantity to be calculated is a quotient with di- 
vergent denominator and numerator. Use of Dyson's 
equation reduces the numerator of a Green function 
to a factor multiplying all connected diagrams. Since 
the factor is exactly equal to the denominator and 
cancelled by it, the Green function consists of just the 
sum over all connected diagrams. 

In the Heitler-London calculation, a set of recurrence 
relations (3.14) and (3.15) for the F matrices will take 
the place of Dyson's equation. Here the F matrices de- 
fined by (3.1) are the normalization matrices and the 
essential quantities to be calculated. Since the recur- 
rence relations reduces the F matrices for N-electron 
systems to those for V—1, N—2, --- electron systems, 
the successive application of the relations leads to the 
complete expansion of the F matrices in terms of over- 
laps. The expansion technique is similar to that used in 
both classical and quantum-theoretical treatments of 
many-body problems, and use of diagrams will facilitate 
the computation. It will be finally found that F is 
written as a sum over connected diagrams, when gu 
overlap integral of the type (X| Xa) is denoted by a line 
starting from the center / of the atomic orbital X, and 


TR ral review of the theory of direct exchange see 
C. ering in Votes edited by Ge Resp and H. Suhl (Aca- 
SURE SE ver D. Pines, Nuovo Cimento 9, 


13 For instance, P. Nozi: 
47 (1958). I. Hubbard, Proc. Roy. Soc. (London) A240, 539 


(1957) and A243, 336 (1957). 
h P f 3 Dar iow Rev. 75, 486 (1949). 
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ending at k of X,. Following the prescriptions (i)-(iv) 
described in Sec. ITI (1b), diagrams for F can be drawn 
without knowledge of the expansion technique used. 
An error accompanied by truncating the expansion will 
be calculated by using a theorem which will be probed 
in the Appendix. 

Since the energy matrix should be Hermitian, it is 
necessary to calculate, in addition to F, the matrix A 
defined by (3.2). The matrix A is to correct the devia- 
tion of F from Hermitian form. Although A is close to 
a unit matrix, the straight forward calculation is very 
difficult. In Sec. III(2), we shall, instead, develop an 
asymptotic expansion of A and discuss the convergence 
of the series. The calculation can also be carried out 
automatically by drawing diagrams. 

By multiplying F by A, the energy matrix can be 
obtained and it will be found that the energy matrix 
is also written as a sum over connected diagrams. 

The significance of the present method will be sum- 
marized as follows. 

(1) By using this method, a rigorous calculation of 
the Heitler-London method including all overlaps can 
be carried out without difficulty. The effect of overlaps 
between distant atoms has been cancelled out rigor- 
ously. Such a calculation has never been performed for 
a many-electron system previously.!5:5 In fact, this has 
been considered impossible from the mathematical 
point of view.? 

(2) The awesome task of calculating the inverse of 
the overlap matrix" is completely eliminated. The 
calculation can be carried out just by drawing diagrams. 
Even for a system with a small number of electrons, 
therefore, this method will be more convenient than 
inverting the overlap matrix directly. 

(3) The fact that the energy matrix consists of just 
a sum over connected diagrams implies that the energy 
density is a finite number. This will, in practice, elimi- 
nate the nonorthogonality difficulties. Actually, a rigor- 
ous proof for this problem will be given in a forthcoming 
paper." 

So far, the calculation was carried out in the total 
space of the Heitler-London wave functions aiming at 
the computation of spin-wave spectra. The wave func- 
tions are naturally described by a vector in that space 
and the energy matrix is given in the form (2.26) or 
(2.27) rather than (1.1). This is why the matrix A 
appears. If, on the other hand, one is interested in the 


18 The only exception is the calculation by F. Takano, J. Phys. 
Soc. Japan 14, 348 (1959). He has calculated the energies of the 
ferromagnetic ground state and the excited states obtained from 
the ground state by reversing only %ne spin. 

16 Also see W. J. Carr, Phys. Rev. 92, 28 (1953). . 

11 The overlap matrix D in this paper is defined by (2.4). Here 
the ij element of D is given by the overlap between N-electron 
wave functions V; and V;. Under this definition of D, neither the 
inverse of D nor D itself has ever been calculated correctly when 
N is large. When the overlap matrix A is defined by overlaps 
between atomic orbitals so that its & element is given by (|t), 
the inverse A~! has been expanded into a power series of & by 
P. O. Löwdin, Advan. Phys. 5, 1 (1956). However, there is very 
little relation between A and D. 
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ground state carrying a suitable spin function 0, the 
wave function is simply written as (2.1) and the calcu- 
lation of A can be bypassed. “This will simplify the 
calculation considerably as we shall describe in Sec. IV. 

Most of the present discussion will be carried out as 
if we are dealing with arrays of one-electron atoms. 
However, this limitation is merely an aid to keep track 
of the formulas and manipulations. In Sec. IV(3), we 
shall give a brief discussion of problems which arise 
when arrays of many-electron atoms are considered. 

Before we start to calculate F and A in Sec. III, we 
shall review the method used in the previous paper I 
in Sec. II. 


Il. THE HEITLER-LONDON METHOD 


Let us consider an array of V one-electron atoms. The 
Heitler-London wave function for this system is given 


by 
N! 

y= (V es 1)? P'A (rts, P SIN) 
12) 


X P70 (1,02, e on) , (2.1) 


where A(ryrs--,rw) is a product of occupied atomic 


orbitals X;(r;), 

A (ry5: rN) X3 (1)X2(12)---Xw(ry), (2.2) 
QO(s1,05,-:-,0N) is an arbitrary function of the spin 
variables, and P” and P" are permutations P of co- 
ordinate and spin variables. (P7 and P" are considered 
as corresponding to the same abstract permutation P.) 
The summation J p in (2.1) includes all N! permuta- 
tions P. 

There are many ways to construct O(01,02,---,o) 
of N spins, but, in any case, we will find 2” linearly 
independent and orthonormal functions Qj, Os, --- 
Starting from an orbital product A (ri,1»,- - -, ry), there- 
fore it is possible to generate, in principle, 2” linearly 
independent functions V, V», --- of this array. Let © 
be the row matrix @= (05,05,- - -), whose elements are 
2N linearly independent and orthonormal functions ©; 
of N spins. Similarly W is the row matrix W 
= (Vi Ws,---). Then the wave functions Y; introduced 
by (2.1) are written as 


a N! 
w= (N 17922 37 (—1)P PAPO (2.3) 


by using the matrix representation. 
The wave functions W= (VY,,V;,---) thus obtained 
- are not normalized because of the nonorthogonality of 
< the atomic orbitals X, used as basis. Instead, it will be 
found that the overlap between Y; and V; is given by 


Dij J WW dr 


= i; [2:(—1) ?P'AP*O; FAO;d7 (2.4) 
í Jg 


== (PA|A)U (P), 
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where (PA|A) represents the integral over the coordi- 
nate variables 


(PA|A)= Í PrA(ryro, ry)" ruts-*,rw)de, (2,5) 


while the integral over the spin variables is denoted by 
U;;(P) as follows: 


Ü(P)e (- 1^ Í Porod Om 


From (2.4), it is clear that the normalization constants 
Dj; can become very large, which leads to the difficulty 
of calculating the Heitler-London energy. 

The overlap integral Di; can be regarded as the ij 
element of the overlap matrix D, and similarly Ü(P) 
as the 77 element of matrix U(P). Using the matrix 
notation, Eqs. (2.4) and (2.6) are written as 


D= [eva (PA|A)U CP), (2.7) 
and 


T(P)= (-1)7 | Prot-@do. (2.8) 


It is easily proved that the matrix Ü (PQ) corresponding 
to the successive application of two permutations Q and 
P is given by the product of two matrices U(Q) and 
Ü (P) as follows!?9: 


Ūro) - 0 (9). (2.9) 


18 The matrix U(P) introduced by (2.8) is unitary and is a 
representation matrix of the permutation group. Use of repre- 
sentation matrices of permutation groups in the calculation of 
electronic energy levels is originally due to R. Serber [Phys. Rev. 
45, 461 (1934)] and T. Yamanouchi [Proc. Phys. Math. Soc. 
Japan 18, 623 (1936)]. Also see the review by M. Kotani in 
Table of Molecular Integrals, edited by M. Kotani et al. (Maruzen 
Co., Ltd., Tokyo, 1955). In the present paper, however, we shall 
not use any group-theoretical argument except in (2.9) and (3.39). 
It is more convenient to understand that U(P) is merely a nota- 
tion representing the integral appearing on the right of (2.8) and 
has the property described by (2.9). A Y 

9 The relation (2.9) will be obtained as follows. Since 2 
linearly independent and orthonormal spin functions Oi, for i=l, 
2, ---, 2N, are complete in the spin space, we can expand P'O; in 
terms of O; as follows: 


P*0;—- (—1)* Z 0;U;:(P)*, à 
7 
where Uj;(P)* is merely an expansion coefficient but it is easily 


recognized that U;;(P) defined above is equivalent to U;;(P) in- 


troduced by (2.6). Here U(P) is the transpose of matrix UG ). 
Let us apply two permutations Q* and P” successively to Oi. The 


P*Q*9;— (—1)0 > P*O&;U&(Q)* 
= (—1)70 2 Z eU (P)*Ui (*. @) 
2 
On the other hand, we can write this as follows: 


P«Q*0;— (—1)*? Z GU; (PQ)*. P 
7 


Comparison of the above two equations yields 7 
U(PQ)- U(P)U(9) ( 
which is equivalent to (2.9). 
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The orthonormal wave functions Wy of the array are 
then given by 
Wy=w)?, (2. 10) 
Our problem is to calculate the Heitler-London 
energy of a Hamiltonian of the form 


N N N ; 
Lo Jiti DV" glm Em), (211) 


hy=1 h2=1 


s 
where the summation >>’ excludes /;—/n. For this 
hr=1 

purpose, it is convenient to split off, from the W- 
electron wave function W, one or several atomic 
orbitals, which are of particular interest for the manipu- 
lations we want to perform. This will be carried out as 
follows. An antisymmetric wave function of N electrons 
is constructed by taking the summation over JV! 
permutations P as is shown in (2.3). Here the JV! 
permutations P can be divided into two sets; the first 
set includes (V — 1)! permutations Q[ 7] of W—1 elec- 
trons 1, 2, ---, 4—1, h+1, ---, N, while the second one 
consists of V permutations P (k +— h) taking electron 4 
to orbital k, where k=1, 2, ---, N. The original N! 
permutations are given by Q[h]-P(k<k) and the 
summation over JV! permutations P in (2.3) splits into 
two; the summations over (V— 1)! permutations Q[/] 
and over £s. If we take the summation over (V — 1)! 
permutations Q[/] after operating a particular permuta- 
tion P(k<h), this portion of the wave function is 
antisymmetric with respect to electrons 1, 2, ---, h—1, 
h4-1, ---, N, but electron 4 always occupies orbital &. 
To emphasize this situation, let us denote the sum by 
Xy (ri) WL (2) ] where $— 1, 2, ---, N. The original wave 
function W^ is then written as 


w= No Xlr) VRH], (2.12) 


where W[k(h)] is the row matrix whose elements 
V, [& (A) ] are functions of N—1 orbitals and JV spins as 
described above. More explicitly, 


w[k(h)J=LW—-1) ! T7 7 LE (—1) eta Po» 


XOARA k hO, (2.13) 


and A[k(h)] is a product of N—1 atomic orbitals, ob- 
tained from A(T,» - :,rw) by applying P'(k «— h) and 


She GEORG TT APP Eod 


TO MANY-ELECTRON PROBLEMS 


A827 


deleting X;(r5) as follows: 


ALR(h) = Pr(k — A)A( t,t: - TN): 


(2.14) 


Xi(Ta) 
Similarly we introduce 
MP Ur) Rs (02) ], 2+, WD Uri) - + Es (s) ]. 
For example, 
W^ :) Ro (12) | 


-[(—2)]9^ Y; (—1) 0AP kr un) 
QUA 
>< Q Chih JAC ki 231210238027 he | 
XP” (kiko = hıh) © ; (2.15) 


where 
ARs (hi) Rs Ui) ] 


P” (Riko c hiha) 


B us 
XA(ryrs:-::,rw), (2.16) 


and Q[/ihe] represents (V—2)! permutations not in- 
volving electrons /; and fz, while P (kika — hiha) is a 
permutation taking electrons /; and / to orbitals £i 
and kz. The functions 


Wr n) ks (2) ], — PA Ue) es 2) Ra (15) ] , 


etc., are related to the original wave functions W^ as 
follows: 


w=[NV(N-1) x 2 Xi (Th) Xtea( The) 


WRC) ke (he) ]— etc. (2.17) 


We further define the overlap matrices 
S[k- -knl hi: E - ha] 
between the functions W[4i(/n)- - -k,(h.) ] as follows: 


Ski: - -kal ha: Ae ftit «+ Raha) ]t 


xw): - E.) )r.: (2.18) 


The method used in obtaining (2.4) is applied to the 
above matrices, which leads to 


LI 


SEki: - E. [I d] ES KOS Js IAL Ra QR) + - -kn (tn) | AUR Q3) - - 00) J) 


KÜL: - In ]P (ki: -kn Inde). (2.19) 


Those avek matrices are again related to the overlap matrix D between the original wave functions W intro- 
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duced by (2.4) as follows: 
D= SEM HEP (RASTA A] * (2.20) 
= SEke] lh ]4- (1| he) (he| hy) SUI | hıh] 5 
TX | hy) EZ This ]2- (| he) SL | h wig |+ (k| This) s] hy) SUnsk | hys ]H- (& | hhl hz) SL Rhy | hyhs ]) ] 
k | 
; TM (Qu) h2} SERES | hıh] (2.21) 
v1^2 
=etc., 
where r 
(k|h)= Í xin) (ni)dra , (2.22) 
0 
i 
and Y, excludes k=h, and Dian,” excludes ki—/n, using the orthonormal wave functions Wy defined by p 
ko=he and also ki=/o, k2=hı. These results can be (2.10), are given by 
proved by inserting (2.19) into (2.20) and (2.21). 
More generally, it will be found that (£e Qr y| » fG) | Wy) I 
( 
S[k: - -knl fa: + Fn] =F (h| f(n) |h) D28Sp&| 4D o 
= 5(hnx~k1, gts Rn—1) SLA: o Escala hi: Q hnil] h | F 
1 —1/2 —1/2 
Tio (Es | hs) SL: * nasa ais], (2.23) tL (k| f(x) | i) D2SCE| W JD“. (2.26) | J 
bere Similarly the expectation values (g) of two-body inter- : 
òla ky s Eua) d, if hn is not included in k, +++, action 1974,22, g(rir&,) are obtained by 
* n—1 
| =0, otherwise (9e (rl? E X (mg) |) | 
1 2 í 
and, in the summation X r ,™, we exclude kn= kı, ko, A 
id à kni, tn. Furthermore, the following equality will m 2 2 x 2 (kika 8 (rm21) | ale) | 
oid: | 
; xD3PSpuk|ma]D^, (2.27) - 
20 SElas ss | altos + ha] [hike] hha] | 
m —Ü(PJSUnuh. hus. hu], (2.24) where hı and hz are also included in the summations | 
p : has : d implicit | 
— where P; is a permutation in which //;: - -hn are taken 27127 tor simplicity. T | I 
| to orbitals hah thu. This saeia cal be proved Here (f) and (g) are both matrices and can be calcu | 
. easily by ssi P(hy- bs Mm: hu) in (2.19) by lated if the normalization matrices D-^S[&|1]D i 
EE Olah). as sd A are evaluated HM 1 
JU(Q[/n- - -hn ]) by using the relation (2.9). The calculation of the normalization matrices 19 the 
ow we are reddy to calculate the energy matrix main subject of the present paper and will be carried | 
the Hamiltonian (2.11). Since the Hamiltonian is CUt™ the following section. 
) symmetric and commutes with the antisym- 
er Z p(—1)?PP in the wave functions (2.3), the II. CALCULATION OF THE NORMALIZATION 
sed in obtaining (2.4) is again applied to this MATRICES 
on and it will be found that Here we shall expand the normalization matrices 2 
the type D2^S[- --k;|I;:--h,]D?^ in ae 9 
\— uw Ü (P) and evaluate the expansion coefficients. 10 pro- 
=| wt xp Tos 
a | Be E oar ceed with the problem, let us divide the expansion into Y 
h two steps and first consider the calculation of matrices i 


Fiki: Esa -hn ]= SUI Es hi < -An D> 
=F folki -knl m JOP) 
P R 
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The second expansion to be calculated is of the form: 
A) (R)= D3PÜ(R)D'?—3^ ap (R)Õ(P). (3.2) 
P 


The original matrix is thus expanded as 
D^? S[R; - -kal hr- -h JD? 
= DPS Eki RI + -hn JD. D’, 
=} foli: Es In -ha D0 (0) D", 


=E X folki -kal hh lap (QŬ(P). (3.3) 
P Q 


Previously,?? we have shown that, as long as atomic 
orbitals x used as basis are linearly independent, the 
inverse D-! exists and can be expanded as (24) of 
paper I, that is, 


D“=> Tpu(P). (3.4) 
P 

Multiplication of this expression with (2.19) yields 

(3.1). The second equation (3.2), which is similarly 

obtainable, has been introduced previously by (68) of 

paper I. 

Our problem now is to expand the coefficients 
Jelki: -kn| ħi: 5] and ap? (R) in terms of overlap 
integrals so that fp and ap? (R) can be evaluated 
explicitly. 
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(1) Expansion of the F Matrices 
(a) The Recurrence, Relations for the Expansion 


Let us first consider the matrices of the form 
F[;- - -halki --5,] and show that, by using the rela- 
tions (2.20), (2.21), and (2.23), the matrices can be 
decomposed into a sum of products of two parts: The 
first ones denoted by F[P] are made of overlaps be- 
tween electrons /,---h, and the other electrons, and 
can be calculated up to any desired accuracy explicitly 
while the second parts are equivalent to the F matrices 
originally considered except that the new matrices are 
for a new array which can be obtained by removing 
electrons %1- - -hn from the original array. Therefore we 
can repeat the same calculation in the second parts. 
This expansion will converge since overlap between 
electron 7 and a distant electron decreases exponentially. 

For example let us consider the simplest case, the 
matrix F[h|4]=S[h|h]D—. By inserting (2.20) into 
F[A|£E], we find that 


FLA|hJ=S[A|A]{ SLA] a222 (k| b) SEE| 5h] 


X[hJ= M (h| e) i | E) SERE | bx ]SER | & + 


FX" X (h| Es) (Ra | ki) 3| 4) SER | hkika ]SER | 4 


TX Y" Y (| Ro) | Ba) en kı) 08 E) SE ees | hkikaks SEA | k + 


ka ko kı 


duces, 


Let us introduce the notation F,[A1---Rn|#1---Rn] by defining 


S[k: - Es Mas - ESSERE] Fas + En a Es. 


Use of this definition as well as (2.24) leads to 


X[k]- X-' (4|) 4) (x) EXE] k] 


2253/2)94/07/210719 221621] h)U (Risk) Fs[ Esa] eo] 
ks kı 


TU Dd s (h| 129162 | 1291623 1210200110222) Fa[ EiEska| kikeks | 


k2 


kı 
dices. 

wh kiko- + -knh) is a cyclic perm 

35 ce In (3.9), terms such as (h| E) (Ro | ea 

Falki” + -Ral Ri: kn] belonging to 

can be calculated by (3.5) and (3.9) exce 


2 The pfesent method can be ext 
also Appendix B of paper I. 


3.5) 
=(1+- X[k], 
where 
X[AJ=>! (k| E) SER] ASA hy. (3.6) 
k 

Repeated use of (2.23) in X[4] yields 
(3.7) 
(3.8) 
' (3.9) 


utation in which electrons A, kı, ---, &s—1, Rn are shifted to orbitals i, k2, 
|4)Ū (Kxksi) belong to the first part TCP], while the matrices 
the second part are equivalent to F[k1- - kalki: --kn]. For instance, F;[£|&] 
pt that electron 4 should be excluded from the summations. 


ended to the case where basic atomic orbitals are not linearly independent. See Sec. IV (3) an 


, 
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The equation (3.5) can be expanded formally as 


F[2|2]- 1— XE ÁH- XE p— XER (3.10) 
This expansion converges as long as «=D »|s pLA]| <1 where xp[/] is the expansion coefficient of X[ 4]: 
X[AJ=>d x»[A]O (P) . (3.11) 
1p 


This will be proved in the Appendix. 
Similarly F[7/| fake] is expanded as 


F[Ayhe| Aiko |= (1+ X Unh: ]) ? 
= 1—X[yho]+ Xinh F— X hh P4 - (3.12) 


where 
Xhe |= Qu | o) (o | a) 8 (raiz) 
FE (| ka) (ka| M) (kih) + lhe] kilki | ha) U (kah) + (hel ki) kı| h) h| ho) U (khh) 
^ + (h| x) (s I2) o | 13) (ook a) ) Fans Ls | k] 
oF 2 (s | eo) eo | ex s | 3) es) + o | ko) (| Ex) es | I2) Qao) 


J (la| s) (Rs | Ex) s Les) a | o Gees) + Qa | Re) (Ro | k1) a | 2) Qno a) O rss eaa) 
(s |a) es fix) Qro | 2) (Ro | 12) 0 (Qa) eng) 
2207112916212 1021522 102112223 1627227729] 8 FranglRike| kiko] 
Te (3.13) 


The matrices F[/a |m], F[J/is | Aho], «++, obtained by (3.9), (3.10), (3.12), and (3.13), can be inserted into 
replace (3.9) and (3.13). Thus the expansion develops further. These relations are more conveniently summarized by 


F[A(n) ]- (1+X[h(n)))+ 


=1— X[A(n) H-XEh(») P— Xk (nm) Fo - - - , (3.14) 
| and 
XD Q0) ]- 92 E TER GR) JF coL Om), (3.15) 
m k 
| where h(n) and k(n) are abbreviations of letters appearing in brackets of the matrices F[/n:--/s [a -ha), 
| X[k: -- 1, ], etc., and n specifies the number of letters involved. 
So far we consider matrices of the form F[/n- - -hn| Mı- - -hn]. More general ones Fk: - -knl hir- hn] introduced 
by (3.1) can also be reduced to the standard form by repeated use of (2.23). For instance, 
F[k|A]- (h| k)U (hk) FLAk | hk.) 
22,3402 02)1021210/02212) 18222112228 
ki 
y 
TEC 2 (h| ko) lkol kx) (R1| k) Ü (RRikoh) 98722121072 20231 
v2 1 
Rt : A (3.16) 
gg pee) ee 


> (b) Diagram Description example, the expansion (3.9) of X[4] is shown in Fig. 1, 
To proceed with the calculation in practice, it is where, as a convenient way of explaining how diagram’ 


nient to draw diagrams. Let us denote, by a solid are built up, points are classified temporarily as t e 
following two kinds: a solid dot @ indicates the starting 


point of X[/] and no line comes back to this point any 
more in the course of expanding X[/], while an poe 
dot O indicates that the expansion is not completed 2 
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this point but a new F matrix starts from here. The first 
term has one open dot and therefore F,[/4|£;] will 
follow. Two open dots of the second term show that 
Fi[£ieo| kik] follows. Since broken lines for permuta- 
tions are the same as the solid lines, we have omitted 
them. 

Similarly the expansion (3.13) of X[44/1;] is described 
in Fig. 2. There we have omitted terms obtained by 
interchanging h and 7. 

To calculate Fj[ 7; | & ], Filkike| £152], etc., we further 
draw X,[4i| £i], Xu [io | bis], etc., starting from open 
dots. For example, the second term in Fig. 1 includes 
the series of diagrams in Fig. 3. 


ki ky k2 k2 
h h h 


Fic. 1. The expansion of X[/] described by Eq. (3.9). 


Solid lines can come into a point many times, but the 
representation of electron permutations should be re- 
duced into the simplest form so that we can distinguish 
a permutation from others immediately. This implies 
that within one diagram, any one electron should not 
be shifted more than once, and hence a broken line can 
come into a point only once and come out only once. 
For instance, the second term in Fig. 3 represents the 
permutation (Ries) (&1&s/rn) which shifts electrons /n, kı, 
kə to kı, kə, hı and, after that, again interchanges kı and 
ko, but this process is equivalent to a single interchange 


k ki ki k2 
3 hi h2 Ü exi OF AN rs i e. 
hi ha hi h2 hy hg 

ki k2 ky k2 kı k2 

hy h hj he hy h2 


Fic. 2. The expansion of X[/42] described by Eq. (3.13). 


of hı and E». Therefore the broken loop in this diagram 
illustrates the interchange (/3&») only. The second term 
represents the quantity È 


| s | Bo) | a | Rn (Ro | ex) a La) O Qs) - 


Since (Ej) (Rika) = (hıkı) is a different permutation 
from the second term, it is important to remember the 
order in which these loops appear in a diagram. 

In order to calculate F[4]4] by (3.14), we need to 
evaluate higher order terms X[/}, X[L f, ---. This 
will be carried out by multiplying diagrams we have 
generated. For instance, X[/ will be obtained from 
diagrams.in Fig. 1. In case permutation (kyksy) ap- 
pears first (on the left of X[4 ]X[ ]) and then permuta- 


TO MANY-ELECTRON 
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Fic. 3. Further expansion of the second term in Fig. 1. 


tion (/£;) follows it [(/n&;) appears on the right of 
X[A]X[A5]], we obtain the diagram in Fig. 4, which 
corresponds to the quantity 


[| ki)? (l| Ba) (o | e) Us|) O (kiko) . 


Since all diagrams in X[A] start from A, solid lines 
come into /; many times after the multiplication and the 
distinction between solid and open dots becomes ob- 
scure. Therefore, we shall not make this distinction in 
diagrams any more. 


ki - saa ke 
Tic. 4. The diagram of (h|k2)(k2|R1) 


X (kı L1) Qe) X (| x) 0 | A) (kih). (0) 
This term comes from X[/r F. 


Similar multiplication should be added in those 
appearing in Fig. 3. Here we have drawn only 
Xi[£ieo | 5i» ] instead of F,[ ££; &1&]. The expression 
will become complete if we include higher terms such 
as X,[Àio | kiko P, Xo [yo | Eye P, etc. The contribution 
from Xi[£1£»| £1&» includes, for instance, the diagrams 
in Fig. 5. The first one is obtained by multiplication of 
the first and second diagrams in Fig. 2. The second one 
is obtained from two of the type appearing in the 
second term, but %ı and kz are interchanged in the 
first one. : 

In (3.14), odd powered terms X[A(z)]"*' carry 
minus signs, which should also appear in diagrams. It is 
possible to associate a sign with the number of loops in 
a diagram. For instance, diagrams in Fig. 1 should have 
minus signs in F[/4|/4] since they are terms appearing 
in X[/4]. In fact, each of them consists of one loop. 
Those in Fig. 2 should also be minus. However, the fifth 


Fic. 5. Some of the 
contributing terms 
from X,LAike| kika F. 
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Fic. 6. The diagram of (|i) (ka [) UJ) 
X (h| kilki |b) U Qa). 


term consists of two loops. This will be explained as 
follows. The same diagram can be constructed from 
X[fi; P. by multiplying two of the type appearing in 
the second term in Fig. 2, as we have already explained 
in connection with Fig. 5. There are two ways to con- 
struct the diagram by the multiplication and both 
carry a plus sign. Therefore, the total sum is 2—1=1 
in accordance with the proposed criterion. Matrix 
X[Jahohs | will include diagrams with three loops. How- 
ever, the same diagram will be constructed from 
X(Maliohs P 6 times with plus sign and from X[ hhh 
6 times with minus sign. The total is — 1+6—6= (— 1): 
We can test this rule further. Let us assume that, up to 
i loops, total sum of all possible contributions to a 
particular diagram with r loops is (—1)", where r<i. 
A diagram with t+ 1 loops is constructed by multiplying 
a diagram with x loops by all possible diagrams with 
total i-x-+1 loops. By considering all possible ways to 
construct diagrams each of which carries x loops and 
by adding all possible cases starting from x=1 and 
counting up to x=/+1, we obtain the total sum of all 
possible contributions to the diagram with ¿+1 loops. 


We can construct i diagrams, each of which carries 


x loops, while the total sum of all possible contributions 
i to a diagram with /—x-F1 loops is (—1)*71. There- 
fore, the total sum of all possible contributions to the 
desired diagram carrying /4-1 loops is 


tk Na! 
Eye eco. 
X 


x=1 


This proves the criterion on assigning signs to diagrams: 
We count the number t of loops in solid lines in a diagram. 
The diagram should carry sign (—1)'. 

Thus we have expanded the matrix F[Ja- - -n| In: - - hn] 
in terms of overlap integrals and U(P). Comparison of 
the expansion with (3.1) yields that 


felm - -Aalha - hn] 


È aS 
I => YL (-02'SmSmS», (3-17) 


t=1 PiPo--.Pt 


where P; is a cyclic permutation and Sp,— J^ P;A*Adv 

represents a loop. The summation takes all possible 

Ps, --:,P, under the condition P-—JPiP.L;-::P; 
at least, P will include some of //i: - his. 

. From (3.17), we can calculate felh: - Ja |a - ha] 

x any P. The computation will be simplified if we 

diagrams as follows: 
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loop. A loop must never go through a point more than 
once and always come back to the starting point. 

(ii) By adding more loops, we can calculate terms 
in high order. Loops appear by successive expansion 
(Fig. 3) of F and by multiplication (Figs. 4 and 5) 
of X. Number the loops according to the order with 
which the loops appear and attach sign (—1)' to the 
diagram, where £ is the number of loops in the diagram, 

(iii) After completing solid lines in a diagram, we 
describe the electron permutation by broken lines. The 
broken lines never go through one point more than once 
in a diagram and always come back to the starting 
point. If the line is equal to P, retain the diagram. 
Otherwise discard it. 

(iv) We can draw all different diagrams, including 
differences in ordering of loops. If, however, loops are 
commutative, we should take only one of the diagrams 
we find by numbering the loops in different ways. An 
example of such a case is the second diagram in Fig. 5, 
where we have already counted all contributions when 
(—1)! was assigned to it. Another example is shown in 
Fig. 6, which, in fact, appears only once in the 
expansion. 


(c) Accuracy of the Expansion Technique 


In practice, the expansion described in this section 
cannot be carried out indefinitely and it is necessary to 
truncate the series. The upper bound to an error intro- 
duced by such truncation of the series can be calculated 
by using the inequality (A7) for sum La of absolute 
values of the expansion coefficients f pL) ]. 

Let us consider, for example, the series in Fig. 1. 
The first term represents 


(oes) es | I) U (exh) Fas | ki]. 


Suppose we do not pursue the expansion beyond point 
k, any more. Thisimplies that we approximate E47] kı] 
by 1, and hence it introduces an error to the series. 
From (A7), it is easily found that the error is less than 


Y b) HEA- OAA) 1]: 


From the second term in Fig. 1, we get an error of 


X X (hl ka) (kel b) IE — 0 (A49) LH; 


by not taking further expansion terms beyond kı and 
kə and so on. Here the factor [(1—0(AA9)7—1 
will increase when increases, and the first term 
(1—O(AAo))~ will eventually become dominant. Un' E. 
such circumstances, there is no reason to write the larg? 
loop without further expansion, since the loop 
a minority term in the further expansion. They show’ 
be neglected altogether. For instance, if we neg 


c ———— 


ig merely - 


lect, 1? a 


aA 


ü 


o: 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


HEITLER-LONDON APPROACH TO MANY-ELECTRON PROBLEMS 


the series in Fig. 1, the fourth term and higher, the 
error will be 


A5(1—O(AA9)-t-- A*(1— O(AA9))^5 
-FA'(1—0(AA9))-*-- --. 


A similar observation can be made for the series in 
Fig. 3. If we retain the first term and neglect all the 
others, the error will be 


AT (1—0(AA9)?—1]. 


If we consider all diagrams that appear in Fig. 3 and 
neglect the higher orders, we have to estimate the error 
for each diagram one by one. This process is not so 
difücult. The first two diagrams are complete and 
therefore introduce no error. The next three carry an 
error [(1—0(AA9))-1— 1] times their values and so on. 
In addition, an error will be introduced by not consider- 
ing terms such as X;,[ (2) P, X;[& (2) f, etc. 

In any case, it is possible to calculate the upper bound 
to an error introduced by truncation of an expansion 


of F. 
(2) Calculation of the A Matrices 


(a) Evaluation of A? (R) 


The calculation of matrix AU/2(R) introduced by 
(3.2) involves more difficulties. In the present sub- 


(hı | ky) (Ri | hj (h 1kı) SEI, | h iok JU (h it2) Do 


A833 


section, we shall compute, instead of AC/2(R), the 
matrix A“ (R) defined by 


A(R)=D“T(R)D=¥ ap (R)Ü(P). (3.18) 
= 


In subsection (b), the matrix A(R) will be calcu- 
lated by using the expansion coefficients a(R) ob- 
tained here. 

We shall proceed with the calculation of A (R) by 
observing from (3.18) that A“ (R) can be expanded in 
terms of matrices F and consequently ap? (R) in terms 
of fp. To facilitate the expansion of A“ (R) in terms 
of F, we shall consider the Hermitian conjugate of the 
expression (3.18), that is, 


AO(R-!- DÜ(R)D2—Y ap (R—3)*Ü(P). (3.19) 
P 


Let us first consider a case where R is a simple 
interchange (Jz). Use of the expansion (2.21) of D in 
DU(R)D- decomposes the matrix A®(hyh2)t into a 
sum of matrices of the type 

(3 I) SERVI | hiha JU (hh) D7. (3.20) 
When the method used in obtaining the expansion 
(3.16) is applied in S[Eiz;| hike], the above expression 
ls rewritten as 


TU (hi | 129102 | ky(hi | h)U 212123) Sis Reik | hyhskyks |Ú (hiha) D~ 
k2 


db my DEA (hi | ks)(ka| k»)(s] ky | 1793910702022 29 518/2072 21212: | hyhok yoke ]U (nihi) D- 
k3 ko 


= (| khi] T) 0 (bj iz) 108 27/2721 | hyhoky | 


TU (hy | ks)(Rs] 1291621 hU (hikikzħ2) F[hihokikz | hyhokike | 
k2 


Z” E" | Es) (Rs Ba) (Re | a) 3] hı)Ü (hıkıkzksh2) 18/70/72 20:2 21 hıhzkıkaka] : . 


ka ka 


Aisone 


Similarly D in the expression Ü (mh) DD is expanded, where a term corresponding % (3.20) yiele 


similar to (3.21), but Ü (hhk), Ú (tse), etc., will appear instead of Ü (Iis), Ü (Ius), e 


The matrix 


(3 | s) SEI | ext JU Qa) D+ n E 


will also produce an expansion obtained from (3.21) by interchanging /n and ha. | 
‘By CUTE all Miis appearing in the expansion of D and by taking the difference bet 
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and U (hh) DD- =U (daha), it will finally be found that 
AQ (uh 2) E DÜ (h iho) D~ > 


ARAI 


=U (hh) 3-32 (Ua e |) — (he | ky)(Ra| ft2)) EO Unki) — U (hkh) ] Fio | his] 


+ Qul ky){Ra| he) (he hy)— lha] bii] his | 12189102720) —U (Ink) ] FUdtski] hyhoky | 


TES 27 Ql ho) (Rs | x) a | ts) — Qro Ro) s [202 
x cu (mkıkzh2) = Ü (hak Rohs) \FLhshokike | h tok 1s | 


tE” E (Ui (iss) (ka s Gre) (a| Ree Ca | ha) 9 


X [Ù (sik) —U ries) ] FUtisesa | hikokrke | 


+3 1 Z" (l iy ltr) (s | eo) (o | 2) — Qu | ki) (k| his) (Ia | eo) (Rs | 13) 
X EO (retoez) —U (rola) ] FLtiisfyeo| afta ] 


+3 D 27 (Ual ky)(k1| ho) (s | 2) eo | 3) — (hz e) 08 21) | ka) (Rs | io) 
PX [Ū( (nko) (t5k3)) — Õ((hıkı) (hok2)) | 198207272021 hhskykz | (3.22) 


By using the expansion technique of the F matrices 
developed in the previous subsection (1), therefore, the 
matrix AO (Mh) can be calculated. The inequality 
(A7) ensures the convergence even if higher order 
terms in n of F[Z(n) ] are neglected. 

Let us insert (3.1) and (3.17) into (3.22) and write 
the series in terms of loops Sp and U(P) as follows: 


AQ (Rig)? 
=Ü (hh) t} 2 » (= 1)'CL (hiha) ; Po] 


t PoPi---Pn 
XSP Sino o oS (02) (3.23) 


where the summation Y]p,r,..»," takes all possible 
PoPa: + - P, which satisfy the relation 
P=P,P_1-:-P2Pi(iahe)Po, 


and C[ (Ike); Po] represents a quantity in parentheses 
in (3.22). For instance, 


CL (sha); (likes) Qs ba) s Un) — Qa a I) 


CE (he); (ratos) \= Qna |a) ea e) o | h) 
m Wia| ka) (Pea fx) |A). (3.25) 


Comparison of (3.23) with (3.19) yields that 


Gp) (Iishe)* 


(3.24) 


E t P 
zx maa PIL > Y; (—1)'CL Uke); Po] 


oP1:-- Pt 


See G20) 


re CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


By noting the relation Sp*— S p^, the above equation 
can be rewritten as 


apO (hiho) =6 chha), P 


XE o X! (DC inka); PeT 


t PoPi---Pn 
XNP Pat: Š ogg 


where the summation ?]p,m,.», takes all possible 
Po, P3-- - P, under the condition 


R= (hih) PrP: PO. 


The equation (3.27), however, is less convenient than 
(3.26). To evaluate the electron permutation P as 
given by (3.28), we have to start from the last loop 
Sp, and to count the cyclic permutations in reverse 
order. On the other hand, computation of a pO (mha) 
by (3.26) and (3.24) is exactly the same as the expan- 
sion (3.17) of f»[A(»)] and the electron permutation 
P can be obtained by multiplying cyclic permutations 
following the same order in which the loops appear 
It is easily found that, after completing a diagram 0 
Ap (Inks)* by (3.26), a corresponding diagram for 
Ap (hıh) can be obtained by reversing the direction 
of arrows in all solid lines as well as broken lines. 

The calculation of A® (R) just described can be n 
tended to any permutation R, and it will be found tha 


(3.27) 


(3.28) 


ap?) (R)*= òr pP H = (—1)'CLR; Po] 


t PoPi---Pt 


XSP SP "Sp, 


(3.29) 
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Diagrams for ap? (R~) can be constructed from those 
obtained by (3.29), by simply reversing directions of 
arrows in all solid lines and broken lines. 

The only problem here is to develop A* (R-)t in 
terms of F and get an equation similar to (3.22), 
which, in return, gives explicit expressions of CLR; Ps]. 
This can also be carried out automatically by drawing 
diagrams of C[R; Po] as follows: Let R be a cyclic 
permutation R= (hıh: --h,) and consider all possible 
permutations Po which start from a point on the loop 
(liha: ++h,). The permutations Po may consist of many 
unconnected loops Poi, Po», ---, Por, but each of the 
loops must have at least one common point with the 
original loop (J/m---A,). The desired coefficients 
CLR; Po] will be given by 


CLR; Po] Spa S Pos" 5 S Por 
—Sr py rS R PyR SRP, (3.30) 
where Po= PorPor1' + * PosPoi. 
In case R is a single interchange, we had the relation 
CLR; Po J=—CLR; RPR], (3.31) 
which leads to factorizations of the type 


CLR; Pol[U (RP) —U(PoR)], — (8.32) 


as has been shown in (3.22). When R is a permutation 
involving more than two electrons, the relation (3.31) 
is no longer valid and each matrix Ü(R-1P;) may carry 
a distinct coefficient CLR; Po]. Therefore AO (R-)t 
will not be written in the form (3.32). 
Use of the inequality (A7) in (3.22) leads to the 
conclusion that the sum 
Lx =F | ap (Iis) —5 nz), pl (3.33) 


P 


is also bounded and small as compared with unity as 
long as A is small. However, the quantity 


DL, => | ap (R )— ôr, P| (3.34) 
P 


will increase in proportion to the number s of electrons 
involved in the permutations R.. This will be seen if 
we calculate some of the dominant terms in A (R,)f. 
For instance, let R, be a loop (hik2---h.) of equally 
spaced s electrons in a crystal with translational sym- 
metry. The matrix AO (R7) will include s terms of 
the form 


CER; (iki) JUCRsA1(eki)), for 7=1,2,---,2, 
whose s coefficients 
CER, ; (mik) ]= (esl bi) I) — rea E a) 
are equal to each other. Terms of the form 
CER. ; (riki) (hjk) TUGR (aks) Qu) 


will appear is(s—1) times in 409 (R,7)', since all 


possible relative positions of loops (Ak; and (Ej) 
should be considered here. 

This divergence of L, will lead to difficulties in 
proving the convergence of the expansion of A“)(R) 
as is seen in the following subsection (b). The origin of 
the divergence is due to the fact that diagrams in 
A“ (R1)! are not connected with solid lines (overlaps). 
It is expected that, if the quantity Sz,A“(R,)! is 
considered instead of A (R,-*)t, the sum of the abso- 
lute values of the expansion coefficients Sp,ap (R;-) 


M=) | Sr Lae (R7)—85,-tp.]l 
= 
[S |Sr, | Ly? (3.35) 


will converge and be very small. In fact, there is no 
essential difference between AW (R;3)t and F[A(z)], 
and their expansion coefficients are calculated similarly 
by (3.17) and (3.26). Therefore the argument used in 
Appendix for bounding F[A(n)] wil be applied to 
AO(R,-)f too. 


(b) Asymptotic Expansion of A"? (B). 


The expansion coefficients ap“)(R) obtained in the 
preceding subsection (a) are related to the desired ones 
ap 0/2 (R) by 


ap) (R) 25, ag  (R)ap ? (Q). (3.36) 
Q : 


This relation is proved as follows. The matrix A? (R) 
defined by (3.18) can also be expanded as 


A®(R)= D-2D-2Õ (R) DD: 
=F ag” (R) D 2Õ (Q) DU? 
Q 


=> ¥ ago? (Rjap® (Q)Ü(P). (3.37) 
RP Q 


Comparison of (3.37) with (3.18) yields (3.36). 

Let ap™®(R) and ap™(R) be RP elements of 
matrices a“/2) and a®, respectively, and rewrite Eq. 
(3.36) in terms of the matrix representation as follows: 


aP = a22, (3.38) 
Use of the orthogonality_relation of the irreducible 
representation matrices U(P) in (3.2) leads tot? 
Ja een UE 
ap) (R)= E —(D-2),,0 Wem (R) 
km | [! $ m 
à X (G3), DU G(P)t, (3.39) 


?1 As we shall show in (4.1) and (4.2), it is always possible to 
consider a representation where D is diagonal. 

2 The N! matrices Üp) introduced by (2.8) are representation 
matrices of the permutation group. By choosing’a suitable set of 
N-spin functions ©; as basis, U(P)'s can be made irreducible. Let 
U(P) be the matrix in the irreducible representation p. The km 
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where f, is the dimension of the subspace (o). Since 
(D-¥2) x6) and (D!?),,,0? are real, we find the relation 


ap“) (R) - ag 0 (P)*, (3.40) 


which shows that the matrices a“) and aC are both 
Hermitian. Hence, there exists a unitary matrix V with 
which a?) and a® are simultaneously diagonalized 
such that 

Via) V=1+y, diagonal, 


Vta92) V= (1+7), diagonal. 


(3.41) 
(3.42) 


Eigenvalues i-+-nx and (1+7x)! of matrices a? 
and a“/2) are all positive, and there is no case where 
(1++7)"? on the right of (3.42) carries a minus sign. 
This will be proved as follows. Let us introduce matrix 
AGI) (R) and the expansion coefficients ap“/ (R) by 


AG (R)=D740(R)D!4=> apto (R)U (P). (3.43) 
12 


As before, a Hermitian matrix a(/? will be constructed 
from the expansion coefficients a /? (R), and the fol- 
lowing relation between a“/#) and a% will be found 


aD = (1/4) a Q4) | (3.44) 


| 


This relation shows that a“/4) is also brought into a 
diagonal form by the unitary matrix V used in (3.41) 
and (3.42), and that the eigenvalues are square roots 
of eigenvalues of a“/?). This implies that, if some eigen- 
values of a€/2 carried minus signs, the corresponding 
eigenvalues of a€/2 would have to be imaginary. This 
is contradictory to the theorem that the eigenvalues of 
a Hermitian matrix are real. Therefore, eigenvalues 
1--7x and (1--5x)/? in.(3.41) and (3.42) must be all 
positive. 

Since off-diagonal elements ap? (R) are small, the 
relations (3.41) and (3.42) suggest that the perturba- 
tion technique can be applied to calculate the eigen- 
values 1+-nx and the unitary matrix V of a”. The 
desired matrix a“/?) can then be obtained from 


ad) = V(12-3)2Vt. (3.45) 


. It turned out that^this approach is not practical since 
the matrix a has an infinite order of degeneracy. For 
instance, diagonal elements aor ™ (hiki), for i=1, 2, 

=... Ñ, are all equal to each other, if we consider a 
crystal with translational symmetry. 


€ ement of matrix A(/)(R) in this representation is then written as 


[A G(R) Jn = (D=) rr Drm P (R) (D)mm® 6 

Fo ree =F aqP(R) Uit? (Q). (1) 
the orthogonality relation of the irreducible repre- 
Bim (0 (0) Tim ® (P)* — or, Q) 


ying (1) by Ui, 9 (P)* and sum- 
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'To eliminate this difficulty, it is more convenient to 
use an asymptotic expansion. Let us expand (1+1)12 
in (3.45) in a power series of n. The matrix a) i 
then written formally as 


at VIH-Ii— st HX G/6)g— ---) Vt 
=14+4b—}X4b+4X4X (3/6)b*—- +, 


(3:46) | 


where 


b= Vn Vi=a—1. (3.47) 


These equations (3.46) and (3.47) are written more 
explicitly as 


ap“!)(R)=6ret+3brp—2 Xt x bnobopd-:--, (3.48) 


and 
brp=ap™(R)—Sdre. (3.49) 


Since brp is obtained by (3.29) and (3.49), the expan- 
sion coefficients ap? (R) can be calculated by (3.48) 
provided the series converges. 

In practice, there will not be any difficulty in using 
the expansion (3.48) for the calculation of a.p (R). 
This will be explained as follows: Since it is not possible 
to calculate brp for all R and P exactly, we need to 
truncate and approximate the matrix b by neglecting 
permutations involving s+1 electrons or more. Let b 
be the truncated matrix. The RP element brp’ is then 
given by an approximate brp when R and P are both 
permutations of s electrons or less, while brp’=0 if 
either R or P involves s+1 electrons or more. Here s is 
chosen so that L, as defined by (3.34) is less than one. 
This implies that 


DY |brr’|<e<1, forall R. (3.50) 
P 


When all of brp in (3.48) are replaced by brr, the 
series (3.48) converges and the quantities ap”) (R) 
thus calculated satisfy the following inequality: 


Y. | ap) (R)| «2— (1— 9:2 « (1 977. (3.51) 
x 


The relation (3.51) will be proved in exactly the same —' 
way as (A6) has been obtained. l 

When the number s increases, however, the value of — 
L,® increases indefinitely, and the convergence © | 
(3.48) will not be proved easily by an exact language of 
algebra. Nevertheless, it is expected that the series 
(3.48), multiplied by Sz 


Sa pul» (R)-— Snbnp-I-35 nbn» 
—3x3 Y, Srbrobor t: (3.52) ; 
Q f 


converges, for the following reasons. Quantities such a5 

Sprbrp or X oSrbro will be finite and small as we 
have discussed in (3.35). Higher terms DoeSabrabarr 
Yad eS nbnojbo0,bQ,p, etc., consist of connected dia 3 
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grams, which build up over Srbpo. This implies the 
convergence of the series provided the same diagrams 
do not accumulate. When a diagram appears repeatedl 
we can count the number of times the diagram appears 
as we did for the F matrix in Sec. III (1b), and it will 
be finally found that they cancel each other in the 
course of the summation in (3.52), so the net contribu- 
tion is a small finite number. 

The calculation of SoSrbreber is, in fact, very 
similar to that of Sz, F[(2)] described by (3.15) or 
(3.9), where bro corresponds to T[k(n)] and bap to 
Fic) (m) J. Sr, FLA (1) ] may contain very large terms 
FinyLk(m)] when m is large, but those are connected 
by Sp, l[k(m) ]. Large terms bo, p for large m are also 
connected by Srbre,,. To evaluate higher terms 
2 Q2 QS nb no;00,0,00; P, etc., corresponds to expanding 
Fic e (m) ] in terms of Ficnyzcmy[l(s) ], etc. As we have 
discussed in the Appendix, the value of Sr, F[h(x)] 
vanishes when 2 becomes large. 

The higher order contribution, for which L;(?» 1, 
can also be calculated by combining a perturbation 


y, 


ki ky 
OAOD © 
pm P "y ar AN + AN 
h : — aa 
2 h k2 h ko OE 
© ki k; 
+ fd + AN 

h = ko cS ko 


Fic. 7. The expansion of F[Z|]. 


technique with the asymptotic expansion described here. 
We shall not describe the details but we will again reach 
the same conclusion that SpA“)(R) is expanded in 
terms of connected diagrams. 


(c) Role of AG? (R) 


We have described the matrix A“/?)(R) as being very 
close to one and merely a correction factor of the F 
matrix. Here we shall illustrate, by an example, an 
essential role of A“) in our calculation. 

Let us consider a Hermitian matrix 


Sy = D3^S[Z|z ]D3:5, 
which, according to (3.3), is expanded as 
Sy = D32S[7|5 ]D-1? 
=Z & folh| ost (ODP). (3.53) 


By using the prescription (i)-(iv) in Sec. III (1b) for 
drawing diagrams of matrix F [h|h], we find that 
F[h|%] contains among others the diagrams illustrated 
in Fig. 7. 

If we set AU (R) — 1 and hence ap? (R) —ón», the 
matrix Sy becomes equivalent to F| [alk] described in 
Fig. 7, which is not Hermitian. For example, Hermitian 


TO MANY-ELECTRON PROBLEMS A837 
aN N pu 
Re 1 + L £M i Hie 
A 2:-/* or TI ok 
iS " LN ZY * 
h k2 h ko <> 
ki ki & 

^ D 

<> (hee 

h k2 h k2 


Fic. 8. The expansion of A@/)(Ak;). 


conjugates of the fourth and seventh diagrams become 
equivalent to, respectively, the second and fourth dia- 
grams in Fig. 9. However, those diagrams cannot 
appear in F[4|4], since, to draw either of them, we 
have to start from loop (kık2). All diagrams in F[ 4|], 
however, should start from /. 

On the other hand, matrix A“/?)(42;) includes the 
diagrams in Fig. 8. Diagrams for A“/?) (hkz) are similar 
to those in Fig. 8 and obtained from them by inter- 
changing kı and ks. If we multiply A“ (hki) by the 
second term in Fig. 7 and A“)(hk2) by the third, we 
obtain the expression for Sy, which is similar to the 
series in Fig. 7, but the fourth and seventh terms are 
replaced by those in Fig. 9, and matrix Sy becomes 
Hermitian. 

The present example clearly shows that the matrix 
A“/?)(R) is essential for retaining the Hermitian prop- 
erty of the energy matrix. 


IV. DISCUSSION 
(1) Evaluation of the Expectation Values 


According to the description in Sec. IIT, any matrix 
containing an operator f(r;) or g(r;r;i,) can now be 
evaluated by drawing diagrams. Let us denote an 
integral of the form (k| f(r) | £) by a heavy line starting 
from / and ending at k plus a wavy line indicating the 
operator f(ri) as shown in Fig. 10, where the second one 
corresponds to (4| f(r;) | £z). The integral 


(21211678 7911779 


is also written by diagrams as in Fig. 11. Here the 
second one represents a Coulomb interaction, the third 
one an exchange interaction. 

From (3.16) and (3.17) together with the discussion 
at the end of Sec. III (2b), it is clear that any éxpecta- 
tion value (f) or ($) given by (2.26) or (2.27) can be 
written as a sum of connected diagrams. This provides 
another indication that the nonorthogonality diffi- 
culties do not appear and that the Heitler-London 
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Fic. 9. Terms appearing in the expansion of 
Sy = D~S[h|h]D~. 
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Fic. 10. Diagrams 
for (R|f(ri) |!) and 
(Qu fa) Un. 


method can be used rigorously in calculating energy 
spectra or other quantities. 

The calculation can be carried out as follows: 

(i) First, weshall writean interaction term (k| f(x) |h) 
or (ksks| g(rira;) |Ak) by using notations introduced 
in Figs. 10 and 11. 

(ii) The matrix D~?S[k|h]D~” or 


D=2S[kake| hh JD 


should follow the diagram just obtained in (i). These 
matrices are expanded in terms of f»[A4(»)] and 
a.p QI (R). 

(iii) Diagrams for f»[h(»)] are all constructed ac- 
cording to the prescriptions (1)- (iv) in Sec. III (1b). 

(iv) Before computing ap? (R), we need to draw 
diagrams for ap® (R). This will be accomplished accord- 
ing to the instructions appearing between (3.29) and 
(3.30). 

(v) The desired coefficients ap“ (R) are computed 
by the asymptotic expansion (3.48) and (3.49). 

(vi) Finally, the matrix D-'?S[£|/: ]D7^ or 

DS Rike| Mh D? 

is calculated by (3.3). 

(vii) Repeat the process for all k. Thus, the calcula- 


tion of (f) or (g), including the inverse of the overlap, 
is now completed. 


| 
: 
: 


(2) Calculation of the Ground-State Energy 


So far, our aim was to calculate spin-wave spectra 

of the Heitler-London space. The wave function W 

defined by (2.3), therefore, was a vector iu the complete 

space G of spin waves, and the energy matrix had the 

dimensions of the space G. This matrix representation 

has introduced the matrices AQ/2(R), which are diff- 
cult to calculate. < 

If, on the other hand, we want to compute the 

ground state of the Heitler-London method by assum- 

ing a definite spin function ©° for the system,” we can 

bypass the difficult calculation of A% (R) and we need 

+ to compute the F matrices in one dimension, only. 
- The 2” wave functions Y; defined by (2.3) can be 
regarded as orthogonal and the overlap matrix D as 


% For a ferromagnet or an ionic crystal with closed-shell struc- 

ture, the spin function for the ground state is known and we can 

* use this simplified method without introducing further approxima- 
ions. For any other type of ground state, we have to assume a 
model function ©? for the spins to compute U(P), since the exact 
unction © is not known. This introduces an additional approxi- 
‘to the wave function Y defined by (2.3). In (2.3), we have 
ed the product A of atomic orbitals, but O can, in principle, 
btained by solving the spin Hamiltonian of the type (1.2). 
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diagonal. If D is not diagonal, there exists a unitary 
matrix V with which D can be brought into diagonal 
form such that 


D= viv- f eevyrerva:. (4.1) 


This implies that, instead of W, we should use the 
orthogonal wave functions W'V defined by 


wV= (N) 5: (—1)?P7X (riro, - rw) 
P 


X PO (o1,05,* - -,oN)V , (4.2) 


and, instead of ©, spin functions OG V should be used 
as basis. 

Use of the diagonal matrix D in (2.26) and (2.27) 
leads to 


=L (h) fi) | SER | h Dic 
TL 2| f) [OSEE aD", (4.3) 


(3 4 


hy ho» ki 


 (Riko| graria) | trie) 
k2 


«E2171 h yt» )a5Dii ; (4.4) 
where S[E|A]a4D; and Spies ]uD;* are di- 
agonal elements of F[k|4] and F[ ko | hıh]. This 
shows that, to calculate (f);; and (g), we need to 
compute the F matrices only. 

The overlap matrices D and S introduced by (2.7) 
and (2.18) can be regarded as if they are one dimen- 
sional. Hence matrices F and X introduced in Sec. III 
are all one dimensional and U(P) is a number defined 
by (2.6) where i=j. The description of drawing 
diagrams of the F matrices in Sec. III (1b) is now 
applied to this “one dimensional" case, and the calcu- 
lation will be simplified considerably. 


(3) Intra-Atomic Terms 


As we have remarked in the end of Sec. I, our method 
can be applied to a case where each of the atoms carries 
more than one electron. We need to make only two short 
remarks for this case. 

Since atomic orbitals centered at a nucleus 4° 
orthogonal to each other, overlap integrals between 
them vanish and we need to draw lines between thos? 
belonging to different atoms. It is convenient, therefore; 


ki k2 
| ( hi ho hi SS hg 
hy he 
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Fic. 12. An example of dia- 
grams for arrays of many- 
electron atoms. 


to group electrons belonging to a single atom and draw 
lines only between groups as is shown in Fig. 12. Of 
course heavy lines introduced in Figs. 10 and 11 repre- 
senting interactions can connect electrons belonging to 
one atom. 

The second remark concerns the case where two elec- 
trons occupy the same orbital with up and down spins. 
Here the overlap matrix D no longer has an inverse 
when the entire space G is considered. As we have 
discussed in Appendix B of paper I, however, there is a 
smaller space in which D~ exists and our method can 
be used without any further difficulty. The smaller 
space is in fact a complete space of spin wave spectra 
for this case. We also do not need to consider V! 
permutations but take one out of those generated from 
each other by simply permuting electrons in the same 
orbitals. For example, we need to consider only one of 
the diagrams in Fig. 13 when /; and se are the same 
atomic orbital. Detailed discussion of such techniques 
will be found elsewhere? and therefore we shall not 
discuss it any further. 
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APPENDIX: EXPANSION OF (1--X[A(n)]) 


In this Appendix, we shall discuss the convergence 
of the expansion (3.14) or (3.10). First, it will be shown 
that, if the inequality 


e=} xp[A(n)]| «1 (A1) 


is valid, the series (3.14) converges. 
Both sides of (3.14) can be expanded in terms of 


Ü (P) and it will be found that 
feh] = re (xl h(n) De-- GERQ) P)e— ---, (A2) 


where 


(a[h(n) ]") p 
=p ties >D xoxo Lt]: : “XPQma7 Lh ]- (A3) 
Qi Qmn-1 


^ M. Kotani et al., Ref. 18, also see T. Arai, J. Phys. Soc. 
Japan 18, 718" (1963). 
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Taking the sum of absolute values of (x[4(n) ]7)p, we 
find that 


EAI SE zh] n er. (A4) 
P Q 


Inserting (A4) into (A2), we obtain 


frLhQ))]€gp-Fe-8-----—8gpd-e(1—6e)-. — (AS) 
and also 
2. fe[h(0)]| & 1— 97. (A6) 
I 


"These results prove the original statement. If we trun- 
cate the series after the mth term of (3.14), errors in 
frlh(n)] and F[k(n)] will both be smaller than 
e" (1— e), 

In the forthcoming paper, we shall introduce the 
theorem that the sum L, of absolute values of the ex- 
pansion coefficients fp[(n)] is bounded as 


[1--O(AA)T"X L,X[1—O(AA)] "^, (A7) 


where 


L,=max on h’s of } | fe[Jri- - -hnl hi --ha]!, (48) 
P 


A=maxon h of 3: | (3|/)| , (A9) 
jAh 
Ao=max of |(j|4)| , (A10) 
and 
- AMe[. 1 4449 1.37 AAA, =F 
MED Lh. 
1—A 4(1—A)? 4.6L(1—A)? 
(A11) 


Here we shall show briefly how this theorem can be 
proved. The expansion formula (2.23) of matrices S 
leads to f 


F[-- 4. A] 
= Els] 
POM uu) Ful i), (A12) 
A 


and 
E[ opener j 
= (hn | hny) EL- T, linyn | Voir hrbtner | 
AYO) | ng) FL- - pa] - egi], (A13) 
h 
where --- in the square brackets indicates At»: - 41s 4. 


When the matrices F in (A12) are expanded in terms 
of fp and U(P), the equation ds converted into a set 
of N! equations for fp as follows: 2 


fel 6 -hal du] 
= fpl- » nhng] SHS hla] 2 
HOY hari) foL: - hu] -+ hiin], (A14) 
À 
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Fic. 13. An example of a case of two equivalent diagrams. Here 
electrons / and /i? occupy the same atomic orbital with up and 
down spins. We need to consider only one of them. 


for all P. By taking absolute values of each term in 


(A14) and summing over N! permutations P, we ob- 
tain that 


Sl fol hd 
SEILE tng hid E99 (| 
xX fel: + Tinh | ++ -Tinltngr |. (A15) 


Since the above inequality is valid for any /’s, we find 
that 


1 DAS ee Nt (A16) 
where 
L—max on h’s of X | feL: - ah]: -hnhnill , (A17) 
Ip . 
and h is not involved among hı: - tsi. 
Similarly transposition of (A12) leads to 
Loa Ls Ala, (A18) 
and, from (A13), we also find that 
TEE Noll eet Ne (A19) 
If the relation 
lm <[0 (AAo)/A\Ln41 (A20) 


- js valid for any n, the present theorem will be proved. 
‘In fact, use of (A20) in (A16) and (A18) leads to 
| -EO(AA)]3L.ELsaX[1—-O(AA)]3L, (A21) 


n. If n becomes zero, the matrix F becomes 
=1 and Zo=1. Inserting this result in (A21) 
, we find the desired bound for Lı. Repeated 
1), by increasing 7 one by one, will lead to the 


ne relation (A20) will be proved by an inductive 
process as follows. When »=N—1, Eq. 


(A22) 
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since the second term on the right of (A13) vanishes 
From (2.19), it is clear that FL---hy_i|---hy ] 
= F[- - Asc |: ash], and hence Ly_1= Ly, This 
implies that (A22) is written as 


ly a € AoL xA. (A23) 


Since Ao is smaller than O(AA9)/A, the above relation 
proves the inequality (A20) when n— N— 1. 

Let us assume that (A20) is valid for n=no. Use of 
(A20) in (A18) and (A19) yields 


[1—O(AAo) JL nop € Lm (A24) 
and 
Ingt X [Ao +O (AA) ]Lagai- (A25) 
From (A24) and (A25), we obtain 
Int <[AotO(AAo) J[1—O(AAd) J2La,. (A26) 


Inserting the expression (A11) for O(AA9) into (A26), 
we will find that the coefficient 


[Ao +O (AA) L1 0 (449 T? 


on the right of (A26) is equivalent to O(AA9/A. 
Hence (A20) is also valid for n=0—1. This shows 
that (A20) is valid for any n. Thus the present theorem 
is proved. 

Use of the inequality (A7) in (3.9) leads to 


ec E lerh] «1.0 (849) 
-A(1—0(AA9))? 
-EA((1—0(AA9)) 


Therefore, the inequality (A1) is satisfied when n=1 
and the expansion (3.10) is valid as long as A is small 
as compared with one. 

Use ofthe inequality (A7) in (3.13) will lead in the 
case n=2 to the same conclusion and it will be found 
that the series (3.12) converges. If, on the other hand, 
n increases, the upper bound (1—O(AAo))-” and e in- 
crease indefinitely and therefore the expansion (3.14) 
is no longer valid. However, it will be found that, 1 
calculating an expectation value (f) or (£), the matrix 
F[A(n)] appears in the form Sz, F[h(~)], which van- 
ishes when 7 increases indefinitely since 


Sg, F[A (n) ] X A*(1—0(AA9)" — 0 


as long as A is small as compared with unity. Here Ra 
is a cyclic permutation with which electrons hh i 
hn-shn are shifted to orbitals Jio: - : Jii and Sra $ 
an overlap of the type J R;A*Adv. 


(A27) 


(A28) 
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In this paper we show that the Thomas-Fermi (TF) theory for a neutral atom is the zeroth-order solution 
of the full N-body problem if (a) the following assignments of smallness are made: N-13— e, e—e;d2, 
m-— mo/ €, e — 0; (b) the singlet density matrix in the x representation, p1(,z), is a fast oscillating function 
of the off-diagonal elements, of the type sin (x/—x)/e]; (c) the higher-order density matrices are deter- 
minants of the singlet density matrix as e— 0. The higher-order approximations, however, cannot be ob- 
tained by a simple power-series expansion in e, since the solutions contain e in a nonanalytical fashion. 
Taking into account the exclusion principle to zeroth order in e, and solving the equations of motion for the 
singlet density matrix to the next-higher-(second-) order approximation, we obtain the equations found by 
Kompaneets and Pavlovskii, and Baraff and Borowitz. These contain the Dirac and Weizsäcker corrections. 
Finally, we offer some conjectures about possible improvements of the approximation scheme, 


1. INTRODUCTION 


HE Thomas-Fermi (TF) model of the atom is 
the most successful attempt to account in simple 
terms for the characteristic features of complex atoms. 
Many papers deal with the application of this theory 
to different systems, and with its heuristic improve- 
ments. However, relatively little attention has been 
paid to the systematic construction of a statistical theory 
of complex atoms. The original derivation of the TF 
model was heuristic, and it is hard to tell what are the 
approximations involved, and how one can improve 
upon them. Such a more systematic treatment can be 
obtained in two different ways: expansion in terms of a 
small parameter, or a variational treatment. 

There were three major efforts utilizing an expansion 
technique, all starting essentially from the Hartree- 
Fock (HF) scheme, expanding (1) in Planck's constant, 
7,?; (2) in inverse powers of the number of electrons 
(N) in the atom?; (3) in the off-diagonal elements of 
the density matrix. 

There are, however, two major difficulties associated 
with any of these attempts. First, it is not clear in what 
sense one should take h, 1/N, or the magnitude of the 
off-diagonal elements of the density matrix as small. It 
cannot mean that in the limit when the chosen parame- 
ter tends to zero we obtain a rigorous solution, since if 
h goes to zero the exclusion principle must become in- 
operative, and if JV goes to infinity the atom acquires 
an infinite radius. Clearly, a coupled limiting process 
is involved and one has to determine which parameters 
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tInvited paper at the Washington meeting of the American 
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“P, Gombas, Die Statistische Theorie des Atoms und ihre 
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G. A. Baraff and S. Borowitz, Phys. Rev. 121, 1704 (1961). 
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Fiz. 31, 427 (1956) [English transl.: Soviet Phys.—_JETP 4, 328 
1957) ]. 2 id 
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tend to zero and at what rate. Second, starting with the 
HF scheme the following objections are left unanswered. 
(A) One can show at most in this way that in some 
limit the TF model is as good as the HF scheme; 
actually it might turn out to be better. (B) We may 
indeed find that in some limit we obtain from the HF 
scheme the TF solutions; it also may happen that in 
this limit the HF scheme itself is a very poor approxi- 
mation to the actual N-body problem. (C) If we intend 
to improve systematically the TF,model starting "with 
the HF scheme we have no assurance that the correction 
terms found are not of the same order of magnitude as 
the error committed by starting with the HF scheme 
in the first place. 

Another very popular approach makes use of the 
fact that the HF scheme can be obtained as the Euler 
equations of a variational principle.’ Write out the 
energy of the atom as a functional of the one-particle 
density matrix. Stipulate as auxiliary conditions that 
the one-particle density matrix or singlet density matrix 
normalized to JV has a fixed trace and is idempotent. 
The Euler equations of this variational problem give 
the HF scheme. One is then tempted to go into this 
variational problem with a suitable trial function and 
determine its best form in the usual manner. However, 
a very serious trouble occurs (apart from the fact that 
the accuracy of the approximation cannot be assessed). 
The one-particle density matrix and hence the trial 
function has to obey an additional condition which 
cannot be given a simple mathematical form without 
solving the whole N-body problem: Thé singlet density 
matrix is to be of the form which can be obtained by e 
integrating N—1 times the JV-particle density matrix.» 
While it is obvious that for the trial function this 
implies “smoothness” in sonte sense, it is by no means 
clear how to formalize this condition. Without satis- 
fying it, however, the variational approximations 
obtained can be void of any physical meaning. 


5 J. E. Mayer, J. Chem. Phys. 100, 1579 (1955); H. Koppe, Z. 
Physik 148, 135 (1957). FS 
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For this reason we decided to investigate the possi- 
bility of an expansion in terms of a small parameter. 
Of course, it is possible that the wisest approach will 
eventually combine both methods. "Throughout this 
paper we confine our interest to neutral atoms. 


2. OUTLINE 


Our plan will be as follows: Eirst, we find the small 
parameter involved; then we show heuristically that 
in the limit when this parameter goes to zero the TF 
model is to be expected to give a rigorous solution of 
the N-body problem in question, and we „assess the 
orders of magnitude of the different correction terms 
usually appended to the original TF theory (see Ref. 
1). Then we start from first principles, set up the 
appropriate equations and symmetry conditions and 
show that indeed to zeroth order in this parameter we 
rigorously obtain the conventional TF theory. Next 
we shall show that the actual solutions are nonanalytical 
in the small parameter, hence the next approximation 
cannot be obtained by a simple power-series expansion. 
If, notwithstanding we attempt to do this in an in- 
consistent fashion, by improving the solutions of the 
differential equations to the next order, while satisfying 
the exclusion principle only to zeroth order, we obtain 
the usual exchange (Dirac) and inhomogeneity 
(Weizsücker) correction. This will demonstrate, then, 
that these correction terms are probably not quite 
correct, and that for a consistent improvement over 
the original TF results it is necessary to devise an 
approximation method which can be used if the solution 
is nonanalytical in the small parameter. 


3. THE SMALL PARAMETER 
Characteristic Quantities 


| To construct the small parameter we shall first need 
the characteristic quantities involved in the TF theory. 
A short, dimensional analysis wil accomplish this. 
Assume that the electron cloud around the nucleus is 
confined to a volume Z’; let this be accomplished 
entirely by an electrostatic potential U in cooperation 
with the exclusion principle. (This, of course, may not 
be possible, but if it is, the argument runs as follows.) 
Let there be JV electrons confined ; denote the maximum 
momentum at a given point inside L* by po, the mass 
and charge of,an electron by m and e. 'Then, from 
Poisson's equation 

U/D.—eN/D, (1) 
since the density is =N (Li. The condition that po is 
the maximum momentum reads 


pè/2m=U. (2) 


The exclusion principle stipulates that each phase cell 
should contain at most one electron with a given spin; 
thes condition that the atom be in its lowest energy 
- state requires that no phase cell having momentum 
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less than po be empty. The two conditions together give 


po/=N/L. (3) 
From (1) and (2) it follows that 
U=Ne/L=p?/2m, (4) 


while (2) and (3) give 
poL- hNVS, L-a/NW?; (a- Bohr radius). 


Thus, we obtained the three characteristic quantities | 


po, U, and L as functions of N. (Observe that the 1/ N13 
dependence is connected with the three-dimensional 
nature of the problem. One- and two-dimensional atoms 
composed of charged sheets, or charged filaments, 
respectively, would have characteristic quantities with 
a different VV dependence.) 


Assessment of the TF Approximation 


Suppose you have solved the TF differential equa- 
tions. How can you tell whether the answer is, or is 
not, a correct solution of the full N-body problem? 
The problem is not trivial since the TF solutions do 
not furnish immediately the wave function or the 
density matrix of the N-body problem. There are two 
avenues open. First, the TF solution gives the number 
of electrons with given positions and momenta ; from 
this we may construct in the well-known manner the 
singlet density matrix. In turn, from this we may con- 
struct the density matrix of the whole system, provided 
further assumptions are made about the relation of the 
singlet density matrix to the complete density matrix. 
Once this is obtained, we may verify how far this matrix 
obeys the equations of motion and the exclusion 
principle. This method has the advantage that both the 
satisfaction of the equations of motion and the ex- 
clusion principle can be tested. However, from the TF 
theory it is not immediately clear what are those further 
assumptions necessary to construct the density matrix 
from the singlet density matrix. For this reason we 
choose the second approach. This will test to a certain 
degree the satisfaction of the equations of motion, but 
not that of the exclusion principle. The TF theory 5 
semiclassical theory, consequently it assumes that the 
local de Broglie wavelength varies slowly from point 
to point. Is this assumption satisfied everywhere by 
the solution? If we plot according to the TF theory the 
number of electrons contained in a spherical shell o 
radius r and unit-thickness around the nucleus, 
n(r)4mr2, where n(r) is the density at the point 7, we 
find a function as shown in Fig. 1. For small values © 
7 the function starts as 4/7, it has a single maximum a 
about a/N/3=L, where a is the Bohr radius and N 3s 
the number of electrons in the neutral atom; for large 
values of r the function goes to zero as 7 *”. Associate 
with this density there is an electric potential energy 
U(r) which goes as JVe?/r if r«&L, and as ear for 
r>L. Finally we can define a local momentum 
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Fic. 1. Qualitative dar <1 
graph showing num- 
ber of electrons on 
a shell of radius 
r as given by 
the Thomas- Fermi 
theory. Approxima- 
tion good within 
region where the rate 
of change of the local 
de Broglie wave- 
length is small. EAA 


nr? 


LO s 


b()- [2m (U — E) ]?. The validity of the semiclassical 
approximation requires that dX/dr&1, with A—7J/. 
Write dA/dr —Xd Inp/dr; put approximately Inp~4 InU 
+const with U~7r-”; n=1, rKL, n=4, r2»L. This 
gives An/2r<]1 as our condition. The latter is certainly 
violated if 4n(A/r)&in(h/pr) is equal to 1. There are 
two roots to this equation: a/2N and 2a. Between then 
lies the maximum of the charge distribution (see Fig. 1). 
Hence we may say that for the majority of the electrons 
the TF solution is a good approximation to the Schró- 
dinger equation. 

Is the exclusion principle satisfied? Seemingly yes, 
since the solution was so constructed that one, or no, 
electrons of a given spin occupy each phase cell. How- 
ever, this elementary form of the exclusion principle 
does not hold any longer if density gradients are present 
within the system and a more complicated condition 
has to be satisfied in its place. The derivation of this 
condition requires more preparation. In Sec. 7 we will 
show what this condition is, and how well the TF 
solutions satisfy it. 


Construction of the Small Parameter 


Now let us find the small parameter of the problem. 
Obviously we seek a limiting process such that in this 
limit the equations and the exclusion principle should 
be satisfied everywhere, while keeping the characteristic 
features of the solution unchanged. The usual solution 
is believed to be correct for a/2N «r« 2a. The charac- 
teristic features of the atom represented by this solution 
are the radius of the atom L, and the energy per 
electron We?/L. (According to the virial theorem for 
charged particles the kinetic and the potential energies, 
hence the total energy have the same order of magni- 
tude.) Thus we seek a limiting process where a/.V goes 
to zero, a goes to infinity, while L and Ne*/L is being 
kept fixed. If we keep /; fixed, this can only be accom- 
plished by letting jV go to infinity, e going to zero as 
AN-3?, and m going to infinity as V7", thus IN >œ, 
e= egIV-1I2, m= mN!, eo, mo being the original values of 
e and m. In this limit then the equations are satisfied 
everywhere, and, as we shall show, so is the exclusion 
principle. Formally, we may consider 4 as variable, 
instead of m. Then the same result can be obtained by 
letting #.go to zero as N, i.e, N=, e= eoN 12, 
h=ħ N=. Since we are employing a limiting process 
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in which .V tends to infinity, the question arises as to 
the computation of extensive quantities. The rule is as 
follows. First, compute the value of the associated 
intensive quantity in the limit as V tends to infinity. 
(E.g., if you are interested in the total energy, compute 
first the energy per particle as V tends to infinity.) 
Next, multiply this quantity by the actual number of 
particles. 

Finally, we would like to stress that this selection 
of a small parameter was specifically designed for the 
purposes of the TF problem in such a manner as to 
make the semiclassical approximation valid for the 
entire atom in the zeroth order. For this reason it has 
less generality than the usual method where one treats 
the difference between the actual and the self-consistent 
potential as small; on the other hand, because of the 
more specific nature of the choice it tells us more about 
the approximation. 


Orders of Magnitude of the Correction Terms 


The usual exchange correction to the energy per 
electron is given by ~e?o/h, i.e., the Coulomb inter- 
action between two electrons at a de Broglie wave- 
length 4/po apart. The ratio of this quantity to pi?/m 
is of the order 1/.V?*— €. The inhomogeneity correction 
to the energy per electron is given by —e'a[ grad logn J^; 
its ratio to po?/m is again €. Thus, if we accept eas a 
relevant small parameter, these terms are correction 
terms of second-order smallness. The Fermi-Amaldi 
correction, replacing V with V—1, gives obviously a 
correction of order 1/N-—&, a term of third-order 
smallness. Finally, the correlation correction, intro- 
duced by Gombás, ~e?n"?/ (aj ?-1- 10) is of the order 
etbo/m. 

4. THE BASIC EQUATIONS 


In this section we derive the basic equations of the 
theory, the equations of motion for the reduced density 
matrices. We shall work with density matrices for the 
following reason. The basic aim is to set up expressions 
in which the coordinates of one or two electrons appear 
only, the coordinates of the other electrons being inte- 
grated out. However, the state of one electron inter- 
acting with the others cannot be described by a wave 
function (not being a pure state) while it can be 
described by a density matrix; hence the latter is the 
proper tool. : 3 

Envisage a neutral atom with N electrons, and a 
fixed nucleus with charge number V. The Hamiltonian 
of the system is given by 

is ¢ 
H-Y,(Kc-V)t © 9(x—2z9, 


1«icjeN 
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The state of the system is described by the antisym- 
metrical wave function W(a,:*:,x,D). (In principle 
the spin coordinates should also be included; however, 
the absence of spin variables in the Hamiltonian has 
the effect that the existence of the spin coordinates 
changes only certain factors in the equations. These 
can be put in any time, practically by inspection. 
Hence, for simplicity, we omit them throughout the 
paper, and shall point out when necessary which terms 
acquire different factors.) The density matrix for the 
whole atom in a pure state is defined as 


px (3, F7 ty’ A Evt) 
—W as, joe x SD) (s yy). (6) 


Differentiate (6) with respect to ¢ and express the time 
derivative of Y through the Schrödinger equation. We 


obtain 
ifidpy/01— (H — H^)ow , (7) 


where H’ acts only on the x’ variables. (This equation 
holds true even if py does not represent a pure state.) 
Define now the reduced density matrices p, by 


: 
(nts nlt [nec 


; d S. EE 
x | ente yt 13s stl * Sy UN; 


Xp (8) 
Contrary to the usual definition we do not include any 
combinatorial factors in the definition of the ps. Con- 
sequently, all the density matrices plGHil, 9 25, i) 
are normalized to_unity. The reason for this departure 
is due?to the fact that an N-dependent normalization 
would,;make a limiting process V — c» more unpleasant. 
In Eq. (7) put x43! =s Xot? —3X» `s ty’ — XN, 
and integrate over these variables. We get 


a S t,t) . 


Ops 5 
Her [Gc — K,)- (Y, V 


r=) 


+ X [$(5-25)—9(x—5/)lo«t Q —5) 


1<i<j<s 


x X | Pap lb@i—vo41)— 9G? —2)] 
1€i«s 


Xpsa(x E ys e413 Xi Ja 1) : (9) 


Tf the atom is in a stationary state, the time derivative 
eq al to zero. The exclusion principle is incorporated 

he symmetry conditions on the p,; an interchange 

two primed coordinates, reproduces the function 

osite sign. (Since we shall discuss stationary 

y choose, if we so desire, all density matrices 
"M 


1e new dimensionless variables 
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and the new dimensionless functions of these variables 
"xl AU a ek) — oun 9 
[COLLE TP THE Ws) =R (Yn yes 21° ym)/ Lh, 
Equation (8) becomes 


Rs(y1,° LI Ys; 21," . - 2s) 
zi J ets: t3Xysysijf515.xe0). + 


Write ®(x)=e/L and observe that K,(x)—K,(q’) 

= }12/mL?(6?/dy,02,). Substitute all these back in Eq. * 
(9), and divide by Ne’/L; we obtain (putting dp,/ 
di=0) 

E {(a/LN)(8?/9y.82:) 


--[6(yi-2i/2) 6 (y; 24/2) DR: 
—AN X (eDu—»ytGic2)2] 


1«:«j«s 


—dé[yi—y;— (z,—2))/2]) R.— (1—s/N) 


oS 


1«i«s 


d*ysa[$ yi— Yep + 2)/2) 


—é(yi—322—5/2) JR 55s 
Z7 7,240) 70. 
Introduce N= €? and L/a=e according to the previous 
section. The first term acquires the factor €. Absorb 
one e in the new variable (=2/e, and divide with the 


other e. This way finally we obtain the set of equations, | 
€? in number, we shall work with: 


Qi 
1«i«s Ez 

XR: Fol €) 

-e X, (/9(—»r tC ()/2] 


1<i<j<s 


—e[yi— y;i— «(6:7 52/2] RO: 8 


+ /9[6(y;- «/2)—-66— «23 


eti el) es) » d*ys i (1/9) 


1«i«s 

X[6(yi—9s44- «/2)— pyi Y+ q/2)] 

XReaa(yu: 7 sys; £3: S44019— 0, 
(SUDO 

R 


Here we have explicitly indicated the fact that the 
functions now can depend on e in two different ye í 
first, due to the change of variables from z to m gu 
second, on account of the fact that originally p» 97^ 


>. 


hence R, involved N(—1/&) parametrically Grd 
they were obtained from py or Ry by integration). io 
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rewrite Eq. (10) in a more transparent form. Introduce 
the difference operator D,; it acts on the argument y, 
of a function F to which it is applied ; 


DF (yyys;* Yr :) 
— (/9LF (Y1, Y2, +++, yet t2, +++) 
—F (yy Vay 7773 Vr EA Ne 


Thus in the limit e— 0 it gives ¢,-0F/dy. Equation 
(10) reads now as 


9? 
| =| +9) |j 


Oyi06; 
—d x LOCI 66 ym 
1«i«j«s 
—(1—és) © Py. | P842 
1<i<s 


XLC(DitDer boise) ] a50. (11) 


For sake of symmetry we introduced the delta function 
6(¢s41) and an additional integration over ¢s+ı. The 
functions f, are defined as: 


Ryu: 7:75 enekel) = felyn Ys; En e kale). 


These equations must be supplemented by conditions 
on the f, functions. These are the recursion relations: 


fossil | Bafana 


X (yn 1 sysaifu 65019; (12) 


the normalization conditions: 


ILES 3350, ::0]9-1; (13) 


e) (14) 


must change its sign upon interchange of x and £r, 
or x and xr, kzÉr. 

We wrote out the latter in terms of the variables x, 
x’, to simplify the conditions. Equations (12) and (13) 
are not independent. If we define fo=1, the normali- 
zation conditions are the consequence of (12) for s=0. 
Hence, in the future we need not mention (13) 
explicitly. 

Then our aim is to solve the equation-system (11) 
for the set of f, functions satisfying conditions (12), 
(13), and (14) in the limit e— 0; moreover, if possible, 
to devise a scheme of successive approximations for 
small, nonzero «. In addition we may require that the 
solution®should give the lowest possible value for the 


the symmetry conditions: 
xx, Xy — x £ — 4s 


C 
Lj 2L , ) 2L D AL 9 5 E 


Fic. 2. Relation between old (xx) and new (y,z) labeling of 
elements of singlet density matrix p1(x^,x). 


mean energy per electron. In that case we seek the 
solution for the lowest energy state of the neutral atom. 
If the atom is at temperature T=0, we must require 
that py, or the associated fy should represent a pure 
state as indicated by (6). [ For nonzero temperatures 
all equations remain unchanged except for (6).] The 
purpose of the new variables y, z, and ¢ is the following: 
Envisage pi(4/,x) defined at the lattice points in the 
(x',x) plane, then the y, z labeling of these points corre- 
sponds to a 45? rotation, and a change in scales (see 
Fig. 2); y labels a distance along lines parallel to the 
diagonal, z the perpendicular distance from the diagonal. 
Since our limiting process shifts the problem in the 
semiclassical region, the off-diagonal elements must de- 
crease in size and oscillate fast; thus only small values 
of z become of importance. For this reason we introduce 
the new variable ¢=2/e, which amounts to introducing 
a new unit of length along the z axis, which is e times 
the previous one, using thereby a yardstick more 
adapted to the problem. This is also evident from the 
fact that in terms of these variables the coefficients in 
the equation-system (11) contain positive powers of e, 
and as e— 0 the coefficients become independent of e. 
(At this stage one could take the Fourier transform 
with respect to z or ¢, a natural step to do for oscillatory 
functions; this introduces the so-called Wigner func- 
tions. However, the general treatment seems to be 
simpler without this step, and consequently we shall 
formulate the theory in terms of the f functions, and 
will use Fourier transforms only as a tool to solve the 
resulting equations.) f 


5. CHARACTERISTIC FEATURES OF THE EQUATION 
SYSTEM AND THE SUPPLEMENTARY CONDITIONS; 
NONANALYTICITY DIFFICULTIES 


The solution of (11) subject to conditions (12), (13), 
and (14) is equivalent to the full, correct solution of the 
problem; in order to obtain an approximate solution 
we must study the characteristic features of the 
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problem for small e. As e tends to zero, in (11) all 
coefficients of f, and fs+ı become independent of e; 
moreover, we observe that there are no terms linear in 
e, and two of the terms proportional to e disappear 
entirely. These terms have a simply physical inter- 
pretation. The set of s particles described by f; interact 
with the external field [second term of (11)], with 
each other, and with the IN — s other particles; as e tends 
to zero the interaction within the set of s particles 
becomes negligible compared with the other inter- 
actions, being proportional to e; consistently with this 
the interaction with the N—s other particles can be 
approximated with the interaction of V other particles. 
Because of this, the s particles within the group behave 
as if they would move independently of each other, 
their behavior being determined by the interaction of 
each with the particles outside the group. Thus we 
expect that up to and including order € the fs functions 
should be expressible in terms of fı. Thus the structure 
of the Eqs. (11) suggests that (a) using the variables 
y and t, fs will depend only on positive powers of e 
(this, of course, was just reason to introduce these 
variables); (b) there shall be no terms linear in €; 
(c) up to &, f, shall be expressible in terms of fi. 
Introduce now functions f, (yn: "Yafo S69) in- 
dependent of e, where k denotes the order of the approxi- 
mation. Then our expectations can be formulated in 
the expression ‘ 


fiov: itu Fl 97 22 ef, (15) 


with f,O—7[f,], feP=0, fP =G]; F and G 
are unknown functionals, independent of e. 

'The following questions are to be faced: (a) Does 
(11) have solutions of this form at all; (b) if so, are 
these solutions compatible with the supplementary con- 
ditions; (c) if so, how to determine F and G together 
with f,99, f, from the equations and supplementary 
conditions? The difficulty we encounter in answering 
these questions is due to the fact that the introduction 
of the small parameter gives no clue as to the con- 
struction of the functions F and G, while the existence 
of the solutions will depend on this construction. 
Omitting in (11) the term containing the double sum 
on i and j, which is proportional to e, we see that the 
equation contains only a sum of operators each acting 
on one set of y, ¢ variables with a given index. This, 
then suggests that the functionals should simply be the 
product of their argument functions each carrying 


^ variables with a given index. Indeed, it is easy to show 


that one can find such ssolutions to (11) [see next 
section, Eq. (27)], however, they violate the symmetry 
conditions (4) for s>1, since they correspond to solu- 
tions which obey Boltzmann statistics. It is natural 
that the symmetry. condition will cause trouble if we 


E 


is rather interesting that such a solution should still exist, 
in constructing the small parameter e we made already 
t use of the exclusion principle. 
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use the variables y, ¢. In term of these variables it js 
impossible lo express the exclusion principle in an 
e-independent way. Thus the variables most suited for 
the discussion of the equations of motion are not so 
useful when it comes to express the conditions implied 
by the exclusion principle. Indeed the command: ex. 
change xy and xy, keep x1, x» fixed, amounts to re- 
placing yı with (yry) 3-1 (F1— £2) 5 ye with $ (yiyə) 
—le(nh—2); Ga with (ya—y)/ et-$ Gi - £2); $2 with 
— (yo—yi)/e+3 (61 +2). This explicit appearance of e 
casts doubt on the possibility of expressing f.(|e) asa 
power series in e with only positive powers, if the sym- 
metry condition is to be satisfied. Indeed, for s=2 the 
symmetry condition reads 


faGuys; $1,821 9 
=— fol orty) (6/4) Eit), 2 rr y) 
— (4/4) (S1—$2); (1/6) —»0--3 i FS), 

— (1/9) (y2— y) +3 (Gr +2) ]. (16) 
Now we see why f» cannot be a power series involving 
only positive powers of e, with coefficients which are 
functions of y and ¢ only. f» will be oscillatory in {1, 
and f», ie, in the third and fourth variables. On 
account of (16), this means that it must also be oscil- 
latory in (yı—y2)/e thus it will have to contain, in 
general, arbitrarily high negative powers of e. This does 
not preclude the possibility that there should be a 
solution of e— 0; it only asserts that the e dependence 
of the solution cannot be analytic around e=0 if it 
also satisfies the exclusion principle. This squashes our 
hopes of finding a solution of the form (15), and we 
must go about the solution differently. First, let us see 
whether there is a solution for e=0; next, investigate 
the possibility of an approximation scheme which is 
not a simple power-series expansion in e. In the re- 
mainder of the paper we will discuss the e=0 approxi- 
mation and only touch upon the more difficult problem 
of the approximation scheme. 


6. THE ZEROTH-ORDER APPROXIMATION 
The Reduction of the Hierarchy 
For e— 0, D, t, (8/8y;), and the equations read 
| o? 86(y) 


ah 
9y1061 yi 


n 191,51) 


ð 
= [e| iem [ors t00; (17a) 
ôyI 
9? 8$ (y) 
xj 
i<igs 


Fir n 51385 EE) 
o mm Je ID 


06 (y:— 941) 
xE Bye i] 
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X faa Ya; 7 +5620) =0, ? 
(s=2, 3, °° ae (17b) 


NE 


aw Tr eer 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


THOMAS-FERMI 
with 


fO: i outysto [yan 


XOv:-sysxaiifu:s£40); (18a) 


where 


xy +a, Xs -Fts xy — x x,—2, 
s EAA ; TE (18b) 
2L 2L eL 


eL 


must change its sign if we interchange any two primed 
variables, x/, xj, say (i21, ---,5; j—1, ---,5; ij), 
or any two unprimed variables, x; x; say; all f, must 
be normalized to unity, the f, must describe a pure 
state, the energy must be a minimum. 

Now we construct the functional F, guided by the 
Hartree-Fock scheme. To do this we temporarily return 
to the density matrices ps. Let fi?(y,ti) be inde- 
pendent of e, qua function of y and ¢; a p1(2^,x) is now 
defined by 


Lp; (21',1) -fe( 


xi +i ==) 
Eu de 


pı, of course, will depend on e in a complicated way. 
We proceed to f, and ps, (s? 1) in a similar manner. 
With fs(y1,° 259/551 555 s S£] €) a ps (3i, “ 25545 Xn "ie, 
is associated by 

fu “tys; (ir à +f.) =L* Gato Y gig e Xp ott) 


where 
A p 1 
xr /L=yrt zett, tk=yr— ekk. 


The functional relation is now obtained as follows: 


First, construct p, as a determinant, 


ps(x1',° . “de Ab . + Xs) 


N:(N—s)! pii a1) m Goa) 
xU : : (19) 
N! pilEs 81) + pil%s sXe) 


Next, express all the «’s through y and ¢, and put 
N=. Finally, let e— 0 wherever it still appears, 
after everything is expressed in terms of y and ig (not 
x). The resulting f, will be our zeroth approximation. 
(The appearance of the W* factor in addition to the 
more usual combinational term is due to the fact that 
here p; is normalized to unity and not to N. This way, 
as N—oo the combinatorial factor goes to one.) These 
functions should satisfy the supplementary conditions 
and the equations of motion. Let us start with the 
supplementary conditions. The recursion relation be- 
tween p» and p; reads as 


; | eto mtn. 
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According to (19), p; is given by 


p2 (xi sl j X12) [N/(N— 1) Jos Coi x) (s 2) 
—pn Gi yieo)pyGr 2) ]. 


Substituting this in the previous equation we obtain 
the integral condition 


y [es (x^ x" )pi(x" ax) = pua). (20) 


Now we show that if (20) is fulfilled, the other recursion 
relations for s22 are also satisfied. Take po, with 
X, =%, and expand the determinant according to 
elements in the sth column; multiply the coefficients 
into the last row and integrate over x,. The integration 
can be performed using (20) and we indeed obtain 
p:—ı as defined by (19). This shows that all recursion 
relations are fulfilled if (20) 1s satisfied. Let us demon- 
strate now that if (20) is satisfied the system is in a 
pure state, Le., it is at the temperature zero. The 
system is in a pure state if py is idempotent, i.e., if 


Hn 7 H n eee P NI 
[es xu" pn (ai, EN 5X1 ,- +N) 


Xen (x1, mur ^ QN) 


= px (xi: $ XN js, *yXN)- 


Again, we can perform the integrations using (20) and 
we find that px is idempotent. Thus, the only conditions 
not yet satisfied through (20) are the normalization 
condition on fi, and the minimal condition on the 
energy. We now express (20) and the normalization 
condition in terms of f; ?. Substitute in (20) 
pi fi (y )/ D. with a//L— y--À3et, x/L— y—3e( and 
put N— €, Change the integration variables from x^ 
to s by (x"—x)/eL—5; one gets 


J emos £—5)fiO [y—5e($—5), s] 
-f9(). (21) 


Letting e— 0, we obtain 
/ Bs fr (y, € -9/ 0.3) iQ ). Q2 

The normalization condition reads 
i: fu (,9)8y-1. 


Thus, our problem is reduced to show that the so con- 
structed f, functions satisfy (17), (22), (23), and the 
minimal condition on the energy. We shall accomplish 
this in two steps. First, we show that the f,“ functions 
for s>1 satisfy (17b) if fi satisfies -(17a); next, ave 
show that the set of equations (17b), (22), and (23) for 
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fi, coupled with the minimum condition is nothing 
else but the TF theory. To accomplish the first step we 
need the f. functions in terms,of f. 19. Let us start 
with fo. Before going to the limit «=0, we get from 


(19) : 
falyryy23 atel 9 =o (yni) fa (2,52) 
FAG 


1 € m= 1 
TO ort £2); tt) 
2 4 e 2 


E (1—»3) 


+t) . QU) 


€ 


In the semiclassical region the off-diagonal elements of 
pi decrease in size and oscillate fast. In terms of fi 
this means that fi for a fixed value of y must tend to 
zero and oscillate fast as ¢ tends to infinity. Thus we 
i seek solutions for which fi(y,9)-—0. Now let e tend 
1 to zero. The second term is then always zero, unless 
317 y» in which case it has the finite value 


fi Dy (34-12 AOL, 3 (3-12) ]- 


Hence, 


fo (yy,yo; $182) EA (1,61) fa (ys t2) 
—Óy nofi Drs 2 (att) i Dys, 2 ((i+f2)], Q3 


where 6,,0 is a Kronecker ô and not a Dirac 6. [We must 
add the stipulation e — 0 since the symmetry properties 
required by the exclusion principle are expressed 
through the variables a^, x and their relation to y, ¢ 
involves e. This makes e appear each time explicitly in 
the right-hand side of (25) whenever we interchange 
x’ and «.] This result can be easily generalized for the 
higher f. functions. The rule is as follows. Think of 
the expanded determinant in (19); each term in the 
sum will contain sp: factors; of these, some pı factors 
will have the same index on both the primed and un- 
; primed x variables, e.g. pix xx); the others will 
. Carry variables where the two indices differ, e.g., 
t /,xj). To get the corresponding fs, replace each 
ere the indices are the same, say k, by fi (yt); 
lace each pi(xy/,xi), where the indices on the 
ariables are different (k and 1, say) by vru ofi 
(cetti). Now we are prepared’to solve (17) 
it e— 0. Write (17a) as 


argument, e.g. 


291,925 S182) - 


L.: 
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The set (17b) reads then 


Se (rs) 


r=l 


is Jrssilfen(L 777, 52-1) 5 00 (0) 


r=l 


We first show that (27) is satisfied if f, is a product of 
fis, provided fi satisfies (26). [By (25) f, is this 
product if all y's are different.] Then fi, s) 


= fy(1)---frO(s), and Jr eal fori, SENI 
=f (1) f (s)Jr,e41L fi (s+1)]. Equations (26) 
and (27) become 
Afr OHAL Hie 9 2) ]70; (28) 
1509 (2)- -- fa ())(4 fi (A) 
+ fr (Sr saLA9 6+1)]} 
+ fr (01509 (3) -- fr () (4:9 2) 
+ fi (2)J2, aL fi? (s--1)]) 
--same for index 3--: +- 
+same for index s—0. (29) 


Thus, if fi(1) satisfies (28), then (29) is auto- 
matically satisfied for any s. If the y’s are not all 
different, we proceed as follows. First, we observe that 
in (26) the second term in fz [see (25) ] does contribute 
nothing, since we integrate a function, which is zero 
everywhere except at à point, where it is finite." Next, 
we observe that for the same reason the integral in 
(27) contributes nothing as long as in the integrand 
there is a Kronecker ô present which carries a Ys+1- 
such terms must carry a factor fi (s+1) according 
to the construction; thus they must be pi (ssi sent) 
times the minor of p1(x,41i,443) in the determinant. 
Consequently, under the operator J, s41 We can vorile 
fel, QE st+1)=f.(1,- = 5) fi (s+1), and (27) be- 


comes 


X Ad sse fuss) 


r=1 


XS TrelfOls+YJ=0, G=1,2). G0 


T—l 


[This expression also contains the product of all the 
diagonal elements, for which we have already shown 
the truth of (30). ] Now interpret 8,9 as an even function 
of y, then its derivative is zero, and A, does not act 02 
the deltas in the product; this makes all deltas a mult 
plicative factor in (30). If all the y’s are not equal to 
each other, this factor is zero, and the equation 15 
satisfied; if all the y’s have the same value, cance the 
5 factors, and we are to satisfy the set (30) with the 


7Tt is easy to see that the (y1—92) ? type of singularity due to 
0$ (91 —92)/8y1 does not affect this argument. Ns 
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stipulation y— ys— - -- =y, Now again, as before, the 
equation breaks up into a sum of terms, each of which 
vanishes, if (30) is satisfied for s— 1. This is, however, 
satisfied if we require (28) to be true. Thus, in the limit 
e=0 we have solved the whole hierarchy of equations 
if we can solve (28) for e=0. 


The Reduced Hierarchy and the Dirac Scheme 


The problem is then solved if we find a solution of 
(28), (22), and (23) which minimizes the energy. 
Equation (28) is the equation of motion of the singlet 
density matrix; (22) expresses the two auxiliary con- 
ditions that the exclusion principle should be satisfied, 
and that the system should be in a pure state; (23) is 
a normalization condition. This set of equations are 
related to the set which formed the starting point of 
Dirac's investigations? Dirac introduces a singlet 
density matrix, normalized to N, and stipulates that 
this density matrix should be a solution of the Hartree- 
Fock scheme with the subsidiary condition that the 
singlet density matrix should be idempotent, and that 
the energy be a minimum. Then he proceeds to solve 
this set of equations in the limit as Planck’s constant 
tends to zero. Equation (28) is related to the equation 
of motion in the Hartree-Fock scheme, (22) to the 
idempotency condition, (23) to the normalization con- 
dition. However, in the further development there 
appears a disparity. Dirac solves in this limit the 
idempotency condition by an heuristic assumption on 
the form of the singlet density matrix; using the so 
resulting density matrix in the equations of motion he 
finds the TF solution, amended by exchange terms, but 
with no other corrections. This is astonishing for two 
reasons. First, according to our estimates in Sec. 3, the 
exchange terms are second-order corrections. Second, 
there is another second-order correction, the Weizsacker 
correction. Thus, either no correction term should 
appear, or both. (Indeed, one of the more mysterious 
features of the Weizsücker correction has always been 
that it somehow failed to appear in Dirac's approxi- 
mation.) The inconsistency arises through the heuristic 
assumption used in solving the idempotency condition. 
This is a correct lower-order approximation; however, 
to this approximation the exchange terms in the equa- 
tions of motion do also vanish [see (25) ], the same way 
as the terms containing the Weizsäcker correction, and 
which indeed were tacitly put equal to zero by Dirac, 
when he equated the kinetic-energy term with its 
classical limit, p°/2m. 

For this reason in the next section we shall analyze 
the solution of our set of equations with somewhat 
greater care as usual, to demonstrate that in zeroth 
order the TF solutions are only obtained. To accom- 
plish this aim we solve the equations in a manner which 
exhibits the similarities with the usual TF derivation 
as much as possible, instead of striking out along the 


path suggested by Dirac. 
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7. THE THOMAS-FERMI SOLUTION 


We seek the solution of (28), (22), (23) which makes 
the energy a minimum. Let us start with (22) and (23). 
Fourier analyze fi? in ¢: 


fi (y,6) = Qr)? J do (y,R)e* 5. (31) 
Equation (22) asserts that 
wy, by — wy), (32) 
or 
wy,k)=1, or O. (33) 


The location of the boundary h(y,k)=0 separating the 
w=1 and w=0 domains (which can be multiply con- 
nected) is left undetermined by (22), while the nor- 
malization condition (23) merely requires that the 
volume of the w=1 region be (27)?. Observe that the 
determination of the boundary 4=0 completes the 
solution of the problem. This is furnished by Eq. (28) 
and the minimal condition. Equation (28) written out 
in full reads: 


3 dgly) 
| +i lpo €) 
9yo(  ðy 
À db(y—y2) CS 
— (8) | By E (920) =0. (34) 
-0y 


Substitute (31) into (34) and observe that we 
=—iw(de™5/dk) — 1(dw/dk)e** by partial integra- 
tion. We obtain: 


k(dw/dy)— (dV/dy) (dw/dk)=0, (35a) 
¥(y)=—469) + | Py (y) ] Pkgo(ys o); 
w(y,k)=1, or 0. (35b) 


Equation (35) is the same as the classical Liouville 
equation for a one-particle distribution function w(y,£), 
k being a dimensionless momentum variable, y a di- 
mensionless position variable, and Y a dimensionless 
potential energy. The latter is a self-consistent potential 
since it depends on fd*kw(y,k), which is proportional 
to the (yet unknown) density: thus (32) is the self- 
consistent field problem of Thomas and Fermi. Equation a 
(35a) gives the equation of motion, (35b) the exclusion, 
principle at T equal to zero. This is to be solved with 
the auxiliary condition that the total energy Jd? 
xJ'diy(BE-W)yw be a minimum. What* restrictions 
are placed by (35a) on the location of the boundary 
h=0? A straightforward substitution shows that 
w(y,k) can only be a function of the combination 
34+W(y). (In a stationary state. the distribution 
function can only be a function of the constants of | 
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motion; the angular momentum being by hypothesis 
zero, only the energy £—5-F Y remains.) Hence, the 
boundary #=0 must be a £= const surface, or several 
different energy surfaces pieced together. 

To determine the boundary we must know (a) which 
energy surface, or surfaces, will be used, i.e., the deter- 
mination of the constants in the E=const equation; 
(b) the potential energy Y. 

We first answer (a). In thé 6-dimensional (yk) 

space (Fig. 3) one constructs the H=const surfaces; 
they will be spherically symmetrical in both the k 
three-space, and y three-space. The former is a con- 
sequence of E containing E only; the latter is due to 
the fact that V can only depend on |y], since ¢ is a 
function of |y|, and so is ^ d*kw(y,k) by symmetry. 
Were the angular momentum of the atom different 
from zero, a preferred plane would exist. (In the one- 
dimensional situation depicted in Fig. 3. This symmetry 
means that the E= const surfaces are symmetrical with 
respect to both the y and k axis.) By assumption only 
bound states should exist; thus w(k,)y=W (E) must 
be zero if E>0; for E<0 W (E) must be zero or one. 
In other words, we could have a W which consists of 
layers of 0 and 1, each layer bounded by Æ= const 
surfaces. (E.g., we could have on Fig. 2 W= 1 between 
the E=0, E— E! surfaces, W=0 between the E-E' 
and E= E" surfaces, etc.) 

Now, it is possible to show that (a) the minimum 
energy obtains if there are no gaps, thus W (E)—1 if 
E<E-<0, where E. is a critical energy; (b) the 
equations have no solution unless E.=0. (The layered 
distributions correspond to excited states.) Thus 


W(E)1 ESO, 
W(E)-0 E»0, 
E=3P+W(y). 


To answer (b) we proceed as follows: Take the 
Laplacian of VY with respect to y. We obtain 


(36) 


AS — An (y) — 4n | ato, (37) 


since Ad=A(1/|y|)=—475(y). The integral over w 
can be simply performed. Fix y; for each y, k can range 
over all values less than or equal to km(y) defined by 
Ikm +Y —0; for all these values of k, w is equal to one; 
hence the integral is (47/3)&,?. (The shaded area is 


Fic. 3. Qualitative pic- 
ture of (k,y) space, reduced 
n space of electron, k-re- 
duced momentum,  jy-re- 
duced position; several con- 
stant-energy surfaces are 
shown. 


I 
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Fig. 2.) Thus we get the two equations 
AW = — 4mó(y) — 4 (4r/3)km (y) , 
bkw +Y -0. 38) 


Equation (38) constitutes the standard set of Thomas- 
Fermi equations in dimensionless variables. The nu- 
merical factors are different. This is due to two trivial 
causes. First, we did not scale out 4m's and other 
numbers with our choice of L; second, as we mentioned 
we omitted the factor 2 arising from the additional 
summation to be performed over the spin states. This 
then shows that in the limit e— 0 the TF theory is a 
rigorous solution of the N-body problem. 


8. THE INCONSISTENT NEXT APPROXIMATION 


We have seen that the simple power-series expansion 
(15) does not exist. In spite of this it may be worth- 
while to pretend temporarily its existence, in order to 
observe through what anomalies the equations inform 
us that we are attempting the impossible. 

The pattern is clear. First, we have to assume a 
functional G which connects the fs with fı to the next 
order in e, still satisfying the exclusion principle. This, 
with the recursion relations gives a condition on fi 
which must be solved together with the equations of 
motion for the fs. Let us assume that G is of the same 
form as F; thus, we extend to second order in e the 
determinantal construction of the f, from fı. Then the 
recursion relations give again Eq. (21) as a condition; 
with fi(y4 = fi (T € fi (y,¢) in place of fi, and 
with y--iet, y—3e¢ appearing in the arguments. Ex- 
panding it to second order in e and equating the co- 
efficient of & to zero, we get an inhomogeneous linear 
integral equation for f®, where the homogeneous 
equation has only zero as its solution. Hence the in- 
homogeneous equation has, at most, one solution. This 
means that if a solution exists we have found f: to second 
order without ever investigating the equations of motion 
to second order. It is clear, however, that such a solu- 
tion must be incompatible with the equations of motion 
for the following reason. We can modify the functional 
G in an infinite number of ways to give the same I€ 
lation between fı and f», but different relations. be- 
tween fı and fs, fs, etc. This leaves the hypothetical 
solution unchanged but will modify the equations of 
motion. Thus, this specification of G is too restrictive. 
The solution, however, does not exist. If we formally 
solve the equation we find that the solution diverges: 
An inconsistent solution can be obtained, however- 
Assume that f» is a determinant, put 


fro fi 0,045659 0,0 


and expand it to second order in e. Insert it in the 
equation of motion for fı, and equate to zero separately 
the coefficient of 2. This way we obtain the previous 
zeroth-order equations for fi), and an inhomogeneous 
equation for fı®. This equation is the one found by 


j 
d 
1 
4 
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Kompaneets and Pavlovskii, and Baraff and Borowitz 
containing the correction terms introduced earlier i 
a more heuristic way by Dirac (exchange correction) 
and by Weizsücker (inhomogeneity correction). As we 
see, this result is then inconsistent in the sense that the 
exclusion principle is satisfied to zeroth order, while the 
equation of motion for f; is satisfied to second order; 
moreover it is left open how far the other equations of 
motion for f, with s2 1 are satisfied, since this involves 
a decision about the functional dependence of faon fi- 


9. CONJECTURES 


As we see, the solution of the hierarchy of equations 
for the reduced density matrices involves three steps: 
the discovery of the proper small parameter; the 
establishment of a functional relation between the 
singlet density matrix and the higher density matrices; 
the invention of a calculational scheme to improve the 
approximations. We shall now comment on each step, 
bearing in mind what we have shown before. First, we 
observe the general difficulty that there is no general, 
all embracing postulate from which all three steps 
would necessarily follow. We first picked a small 
parameter such that in zeroth order we should recover 
the TF model; this of course does not make the small 
parameter unique. However, the parameter of smallness 
gives no clue as to the construction of the functional 
connecting the higher density matrices with the singlet 
density matrix. Moreover, this functional relation 
(chosen in such a way as to satisfy the exclusion 
principle) forces the small parameter already picked to 
appear in a nonanalytical manner. What we are missing 
is a postulate which connects the two separate assump- 
tions about the small parameter and the functional 
relation. The basic feature of the TF model is that the 
energy is a functional of the density only. This unique 
feature should be incorporated from the beginning in 
some manner, as for example to seek those solutions of 
the hierarchy for which the singlet density matrix is a 
functional of the density (its diagonal elements) only, 
the pair density matrix a functional of the singlet 
density matrix, etc., and trying to determine the un- 
known functionals through the expansion in terms of 
a small parameter. Sidestepping this basic predicament, 
the present treatment suggests two basic improvements. 
We observe that the present small parameter essentially 
expresses the smallness of the off-diagonal elements of 
the singlet density matrix, but it does so in a very crude 
way. Think of the singlet density matrix m the x 
representation; our expansion says that only those 
elements are of importance which lie along the diagonal 
in a strip of uniform width e. In reality this strip will be 
far from being uniform. As we go (in the x represen- 
tation) along the diagonal, the diagonal elements p(xyx) 
give the density as a function of the radial distance 
from the nucleus. The heuristic considerations in Sec. 
3 show that the TF approximation is good around a 
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distance L from the nucleus but poor everywhere else. 
Close to the nucleus the field is changing too rapidly ; far 
away the density is insufficient. Thus, this strip is 
rather wide at both ends, becoming narrow in between. 
Thus, it is unwise to assign the small parameter e uni- 
formly in the (y,¢) space; most likely there will be 
different asymptotic regions in this space with different 
correct assignment of smallness. One way to accom- 
plish this is to introdüce the unknown density distri- 
bution in the scale, and hence in the small parameter. 
Thus the small parameter must be specified implicitly. 
This already may make the nonanalyticity difficulties 
disappear. However, it seems to us that another remedy 
may also be required. In the present scheme, as it is 
usual, we insisted that the temperature of the atom is 
absolute zero. This may be too restrictive a stipulation. 
For, is it not rather preposterous to state, that you 
wish to determine the energy of the atom within an 
accuracy of, say, 20%, and at the same time to insist 
that you know its temperature exactly and not only 
to the same accuracy as you know the energy? It is 
likely that a coupled limiting process is required, in 
which the temperature is treated as an additional 
small parameter. Preliminary work suggests that this 
does indeed eliminate some of the difficulties, since at 
finite temperatures Eq. (22) acquires an additional 
term kT0f/dp (u being the chemical potential). Thus, 
we have essentially a “boundary layer" problem in pz 
around the Fermi surface, kT being the thickness of 
the boundary layer. It is a well-known fact of asymp- 
totic analysis that one must of treat such cases by 
putting simply the boundary thickness equal to zero, 
since this obliterates the highest derivative in the 
equation. 


10. SUMMARY AND ‘CONCLUSIONS 


We have shown then, that there exists a small 
parameter and a functional relation between the singlet 
and higher density matrices such that in the limit as 
this small parameter tends to zero, the TF solutions are 
exact solutions of the full V-body problem, for neutral 
atoms at absolute zero temperature. One surmises that 
the same would hold for ions as well, if under ionization 
we mean that a fraction of the total number of electrons 
are removed (and not a given number). The problem, 
however, involves the small parameter in question in a 
nonanalytical fashion and for this reason a simple power- 
series expansion cannot be used to go to the next ap- 
proximation. If we are content to satisfy the exclusion 
principle to zeroth order, and te equations of motion of 
the singlet density matrix to next order, we ebtain the 
equations found by Kompaneets and Pavlovskii, and 
Baraff and Borowitz, which contain the exchange and 
inhomogeneity correction terms. Finally, we suggest 
possible steps one could take to get rid of the difficulty 
generated by the nonanalyticities. 
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A convenient method is devised for the calculation of magnetic hyperfine constants in atoms, molecules, 
and metals taking into consideration the exchange interaction between the core electrons and the unpaired 


valence electrons. In this method, the core-electron wave functions are perturbed by the nuclear mag- 


states, we obtained values for the core contribution 


I. INTRODUCTION 


NUMBER of recent detailed and careful investi- 

gations!” of the magnetic hyperfine interaction 
in atoms and paramagnetic ions have clearly demon- 
strated the important role of exchange polarization of 
the core electrons in contributing to the magnetic 
hyperfine interaction constant. In all these investi- 
gations, the method that has been employed has come 
to be known generally as the unrestricted Hartree-Fock 
(UHF) method. In keeping with the recent attempt of 
standardization of nomenclature," we shall call the 
unrestricted Hartree-Fock method the spin polarized 
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netic moment via the Fermi contact term, and the energy of the system is then calculated in the Hartree- 
Fock approximation using the perturbed core wave functions. The present method is closely related to the 
exchange perturbation method of Cohen, Goodings, and Heine. However, the former has the advantage of 
being more flexible in the sense that the same perturbed core-electron functions may be used for the ground 
and excited states of the atom and for metals without significant error. For lithium atom 1s°2s and 15225 


to the hyperfine constant a (in al- S) of 83.76 Mc/sec 


and —8.9 Mc/sec in good agreement with the earlier values of Cohen, Goodings, and Heine. We have 
applied this method to a calculation of the core-polarization correction to the Knight shift in lithium metal 
using recent wave functions of Kohn and Callaway. The core-polarization corrections produced by the s 
and ? parts of the conduction-electron wave function are nearly equal but opposite in sign, while that pro- 
duced by the d part is an order of magnitude smaller. 
direct contribution to the Knight shift from the conduction electrons. 


This results in a net correction of about — 5.3% of the 


method (SP). Similarly, the projected unrestricted 
Hartree-Fock method will be denoted as the projected 
spin polarized method (PSP). Some of the investi- 
gators-^9? have handled the SP method self-con- 
sistently, while others have used a perturbation 
approach. To avoid confusion we shall refer to the 
method of treating the exchange potential as a. per- 
turbation, the exchange perturbation method (EP). 
In recent papers, Nesbet,” Marshall,’ and Heine!’ have 
discussed possible errors that can occur in the results 
of calculation by the SP method, because the many- 
electron wave function used in the SP method is not an 
eigenfunction of S? where S is the total spin of the 
atom. From the investigations of these authors, one 
arrives at the conclusion, that for paramagnetic ions 
and atoms, this limitation of the SP method is not à 
serious source of error; but in extending the SP method 
to metals and molecules, one has to be more careful 
about the influence of this source of error. 
Meee We*are interested in a perturbation 
EE SeA ch has the advantage of flexibility over the 
od while not sacrificing accuracy. The accuracy 


12 
miei In qe ERod should be distinguished from the SPP 
after an SP ene one applies the projection operator 
of S? On is performed to obtain an eigenfunction 


13 V. Heine (to be published). a 
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of the method can be easily compared with other 
perturbation calculations, which we shall show to be 
exactly equivalent in principle to our method. Specifi- 
cally, what one does is to perturb the core electrons 
with the nuclear magnetic moment (u) via the Fermi 
contact interaction ; and with these perturbed functions, 
one then computes the energy to first order in u, which 
is directly related to the hyperfine coupling constant. 
This method, which we shall call the moment per- 
turbation method (MP), differs from the EP approach 
only in the order of application of the perturbations, 
since there the core electrons are perturbed by the 
exchange potential rather than the nuclear magnetic 
moment. The flexibility of the method arises by letting 
the nucleus perturb the core functions and approxi- 
mating (which we demonstrate by our lithium atom 
calculations to be a good approximation) these per- 
turbed functions as being independent of the configu- 
ration of the outer valence electrons. Thus, we assume 
that the interaction between the magnetic moment of 
the nucleus and the core electrons of the atom is not 
greatly altered when one goes from the atomic ground 
state to an excited state, or to molecular and metallic 
states. This has the advantage of enabling one to 
calculate exchange-polarization effects when the system 
is in a variety of environments without having to solve 
separate differential equations in each case. In order to 
demonstrate the practicality and flexibility of the MP 
method, we have calculated the hyperfine interaction 
constants for atomic lithium in the 1s?2s and 1s?2p 
configurations, and compared them with the results 
of earlier calculations. We then apply the MP method 
to calculate the contribution of exchange polarization 
to Knight shift in metallic lithium. In all these calcu- 
lations, the perturbed core wave functions used were 
those calculated for atomic lithium 1s?25 configuration. 
This procedure“ is analogous to similar cross pertur- 
bation methods that have been used for the quadrupole 
antishielding factors for ions? and for the calculation 
of nuclear magnetic shielding coefficients in molecules." 

Tn Sec. II we describe the details of our method and 
its correspondence with the EP method. In Sec. III the 
method is applied to lithium atom 1s?2s and 15725 
states, and the results are compared with earlier EP 
calculations. In Sec. IV we calculate the core-polari- 
zation correction to the Knight shift in lithium metal 
using the recent Wigner-Seitz wave functions obtained 
by Kohn and Callaway.!" In Sec. V we shall discuss the 


14 While this work was in progress, we received a preprint of a 
paper by Heine (Ref. 13) in which he mentioned that E. Simánek 
had independently proposed a similar method to Heine to avoid 
some of the inaccuracies of the SP method that were discussed in 
Ref. 13. 

15 R, M. Sternheimer, Phys. Rev. 84, 244 (1951); R. M. Stern- 
heimer and H. M. Foley, ue 102, 731 (1956); T. P. Das and R. 
Bersohn, zbid. 102, 733 (1956). k: 

16 T. P. Das and R. Bersohn, Phys. Rev. 115, 897 (1959). 

1 W. Kohn and J. Callaway, Phys. Rev. 127, 1913 (1962). 


limitations of the MP method and the problem of self- 
consistency. 


THEORY OF THE MP METHOD 


The general procedure followed here is similar to that 
employed in some other perturbation problems. How- 
ever, for the sake of completeness, we have presented 
here an adaptation pértinent to our problem at hand. 
Let us consider a general system whose unperturbed 
Hamiltonian is 5) and whose ground-state wave 
function is Yg. 


Foko = Eo 5 (1) 


Let the system be perturbed by two general first-order 
perturbing forces described by Hamiltonians 36g and 
iy. If ôF g and 6¥y represent the first-order changes in 
the wave function of the system due to J€g and JCy, 
respectively, then the energy of the system correct to 
the second order is 


(Vo-5Y et by | 3654-3€ p -3Cy | Vo - 0 p 1-0 y) 
(tott g- 0 v | Vo -0V g-I- y) B 


5 


Q) 


On simplication, after making a binomial expansion 
of the denomintaor, one gets 


BZ {Eo HE+ Ey Yo | Re [ôF g)4- 2(v, | Tey | ôV g) 
+2(Wo| 3€ z | V nH Fol Fy Ey) tF g | C | OV ez) 
+ôYy | gc | OV y) + 2(9Y E| Ho | ôF x)) 

— {(6Vz|6V2)—6Vy|dVy)—2(6V2|6¥y)}. (3) 

After omitting terms higher than second order, Eq. (3) 

becomes 

E= Eod Ecl- Ex - (V s |3€9— Ec | Wy) 

+(OV g | 69 — Eo | OW z)+ 2(8V g|3€9— Eol ôl y) 
+2(% | zg | 6Vz)+2(Yo | Iy | OW g) 
+2{Vo|ICz|O¥y)+2Vo| Cw |o), (4) 


where we have assumed that the perturbed wave func- 
tion is normalized to first order. That is, 


(Tol 9 z) — (Vol d¥y)=0, (5) 
and the first-order changes in energy arè given by 


E.=(Vo|5z| Vo) , 


Ex (Sl Sv | Wo). e 


The second-order change in energy is given by 


EO —2(Vo|3g | dV g)-- (OV z | Ho— Eo] ôF g) 
4-2(Vo| SC | y) H- (9 | €9— Eo | rw) 
4-2(Vo| € | Wy) - 2o | Cy | Ve) - 
+ ôF g| 3€o— Eo] OV) > (7) 
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The terms in the second-order change in energy can be 
regrouped!’ as 


TOZ Pol 3C g | NV 2) + (V | 9 — Eol OV) ; 
En ES 2(Vo| In | 6Vy)+ (OV Kir Eol dV) 3 


Een = 2(Yro | C | Sr y) H- 2(Vro | Cx | VV £) (8) 
+ XOY g | Jom Eol ON x) . 


The quantity 7, is the second-order change in energy 

due to JCs alone; Ex is the second-order change in 

energy due to 5Cy alone; and Eey © is the second-order 

change in energy due to the interaction of 36g and Sty. 
The first-order equations for 6¥z and 6Wy are 


(5y— Eo) Y r= — (3€ x— E.) Yo, (9) 
(305— Ey) OV — — (Cy — Ev)Wo, (10) 


which may also be derived by minimizing £,? and 
Ex with respect to Vg and 6Wy, respectively. From 
Eqs. (9) and (10) it is easily seen that 


(OWy [ICE | 7o) — (Fz | Cv | Vo) 
= — (BV g|3€9— Eo | Vy). (11) 


Hence, using Eq. (11), Eey ? reduces to 
Ey = (ôV g | Cy | Vo) J (12) 
Ew = (Wy |3C y | Vo). (13) 


One could now apply this result to our problem of 
core polarization. For the perturbing Hamiltonian 5C, 
we take the exchange potential at the position of the 
core electron due to the unpaired valence electron and 
for 3Cy, the Fermi contact interaction due to the nuclear 
magnetic moment. 


ree he) Í V2) : Veo (2)d 1 

HL va —_¥Yeore T2; 4 
Weore(1) ; 712 
16m 

ing S0 . (15) 


In Eq. (15), y. is the magnetogyric ratio (e/2 Mc/sec) of 
the electron, yx is the magnetogyric ratio of the nucleus, 
and I and S are the respective spins of the nucleus and 
electron. Equation (14) requires some explanation : 
In the presence of an unpaired valence electron, with 
spin “up,” Eq. (14) represents the difference in exchange 
^ potential seen by the up and down core electrons. If the 
^restricted Hartree-Fock Hamiltonian for the core states 
were used as 3Co in Eq. 49), we should consider two 
first-order equations of the type of Eq. (9), one for the 
up core state and one for the down. The equation for 
the up state would involve +5¢€z/2 on the right and 
that for the down state —3€z/2. For those effects which 
involve 3Cz to first order, one can consider the down 
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state to remain unperturbed and the up state to be 
perturbed by 3€z as given in Eq. (14). 

Besides Eey, the various higher order terms that 
could contribute to the hyperfine interaction are 
ICE ICN, ICRI, Ieren, `- These terms are expected 
to be negligible in heavy atoms where 3Cz is a small 
perturbation compared to 3Co. For light atoms there 
may be a non-negligible contribution from such terms. 
We have not investigated its effect in lithium atom 
since our major interest is in Knight shift calculations. 
Tt will be shown in Sec. IV that these higher order terms 
in 5Cg do not have to be considered in core-polarization 
contributions to Knight shift. Higher order terms like 
JCN? g, Ier Ie, --:, lead to terms in the energy non- 
linear in nuclear spin and are of no interest to us. 

For metal 4,1 becomes Weona, the wave function for 
a conduction electron which extends over the entire 
crystal and therefore, involves more than one center. 
The perturbation equation for Vz [Eq. (9) ] cannot, 
therefore, be separated into radial and angular parts. 
Cohen, Goodings, and Heine? attempted to meet this 
limitation by expressing Weona around the nucleus in 
question as a linear combination of spherical harmonics 
and used the linear independence of spherical harmonics 
of different orders to separate Eq. (9) into a number of 
independent differential equations. This procedure is 
rather cumbersome because one now has to solve a set 
of differential equations in place of Eq. (9). In the MP 
procedure, one considers the perturbations in the reverse 
order. This is justified because of the two alternative 
expressions (12) and (13) for E.v®. Instead of first 
considering the perturbation óYWg due to 3Cg, let us 
consider the perturbation of the wave function 0Vw 
due to iy. Thus, we can determine 6V¥y by solving 
Eq. (10) and obtain E.y® from Eq. (13). Since 3Cy 
is a localized perturbation by a point source and wo 
refers to Weoro (the wave function of the core electrons) 
which is, to a very good approximation, localized and 
centrosymmetric, Eq. (10) can be separated into radial 
and angular parts. It should be noted that when the 
wave function for the conduction electron is expanded 
as a sum of spherical harmonics, the MP method 
obviates the necessity of having to solve a set of 
differential equations; but instead, one has to evaluate 
a set of integrals arising out of Eq. (13). In principle, 
the two procedures for calculating E.y ® should lead 
to identical results ; but in practice, this will not be 
exactly true. In the process of obtaining numerical 
solutions of the differential Eqs. (9) and (10), one 
poene various regions of space differently because of 

e different. natures of the two perturbations 3Cp and 
wae ES E c clear in Sec. IIT, where we 
and Heine's EP ue Sonik 5 a 
a a AP: on lithium atom. We shall 
solutions are oni = m cenae 
the two method «wath both EP and MP methods, 

ethods do, in fact, lead to identical results. 
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With these general remarks, we now proceed to the 
detailed forms of the equations that one has to solve 
for the MP method. Since, in a general case, one may 
have more than one core state to perturb, a formulation 
such as Dalgarno’s for the perturbation of many- 
electron systems" is helpful. We will follow Dalgarno’s 
method IT (i.e., non-self-consistent perturbation) and 
obtain the first-order perturbation equations for px 
by minimizing the second-order energy [Eq. (59) in 
Dalgarno's paper] with respect to dpy. 

The first-order perturbation equations are then given 


by 


(9C;— ej); v — 955 (ej— e) | Vs); 
— —S9 v5; (Sev |vov;, (16) 


where 3€; is the one-electron Hamiltonian, and v;, v; 
are the one-electron wave functions in the Hartree- 
Fock approximation, and «e; and e; are the corre- 
sponding eigenvalues. The first-order change in the 
wave function for the ith one-electron state due to 
Hy is Oy; w. For a core ns state, Eq. (16) for ój,.y 
takes the form 


(30,,— €ns)OWns,N= Dx V€n^s — Ens) urs | OV ns, NY ns 


n's 


—IenYns t Dy (us 


n's 


Ien | Yn Wns. (17) 


One has to solve Eq. (17) for all occupied core ns states, 
since the s states are the only ones which do not have 
nodes at the origin and, hence, can contribute to the 
hyperfine interaction through the Fermi contact term. 
The summation terms on the right-hand side extend 
over all the core s states. 

The zero-order one-electron wave function Yn, satis- 
fies the equation 


(Iens — Ens)Yns=0. (18) 
That is, in atomic units 
(— Vr Vne—€ns)Wns=0, (19) 


where V,, is the one-electron potential seen by an 
electron in the zs state in the Hartree-Fock approxi- 


* mation. Therefore, 


(s Ens) = V5jas/ V/ns . (20) 


Substituting this in Eq. (17), we get 


Vins 
(m Jie 


ns 


zm 2 (€nts— Ens) rs | BV as Nurs 
—GS6xVasl- 22 Qual Ien Ynna- (21) 


————— 
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If we now introduce the new function $55, N, defined by 


A Yna Pna,! 
Bj —— — t FOA), Q2) 


dr r r 


where 
16r m 
A= nil 
B aoh? 


and remembering that V?(1/r)=—4rô(r), Eq. (21) 
reduces to 
pny A A 


Una 


dr? 2rr 


dit; , Pns.N Pune 


27r? dr Une dr? 
=> (€nts—€ns){(Unrs|One,nUn’ s) 
+A Z Ynra(O)Pna (0), ,, (23) 


where we have used the notation 
Vas [25s (r)/r]Y 9 (6,0) . 


Because of the first term on the right-hand side, 
Eq. (23) is an integrodifferential equation and must be 
solved self-consistently. This term represents the in- 
fluence of Pauli principle on the perturbed core states. 

After determining óV,, w by solving Eq. (23), one 
can get the second-order energy E,y using Eq. (13). 
Thus, substituting for the determinantal function 6Vy 
in terms of the one-electron functions óV,,,v and using 
Eq. (14) for 30g, we get 


E. 0 —2 »E | [tss COP Wa) 


1 : 
X—dridri— 5 Wear 1) as (1) War (2)Ynee (2) 


712 
£ | 
x dridrs eese «(QD 
712 


From this equation the correction a, to the hyperfine 
constant @ from core polarization is obtained by the 
equation T: 
a, — En O / IJ2zh. (25) 


In solids and molecules, J is equal to the total spin S; 
whereas, for atoms, J is the total angular momentum 
of the valence electrons. 


IH. RESULTS FOR LITHIUM ATOM 1s:25 
AND 1s°2s STATES 


For the lithium atom 1s72p (J—2) state Eq. (23) 
for ¢1;,v takes the form 


A ditis Qis. N du, 


Hia us ae NE 
Qxr* dr Urs dr - 


—-Al (Q0). (26) 


Lis. N A 
dr? 22r 
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To illustrate the nature of the solution of Eq. (26), we (28), and (31), and performing the necessary integra- 


first assume a hydrogenic form for tys for which an tions, we get 


} | exact solution of Eq. (23) can be obtained. The Coulson- a; 1472 15 273: 

i 1 Duncanson functions? represent such a choice. E.y®=——BE al 

pit el " LEG z 1 

| AN ENT pu? 4(z4-E)* 8( HES 8(z-+é) | 
n Vu (-) eT, Vas e reč" cosð (27) 2izIn(z--E) 48z1n2 21zlnz "m 
n z d z 50) 
n 2G--D' C+E 20+) | 

hes! and | 
| Im z=2.69, £—0.525. Substituting the values of z and £, one gets P 

dad] 

id The solution of Eq. (21) is then found to be™ E.n O0 = —0.0540£. (34) | 

E A 3 a j ee | 

fa 7 From Eqs. (25) and (32), remembering 7—J-—2, it 

Í danm e Inr-+2er)ttre(7) (28) then follows that : 


å 3 2 0 
To find a solution 5J4,,y which satisfies Eq. (5), we Sunnolen 


use the Schmidt orthogonalization procedure and UE 3asIJ (2x3) B 
obtain from Eqs. (22) and (28) 


=— 5.24 Mc/sec, (35) 


where uy=yxIh is the magnetic moment of the Li? 


MEN nucleus and yo is the Bohr magneton. 
Bas LFu- Wael F 14|¥1s) Wie, Since 41,(r) involves a single exponential for the 
where (29) Coulson-Duncanson function, one can obtain an exact 
1 A analytic solution for the perturbation equation (16) 
Fis= ——+2z nr 27r. for the EP method, with óV;y and 3€y replaced by diz 
i and 3Cz, respectively. 
From Eq. (14), the exchange perturbation JCz due to Sow Hum AS oe 
the 25» electron is given ps Bike =-(-) GU per TE 
Se [ os 9\r/ HE 44 
=— — | uzp(S)ti(S)sds 5 4 5 3 z 4 
Suy (r)-r? Jo z pens yak +6) Je E 
5 1 Ap 2E sp | 
+] ses) | . (30) O(z+£)> 51 (z+)? 15g | 
r S + -+ Je 
Ag 4£ 4r j 


Hence, from Eqs. (24) and (30) it follows that 


: $ 24e0-897T 755 ge eer 
TE Ee iB i tsp (r)0u1s v (r)dr mu» r Cd ji 7 ar 
boi 0 Ne. 
M r 
1 dom 28 E ) gie ( 1 1 | 
j ja ; ol ae r--— Inr— ) (36) | 
| : a i} U15(S)tU2p(s)sds aXe GHE z m Lo 


Substituting 9/1, g from Eq. (36) into Eq. (12), one | 


co 1 . 
HE | are Vet) ze . (31) obtains, after some manipulation, exactly the same 
5 E expression for Esy® as in Eq. (33). 


where The value of a, in E 5) i 
: ; 3 - o q. (35) is about one-half of 
: ut A e _ fared I, Goodings”4 calculated value (—10.7 Mc/sec) by the 
ir ao 3 ag ? SP method and is in somewhat better agreement with 
i Cohen, Goodings, and Heine’s® value (—8.5 Mc/sec) 
es dub CLONE E by the EP method. However, one does not expect very 
"NT Ja, N- (32) good agreement with these earlier results because the 


EX : single-exponential hydrogen functions are not good 
- the Coulson-Duncanson functions using Eqs. (27), PProximations to the Hartree-Fock wave functions. 
omo PE NNEM To m a result by the MP method which could be 
5 mu iu: - Duncanson, Proc. Roy. Soc. properly compared in terms of accuracy with Cohen, 
artz, Ann. Phys. (N. Y.) 6, 156 (1959). 7 Per-Olov Löwdin and K. Appel, Phys. Rev. 103, 1746 (1956). 
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Fic. 1. Plot of unperturbed (u+) 6 

wave function and first-order per- 4 
turbation ôy as function of r i 

for lithium atom 15725 configura- $ 

tion. o 


Qao 0.5 


Goodings, and Heine’s EP calculation and Goodings’ 
SP calculation, we next solved Eq. (23) using Hartree- 
Fock wave functions. The solution has now to be ob- 
tained by numerical integration of Eq. (23). In Fig. 1, 
the function 821, [defined in Eq. (32)] is plotted, 
together with Goodings’ Hartree-Fock wave function? 
tts, Which was employed in our calculations. The 
function 6z1.,y resembles a 2s type function, in that, 
it has a node, but àji,y-— (4/47) (0w1,,v/r) goes to 
negative infinity at the origin. 

Using Eq. (31), together with Goodings’ HF wave 
functions 14, 1/25, and 611.,n of Fig. 1, it follows that 


Ev = —3B (0.137). (37) 
From Eq. (25) this leads to 
0,— —8.9 Mc/sec. (38) 


This value compares very favorably with Cohen, 
Goodings, and Heine's EP value (Table I). This is 
satisfying in light of our Eqs. (12) and (13) and our 
earlier demonstration that EP and MP methods lead 


Tape I. List of contributions to hyperfine constants in Mc/sec for lithium atom 15'2s and 15725 statés. 
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to identical results whenever exact solutions of Eqs. (9) 
and (10) are obtainable. 

Both ours and Cohen, Goodings, and Heine’s values 
for a, are in slight disagreement with Goodings’ value 
(Table I) obtained by the SP method. However, no 
measured value of a, is available, so it is rather difficult 
to assess the importance of this small disagreement 
between the values obtained by MP (and EP) and SP 
methods. 

The feasibility and accuracy of the MP method has 
now been demonstrated by its favorable comparison 
with the results of the EP method for the lithium atom 
15?2p state. We would next like to show the flexibility 
of the MP method by a consideration of core polari- 
zation effects for the 1s?2s state of lithium atom. The 
perturbation equation to be solved to get 6~1.,y (or 
$e.) in this case is exactly the same as in Eq. (26) 
except that 11, may be different from its value for the 
1s?2p state. Available tables? of 11, for 1s*2s and 1572 
states show that there is only a very slight difference 
between the values of t1, in the two cases. One could 


15225 152p 
Direct 2s Core Direct 26 
Wu contribution Total polarization €ontribution Total 
—10.7 0 e—10.7 
Gooding 106 284 390 Iy = 
(Galton, (Camila 86.74 286.26 373.0 8.5 0 HE 
and Heine* S 
i 84 367.76 —89 0, 8.9 
This paper 83.76 2 ; 
Experimental 401.786 
of Ref. 5. 


a The direct 2s contribution is taken from Table I 


* D. A. Goodings, Ph.D. thesis, Cambridge University, 


Cambridge, England, 1961 (unpublished). 
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To illustrate the nature of the solution of Eq. (26), we 
first assume a hydrogenic form for ıs for which an 
exact solution of Eq. (23) can be obtained. The Coulson- 
Duncanson functions” represent such a choice. 


TY g 12 WIE 
| w= (=) Qu von=(~) re-*'rcosü (27) 
du T T 


qu and 


I 2—2.60, £—0.525. 


The solution of Eq. (21) is then found to be” 
A 
drew - Qs Inr-++22*r)114(r) - (28) 
T 


ME To find a solution óVi,x which satisfies Eq. (5), we 
Tay use the Schmidt orthogonalization procedure and 
i obtain from Eqs. (22) and (28) 


A 
Sas LP Wis | Fi, | Vas) Wis ) 
T 
where i (29) 
Fy,=—-+ 2z Inr4- 22r . 
r 


From Eq. (14), the exchange perturbation J€g due to 
the 2p electron is given by 


2p 1 T 
E I tsp (S)tt1s (s)sds 


r2 


E» 1 
+r | ws us) | . (30) 
r $^ 


Hence, from Eqs. (24) and (30) it follows that 


E,y®= —3B | I 5 Uop(r)òttis,y(r)dr 
0 


=e 


x i : U15(S)top(s)sds 


m [ i ss s] ; Gn 


2 


Ar to 3 a 
4T 
6t1s,v= ahy o (32) 


Duncanson functions using Eqs. (27), 


he 


GASPARI, SHYU, 
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(28), and (31), and performing the necessary integra- 
tions, we get 


1472? 15 273z 
4(-D* BEHE 8GTE' 


21z Inz 
i | . (33) 


32 
9 


2izln(z4-£) 482 In2 


2(2-£") — (FB! 2(2+é)" 
Substituting the values of z and £, one gets 
E.n = —0.0540B. (34) 


From Eqs. (25) and (32), remembering 7—J =2 it 
then follows that 


AuypoLen™ 


ag — 5.24 Mc/sec, (35) 
3agIJ Qai) B 


where uy —yxIh is the magnetic moment of the Li’ 
nucleus and uo is the Bohr magneton. 

Since 4;,(r) involves a single exponential for the 
Coulson-Duncanson function, one can obtain an exact 
analytic solution for the perturbation equation (16) 
for the EP method, with àj;y and 3€y replaced by Oy;z 
and 3Cz, respectively. 


YANE £6g—7 (3 (z+ Eire tr 
Wane ) | 
CHE 4f 


9 
3(s--5* 9(-r£y 3(z4- £) 
+| (r£) jews] (2+ &) 
d 2E 8g 
9 3 51 2 Sor 
a (z--£) fe (z-- E) J 


AE 4E A» 


T 


24e-Pr 75g fetr ec-9r 
a 5 I dr+48z | ir} 


(f Y T 


28 (= J zé5e =n 1 1 
decr -=== WO 
3 NT mud Z ea =) e» 


Substituting 6Y1.,z from Eq. (36) into Eq. (12), one 
obtains, after some manipulation, exactly the same 
expression for E,y ? as in Eq. (33). 

The value of a, in Eq. (35) is about one-half of 
Goodings” calculated value (— 10.7 Mc/sec) by the 
SP method and is in somewhat better agreement with 
Cohen, Goodings, and Heine’s® value (—8.5 Mc/sec) 
by the EP method. However, one does not expect very 
good agreement with these earlier results because the 
single-exponential hydrogen functions are not good 
approximations to the Hartree-Fock wave functions.2 

To get a result by the MP method which could be 
properly compared in terms of accuracy with Cohen, 


D 


2 Per-Olov Löwdin and K. Appel, Phys. Rev. 103, 1746 (1956). 
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Goodings, and Heine’s EP calculation and Goodings’ 
SP calculation, we next solved Eq. (23) using Hartree- 
Fock wave functions. The solution has now to be ob- 
tained by numerical integration of Eq. (23). In Fig. 1, 
the function 84i, [defined in Eq. (32)] is plotted, 
together with Goodings’ Hartree-Fock wave function? 
tys, Which was employed in our calculations. The 
function óuj,,w resembles a 2s type function, in that, 
it has a node, but yu, = (A/4m)(011,,v/r) goes to 
negative infinity at the origin. 

Using Eq. (31), together with Goodings’ HF wave 
functions tis, 2/25, and óu1,,v of Fig. 1, it follows that 


E,y 9 = —$B(0.137). (37) 
From Eq. (25) this leads to 
0,— —8.9 Mc/sec. (38) 


This value compares very favorably with Cohen, 
Goodings, and Heine's EP value (Table I). This is 
satisfying in light of our Eqs. (12) and (13) and our 
earlier demonstration that EP and MP methods lead 
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to identical results whenever exact solutions of Eqs. (9) 
and (10) are obtainable. 

Both ours and Cohen, Goodings, and Heine’s values 
for a. are in slight disagreement with Goodings’ value 
(Table I) obtained by the SP method. However, no 
measured value of a, is available, so it is rather difficult 
to assess the importance of this small disagreement 
between the values obtained by MP (and EP) and SP 
methods. 

The feasibility and accuracy of the MP method has 
now been demonstrated by its favorable comparison 
with the results of the EP method for the lithium atom 
15225 state. We would next like to show the flexibility 
of the MP method by a consideration of core polari- 
zation effects for the 15?2s state of lithium atom. The 
perturbation equation to be solved to get ài,» (or 
Qu.) in this case is exactly the same as in Eq. (26) 
except that 2, may be different from its value for the 
15225 state. Available tables? of 14, for 1572s and 1525 
states show that there is only a very slight difference 
between the values of 21, in the two cases. One could 
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a The direct 2s contribution is taken from Table I o 


2D. A. Goodings, Ph.D. thesis, Cambridge University, 


Cambridge, England, 1961 (unpublished). 
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therefore use the same ¢1s. for both 1s?2 and 1s?2s 
states. If using this approximation we find good agree- 
ment with results obtained by Cohen, Goodings, and 
Heine’s EP calculation (which is not subject to the same 
approximation), we can indeed justify our claim about 
the flexibility of the MP method. For heavier atoms, the 
alteration in core wave functions :s5, for different 
atomic configurations would be even less than for 
lithium; and so the flexibility approximation will be 
more justified. 

The second-order energy expression Eey? for the 
lithium atom 1s?2s state is then given by 


Ex = -2| | tss (r)1, v (7) 
0 


1 T 
AE ttis (s)uss (s)ds 
T Jo 
ES 1 
+f ws Gs) |r| . (39) 
3 S 


On evaluating the integrals in Eq. (39), one gets 
E.n 9 —2B (0.1434). The hyperfine constant is therefore 


0,— 83.76 Mc/sec. (40) 


The comparison of this value with Cohen, Goodings, 
and Heine’s and Goodings’ values (Table I) is very 
satisfying. 


IV. CORE CONTRIBUTION TO KNIGHT 
SHIFT IN LITHIUM METAL 


As another important application of the MP method, 
we shall consider the magnetic field produced at the 
Li? nucleus in lithium metal as a result of the exchange 
polarization of the core 1s electrons when the con- 
duction electrons are polarized by the external magnetic 
field. The usual expression for the Knight shift due to 
the conduction electrons is given by? 


H 8 


o XXV 0) |a, (41) 


where X, is the spin susceptibility per atom, and 
{|¥r(0)|?)av is the average electron density at the 
nucleus from electrons at the Fermi surface. Equation 
(41) can be interpreted in the following way: Of the 
conduction electrons near the Fermi surface, a fraction 
($-- X,H/2po) have theimspins parallel to the field, and 
a fraction (;—XpH/2y0) have their spins antiparallel 
to the field. When these electrons interact with the 
- nuclear moment through the interaction term 3€y, we 
TON get the effective field AH in Eq. (41) out of the difference 
n populations in the parallel and antiparallel spin 


cmi 


Knight, in Solid State Physics, edited by F. Seitz and 
ull (Academic Press Inc., New York, 1956), Vol. 2. 
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states. This population difference between the parallel 
and antiparallel conduction electron states also leads 
to an exchange polarization potential to the core 


electrons, given by 


X H yu (1) 1 
Ier pH o | Wi(2)—Weore(2)dr2. (42) 
Ho Weore(1) 712 


It is this potential 3€z which enables the core electrons 
to produce an additional field at the nucleus. 

Since 3€g is now a weaker perturbation than in Eq. 
(14) by a factor X,H/uo, the perturbation methods EP 
and MP are more justified than in atoms. Also, it is 
clear from Eq. (42) that terms of the order 3C773Cy, 
5Cz*ICy, ---, would lead to contributions to the Knight 
shift which are proportional, respectively, to the first, 
second, and higher powers of the field. However, there 
is little current evidence for such contributions to the 
Knight shifts in metals. 

To calculate the contribution to the Knight shift 
from the core 1s electrons by the MP method, we then 
proceed in the same manner as in Sec. III to calculate 
E.w® as given by Eq. (13). For the perturbed core 
wave function, the function às;1,,w (Fig. 1) for the 
atomic lithium 1572p state was used. This is again an 
approximation because there may be a small difference 
between the wave functions tıs for the metal, and for 
the atomic 1s?25 state. However, our experience with 
the lithium atom 15s?2s state in Sec. III indicates that 
the error due to this approximation has little effect on 
E.y™. We would also like to point out that this error 
would be expected to be smaller for the metal than for 
the 1s?2s atomic state because the conduction electron 
wave functions appear to have predominantly p 
character at the Fermi surface.!? 

Making use of Eqs. (13), (25), and (42), and re- 
membering that we have to average over the Fermi 
surface in calculating Knight shifts, we get 


AH 8x 2 Ts 
LE A eee) n 
"ELS TU Yar (D)e (1) 


, (43) 


Fermi surface 


1 
XWer* @yu(2)—drdrs) 


T12 


where Wx, is the wave function for a conduction electron 
at the Fermi surface, 7, is the radius of the Wigner- 
Seitz sphere, and A, is the area of the Fermi surface. 
Equation (43) can be evaluated using available wave 
functions Wx, for lithium metal. Of the published wave 
functions for lithium, those which are available in a 


readily usable form are the recent ones by Kohn and 
Callaway.17 


gikr 


Y= 


[uo(r)+iku, (r)Pi (c0s6,.,) 


(N) 1/2 - 
-EEus(r)Ps(cos,)H-(7))]. (44) 
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The potential used in obtaining these results was an 
empirical potential which is expected to be more ac- 
curate than a calculated potential based on some model. 
In Eq. (44), 62, is the angle between a radius vector r 
and wave vector k. The functions we(r), (r), u(r), 
and u(r) were given in tabulated form and ÑN is a 
normalization factor calculated up to kr? term (where 
ky is the Fermi momentum in the spherical Fermi 
surface approximation). 


N= 4n (1++2.79333k r?) . (45) 
Since the Fermi surface in lithium approximates a 
sphere quite closely, we can take the magnitude of kp 
in all directions as the same. The details of the calcu- 
lations for evaluating the integral in Eq. (43) are given 
in the Appendix. It is shown there that including up to 
terms of order kr? [ Eq. (A4) ] 
Wrp(r) = As(Rr,r)Po(cos0s.)-- Ap(R pr) P1(cos6,;) 
FAd(R p,r) PX(cosb;,) , 
1 . . 
Aseo Herul) bein 


m)” 


+b (Julbo) — jalkr,r)ue(r) 


2.79333 ' ; 
Sar Jienna) |, 
A (k A= [Sujit Gun je) 
p Z (47)!? 
(46) 
(eie 8jam 
3(2.79333) é 
DE] > 
2 
Ad(kp,r)= ES 
(4r)? 


10 PALSA 
teneo nn Seb t= it 


] function of the 


Where à (r,r) is the spherical Besse new 


order s. Using Eq. (46), it is shown in the 
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LEq. (A15), that one obtains after some manipulation 


AH 8«X, 1 


eei e 6 8,7 ( at » 
H 3 IL Na)as(ra) 


(4g — 


| As(Rp,ri) As* (Eg,rs) 
x ENA 
rs 


dr 
ex) A plke,niyA p* (krr) 


T< 


rj 
[EY r<] 
+5( ) A d(kp,r)Ad*(kp,r2) z praia E (47) 
T> 


5 


The contributions to AH/H from the three terms in 
Eq. (47) involving As, Ap, and Ad refer, respectively, 
to the s, p, and d parts of the conduction electron wave 
function at the Fermi surface. These were evaluated 
by numerical integration and are listed in Table II. 
It can be seen that (AH/H),, is almost equal to 
(AH/H)., but opposite in sign. In contrast, (AH/H)ea 
is about a factor of 25 smaller than both (AH/H)., 
and (AH/H).:. This leads us to expect that the con- 
tributions to the core polarization from f and higher 
angular momentum parts of the conduction electron 
wave function would be negligible. Unfortunately, it 
is not possible to test this point using Callaway and 
Kohn's wave function, because they did not give higher 
angular momentum components beyond d. The value 
of the total core contribution, (AH/H), in units of 
(647.5 Xp), namely —0.00544, is to be compared with 
the value of 0.10280 in the same units, for (AH/H) direct 
that one gets using Kohn and Callaway's wave functions 
and Eq. (41). So we find that the core polarization 
contributes about —5.3% correction to the Knight 
shift. Also it is to be noted that, while the s character 
of the conduction electron wave function is alone im- 
portant, for the direct contribution to the Knight shift 
as given by Eq. (41), the core polarization depends 
sensitively on the s and p parts of the wave function, 
since there is a near cancellation between (AH/H)-. 
and (AZH/H)., It is therefore imperative to know 
accurately the relative amounts of s and p characters 
in the wave functions for other metals, since the core- 
polarization contribution may be a more Significant 
fraction of the direct contribution. This point has also 
been noted by Cohen, Goodings; and Heine.5 These , 
authors assumed that the conduction electron wave 
function can be well approximated by a combination 
of 2s and 2p atomic functions. In such a case, the core 
polarization can be calculated by taking a weighted 
combination of the core polarizations for atomic 15?2s 
and 1s?2p configurations. However, in Figs. 2 and 3, 
we have compared As and Ap with. the 2s and.2p 
atomic wave functions, respectively, obtained by 
Goodings.! It is seen that, while there is a reasonable 
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Fic. 2. Comparison of the s part 
of the conduction electron wave 
function (at Fermi surface) with 


As Goodings’ 2s wave function (ttz) 
for lithium atom 1s?2p configura- 
tion. 
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similarity in shapes between the metallic and atomic 
wave functions, the correspondence is far from exact. 
To make a quantitative application of the EP method 
comparable to our MP calculations, one would have to 
solve two equations of the type of Eq. (9) using Eq. 
(42) for 30g and then calculate (AH/H), using Eq. 
(12). With the MP method, however, because of the 
flexibility in using ôVıs,v as long as the core wave 
functions do not vary substantially, we did not have 
to solve any new differential equations to obtain 
(AH/H),. This is an illustration of the computational 
advantage of the MP method. 

Tn our numerical calculations leading to the results 
in Table II, we had to make use of the expressions in 


TABLE II. List of contributions to Knight shift in lithium metal. 


Contributions? to AH/H Value 
Core-polarization s part, (AH/H) cs 0.02355 
Core-polarization p part, (AH/H)cp —0.02813 
Core-polarization d part, (AH/H) ca — 0.00085 
Total core polarization (AH/H). — 0.00544 
Direct contribution? (AH/H) direct 0.10280 
Grand total (AH/H) tots 0.09736 
Experiment® 0.101 


a The tabulated numbers are in units of ao^? (atomic units). To correct 
to the usual dimensionless manner ( 95) that Knight shift data are obtained, 
multiply by 647.5 Xp, where Xp is the spin susceptibility per gram. 

b Calculated using Kohn and Callaway's wave function (Ref. 17). 

e Taken from Table IV, Ref. 24. 


Eq. (46) correct to kr’. This was necessary because of 
the form of Eq. (44) in which Callaway and Kohn's 


3.0 34 r 


wave function was available. However, Eq. (47) can 
be used for any general wave function (such as orthogo- 
nalized plane wave and augmented plane wave), since 
one can obtain As, Ap, and Ad by expanding the con- 
duction electron wave function in spherical harmonics. 


CONCLUSION 


As we mentioned at the outset, our motivation in 
developing the MP method was to find a procedure for 
investigating the contribution from exchange polari- 
zation to Knight shifts in metals and alloys. While the 
EP method as developed by Cohen, Goodings, and 
and Heine can, in principle, be applied in any situation, 
one would like to have a method that is computationally 
simpler and flexible enough to apply for a nucleus in a 
variety of environments. Our experience with lithium 
nucleus in the three situations discussed here leads us to 
believe that the MP method can be extended to give 
reliable results in other cases. The necessity for careful 
investigations of core polarizations is strongly felt at 
the present time, because a variety of Knight shift data 
in metals, alloys, and internal fields in ferromagnets, 
are now available; and it appears that direct contri- 
butions from conduction electrons cannot quantita- 
tively explain such data. 

We would like to conclude by making a few comments 
on the self-consistency problem in core-polarization 
calculations in general and specifically as pertinent to 
the MP method. In an atom containing an unpaired 
electron, the many-electron wave function has to be 


0.45 
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a Fic. 3. Comparison of the p part 
0.25 of the ORA wave 
BE 1 unction (at Fermi surface) with 
P Goodings' 2p wave function (125) 
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tion. 
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an eigenfunction of S?, As has been pointed out by a 
number of authors, in a correct application of the 
Hartree-Fock method, this condition must be built in 
as a constraint. This is the case with the PSP method. 
The SP method, however, does not have this constraint, 
and the results from both SP or SPP methods are 
therefore somewhat suspect. Heine has pointed out 
that it is the self-consistency procedure in the SP 
method which is responsible for its dangers because it 
leads to an uncertain admixture of correlation and 
exchange effects and may in fact overemphasize the 
correlation unduly. The EP method, which considers 
the exchange polarization as a perturbation and has 
no self-consistency built in, is free from this danger 
although it is evidently less accurate than PSP. In the 
MP method, however, the perturbed state is not an 
eigenfunction of S? because of the nature of 5Cy; hence, 
there is no condition of constraint regarding the total 
spin. Self-consistency can therefore be used in the MP 
method without any fear of spurious contributions 
from correlation. However, our aim was to extend this 
method to metals where self-consistency is rather 
difficult to apply even for restricted Hartree-Fock 
calculations. We have therefore not explored the effects 
of self-consistency (Dalgarnos’ method I) here. 
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APPENDIX 


From Eq. (24), the second-order change in energy 
due to the exchange polarization of the core electrons 
by the conduction electrons at the Fermi surface is 


2 ,[" 
Bx (f Vae (1)80r0 a (1) 
Ark? NJo 


1 
xiu s (2) indre) D 
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Fermi surface 


where r, is the radius of the Wigner, Seitz sphere= 3.21 
Qo, ky is the Fermi momentum=0.59788/a0, and 
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6x, being the angle between kr and T, and Y the nor- 
malization factor, 44(1+2.79333ke") and for abbrevia- 
tion we denote Pi(cos&i,.) by Piom) where Ir is 
the angle between kr and r. On expanding e'r and 


N, and keeping terms up to kr’, y;, can be written as 
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where fn(kr,r) is the spherical Bessel function of order 
n. Using Eqs. (A3) and (A4), Eq. (A1) takes the form 


1 1 
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zı representing the angle between kp and rı and 675 the 
angle between kp and rs. We perform the integration 
over the spherical Fermi surface first, and then inte- 
grate out the angular parts of dr, and dz2. 

Since the directions of r, and re are arbitrary, let rı + 
be directed along k, for converience. Using the addition 
theorem for spherical harmonics 
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and rə. 


one can evaluate the following integrals over the Fermi 


surface: 
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Equation (A5) then reduces to 
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In order to perform the angular integration involved 
in Eq. (A9) the following relationships will be used: 
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where (A11) again follows from the spherical harmouic 
addition theorem. Thus, 
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The final expression for E,y ? is then 
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which involves only radial integrations over 7; and 72. 
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The pure rotational spectra of the alkali chlorides were investigated in the 0.96- to 3-mm range of the 
microwave region with the molecular-beam spectrometer earlier developed at Duke University. Introduction 
of Teflon microwave lenses and high-pass microwave filters improved this spectrometer so that measure- 
ments into the submillimeter region were possible, to an accuracy of better than one part in 10*. Dunham's 
solution for the diatomic molecule was applied in interpretation of data. Improved values for Be, ae, and ye 
were obtained for most molecules studied. The centrifugal distortion constants D, and B, were obtained 
from the rotational spectra for the first time for all molecules measured. From the latter two constants, 
accurate values of e, and wex, were derived. Other derived quantities are: potential coefficients, isotopic 
mass ratios, moments of inertia, and internuclear distances. For most of these quantities, the accuracies 


obtained surpass those from previous measurements. 


INTRODUCTION 


AS earlier designed, high-temperature, molecular- 

beam, microwave spectrometer? for operation 
in the shorter millimeter-wave range of the spectrum 
has been improved and extended in frequency coverage 
and has been used for extensive measurements of the 
alkali chlorides. Unlike the rf molecular-beam spectrom- 
eters,- which detect deflected molecules, this spectrom- 
eter detects the loss in radiation which results from the 
absorption of radiation by a beam of molecules simply 
sprayed across the radiation path. Since no deflection 
of the beam is necessary, this spectrometer can be used 
at the very high millimeter- or submillimeter-wave 
frequencies, at which molecules generally become in- 
sensitive to the deflection fields required for operation 
of the molecular-beam electric-resonance spectrometers. 
It has the advantage over the latter of employing a 
microwave detector which does not depend on the type 
of molecules being investigated. Because of the weak 
absorption of molecules in the centimeter or longer wave 
rf region, the present instrument, which we shall desig- 
nate as a molecular-beam absorption spectrometer, is 
not generally applicable at these frequencies. It becomes 
applicable and effective in the shorter millimeter- and 
submillimeter-wave regions, where the absorption coef- 
ficients of most molecules become large. Because mole- 
cules generally become less sensitive to electric and 
magnetic fields with increase of rotational frequency, 
the molecular-beam resonance methods become ineí- 
fective in the millimeter-wave region, where the present 
spectrometer operates best. Thus, the molecular-beam 


i J. S. Ai ffice of 
* This study was supported by the U. S. Air Force O 
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vee Garrison and W. Gordy, Phys. Rev. 108, 899 (1957). 
2 J. R. Rusk and W. Gordy, Phys. Rev. 127, 817 (1962). 


3 "Rev. 72, 614 (1947). a 
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Phys. Rev. 91, 1395 (1953). ; 
eB. Cd a Carlson, C. A. Lee, and I. I. Rabi, 
Phys. Rev. 91, 1403 (1953). 


absorption spectrometer is complementary to the elec- 
tric- and magnetic-resonance spectrometers. 

Before development of our molecular-beam spec- 
trometer, the microwave group at Columbia University 
designed a hot-cell microwave spectrometer^" which, 
like ours and like the conventional microwave spec- 
trometer for gases, employs a microwave radiation 
detector. With it, they measured rotational transitions 
in the centimeter wave region for most of the alkali 
halides. More recently, Lide, Cahill, and Gold? have 
improved the hot-cell spectrometer and have extended 
its range to 45 kMc/sec for the measurement of the 
J=0— 1 transition of LiCl. The principal disadvan- 
tage of the hot-cell spectrometer is the pronounced 
pressure and Doppler broadening of the spectral lines 
in a gas heated to high temperature. Also, the micro- 
wave-transmission losses in a hot cell are large for the 
high millimeter-wave frequencies. The molecular-beam 
spectrometer used here eliminates most of the pressure 
and Doppler broadening, and its microwave-transmis- 
sion losses are sufficiently low that it can be used in the 
submillimeter-wave region. Introduction of dielectric 
lenses for the focusing of the millimeter waves through 
the absorption cell has significantly reduced its trans- 
mission losses over those of the earlier design. 

As was demonstrated in the study of alkali bromides 
and iodides? and as is evident from‘this study of the 
alkali chlorides, a precise measurement of the milli- 
meter-wave rotational spectra allows a more complete 
and more accurate evaluation of the molecular constants 
of these molecules than is possible from either infrared 
or centimeter-wave measurements. The millimeter-wave 
measurements are needed because the effects of centri- 
fugal distortion on the molecular structure can be signi- 
ficant and because these effects of distortion may be 


* M. L. Stitch, A. Honig, and C. H. Townes, Rev. Sci. Instr. 
25, 759 (1954). 2 

* A. Honig, M. Mandel, M. L. Stitch; and C. H. Townes, 
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Fic. 1. Schematic diagram of molecular beam absorption cell. 


accurately evaluated only by measurements at the higher 
microwave frequencies, at which frequencies this dis- 
tortion significantly alters the rotational spectra. 


EXPERIMENTAL CONSIDERATIONS 


In its systems for microwave-power generation, 
detection, control, and measurement, the spectrometer 
used in this study is similar to most of those employed 
for video detection in the Duke microwave laboratory. 
The principal difference between this spectrometer and 
conventional microwave spectrometers is in the absorp- 
tion cell itself, which is specialized for the generation 
and utilization of high-temperature molecular beams. 

Since the molecular-beam spectrometer has been 
described in earlier papers of this series,! only the pres- 
ent modifications of it will be described here. The ab- 
sorption cell is shown in schematic arrangement in Fig. 
1. It consists of a 60-cm section of S-band waveguide 
(3- X 13-in. cross section), into which the output of a 
harmonic generator operating from a 32-39 kMc/sec 
reflex klystron is matched by means of a 30-cm G- to 
S-band transition section.? The power is matched to the 
G-band waveguide of the detector by an identical S- to 
G-band transition section. A 6-in. slit is cut in the bot- 
tom of the S-band waveguide which is the absorption 
cell so that a beam of molecules can pass into it from a 
separate oven located just below the slit. The spectrom- 
eter has been modified from the earlier design? by the 
addition of high-pass microwave filters which reduce 
detector-crystal noise in order to give higher harmonics 
and the addition of Teflon lenses which increase the 
efficiency of transmission through the slotted waveguide 
section. These modifications have made possible detec- 
tion into the submillimeter region. The filters, which 
operate by means of a short constriction in the E-field 
direction in an otherwise straight section of rectangular 
waveguide, are located at the output guide of the har- 

monic generator. Three high-pass microwave filters 
were made with cutoff wavelengths corresponding to 


Gordy, W. V. Smith, and R. F. Trambarulo, Microwan 
scopy (John Wiley and Sons, New York, New York, 1953), 
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the fourth, fifth, and sixth harmonics of the klystron 
frequency at the low end of its frequency band. These 
cutoff wavelengths were 1.87 mm, 1.56 mm, and 1.17 
mm, respectively. 

The lenses were placed at the large ends of the G- to 
S-band transition sections, as shown in Fig. 1. They are 
identical, planoconvex types designed upon assump- 
tion of a point source of radiation about one centimeter 
from the apex of the input transition section. Y. Y. Hu” 
shows that for sectoral £- or H-plane horns of less than 
a five-degree half-angle, the formula d/\=5 gives the 
location of the phase center or “equivalent point 
source” of the horn for the design of parabolic reflectors. 
That the theory should follow for transmission lenses 
is evident. The distance d into the horn from its apex 
for the wavelength for which the lenses were designed 
(A= 2 mm) was approximately a centimeter and was not 
at all critical. The “equivalent point source” appears to 
be actually a sphere with a radius of about 1 cm for 
the small-angle horn used here, as is noted in Hu's 
work. However, with the lens 29 cm from the equivalent 
source, the point-source approximation is a good one. 

By use of the equivalent point source, two lenses were 
designed by means of the thin-lens equation normally 
employed in the design of optical lenses. Since the 
wavelength of 2 mm is small as compared with the 76- 
38-mm lenses, the thin-lens equation is a reasonable 
approximation from consideration of diffraction effects. 
Teflon was used for fabrication of the lenses because 
of its high softening temperature. Its index of refraction 
was taken to be 1.35. A focal length of 29 cm causes the 
lens to collimate the output beam of the horn. This 
collimated beam of microwave power is refocused by its 
mate at the receiving horn so that it matches this power 
into the G-band waveguide of the detector. Brown’s 
monograph on microwave lenses! offers further infor- 
mation on this subject. 

The errors in measurement with the present spec- 
trometer compare favorably with those obtained with 
the methods of molecular-beam resonance. The accuracy 
of measurement is limited only by the observer’s ability 
to ascertain the center of an absorption line. Six meas- 
urements were made on each line, with directions of the 
electronic sweep of the klystron alternated so that time 
delays in the amplifier are canceled. This gives three 
measurements for which the dispersion usually lies 
between 1 and 5 parts in 107. 

No very difficult problems were encountered in the 
measurement of the rotational transitions in the more 
abundant chlorides of Na, K, Rb, and Cs. Difficulties 
with line strength in most of the less abundant chlorine 
and rubidium isotopes precluded measurements over 
the three vibrational states necessary for resolution of 
B, into the three structural constants Yor, ae, and Ye: 


? Y. Y. Hu, J. Franklin Inst. 271, 31 (1961). 
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The spectra of LiCl were not observed because of the 
high degree of dimerization in the molecular beam. 
Lide, Cahill, and Gold* have reported measurements 
of the /=0— 1 transition of LiCl made with a heated 
Stark modulation cell. 

Cesium chloride, the heaviest molecule investigated 
in this study has significant populations in the first 
four vibrational states. This division of population 
among vibrational states causes the rotational absorp- 
tion lines for the ground vibrational state of this mole- 
cule to be weaker than those of the lighter chlorides, 
but allows measurement of rotational transitions in 
four of the vibrational states for CsCl**. 

All the isotopes of rubidium chloride studied have 
significant nuclear quadrupole couplings and at the 
lower J values show an absorption line splitting of 
several megacycles per second. Nevertheless, for the 
high J transitions measured in this work, the nuclear 
quadrupole splitting is negligible. For example, the line 
splitting is less than a tenth of a megacycle at 190 
kMc/sec and is thus within the experimental error of 
our measurements. The rotational absorption lines in 
the third vibrational state were very weak; only one 
was measurable. 

The spectral frequencies of NaCl? were measured 
over a wide range extending into the submillimeter- 
wavelength region. A cathode-ray oscilloscopic display 
of one of the submillimeter lines obtained for NaCl? 
is shown in Fig. 2. The signal is weak because it is near 
the upper frequency end of the operating range of the 
present spectrometer. Much stronger signals were, of 
course, observed in the one- to three-mm range. 

Approximate oven temperatures at which the spectral 
lines appeared for each molecule are shown in Table I. 


TABLE I. Oven temperatures. 


Molecule Temperature (°K) 
NaCl 1022 
KCl 1010 
RbCl 970 
CsCl 889 


No attempt was made to measure these temperatures 
with strict accuracy. Care was taken only to obtain 
reproducible temperature measurements, since these 
measurements were used to predict the temperature at 
which spectral lines should appear at subsequent 


observations. 
THEORETICAL CONSIDERATIONS 


Dunham’s theory? for the diatomic molecule has 
been applied in the analysis of the data obtained for 
the alkali halides. The part of this treatment which is 
required to make the derived results meaningful is 
outlined below. 
5T. L. Dunham, Phys. Rev. 41, 713 (1932); 41, 721 (1932). 


Fic, 2. Cathode- 
ray oscilloscopic dis- 
play of the J=23 — 
24, v=0, transition 
of NaCl occurring 
at 312109,88 Mc/ 
sec (wavelength 0,97 
mm). The width of 
the line is approxi- 
mately 100 kc/sec. 


The time-dependent, Schródinger equation for the 
diatomic vibrating rotor may be written as 


dy/d£-r Qur2/It) 
XL[G—-U)—J(U-r1)/0-F£*y—0, (1) 


where ¿= (r—r,)/r., where E is the energy value, and J 
is the rotational quantum number. The term J(J-+1)/ 
(1+ £)? may be considered as a “centrifugal potential” 
and y as a reduced mass in the equivalent, one-dimen- 
sional problem. The term U is dependent on the separa- 
tion distance of the nuclei and represents the interatomic 
potential caused by the nuclear and electronic charges. 
For solution of this equation, the form of the potential 
function U must be known. 

Many potential functions of the closed formula type 
have been examined by Varshni"? and by other investi- 
gators.4-15* Probably the most successfully and widely 
used of these functions is the Morse potential. How- 
ever, the best fit to the microwave data is obtained by 
use of a potential function expressed as a power series 
in £, valid only near a minimum and used first bv Dun- 
ham.? 'This potential is 


U(£) - Icaé&-Fait--as?--as4----), (2) 


where the a’s are constant coefficients. 

Dunham applied this power series expansion of the 
potential in Eq. (1), expanded the centrifugal term as a 
power series, and solved the resulting equation about a 
minimum by the Wentzel-Kramers-Brillouin method.” 
The formula which he thus obtained for the energy 
levels is 


F,4—5, Yu(v- 1/2) (3-1), (3) 
lj 


where the Y, are the spectral constants to be obtained 
from the frequency measurements, J is the rotational 
quantum number, and v is the vibrational quantum 
number. If the units of the F, are defined as Mc/sec, 
F,,; becomes a power series fo. the energy levels in the 
same units. f 
Dunham’s approach has been reviewed by Sandeman!* 
and has been extended slightly to include those Fz 


3 Y, P. Varshni, Rev. Mod. Phys. 29, 664 (1957). 

X P, M. Morse, Phys. Rev. 34, 57 (1929). 

15], Sandeman, Proc. Roy. Soc. (Edinburgh) 60, 210 (1940). 
15 J. E. Kilpatrick, J. Chem. Phys. 30, 801 (1959). 
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expressions which would be needed if measurements of 
the present accuracy could be made on vibrational 
transitions as well. Kilpatrick,’ using a perturbation 
approach different from that of Dunham, has arrived 
at the same results as Sandeman and Dunham, in dif- 
ferent form, except for his equation for Yoo, which in- 
cludes D, a term for the dissociation energy. The term 
for the dissociation energy does not appear in the Dun- 
ham solution because this solution considers only the 
form of the potential in a restricted region near the 
minimum of the potential function. Sandeman has 
pointed out that the Dunham potential can be valid 
only over a limited region because of difficulty with 
convergence of the series potential for large £. Dunham's 
formulation, or Kilpatrick's, seems to be the only one, 
however, which can be used with the precise spectral 
data obtainable in the microwave region. It is satis- 
factory for the microwave measurements probably be- 
cause the energy levels accessible to our microwave 
spectrometer lie sufficiently near the minimum of the 
potential curve for the formulation to apply. This theory 
was applied by Honig eż al.’ to their centimeter-wave 
measurements of the alkali halides made with their 
hot-cell spectrometer and by Rusk and Gordy? to their 
measurements of the alkali bromides and iodides 
made with their high-temperature molecular-beam 
spectrometer. 
By applying the Bohr frequency condition, 


v= (W3—W3)/h, 


and the pure rotational absorption selection rules, 
AJ=+1, Av=0, to the energy-level formula, one ob- 
tains the equation relating frequency », quantum 
numbers v, J, and structural constants Y ;;. The equation 
given below expresses this frequency in terms of those 
Dunham constants which have a measurable effect on 
the energy levels. The rotational quantum number for 
the lower level of the transition is represented by J. 


y-2Ya(J4-1)d-2Yn(v-3) U4- 1) 
-F2Y s (04-3) 04-1) -4Y o (74-1 
TAY (v2) (--1y4-Yo(U--19(--25. (4) 


The relation of Yos to other well-known structural 
constants is given by 


Y= (2D2/302) (12B2—a..) . (5) 


Subtraction of this term, in frequency units, from each 
measured line frequency v, gives a corrected frequency 
v’. The corrected frequeticies v; are used in the calcula- 
tion of the five remaining constants with a least-square 
technique. The derived constants are calculated from 
the Y values with the following relations given by 
Dunham?: 3 


Yn =W= 2 (Bë/. Yo)! 12 ? (6) 
as o4 /4Be , (7) 
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ai Ynod/6B2—1, (8) 
a= Yo /48B--2.315--2.25a) 1.1250, (9) 


a—e2Ya/30Bé—1—2ark6ast2.0m8s —— 
iG) (10) 


Ya —wete= —1.5Be(a2— 1.2507) , (11) 
Y. Be(3a2—1.750:2)/8 ; (2) 
Y 16B5(3--a)/od, (13) 
Vor= — (644/059) (134-92; —a:-2.25a7) , (14) 


Bu= 75+ 7a4— 4.5a34- 1.5a3— 11.5a1a2 
+5.25(a2+a1*), (15) 


B, Yu/ (1d- BéBu/o?) - (16) 
I,-h/8:2B,, (47) 

u- MjMs/ (My -M3), (18) 
ra= Gin, de 


where B, is set as equal to Yo on the right-hand side 
of all equations except Eq. (16). 

The probable error limits in the evaluated Y values 
were obtained by use of a statistical method? designed 
to produce a 99% confidence in the stated errors. The 
limits of error for the derived constants are based on 
these errors and have a 99% confidence level or better. 
The bulk of the data analysis was performed on the 
IBM-7072 digital computor available in the Duke 
University Digital Computing Laboratory. 

Except for Yo, the observed Dunham constants Y 
are related to the more familiar spectral constants for 
diatomic molecules, as follows: 


YoaczB, Yu=Ye Vi2=—Be (20) 
Yu=—a, VYo=—D. Yw-H,. 
Similarly, 
Vio=we Voo=—werte- (21) 


The first of these relations, Y'u£zB,, is only approxi- 
mate, but the other relationships hold with sufficient 
accuracy for the results described here. For a further 
description of these spectral constants, see Herzberg's 
monograph.” 

> Because of the high J values involved in the transi- 
tions observed in the present work, no nuclear hyperfine 
structure was resolved. A negligible broadening of some 
of the lines by nuclear quadrupole interactions were the 
only hyperfine effects which could be observed. Thus 
we shall not outline the theory of nuclear interactions 
here. A treatment of these effects, which are resolvable 
at lower frequencies, is given by Honig eż al." 


%G. Herzberg, Molecular Structure and Molecular Spectra 


(D. Van Nostrand C i ^ 
dnd ed, Vol T p. repay) Inc., Princeton, New Jersey, 1959), 
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Pane IT. Observed and calculated microwave frequencies of the alkali halides. 


ee 3 Measured Calculated Measured Calculated 
Transition (Mc/sec) (Mc/sec) Transition (Mc/sec) (Mc/sec) 
NaCl Tem s 
Rb* Cl 
9-0, J=7 —8 104 189.74+0.10 — 104 189.66 2=0, J—48—49 25612042--0.10 256 120.44 
v=0, J-11— 12 156 248.60+0.10 156 248.63 v=1, J-20— 21 109 440.74+0.10 109 440.77 
v=0, J—13— 14 182262.89:-0.10 182 262.86 v=1, J-—33—34 177045492:0.10 177 045.50 
»-0, J-14— 15 195 265.39+0.10 195 265.38 o=1, J=34—35 182238.38--0.10 182. 238.39 
»—0, JT 221 260.15+0.10 221 260.11 2-1, J-41—42 218 551.58+0.10 218 551.62 
v=0, J-17— 18 23425183-£0.10 234 251.87 v-1, J—48— 49 254791364-0.10 —— 254 791.47 
v=0, J-20—21 27320201--0.15 273 202.04 v-2, J=33>34 176126.302-0.10 176 126.30 
v=0, J-23—24 3121090882020 312109.84 v=2, J-34—35 181292.0240.10 181 292.14 
v=1, J-13— 14 180907.78-£0.10 ^ 180907.81 v=3, 23334 175209.82-L0.10 175 209.90 
v=1, J—14— 15 193813.63--0.10 193 $13.59 v-3, J-34— 35 180348.70--0.10 180 348.78 
v=1, J—17—18 23250995--0.10 232 509.89 i e i 
v=1, J—23—24 309 787.8240.10 309 787.79 RICS 
7—2, J=14>15 192 371.0540.10 192 371.02 v=0, J=29—30 156025063-0.10 156 025.05 
me v—0, J=33—>34 1761714620.10 —— 176 777.45 
EET. WERE ORI v=0, J-40—41 213046832-0.10 213 046.82 
v=0, 7—14— 15 191091.622-010 191 091.57 uda me E o o LLL 
v=0, 7-16 17 21653144-0.10 ^ 21653148 sete co 4 
v=1, J =e = 7 113 858.03+0.10 113 857.97 EKOE 
7-1, = — 51 157. : 51 783. x = “a i 
p=, J-14—15 189686134010 189 686-11 050, Jai 3> OL eae 
cot : v-0, J—35—36 181 216.3040.10 181 216.30 
KCI v=0, 7-4 —42 2131117010 — 211311.15 
v—0, J-12—13 99929.542-0.10 99929.52 onl, Jedt-r35 SUUM 
»—0, J—19— 20 1536714820.10 ^ 15367746 v—L J-35—26 E NR 
v=0, J=24>25 192023493-0.10 192 023.48 o=2, J=34 35 RA Se 
v=0, J—29— 30 230320.560.10 ^ 230 320.56 
v—0, J—34—35 268558.98--0.15 268 558.94 CsCl 
v=1, J=12— 13 99 316.44+0.10 99 316.40 27-0, J=35 — 36 155 062.73+0.10 155 062.72 
g=1, J=24— 25 190 844.49+0.10 190 844.53 2-0, J—4t> 45 193 699.772-0.10 193 699.74 
v=1, J=29— 30 228 905.95+0.10 228 905.90 2-0, J=53 — 54 232 250.96+0.10 232 250.92 
v—2, J—24— 25 189 670.524-0.10 189 670.51 GM dios hss 192 790.44+0.10 192 790.44 
g=1, J-53— 54 231159534010 231 159.46 
KCI? v=1, J=62—63 260425242-0.15 269425.25 
=0, J=21— 22 164 220.00+0.10 164 220.01 7—2, J=44— 45 191 883.04+0.10 191 883.03 
v=0, J=23— 24 179 121.98+0.10 179 121.92 9—2, J—53— 54 230 070.34+0.10 230 070.29 
y=0, J=35— 36 268 363.84-£0.15 268 363.80 t=3, J=44—45 190 977.56+0.10 190 977.53 
v—1, J-18— 19 140997.102-0.10 ^ 140997.03 
g=1, J-223—24 178038.1840.10 178 038.10 CsCI7 
v=0, J—42—43 177224.9040.10 177 224.89 
RbSsCEs v-0, J-59— 60 246912.7140.10 — 246912.69 
v—0, J-33—34 1779671.572-0.10 177 967.52 v-1, J242—43 176411.22-0.10 176 411.12 
v—-0, J=34—35 183 187.4840.10 183 187.53 v-1, J=46—47 192760.59--0.10 192 760.52 
v-0, J-41—42 219690.65--0.10 ^ 219 690.65 v=1, J—59- 60 245716.6012-0.10 —— 245 776.61 


ROTATIONAL CONSTANTS 


The measured frequencies for each of the alkali 
chlorides studied are listed in Table II. In the same 
table are given for comparison the theoretical values of 
the frequencies as calculated with the spectral constants 
given in Table III. 

The six constants (Yo, a, Ye, De, Be, and He) were 
calculated directly from the measurements for the iso- 
topic species CsCl**, RbCI**, KCI?, NaCl, and Rb* CI". 
For the species CsCl’, KCI”, NaCl’, and Rb*Cl*, 
these measurements yielded only four constants (Bo, 
Bı, De, Be) because observations of rotational transi- 
tions for only two vibrational states were made. For 
the latter, Yo, and o, were derived from Bo and Bi by 
use of a value of y, calculated with the isotopic mass 
ratio formulas. These six structural constants for each 
molecule are shown in Table IIT. 

Comparison of the RbCl and KCl data with previous 


results! obtained with molecular-beam electric reso- 
nance shows that the three constants making up 
B,(¥01,%e;¥e) have comparably small errors for the two 
types of measurement and that they agree well within 
these errors. This experiment shows no improvement in 
B, over that obtained from low J transitions accurately 
measured with molecular-beam, electric-resonance tech- 
niques because the fractional error in the measured 
frequencies for both experiments is:comparable. 

A comparison of the present data for CsCl and NaCl 
with those of the previously made hot-cell measure- 
ments? shows a significant improvement im accuracy 
in the values for the three constants making up B,. 
The greatest improvement of the present study over 
the previous results arises in the nieasurement of the 
centrifugal distortion constants D, and 6, and in the 
evaluation of other constants which are influenced by 
these constants. The earlier microwave measurements, 
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[4 
"TABLE III. Observed rotational constants. | 
Qe Ye D. B. d. 
Yo —Vu Yn = Yoz Ke Vie A sx 1073 
| Molecule Mc/sec Mc/sec Mc/sec Kc/sec E c/sec p 
i NaCl 6537.367-0.006 48.711-0.009 0.154-+0.008 9.3540+0.0050 —0.0250+0.0050 — 1,020 +0.010 
NaCl" 6397.287 0.006 47.1512:0.009 0.147+0.008 8.96302-0.0060 —0.0190-£0.0060 —0.950 0.010 | 
4 KCI 3856.373=-0.009 23.681+0.013 0.0492-0.004 3.26002-0.0030 — 0.0025 -1:0.0003 — 0.623 +0.008 m 
| KC 3746,578-:0.009 22.6783-0.013 0.0472:0.004 3.07802-0.0040 —0,0041-+0.0040 —0.571 0.008 Í 
i RbCl 2627.393 0.004 13.599+0.004 0.0212-0.001 1.4830+0.0010 0.0007+0.0004 — 0.248 =-0.002 | 
i Rb*'ClI55 2609.774-:0.004 13.4584:0.006 0.020-0.001 1.4610-0.0010 0.00192-0.0010 — 0.243 +0.002 
| RbS5 CP 2526.8562:0.004 12.8264-0.005 0.0194-0.002 1.37102:0.0010 0.00032-0.0010 —0.221 +0.002 t+ 
| CsCl 2161.2462-0.002 10.1192-0.002 - 0.011-+0.002 0.9791+0.0004 0.00154-0.0002 — 0.14132-0.0006 
i CsCl*? 2068.8132:0.002 9.4764-0.003 0.0102-0.002 0.897442-0.0005 0.0014.2-0.0005 — 0.12394-0.0007 
i which were performed at lower frequencies, did not Fo. The equilibrium moment of inertia and equilibrium | 
] yield these constants. nuclear distance, Je and fe, were derived from B, by 
1 The slight effects of Yos or He on the observed fre- use of Eqs. (17) and (19), respectively. These values are 
i quencies were obtained from values of the other struc- listed with two-figure errors to allow for adequate 
1 tural constants with the equation comparison between isotopes. If, at a later date, a more 
ji accurate value of Planck’s constant / should become 
or Yw- H,-2D,(12B2—o.)/3o7). (22) available, these values may be calculated from the data 
Er with greater absolute accuracy. Values for 7, and re 
ji Only one pass through the computor was required for both agree within their experimental errors with previ- 
ol reduction of the measured frequencies to structural 
4 constants when this calculated value was used. The 1 j ! 
values of H, as calculated from previous constants and Tase IV. Derived values of Yoo and Vos. 
as calculated from the constants derived from the first 
passage through the computor agreed within the limits Yoo E 
of the effect of the H.’s on the measured frequencies. Molecule (kMc/sec) Yos 
Values for He, listed in Table IIT, show about two more Naci’ 2.30+0.20 —0.003 0.002 
significant figures than would have been obtained if the es ec RIS E00 
value for He had been extraced directly from the meas- KCI 1.40 £0.40 —0.0006 — 0.0007 
ured frequencies. eus pee aoe — 0.00033 — 0.00005 
The measured constants under consideration have RDC] d E0 20 TOMO EIND 
been found to be consistent under the mass ratio trans- CsCl 0.40+0.04 —0.00020=-0.00002 
formation within the experimental errors. The values CsCl 0.403-0.10 — 0.00017--0.00003 
for Yo» and Fos (Table IV) have a number of significant 
figures and thus provide some check for the Dunham | 
» 1 e. A 
theory. Derived values for Yo» and Fo, are consistent ous measurements because the five-figure accuracy of Y 


for different isotopic species of the same molecule, as 
are all the derived constants presented in the following 
sections. : 

Equation (16) was used for derivation of B, from 


h masks the differences between these and the hot-cell 
measurements. All conclusions reached by Honig et al.’ 
regarding 7, and re remain unaffected. Values for these ^ 
constants are listed in Table V. 


"TABLE V. Molecular structural constants. 


Molecule B. (Mc/sec) Bu Iè (amu-À) ze (BS) 
F NaCl 6537.4062-0.006 —174-2 71.33162-0.0030 
2d LE eas = 1543 79.0311-£0.0031 2300898 0.000040 
KCl? 2530 185 E000 —15+1 131.10402-0.0053 2.6667722-0.000054 
ET 3746. 924 009 1847 134.9459-+0.0054 2.666771 +-0.000054 
E. RbICH 7600 EUG ee GARE 2.786865 4-0.000055 
A . : = .12822-0.0077 i 
Rb"CI? —  — 2526860--0.004 —12:c5 200.08522-0.0079 2:T86861::0 000055 
Cic z 2161-247 30.002 meal 233.9332-0.0091 2.906411 0,000057 
o A m o 0 no 2:906408--0.000057 i 
i 1.05443 +0.00004) X107?! erg-sec, 1 g = (6.02486 +0.000 
ndon and H Odishaw (Met raw-Hill Boe Companys Tee New Work, 1988), We ate d and E. R. Cohen, in Handbook of Physics, edited by | 
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VIBRATIONAL CONSTANTS 


The vibrational constants w, and w,«, were derived 
from the experimentally observed Y's by use of the 
Dunham theory, Eqs. (6)-(20). Values thus obtained 
are compared in Table VI with the values obtained from 


TABLE VI. Derived vibrational constants of alkali chlorides. 


Ee We Qe, Wek, 
Microwaye® Infrared’ Microwave* Infrared 
Molecule cm cm cm! cm 


17552-0.080 — 2.05 


NaCl 364.60+0.10 — 3662-4 

NaCl 360.603-0.10 1.7402-0.030 

KCB 279.80+0.10 28146 1.1672-0.005 1.30 
KCI’ 275.80+0.20 1.1134-0.005 
RbS5CP5 233.34+0.08 — 228-56 0.856--0.008 0.92 
RbCl 232.67+0.08 0.870+0.020 
Rb55C]? 228.88+0.08 0.8102-0.040 

CsCl 214.222-0.05 209-6 0.7402-0.004 0.75 
CsCl? 209.55+0.06 0.710+0.020 


a From present data calculated with Dunham's theory. 
b Infrared results of Rice and Klemperer (Ref. 18). 


infrared measurements by Rice and Klemperer.!5 The 
present values of we agree within experimental error 
with those from their infrared data, but are an order 
of magnitude more accurate if one can trust the Dun- 
ham theory to hold within the limits of our experi- 
mental error. It is worthy of note that the we values of 
Rice and Klemperer are in much better agreement with 
our we values, and also with those of Rusk and Gordy 
on the alkali bromides and iodides than are any other 
previous values from infrared or optical spectroscopy. 
Through an oversight their excellent infrared values 
were not listed for comparison in the earlier paper.? 
There is an interesting regularity evident in the com- 
parison of the theoretical values of wese from the Dun- 
ham equations, from a formula derived by Pekeris,? 
and the experimental values of Rice and Klemperer.’ 
Pekeris’ formula, shown below, gives better agreement 
with the cx, values of Rice and Klemperer than does 
the Dunham expression for the lighter molecules, 
particularly NaCl (see Table VII). The agreement of 


Taste VII. Comparison of microwave and infrared 
wexe’s for alkali chlorides and bromides. 


QeXe WeXe CET 
Microwave? Infrared? Microwave* 

Molecule cm! cm cm 
NaCl 1.755 2.05 2.0640 
NaBr”? 1.1604 1.50 . 1.40404 
KCI 1.167 1.30 1.3390 
NaI 0.9654 1.08 1.07202 
RbS55CBS 0.856 0.92 0.9530 
KBr? 0.7584 0.80 0.85344 
CsCliss 0.740 0.75 0.7940 


a Calculated with Dunham's Theory. 

b Infrared measurements, Rice and Klemperer (Ref. 18). 
* Calculated with Pekeris's theory (Morse potential). 

4 Rusk and Gordy (Ref. 2). 


"158. A-Rice and W. Klemperer, J. Chem. Phys. 27, 573 (1957). 
1C, L. Pekeris, Phys. Rev. 45, 98 (1934). 


infrared data with the Dunham or Pekeris formula 
shows a regular variation with increasing molecular 
mass. 

The Pekeris formula for wex, follows from one for 
a, which was derived from a perturbation treatment 
with the Morse potential! as a basis. 


a, (2o, x, Bil o.) [CB mus.) ?— 3 (Bo ux.) |. (23) 


This equation may be solved for «x, to give 


wte= Bl (auo, / 6B2)4- 1T, (24) 
which in terms of Dunham's constant a, is 
Wele = Bar € (25) 


Formula (25) may also be obtained from Dunham's 
relations if Yo is equated to zero in Eq. (12) and the 
expression thus obtained for a; is substituted into Dun- 
ham's Eq. (11) for Yeo. By Eq. (11), Yoo equals ~ wte. 
The constant Vo is invariant between isotopic species 
and is independent of the nuclear mass of a given 
molecule. Therefore, Yo, must be dependent on the 
nature of the potential function. It cannot arbitrarily 
be set to zero because its value is determined by the 
potential coefficients a; and a», whose values depend on 
the measured rotational constants. Table IV shows that 
the value for Yo obtained from the rotational constants 
is definitely not zero. 

The only spectral quantity which shows any possibly 
significant discrepancy within the Dunham theory is 
the anharmonicity constant, e,x,. This possible dis- 
crepancy should be examined further. Although Rice 
and Klemperer did not give estimates of the limits of 
error for their w,«, values, the good agreement between 
their e, values and those of the present study gives 
credence to their wete values. The Dunham theory is 
based upon an expansion of the potential function in a 
power series around the equilibrium position. The Morse 
potential function upon which the Pekeris formula is 
based should give a better representation than does the 
Dunham potential for regions not so near the equi- 
librium position. This might explain the better agree- 
ment of the Pekeris formula with the wx, values from 
the infrared measurement of the lighter molecules with 
their higher vibrational energies. Varshni and Sukla? 
compare these and other observed constants of the 
alkali halides with values calculated from a number of 
different potential functions. It is of interest tó compare 
their different values for w.x. (see their Table IV) with 
those obtained here. : 


MASS RAÍIOS 


The mass ratios between CI? and CI? alsó those be- 
tween Rb® and Rb, were calculated for all the mole- 
cules measured. All mass ratios are shown in Table 
VIII, along with some previously óbtained values for 


20 y. P. Varshni and R. C. Shukla, J. Chem. Phys. 35, 582 
(1961). 
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TABLE VIII. Isotopic mass ratios. 


Chlorine (CI5/CI7) 


Rubidium (Rb®7/Rb*) 


Molecule Method Ratio Molecule Method Ratio 
NaCl Present 0.94597804-0.0000045 RbCF* Present oal í OO EO 
KCl Present 0.9459783-1:0.0000089 RbI Present (:9770163:£0.0000045 
RbCI Present 0.9459768-1-0.0000047 RbCI MBER* 9770148 0.0000052 
CsCl Present 0.9459785--0.0000028 RbF MBER 09770177 £0.0000045 
CsCl Hot Cell* 0.9459781+0.0000030 RbI Hot ce - 0:9770146.:0.0000055 
KCl MBER» * 0.9459803--0.0000015 RbBr Hot cell! E 0.977019 1 4.0.0000022 
Mass spectroscopy? 09459759 --0.0000035 RbBr Mass spectroscopy . : 
a See Ref. 7 


Inc., New York, 1958), pp. 9-55. 
4 See Ref. 2. 
* See Ref. 4. 


comparison. These quantities were calculated from the 
ratios B,/B.’, as described by Honig eż al.’ 


j POTENTIAL COEFFICIENTS 


The coefficients in the Dunham potential series, 
Eq. (2) up to and including as, are listed in Table IX 


TABLE IX. Potential coefficients. 


ao 

Molecule em ai a: a1 

NaCl 152 400 +90 —3.076+0.001 647+0.09 —11.0+1.0 
NaCh? 1523202100  —3.076--0.001 6.39+0.10 —11.0+1.0 
KCI 152100+100  —3.226+0.002 6.96+0.04 — —12.0-E0.8 
KCI" 152 1004+200  —3.3262-0.003 7.104030 —13.0+3.0 
Rb5 CBS 1553004100 —3.297-40.001 7.08+0.08 —11.5+0.8 
Rbs7Cl5  155500--100  —3.297--0.002 6.90+0.20 —10.0+2.0 
RbCl?  155400--100 —3.297+0.002 7.14+0.20 —12.0+2.0 
CsCI5 159 1003-70 —3.317+0.002 6.92+0.04 —10.0 +0.8 
CsCl? 159 090 2-90 —3.318+0.001 6.89+0.20 —10.0 +2.0 


for each individual molecule. They were obtained with 
Eqs. (6)-(20). The potential coefficients between iso- 
topic species are equal (within the stated errors) as 
should be expected. 

Table X, which includes results for the bromides 
and iodides by Rusk and Gordy,? compares values of 
09, 41, G2, and a5 for three different halides of the same 
alkali metals. The coefficients a; are shown to be ap- 
proximately the same between the iodide, the bromide, 
and the chloride for a given alkali metal, except for 
cesium, where the relationship is one of slightly increas- 
ing absolute value of a, with decreasing mass. The coef- 
ficients ao also show a regularity in that ap increases 
smoothly with decreasing halide mass for the same 
alkali metal. These regularities were noted in the results 
on the iodides and bromides previously reported from 
this laboratory.” This chloride work extends and confirms 
them, They had not been shown before the development 


rA 


a ~ 


bL. R. Maxwell, S. B. Hendricks, and V. M. Mosley, Phys. Rev. 52, 968 (1937). n 
* W. H. Johnson, K. S. (ORI CORE and A. O. Nier, Handbook of Physics, edited by E. 


U. Condon and H. Odishaw (McGraw-Hill Book Company, 


of our millimeter-wave, molecular-beam spectrometer, 
apparently because values of D, and f, were not avail- 
able to sufficient accuracy for precise calculation of the 
potential coefficients. For comparison, we have given 
in parentheses the potential coefficients earlier obtained 
by Honig eż al.” It will be noted that in many instances 
there are rather wide discrepancies between our values 
and theirs. 


TABLE X. Comparison of potential coefficients. 


Go 
Molecule cm 1X 1075 


01 a2 a3 
LiBr 1.42 (1.02) —2.71 (—2.45) 6.8 (5.5)  —19 (—14) 
Lil 1.35 (1.14) —2.70 (—2.56) 5.0 (6.0) —8 (—15) 
NaCl 1.52 (1.66) —3.07 (—3.14) 6.4 (9.2) | —11 (—31) 
NaBr 1.48 (1.64) —3.05 (—3.16) 6.5 (7.4) —12 (—16) 
Nal 1.42 (1.74) —3.02 (—3.23) 5.9 (8.8) —8 (—26) 
KCl 1.52 (1.81) —3.22 (—3.43) 7.0 (10.0) —12 (—32) 
KBr 1.48 (1.63) —3.24(—3.35) 6.9 (9.1)  —11 (—26) 
KI 1.43 (1.64) —3.25 (—3.41) 6.9 (9.1) —11 (—25) 
RbCl 1.55 (2.08) —3.30 (—3.66) 7.0 (11.8) —11 (—42) 
RbBr 1.53 (1.73) —3.33 (—3.49) 7.3 (10.3) —13 (—33) 
RbI 1.44 (1.64) —3.34 (—3.49) 7.2 (10.2) —11 (—32) 
CsCl 1.59 (2.00) —3.32 (—3.59) 7.0 (10.6) —10 (—35) 
CsBr 1.55 (2.03) —3.38 (—3.72) 7.6 (11.7) —14 (—41) 
CsI 1.50 (1.53) —3.43 (—3.45) 7.6 (9.2) —13 (—26) 


* Data for bromides and iodides come from Rusk and Gordy (Ref. 2), 
those for chlorides from the present study. Values given in parentheses 
are those of Honig c! al. (Ref. 7). 


FUTURE WORK 


Modifications of the oven are being made which will 
permit measurements to be made on substances which 
vaporize at higher temperatures than do the alkali 
chlorides. Millimeter-wave rotational transitions of 
NaF in different vibrational states have already been 
detected,» and it is hoped that a series similar to these 
on bromides, iodides, and chlorides can soon be reported 
from this laboratory on the alkali fluorides. 


^: S. E. Veazey and W. Gordy (to be published). 
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Precision Evaluation of the High-Frequency Limit of the Continuous 
X-Ray Spectrum Using a Gas Target X-Ray Tube*t 


Jon J. SPrjkERMANÍ AND J. A. BEARDEN 
Department of Physics, The Johns Hopkins University, Baltimore, Maryland 
(Received 30 December 1963) 


The major uncertainty in previous measurements of the high-frequency limit of the continuous x-ray 
spectrum was due to the fine structure associated with the observed cutoff. The development of a high- 
power mercury vapor target x-ray tube permitted observations to be made under idealized thin-target condi- 
tions. The x-ray intensity near the high-frequency limit was comparable to that of a solid target x-ray 
tube. The x-ray intensity was measured as a function of the voltage applied across the tube, with a double- 
crystal monochromator adjusted for the 1.537400 kxu wavelength of the Cu Ka; line. The absence of fine 
structure in the observed isochromat, in contrast to most solid targets, greatly increased the accuracy in the 
determination of the voltage cutoff. Accurate measurements of this voltage yielded a voltage-wavelength 
conversion factor VÀ,— (12372.264-23 ppm) xu-volts. Using h/e from non-x-ray data, the x-ray wavelength 
conversion factor from x units to cm (or angstroms) can be calculated, or conversely h/e evaluated. The 
effects of a revision in the Cu Ke: wavelength, which is now in publication by one of the authors, are also 


discussed. 


INTRODUCTION 


LEMENTARY quantum theory considerations 
give the short wavelength limit of the continuous 
x-ray spectrum in terms of the voltage applied to the 
x-ray tube. This relation is Ve=hv, which may be 
rewritten as 


h/e=V), (A/c), (1) 


where %/e (the ratio of Planck’s constant to the charge 
on the electron) is expressed in erg-sec/esu, V in volts, 
A, in kxu, and c, the velocity of light, in cm/sec. 
A=),/\;, the x-ray wavelength conversion factor, 
represents the ratio between wavelength on the 
Siegbahn scale and wavelength on the absolute scale, 
i.e., A/kxu. (1 kxu= 10? xu.) 

This relationship has traditionally been used as a 
means of measuring Z/e, but of course the x-ray 
measurement only evaluates Và.. The three’ most 
precise evaluations of VA, and the calculated values 
of h/e have yielded values which were considerably 
lower than those obtained through non-x-ray measure- 
ments. The discrepancy was so great that DuMond 
and Cohen? gave such values a low weight in their 
evaluation of atomic constants while Bearden and 
Thomsen! excluded these measurements entirely. This 
discrepancy was largely attributed to the fine structure 


* Research supported by U. S. Atomic Energy Commission 
t No. AT (30-1)-2543. t 

UP in part k a deus (J. J. S.) submitted to the Department 
of Physics, The Johns Hopkins University, 1962. t 

I Present address: National Bureau of Standards, Section 2.4, 
Washington, D. C. 20234. * 

i ns Berden and G. Schwarz, Phys. Rev. 79, 674 (1950). 
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81, 70 (1951). À 3 d RA 
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observed at the high-frequency limit of the continuous 
spectrum. Detailed measurements of the structure as 
a function of atomic number and voltage applied to the 
x-ray tube have been made by Johnson? and Per 
Johansson.’ Its origin has been interpreted by Ulmer? 
as due to solid-state band structure. The initial state of 
the electron is determined by the cathode of the X-ray 
tube while the final state is characterized by the target. 
The distribution of those unoccupied energy levels to 
which a significant number of transitions are allowed 
then determines the shape of the continuous spectrum 
near cutoff. In principle this result holds only for a “thin 
target”; however since characteristic energy losses? 
occur in discrete steps of the order of 10 eV, it will also 
hold for a thick target within a few electron volts of 
the cutoff point. 

In order to eliminate the fine structure, due to the 
solid state, a gas target x-ray tube” has been developed 
in which an electron beam of 20-50 mA and 8-20 kV 
is focused to a mercury jet as target. The intensity 
observed near the high-frequency limit is comparable 
to that of a solid target. This effectively thin target 
satisfies the assumptions of the Sommerfeld! theory 
and hence greatly simplifies the interpretation of the 
measurements which can then be used to obtain a 
refined value of ///e. On the other hand, if h/e is known 
with sufficiently high accuracy from non-x-ray. sources, 
it gives a means of determining A, as is' evident from 
Eq. (1). At the present moment, the relative uncer- 
tainty in the value of A appears somewhat greater than 
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* K- Ulmer, Z. Physik 159, 443 (1960); 162, 254 (1961); Phys. 
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Fic. 1. Three-dimensional view of the gas target x-ray tube. 


that of Z/e, although both are of the same order of 
magnitude. 

In interpreting the results, certain small corrections 
to the measured potential are essential, including those 
due to space charge and to work functions of the cathode 
and anode.” These are obtained, in part by a measure- 
ment of the ionization potential of the gas. 


GAS TARGET X-RAY TUBE 


Two gas target x-ray tubes have been used in the 
present work. The basic principles of both tubes are 
shown schematically in Fig. 1. The x rays were observed 
along an axis which was perpendicular to both the 
electron beam and the mercury jet. 

The required area density of the jet can be estimated 
as follows: From solid target measurements? and 
electron energy losses in thin foils? it can be shown that 
the radiation near the high-frequency limit is produced 
by a solid target thickness of the order of 500 À. Thus, 
to obtain sufficient x-ray intensity for precision meas- 
urements, the total atomic cross section for the gas 
target should be equivalent to a 500 A thickness of a 
solid. This was achieved by using a vapor jet formed 
by a nozzle, as shown in Fig. 1. The diameter of the 
mercury beam as indicated by the focal spot was 
approximately 5 mm. The vapor was formed by an 
electrically heated Hg boiler, and the jetstream directed 
into a dry-ice cold trap. The freezing out of the Hg 
vapor greatly reduced the pumping speed required to 
maintain a pressure of 10 mm of Hg in the target 
chamber. A 500 literx’sec mercury vapor diffusion 
pump and liquid nitrogen traps were used to evacuate 
the electron gun and high voltage section of the x-ray 
tube. 

The electron beam was formed by a two-element 


| W. M. DuMond and V. L. Bollman, Phys. Rev. 51, 412 


D. L. Webster, Phys. Rev. 9, 220 (1917). 
*W. W. Nicholas, J. Res. Natl. Bur. Std. 2, 837 (1929). 
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Pierce electron gun,” with a unipotential dispenser 
cathode.!® This gun operated very satisfactorily, and 
at 8 kV produced a maximum beam current of 50 mA 
with a beam diameter of about 3 mm. The accelerating 
anode of the Pierce gun, the electron collector (beam 
stop), the nozzle, and the condenser were all electrically 
connected and maintained at ground potential, except 
for possible small effects such as contact and thermal 
emf's. The presence of Hg atoms in the tube largely 
neutralized the space charge!” and collimated the elec- 
tron beam through gas focusing. However, the ion 
bombardment of the cathode reduced its life to approxi- 
mately 40 h. 

In order to obtain effective work function corrections, 
as described in the section on corrections and probable 
errors, the ionization potential of Hg was measured by 
the usual method. This required a shielding grid at the 
potential of the target chamber and an ion collecting 
plate at a small negative potential. (These auxiliary 
electrodes are not shown in Fig. 1.) In the first tube a 
small grid and plate were inserted through the port 
where the x-ray beam was normally observed. In the 
second tube, both the stainless steel grid and plate were 
cylindrical and concentric with the mercury jet, with 
openings to permit passage of the electron and x-ray 
beams; they remained in place throughout the x-ray 
measurements, permitting an ionization curve to be 
run after each isochromat. 

The first tube!? was essentially two intersecting stain- 
less steel cylinders each 2 in. in internal diameter and 
4.5 in. long. The mercury jet was directly vertically 
upwards and an arrangement provided for condensing 
the mercury and returning it to the boiler. The second 
tube was designed on a larger scale, the actual geometry 
being very similar to the schematic shown in Fig. 1. The 
enclosing steel cylinder was 11 in. i.d. and 3.5 in. in 
height, with end plates of 0.5 in. thickness. The mercury 
jet was directly vertically downwards; no return was 
provided, since less than 300 cc of mercury was required 
for a 12-h run. 


POWER SUPPLY AND MONOCHROMATOR 


Power supply. The high-voltage regulated power 
supply and the high-voltage measurement circuit were 
basically the same as reported by Bearden and Schwarz," 
with minor modifications. The regulating amplifier was 
replaced by a commercial high-gain operational ampli- 
fier, and a phase-shift-free voltage divider? was designed 
for the feedback loop. This circuit reduced the ripple 
voltage to 0.05 V peak to peak. 

The high voltage Vm (see Table I) was measured with 


15 J. R. Pierce, Theory and Design of El 
Nestea Company, Inc., New eae tos Ne pe 
s E: Levi, J. Appl. Phys. 26, 639 (1955). 
" : D de J. Electron. Control 6, 307 (1959). 
J. J. man, dissertation. i iversi 
Baltimore, Maryland, 1962, p- 12 ED M ids 
C. L. Conner, Electronics 35, 52 (1962). 
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TABLE T. Measurements for the voltage-wavelength conversion factor. 


Run 
1 2 3 4 
High-voltage measurement Se 
E (St. cell) 1.018 414 1.018 414 1.018 414 1.018 414 
AE (K —3 pot.) 0.018 256 0.018 191 0.018 231 0.018 278 
E—AE 1.000 158 1.000 223 1.000 183 1.000 136 
Vin 8048.96 8049.48 8049.16 8048.78 
Corrections 
Vi 10.434 10.434 10.434 10.434 
Vmi 10.32 10.57 10.57 10.05 
VM 0.90 0.92 ; 0.83 0.83 
Vp 0.07 0.07 0.07 0.07 
V 8048.10 8048.35 8048.12 8048.26 
Wavelength 
(xu) 1537.400 1537.370 1537.395 1537.403 
Voltage-wavelength conversion factor 
VAS 12 373.14 12 373.29 12 373.13 12 373.41 
Weight 1 2 2 2 


a Weighted average VA, = (12 373.26 2-23 ppm) xu- V. 


a precision voltage divider and a potentiometer. The 
precision voltage divider, consisting of an 8526-0 
protective resistor in series with two 1-MQ standard 
resistors (each with 10 subdivisions) and two standard 
resistors of 100 2, was so arranged that the smaller 
voltage was almost exactly equal to that of a standard 
cell for the average value of high voltage used in the 
measurements. The precision potentiometer then was 
used only to measure the small incremental difference 
between the voltage drop in the two 100-2 standards of 
the divider and that of the standard cell. Hence the 
high voltage Vm was determined from the difference 
between the emf of the NBS 968 saturated standard cell 
and the potentiometer reading AZ, multiplied by the 
voltage divider ratio of 8047.688. The standard 
resistors and standard cells used have been calibrated 
by the National Bureau of Standards a number of 
times, including one check during the present experi- 
ment. 

Monochromator. A double-crystal spectrometer,” with 
quartz crystals cut parallel to the 1011 plane, was used 
for a monochromator with a resolution of approximately 
13 000 as determined from the parallel (1, —1) position 
rocking curve. The monochromator was adjusted for 
the wavelength of the Cu Ka: line, which is one of the 
best known standard x-ray wavelengths. The mono- 
chromator was aligned by replacing the vapor target 
with a Cu target, and accurately calibrated against the 
Ko; line, assuming a peak wavelength of 1.537400 
kxu.?! In the different experimental runs the exact wave- 
length setting was determined with respect to this peak 
angle position. The x rays were detected by a scintilla- 


?! A. H. Compton and S. K. Allison, X-Rays im Theory and 
Zxperiment (D. Van Nostrand Company, Inc., New York, 1954), 


2nd ed., po. 709-740. 2 
E Di A FBearden and C. H. Shaw, Phys. Rev. 48, 18 (1935). 


tion counter with a 25-mil-thick NaI (TI) crystal placed 
on a selected photomultiplier tube and shielded by a 
low radioactive iron housing. For a pulse-height 
discriminator setting which gave 80% detection effi- 
ciency the background was 0.3 counts/min. 


ISOCHROMATS 


In principle the high-frequency limit of the con- 
tinuous spectrum can be found most simply by holding a 
fixed voltage on the x-ray tube and determining the 
shortest wavelength emitted. In practice it is usually 
more convenient to adjust the monochromator for a 
fxed wavelength and determine the variation of 
intensity as the voltage is changed in small steps. The 
resulting curve is called an isochromat. 

Four averaged isochromats were used in the four 
runs shown in Table I. These were obtained as follows: 
Each averaged isochromat is the composite of ten 
individual isochromats. Seventeen voltage points in the 
vicinity of cutoff were selected and the x-ray intensity 
recorded for a period of 200 sec at each point. In order 
to minimize instrumental drift, these measurements 
were first recorded for increasing voltage and then 
repeated for decreasing voltage for each individual 
isochromat. After initial adjustment of the apparatus 
was completed and definitive recording was begun, no 
individual isochromat was rejected for any reason. One 
of the averaged isochromats is indicated by the data 
points in Fig. 2, where the normalized intensity is 
plotted against voltage change. Measurements up to 
500 V above the short wavelength limit indicated no 
further appreciable increase in intensity. 

Since the profile near the high-frequency limit is of 
primary interest, both for an 4/e determination (or VÀ) 
and from a theoretical standpoint, the observed 
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Fic. 2. Experimental window (solid line) and theoretical window 
(dashed line) of the double-crystal monochromator. 


isochromat was corrected for instrumental smearing, 
which is primarily due to the width of the monochro- 
mator window. This window, shown in Fig. 3, was 
obtained from the parallel position (1,—1) rocking 
curve using a copper target. The double crystal diffrac- 
tion pattern was also calculated from the Darwin-Prins- 
Ewald theory of single crystal diffraction.” The greater 
width of the observed pattern must be attributed to 
slight imperfections in the quartz crystals. A study of 
the crystals? showed that they were not perfect speci- 
mens. Since a 3 cm width of the crystal was used for 
both copper and vapor targets (due to the extended 
x-ray source), the crystal irregularities broadened the 
diffraction pattern. 

The measured window was used to correct the ob- 
served isochromat, but the extended wings made an 
unfolding procedure inaccurate. This difficulty was 
avoided by using an IBM-7094 computer to fold 
assumed isochromat profiles with the observed window 
and finding the best fit for the observed data. The 
assumed profiles were based on the Sommerfeld theory,” 
which gives the x-ray intensity as a constant inde- 
pendent of the frequency for hy<eV, and zero for 
hv? eV. However, near cutoff the shape of the profile is 
determined by the distribution of final states available 
to the electron. If one assumes no electric or magnetic 
field at the target, the density of final states is propor- 
tional to (AE)!2, where AE=e(V—Vo) or the energy 
difference between the incoming electron and the 
emitted photon.” 

Consequently the x-ray intensity was represented by 
the empirical relation 


T=1{1—exp[—a(V—Vo)*?},, (2) 


where a.is an adjustable parameter. Equation (2) 


23 The authors are indebted to J. G. Marzolf, S. J. for performing 
these calculations. See J. G. Marzolf, S. J., Bull. Am. Phys. Soc. 
8, 313 (1963). g 
( 1555) Henins and J. A. Bearden, Bull. Am. Phys. Soc. 8, 313 

24 A. Sommerfeld, Proc. Natl. Acad. Sci. (U. S.) 15, 393 (1929). 

35 E. P. Wigner, Phys. Rev. 73, 1002 (1948). 
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approximates the Sommerfeld profile, except near the 
high-frequency limit, and becomes proportional to the 
density of the final states as this limit is approached. 
The observed x-ray intensity should be given by folding 
Eq. (2) with the observed window of Fig. oe 

The parameters Vo and œ were varied stepwise; for 
each pair of values an isochromat was computed, and 
compared with the observed curve. The best fits were 
obtained with values of a between 1 and 2 V~; outside 
this range the predicted isochromat varied significantly 
from the observed one. The isochromats for these two 
values of a are shown in Fig. 2. For the best fit the 
value of Vo differed only about 5 ppm from the voltage 
at the half-intensity point; hence, the latter was used 
as a sufficiently accurate criterion for the high-frequency 
cutoff. 


CORRECTIONS AND PROBABLE ERRORS 
Work Function Correction 


The voltage measured, Vm, represents the potential 
across the x-ray tube and not the electron energy, since 
contact potentials and the initial thermal electron 
energy must be taken into account. In the solid target 
h/e experiments, the applied potential must be corrected 
for the work function of the cathode only; the work 
function of the anode is not involved, since the electron 
comes to rest near the Fermi level of the anode. For the 
gas target tube, however, both initial and final states 
are those of a free electron, and the difference between 
the work function of the cathode e. and the anode ga 
plus the thermal energy kT must be added to the meas- 
ured potential. 

The retardation of the electrons due to the space 
charge at the cathode has the effect of increasing the 
cathode work function. This space charge potential V; 


NORMALIZED INTENSITY 


VOLTS FROM H.F.L. 


Fic. 3. (a) Assumed isochromat, Eq. (2), wi zw 
assumed isochromat with a=2 V=; (6) UR edi 2 
tained by folding either assumed curve with the experimental 
window of Fig. 2; the predicted curves based on (a) and (b) 
showed no significant difference. Experimental data points are 
shown as dots, with bars indicating the statistical errors. 
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Fic. 4. Experimentally determined probability of ionization by 
electron impact in Hg gas. Cathode temperature 1493°K. 


can be calculated?5 from the cathode work function ge 
the critical cathode temperature 0 (where the space 
charge minimum disappears), and operating tempera- 


ture T, by 
T—0 kT I 
= ef )- In—. (3) 
0 € Io 


In this expression J represents the cathode current at 
the critical temperature 0, and J that at operating 
temperature T, while & and e are the Boltzmann con- 
stant and the electronic charge. With the above- 
mentioned corrections to the measured voltage Vm, the 
effective difference V becomes 


V-Va-- Vir (ec— Ga) +Vs. (4) 


'To determine these correction terms experimentally, 
a measurement was made of the ionization potential?” 
of Hg (for which a value of 10.434 V is known from 
spectroscopic data). The observed probability of 
ionization by electron impact for Hg vapor is indicated 
in Fig. 4. The structure in the curve is due to auto- 
ionization 2° corresponding to the *P;' state of mercury 
0.6 eV above the ground state, which competes with the 
lonization process. 

For the ionization potential of mercury at the same 
cathode temperature, the relation corresponding to 
Eq. (4) is : 
Vi War Vert (a= Pa) t Vs: , (5) 

` 


where the subscript 7 denotes quantities involved in the 
ionization potential measurement. Subtracting Eq. (5) 
from Eq. (4) and making use of Eq. (3) now yields 


kT I 


V= Vmt P= m= In— D (6) 
@ ls 


26 Wayne B. Nottingham, in Encyclopedia of Physics, edited by 
S. Flügge (Springer-Verlag, Berlin, 1957), Vol. 21, p. 42. — 

21 S. C. Brown, Basic Data of Plasma Physics (John Wiley & 
Sons, Inc,, New York, 1959), p. 14. 

35 W. M. Hickam, Phys. Rev. 95, 704 (1954). 
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the last term being simply V ,— V,;. Numerical values of 
these correction terms are shown in Table I. 


Potential Depression 


Since there is no external electric field acting on the 
clectron beam in the target chamber, the space charge 
effects could produce a potential variation across the 
electron beam, which might become significant at 
moderate current densities.” The space charge effect 
would lower the electron energy. It should also produce 
an energy spread of several electron volts in the cutoff 
of the isochromat, unless it is largely neutralized by 
the presence of positive ions, which are trapped by the 
electron beam. Linder and Hernqvist™ have studied the 
degree of neutralization of space charge by positive ions, 
and derived an expression for the potential depression 
as a function of the gas pressure. 

For an operating pressure of 10-? mm Hg in the 
described apparatus, the calculated voltage depression 
V p is 0.07 V (Table I). This could be in error by 0.07 V, 
since complete neutralization would take place at 
2X10- mm Hg and the pressure measurement could 
be in error by an order of magnitude. 


Errors 


The probable errors, in parts per million (ppm) of 
the final result are estimated as follows: 


1. Determination of the voltage at the isochromat cutoff: 


Potentiometer measurements 3.5 ppm 
Precision voltage divider 2.0 
Isochromat cutoff 12 
Work function correction 13 
Voltage depression 9 

Total (rms) 21 ppm 


2. Spectrometer calibration 10 ppm 


Total probable error in the voltage-wavelength con- 
version factor is thus 23 ppm; the error in the conversion 
factor from NBS to absolute volts (5 ppm) does not 
significantly increase this figure. 

'The four separate runs were assigned weights based 
on the statistics of the isochromat recording. 


RESULTS 


A summary of the experimental results is given in 
Table I. The total recording time for each run was 
slightly less than 19 h. Runs 1 and 2 were made with 
the tube briefly described in the gas target x-ray tube 
section and reported in detail in the thesis of one of the 
authors (J. J. S.). Runs 3 and 4 were made with the 
tube design shown in Fig. 1. The excellent agreement 
of the final values lends support to the methods and 
experimental measurements involved in applying the 
necessary corrections. 


? 0. Klemperer, Electron Optics (Cambridge University Press, 
London, 1953), p. 210. : NE < 

9 E. G. Linder and K. G. Hernqvist, J. Appl. Phys. 21, 1088 
(1950). 
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The voltage of standard cell E (actually three were 
used and frequently intercompared) has been corrected 
for the difference between the National Bureau of 
Standards volt and the absolute volt, using the con- 
version factor given by Driscoll and Cutkosky* 


1 NBS volt=1.000 010 absolute voltsd-5 ppm. 


AE is the incremental voltage «measured by the K3 
potentiometer. 

As shown in Table I the weighted average of the 
above measurements gives 


VA,— (12 373.2623 ppm) xu-volts. 


Combining this with the commonly accepted value of 
A=1.002 022-20 ppm and c— 299 792.5 km/sec,” one 
obtains 


h/e= (1.379 494-30 ppm) X 107? erg  sec/esu. 
The non-x-ray value? is 
h/e= (1.379 472-7 ppm) X 10" erg: sec/esu. 


'The probable errors overlap, and for the first time the 
x-ray and non-x-ray values of ///e are in good agreement. 
However, the probable error of the x-ray value is more 
than four times that of the non-x-ray one. Approxi- 
mately half of this is due to the uncertainty in the 
value of A. 

DuMond and Cohen? have pointed out that the 
optical calibration lines used by Tyrén?! in his concave 
grating measurement of A require correction for the 
Lamb shift. This might increase his reported value of 
A=1.001 99 by the order of 50 ppm (due to loss of data 
and plates it is impossible to determine the exact 
correction). DuMond and Kirkpatrick in a recent report 
to the National Science Foundation on a new concave 
grating experiment give a provisional value 


A=1.001 8010 ppm. 


This is about 200 ppm lower than Tyrén’s value or the 
plane grating values. Thus the true value of A may be 
considerably outside the 1.00203--20 ppm range 
calculated by Birge, based principally on measure- 
ments with plane gratings. 

From these considerations the measured value of VA, 


3 R. L. Driscoll and R. D. Cutkosky, J. Res. Natl. Bur. Std. 
60, 297 (1958). 

$ Natl. Bur. Std. Tech. News Bull. 47, 175 (1963). 
( o3 W. M. DuMond and E. R. Cohen, Phys. Rev. 103, 1583 
1 : 

31 Folke Tyrén, dissertation, Uppsala, 1940 (unpublished). 

35 R. T. Birge, Am. J. Phys. 13, 63 (1945). 
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could best be used to evaluate A. With h/e= (1.37947 
+7 ppm) X 1077 erg-sec/esu™ the value is 


A=1.002 0052-24 ppm. 


This value is based on Cu Ko; — 1537.400 xu. How- 
ever, a report by one of the authors (J. A. B.) on a 
serious wavelength discrepancy of approximately 20 
ppm between the Mo Ka; and the Cu Ka; xu standards 
has been confirmed and is in preparation for publication. 
On the basis of Mo Ka; as 707.831 xu (which is taken 
as a provisional definition of the xu in this report?) the 
wavelength of the Cu Ka; is 1537.370 xu. With this 
revision the experimental value becomes 


VA,— (12 373.022-23 ppm) xu: V 
and 
A — 1.002 0244-24 ppm. 


(This revision will not significantly alter the value of 
h/e calculated above, since the change in Cu Koi wave- 
length also implies a compensating change in the ruled 
grating values of A.) 


CONCLUSION 


The absence of structure and the sharpness of the 
high-frequency limit of the continuous x-ray spectrum, 
together with a measurement of the work function 
correction made possible a precision determination of 
VA,. The lack of structure in the gas target isochromat 
tends to support Ulmer's theory as to the origin of the 
structure. The profile of the isochromat gave the shape 
of the continuous spectrum produced by single electron 
bombardment. The measured intensity was constant 
from a few eV of cutoff to 500 eV above in accordance 
with the Sommerfeld theory. 

With the use of the non-x-ray value of //e, a precision 
determination of A has been obtained. The accuracy of 
this value appears comparable to those obtained by 
either ruled-grating or crystal measurements. 
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The inelastic scattering of electrons by Hat is investigated by means of the first Born approximation. 
Using the exact electronic wave functions, the integral that must be evaluated to find the differential cross 
section for fixed internuclear separation is studied for an electronic excitation from the ground state to any 
discrete excited state. Values related to this integral are tabulated for the processes 1s0,-2pou, 1505-2 fru, 
1525-25e; at the equilibrium internuclear distance, while the first case is studied at two additional inter- 
nuclear separations. The corresponding total cross sections are calculated for incident energies up to 400 eV. 
Assuming that rotational and vibrational levels of the final electronic state cannot be resolved, it is known 
that an observed cross section necessarily depends on the initial vibrational state of the molecular ion. The 
effect of two different initial vibrational states (v=0,3) is investigated for the 1sc,-2po. case and it is ob- 
served that the vibrational state has a marked influence on the total cross section. 


INTRODUCTION 


HE problem of inelastic electron scattering by 

molecules has a complication in addition to those 
found when considering scattering by atomic systems 
in that the internal degrees of freedom of the molecule 
must be taken into consideration. However, in the 
range of incident energies where the first Born approxi- 
mation can be expected to be valid, experimentally 
determined cross sections which resolve rotational 
effects on electronic transitions have yet to be published, 
and it appears that the resolution of vibrational struc- 
ture is just now becoming an experimental possibility. 
This being the case, it seems highly plausible to treat 
the internal degrees of freedom so that specific vibra- 
tional and rotational excitations are ignored while 
considering a given electronic process. A detailed 
analysis of the first Born approximation to scattering 


by molecules, where the above treatment of the internal 


modes was employed, has been given by Lassettre! 
and by Craggs and Massey.? Following their treatment, 
the differential cross section, where the excitation is 
n—n' for the electronic system, L,M — L',M' for 
rotation and v — v’ for vibration, can be written 


I(nLMvn' L'M'v' ; 0) 


D I I €(K,8,£,R) 
0 0 0 


- Ozyr(9,£) 8 Lae* (9,) X, (R)X * (R) RC 


= (4k w/R nK‘) 


xsinàdédtdR| . (1) 


* This work performed under the auspices of the U. S. Atomic 
Energy Commission. ede f 
! E. N. Lassettre, Ohio State University Research Foundation, 
Columbus, Ohio, 1957, R. F. Project 464, Report No. 1 (un- 
$T b ties and H. S. W. Massey, in H andbuch der Physik, 
fed by S. Flügge (Springer-Verlag, Berlin, 1959), Vol. 37, Part 
» P. - 


O, X are rotation and vibration functions, respectively, 
where it has been assumed that these functions do not 
mutually interact in addition to the usual assumption 
that the nuclear and electronic coordinates are sepa- 
rable. This expression and all remaining equations are 
given in atomic units unless stated otherwise. The 
quantity e is defined for a molecule with j electrons as 
(neglecting spin variables) 


i 
KAER) = [inttr ems uz exp(iK-n) | 
i=] 


XW,.(r- rj; R)dry--+-dr;, (2) 


where the states n,n’ cannot be the same. The quantities 
K, 0, £, R refer to the parameters that must be fixed 
during the integration over the electronic coordinates. 
The symbol K, the magnitude of K, defines the mo- 
mentum change of an incident electron scattered 
through an angle 0; K?^— &;?-- 5, — 2b, k,, cos, where 
kn, kw are the magnitudes of the momentum for the 
incident electron before and after scattering. The 
angles ô, & fix the orientation of the molecule in space 
and R is the internuclear distance. Lassettre! has shown 
that the differential cross section becomes 


I (nv,n'v'; 0) = (Aha: /baK*)| (1/4) 


| 


xf [if «totos tox, toram] 
0 0 0 t 


y Xsinadade| (3) 


when one sums over all final rotational states and 
averages over the degenerate levels of the initial state. 
Equation (3) will be true if the rotation of the molecule 
can be described by a symmetrical top and if the 
dependence of kw, K on L'M' is ignored.! It has been 
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assumed that the final vibrational levels cannot be 
resolved, so a sum over all y’ can be made, which gives 


I(nv,n’ 3 6) = (hw tak} (1/47) 


xj i i | e(K,6,£) |2|X,(R) E sind 
0 0 0 


x asagar) (4) 


if the dependence of kn, K on y’ is neglected. This is 
then the first Born approximation for a given electronic 
excitation of a molecule when the final rotational and 
vibrational states of the system are unresolved. 

The simplest molecular system available for study 
is the hydrogen molecule ion. Previous calculations on 
electron scattering by this system have been reported 
by Ivash* and Kerner.‘ Ivash treats the electronic 
excitation 1sc,-2pc. (in the notation of Bates, Ledsham 
and Stewart), where he approximates to the ground 
vibrational state by fixing R—R,, the equilibrium 
internuclear distance, and then evaluates Eq. (4) by 
ignoring the integration over R. In evaluating Eq. (2), 
he also treats the possibility of electron exchange. 
Kerner does not use Eq. (4), but evaluates Eq. (1) by 
considering an excitation to a specific rotational and 
vibrational state while treating the 1sc,-2p0. electronic 
transition. In each of these studies, the electronic wave 
functions that appear in Eq. (2) were replaced by linear 
combinations of 1s atomic orbitals. Since the exact 
wave functions for Hz* (assuming fixed nuclei) are 
available? and the use of exact wave functions for the 
scatterer is assumed iu the derivation of the first Born 
approximation, it would seem that electron scattering 
by Hj* should be reexamined using these wave func- 
tions. This will make it possible to give a quantitative 
estimate of the error introduced into the first Born 
approximation when approximate electronic wave 
functions are used to evaluate Eq. (2), at least for this 
relatively simple case. In addition, there are a number 
of electronic processes which can be expected to con- 
tribute to the inelastic scattering by H5* and it would 
be interesting to obtain quantitative information about 
some of these electronic transitions. 

The integral defined in Eq. (2) is studied for tran- 
sitions from the ground state to any discrete excited 
electronic state. It is found that this integral can be 
evaluated analytically ‘yith respect to all coordinates 
except one. The behavior of this integral as a function 
of K and the orientation parameters is explored and a 

quantitative example is given for one transition. A 


5 


3E. V. Ivash, Phys. Rev. 112, 155 (1958). 

+E. H. Kerner, Phys. Rev. 92, 1441 (1953). 

5D. R. Bates, Kathleen Ledsham, and A. L. Stewart, Phil. 
Trans, Roy. Soc. London A246, 215 (1953). 
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table of this integral, after the appropriate integration 
over nuclear orientation has been performed, is given 
for the transitions 1sa9-2pcu, 1so,-2pru, 1505-25c,. All 
transitions are studied at the equilibrium internuclear 
distance while the 1sc,-2pou case is evaluated at two 
additional internuclear separations. The total cross 
section for incident energies up to 400 eV and fixed R 
is then calculated for these various cases. The effect of 
two different vibrational states for the ground state, 
y=0, 3, is also studied for the 1sc,-2pou transition and 
the corresponding total cross sections are given. 


GENERAL CONSIDERATIONS 


The total cross section corresponding to Eq. (4) is 


Q,— fro ;6)dQ— (8x/k.2) (1/47) 


hehe pe pir ez (e(K,5,5, R)|* 
SEU E m 
kn—En^ 0 0 0 E 


XR? sinôdôdtdRdK, (5) 


X,(R)|? 


where dQ is the solid-angle volume element. Carrying 
out the integration over the orientation angles,} 


|e(K,R) |= (1/4) | | | eG SR)|tsinadbdt (6) 


is found. Defining 


Enc En? 
Q(R)= (8x/k.?) [le(K,R)|2?/K*]dK (7 


kn—kn’ 


and interchanging the order of integration, the total 
cross section becomes 


Qc Í O(R)|X,(R)| RAR. (8) 


In order to obtain Q(R) the integrations defined by 
Eqs. (2), (6), (7) must be carried out for a fixed R. 
Then Q(R) must be found as a function of R to effect 
the evaluation of Eq. (8). 


For the hydrogen molecule ion, Eq. (2) becomes 


«(K,8,R)= I 2 Í "ü f À exp( (iKR/2)Du cosò 


+[0Q?—1)(1—p?) }? sing sind ]) 
XVn Yn” (R/2 O —)dudodh, (9) 
where the coordinate system is shown in Fig. 1. It 
should be noted that for any linear scattering system 


£ can be suppressed: The wave functions to be used in 
evaluating Eq. (9) are given by Bates, Ledsham, and 
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z where 

y= (KR/2) (X — sin)", (13) 

cosy =) cosó/ (X?— sin?5)? , (14) 


Fic. 1. The coordi- 
nate system for H.* 
and its orientation 
with respect to K. 


Stewart? and are of the form 


A—1\i 
Y. | 0*0 nete 2 (=) ] 
T NAH 


XD SPm” lu) etr. (10) 


The primed sum means that only even or odd values 
of t are to be used. The quantities o, p, m are eigen- 
parameters for the nth state and P," is an associated 
Legendre polynomial of the first kind. The product 
Y, Y ,,* appearing in Eq. (9) can be written in the form 


pee Vy *=p2 f(r) exp (ime) 27 Pirm” (n) 
t 


= f(d) exp(imé) 27 Pas), — (09) 


since the product of any two Legendre polynomials 
can be written as a sum of Legendre polynomials. [See 
Ref. 6 for a general formula applicable to the case 
m=0.] The ground state involves only terms of the 
type P2,; hence, the sum in Eq. (11) will have values 
of m and t-+-m identical to those representing the excited 
state. 
In this notation Eq. (9) becomes 


e(K,6,R) — 4x (R/2)* | 2 1"05 
X I : XQ) jac Q)P ntm™(cosy) dd 


Yrs. f i NOE , 02 


atson, A Course of Modern 


6 ittak .N.W K 
E. T Whittaker and G Cambridge, 1958), p. 331. 


Analysis (Cambridge University Press, 
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and j is a spherical Bessel function. The details of this 
derivation are sketched in the Appendix. 


NUCLEAR ORIENTATION 


The qualitative behavior of e(K,ô,R) as a function 
of ô for various final states can easily be determined 
from Eq. (9) or Eqs. (12)-(14). Expanding the expo- 
nential of Eq. (9) in a power series of the exponent and 
then noticing the symmetry of the integral with respect 
to u or $, a large number of terms will be found to 
vanish. In addition, each nonvanishing term in this 
series will have a very restricted dependence on ô. 
Listing the results for the cases studied here, it is 
found all nonvanishing terms involve cos(2k+1)6 for 
final upc, states, sin(2k+1)é for final mpm, states, and 
cos2k6 for final sso, states, where n, k are positive 
integers. These same conclusions can be reached by 
using Eqs. (12)-(14). It is also known that e(K,5,R) 
must be either symmetric or antisymmetric about 
6=0, 7/2, =, since |e(,5,R)|? must be symmetric 
about these points. This follows immediately from the 
symmetry of H; and implies that we need e(K,0,R) 
only over the range 00/2. 

From the above analysis or directly from Eqs. 
(12)-(14), it can be shown that for certain transitions 
e(K,6,R) will vanish identically for a specific orien- 
tation. It is apparent that e(K,0,R) is zero when 0—7/2 
for npc. final states and when 6=0 for sz, final states. 
e(K,6,R) is never identically zero for final states of the 
type sog. This type of orientation selection rule has 
been noticed by other authors and the results here are 
consistent with the conclusions of Dunn’ for the homo- 
nuclear diatomic molecule. 

The quantitative behavior of e(K,ô,R) as a function 
of K and ô is shown in Fig. 2 for the case 1sa,-2pou, 
R=2.0. In the limit of small K, e(K,6,R) approaches 


Fic. 2. The be- 
havior of ¢«(K,6,R) 
shown as a function 
of à for the 1so,-2p0% 
(R=2.0) case and 
several values of K. 


7G. H. Dunn, Phys. Rev. Letters 8, 62 (1962). 
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TABLE I. Values of | e(K,R) |*/K? [sce Eq. (6) ] for the electronic transitions Iso 9-2 pou (R=1.4, 2.0, 3.2), 
lsog-2pru (R=2.0), 155-2so5 (R=2.0) as a function of K. 


1soy-2pou (R=2.0) 1s0-2pou (R— 1.4) 


Lsog-2pou (R=3.2) 


e(K,R) |?/K? 
| 1sog-2so, (R — 2.0) 


1se5-2pr. (R— 2.0) 
sar Rak (AE=0.74176) 


(AE=0.67385) 


K (AE=0.43509) (AE=0.67219) (AE=0.18102) 
0.0 0.3676 0.2296 0.7656 0.3414 0.0 
0.05 0.3666 0.2291 0.7632 0.3403 
0.1 0.3641 0.2277 0.7559 0.3369 0.001091 
0.2 0.3539 5 0.2221 0.7275 0.3237 0.004143 
0.3 0.3378 0.2130 0.6826 0.3031 0.008633 
0.4 0.3166 0.2011 0.6247 0.2765 0.01382 
0.5 0.2915 0.1869 0.5579 0.2461 0.01890 
0.6 0.2638 0.1711 0.4867 0.2140 0.02320 
0.7 0.2349 0.1543 0.4145 0.1818 0.02622 
0.8 0.2058 0.1373 0.3453 0.1513 0.02771 
0.9 0.1777 0.1205 0.2816 0.1233 0.02785 
1.0 0.1512 0.1045 0.2250 0.0987 0.02667 
14 0.1270 0.0896 0.1762 0.0777 0.02456 
1.2 0.1053 0.0760 0.1355 0.0602 0.02185 
1.3 0.0863 0.0639 0.1024 0.0460 0.01886 
1.4 0.0700 0.0532 0.0762 0.0347 0.01585 
1.5 0.0562 0.0439 0.0558 0.0259 0.01301 
1.6 0.0447 0.0360 0.0404 0.0191 0.01047 
1.7 0.0353 0.0293 0.0289 0.0140 0.00827 
1.8 0.0277 0.0237 0.0205 0.0102 0.00643 
1.9 0.0215 0.0191 0.0145 0.0073 0.00493 
2.0 0.0166 0.0153 0.0053 0.00374 
2.1 0.00281 
22 0.0098 


cosine behavior; hence, the differential cross section 
will have a cosine-squared dependence on à. As K in- 
creases, the behavior changes from a cosine curve to 
one quite eccentric, although still possessing cosine 
symmetry. Note that the maximum in ¢(K,é,R) moves 
away from 8—0 and approaches 6=7/2. From the form 
of Eq. (9), when the exponential is expanded in a power 
series, it is possible to say something about the behavior 
of this eccentricity as the internuclear separation is 
altered. As R— 0, e(K,8,R) becomes much less de- 
pendent on K while, as R is allowed to become large, 
its behavior becomes much more dependent on K and 
quite complex in general. This behavior is observed in 
the case where several internuclear separations are 
studied. These general arguments apply to other tran- 
sitions but both the symmetry and quantitative 
behavior can be expected to be different. 

These observations about the dependence of e(K,6,R) 
are useful when the integration of Eq. (6) is performed. 
€(K,5,R) need be evaluated only over the range 0<6 
<r/2, as pointed out above. Since this must be done 
numerically, a considerable saving in labor ensues. In 
addition, since the form of the trigonometric behavior 
is well specified, this gives a strong hint as to the 
method of numerical integration that should be used. 
If one evaluates e(K,ô,R) at a series of equally spaced 
values of à for a given K and then fits these points with 


- a trigonometric interpolation formula,* a simple series 


results which contains only terms like cos(224-1)8 for 


Lanczos, Applied Analysis (Prentice Hall, Inc., Englewood 
S N. J., 1956), p. 229. 


the 1sc,-2po. case. Similar behavior results for the 
other cases. It is a simple matter to then square this 
series and carry out the indicated integration. In 
practice, «(K,6,R) was evaluated for increments of 10 
deg in 6; hence, the series consists of nine terms. This 
procedure is exact if the expansion of the exponential 
in Eq. (9) can be truncated to the first nine nonvanish- 
ing terms. Obviously, as K and R increase, this expan- 
sion becomes less valid. However, for the cases studied, 
a sufficient range of K could be covered so that all 
significant contributions to the total cross section were 
obtained. The resulting values of | ((&,R) |?/K? for the 
various cases are given in Table I. An accurate error 
analysis seems out of the question but it is estimated 
that any error should be confined to the last figure 
quoted. 

One check of the values given in Table I can be made 
through the relationship 


limf(K,R) — f(R) (15) 
K-—0, i 
where 
eK, R)|? 
f(K,R)- nv EE à (16) 


$ 


f(R) is the optical oscillator strength and f(K,R) is 
the generalized optical oscillator strength as defined 
by Bethe? Values of f(R) for the 1so,-2po, case are 


*N. F. Mott and H. S. W. Massey, The Theory of Atomic 
Collisions (Oxford University Press, Oxford, 1952), n 
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available” and are 0.309 (R=1.4), 0.319 (R=2.0), 
0.281 (R=3.2) which are to be compared with 0.3087, 
0.3198, 0.2772, the values of f(R) obtained from Eq. 
(15). The optical oscillator strength for the 1sog-2pru 
transition” is 0.460 which is to be compared to 0.4601. 
Obviously, f(R) vanishes for the forbidden 1sa,-2se, 
transition, as does the limit of Eq. (15) for this case. 
[ Nonvanishing limits were found directly from Eq. (9) 
rather than using Eq. (12).] Agreement is quite good 
for all cases except 1so,-2po, (R=3.2) and the value 
obtained here is within the limits of accuracy (3%) 
claimed by Bates.! 


TOTAL CROSS SECTIONS 


The total cross sections Q(R), defined by Eq. (7), 
were found from the values listed in Table I in the 
following way: For small values of K(K<0.4) Table I 
was interpolated graphically to intervals in K of 0.01. 
Then the integrand of Eq. (7) was evaluated for avail- 
able values of | e(&,R)|?/K? and the integration was 


I sog- 2pTy 
0.5- 
“p o3- 
E 
c L 
O.lr- | 
1 i fi = 
[9] 100 200 300 400 


INCIDENT ENERGY (eV) 


Fic. 3. The total cross section Q(R) for the 1sog-2pru 
(R=2.0) case. 


carried out using Simpson’s rule. In general, Table I 
does not contain values of |e(K,R)|?/K? for K large 
enough to include the upper limit of the integral in 
Eq. (7). However, the contributions to the total cross 
section from this range are quite small and in practice 
this remainder was estimated by fitting the integrand 
with a function depending on K~. j 

The cross section Q(2.0) for the 1sc,-2p-. case is 
shown in Fig. 3. (The equilibrium internuclear distance 
of Het is Re= 2.0 ao.) It is apparent that this transition 
contributes quite heavily to scattering by H+ and 
must be taken into account when considering processes 
where this final state is possible. The 157,259, R= 2.0 
case is shown in Fig. 4. This corresponds to an optically 
forbidden transition and the total cross section 15 


"D, . Chem. Phys. 19, 1122 (1951). 
aL D. R. ue L S Daring, S.C. Hawe, and A. L. Stewart, 


Proc. Phys. Soc. (London) A66, 1124 (1953). 
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Fic. 4. The total cross section Q(R) for the 1sa,-2s0, 
(R — 2.0) case. 


relatively small, as expected. The 1ss,-2pe, case is 
shown in Fig. 5 for three internuclear separations. The 
curve for R—2.0 corresponds to the equilibrium inter- 
nuclear separation and is, therefore, an approximation 
to the Born cross section when the effects of the inte- 
gration in Eq. (8) are ignored. This is, of course, most 
accurate when H;* is in the v=0 vibrational state. The 
other two curves are for internuclear separations that 
correspond roughly to the turning points of the vibra- 
tional state v=3. It is apparent that the cross section 
for this case is strongly dependent on internuclear 
distance for intermediate incident energies. In the limit 
of infinite incident energies, these cross sections ap- 
proach constant ratios of each other; specifically, 
Q(3.2)/Q(2.0) 22.1 and Q(2.0)/Q(1.4) — 1.6. The effect 
of this strong dependence of Q(R) on R is investigated 
below where Eq. (8) is evaluated for two different 
vibrational states. 

The curve for 1sa,-2po. (R=2.0) can be compared 
with the results, ignoring exchange, given by Ivash* 


em Iso -2 poy 
V R 23.2 
\ 
M " 


INCIDENT ENERGY (eV) 


Fic. 5. The total cross section Q(R) for the 1. 
2.0, 3.2) case. The dashed line is from a 
(Refs. 3, 12). : 
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which are shown as the dashed curve” in Fig. 5. To 
obtain this curve, he used wave functions of the form 
W=ua+us, where u is a ls screened hydrogen wave 
function. His results are uniformly above the present 
calculation of Q(2.0) by 20% or more in this energy 
range and in the high-energy limit will be greater by 
17%. The difference between the two calculations is 
not large but the present treatment has the advantage 
that all aspects of the derivation of the first Born 
approximation remain valid as a result of using the 
exact Hst wave functions. A direct comparison with 
Kerner’s calculation does not seem justified since he 
treats a specific rotational and vibrational excitation 
in conjunction with the 15e,-25c, electronic transition. 
However, it is interesting to note that his cross section 
is roughly an order of magnitude less than Q(2.0). 

To study the effect of vibration on the total cross 
section, it is necessary to evaluate Eq. (8). If a diatomic 
molecule is in its lowest vibrational state (v—0) and 
it is assumed that Q(R) varies slowly with R, it is a 
good approximation? to replace Qo with Q(R.), since 
|Xo(R)|2 will be strongly peaked at the equilibrium 
internuclear distance Re For the transition under 
consideration, Q(R) is seen to be rather strongly de- 
pendent on R, so it appears that Qo must be evaluated 
without using the above approximation. In addition, 
it is known that Hot is often observed in excited vibra- 
tional states which have a rather long lifetime. Hence, 
Q, will be evaluated using Eq. (8) for the vibrational 
state y=3. This state is chosen because it is a highly 
probable vibrational state for Hs* if this ion results 
from the ionization of H» in its ground electronic and 
vibrational state. 

The integral defined by Eq. (8) was estimated in the 
following way: Q(R) for a given incident energy was 
fit by a polynomial in R. Since Q(R) is known for only 
three points this polynomial was necessarily a quadratic. 
Then, using the vibrational functions given by Cohen, 
dd and Riddell,’ Q, was evaluated by Simpson’s 

e 


The results for Qo, Qs are shown in Table II along 


TABLE II. The total cross sections Q(R) and Q, (in units of 
mao?) for the 155-252, transition tabulated as a function of the 
incident energy. 


Q(1.4) QQ.0  Q(32 ^ Qe Os 


50 0.4200 0.977 3.25 1.09 1.92 
100 0.304 0.635 1.92 0.698 1.17 
200 0.197 - 0.396 1.11 0.431 0.691 
300 0.148 LER 0.795 0.320 0.502 
400 0.120 0.230 0.612 0.248 0.383 


12 Tyvash's calculation has been extended to include the energy 
range of interest in this investigation. 

13 S, Cohen, J. R. Hiskes, and R. J. Riddell, Jr., University of 
California Radiation Laboratory Report No. UCRL-8871, 1959 
(unpublished); S. Cohen, J. R. Hiskes, and R. J. Riddell, Jr., 
Phys. Rev. 119, 1025 (1960). 
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with Q(1.4), Q(2.0), Q(3.2). It is interesting to note 
that Qo is about 10% higher than Q(2.0) and that Q; 
is almost 70% higher at 400 eV with the deviation 
increasing as the incident energy decreases. Hence, to 
predict the inelastic electron scattering into the 2c, 
state, it is necessary to know precisely what vibrational 
levels are populated, and to what degree. It does not 
follow that scattering into other final electronic states 
will show such marked dependence on », but points 
out the necessity for caution before neglecting the 
initial vibrational state. Relationships equivalent to 
Eq. (8) can easily be derived for the differential cross 
section and the generalized oscillator strength. The 
effects of v on the differential and total cross sections 
can be expected to be quite similar. However, using 
the information in Table I, it can be seen that f(K,R) 
has a maximum near Re for the 1sc,-2pc, case; hence, 
the generalized oscillator strength will be much less 
dependent on v. 
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APPENDIX 


Substituting Eq. (11) into Eq. (9), integrals of the 
type 


nr= J [ I ^ fO) expt @KR/2) 


X Du cosó4- (42— 1)!2 (1 — 1)? sing sinô] 
—imd}Prim™(u)dbdudd (A1) 


are obtained. Utilizing the integral 
ata 
Jj ei(nd—Z sins)diy — 25], (Z) , (A2) 


where J, is a Bessel function of the first kind, 


ne | I f(A) exp (KR/2)Au cosô)} 


XI [SK R(2—1)2(1—pu2)!? sinô] 
XP" (u)dudN (A3) 


is found. Gegenbauer’s finite integral!® can be rewritten 


( di S. a Watson, A Treatise on the Theory of Bessel Functions 
mbridge University Press, Cambridge, 195 . 20. 
15 Ref. 14, p. 378. apnee ee 


"EE 
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in the forn 


1 
J exp(iyu cosy) J »[y(1— 42)! ? siny Pep m™ (udu 

es 
= 2i'Pirm™(cosp)jrrm(y), (A4) 


where j,(x) is a spherical Bessel function defined by 


jn (x) oa (1/2)? I , yo (xr) . (A5) 
Let 
y= (KR/2) (9 —sin?*6)? (A6) 
cosy =À cosó/ (X— sin?6)?; (A7) 
then 
sinj.— (\2— 1)'? sin8/ [A^ —sin?ó |. (A8) 


Substituting Eqs. (46)-(A8) into Eq. (A4), we find an 
integral of the same form that appears in Eq. (A3), 


A883 


and it follows that 


niei | fO)jrem(y)Perm™(cosp)dr. (A9) 
H 


Using these results, Eqs. (12)-(14) are easily verified. 

Assuming that Eq. (A9) cannot be evaluated analyti- 
cally, the following numerical scheme was used for this 
purpose. A program for a digital computer, the Control 
Data Corporation's 1604, was written which carried 
out the integration of Eq. (A9) for a given choice of 
K, à, R by means of Weddle's quadrature formula. 
With these values it is then possible to evaluate Eq. 
(12). The sum in Eq. (12) was usually truncated to 
three terms, although four terms were used in some 
cases. The same program was used to evaluate e(K,6,R) 
and carry out the integration of Eq. (6). 
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AS=-+1 Magnetic Multipole Radiative Transitions* 


Masataka MIZUSHIMA 
Department of Physics and Astrophysics, University of Colorado, Boulder, Colorado 
(Received 20 December 1963) 


The magnetic multipole transition probability is calculated in terms of the matrix elements of the mag- 
netic multipole. The magnetic jm moment Qj; (79 is defined as 


(e/a) Etr Qj-1) 1? Z (v rd Ys) E G-- 0l si, 


where e and y are electron charge and electron mass, r;, l;, and s; are the coordinate, orbital angular momen- 
tum, and spin-angular momentum of the ith electron and Y jm is the spherical harmonic. Magnetic quadrupole 
and octupole moments are explicitly given. It is shown that for the 3Z,* <> 12,7 transition of the hydrogen 
molecule, the magnetic quadrupole transition is more important than the conventional spin-orbit electric 
dipole transition. The magnetic octupole transition has the same order of magnitude as the spin-orbit mag- 


netic dipole transition. 


INTRODUCTION 


HE transition between states with different multi- 

plicity is a very weak one. The corresponding 
emission line of some ions were first found in the spectra 
of some nebulas! and the theory has been given by 
Condon and other people.? This type of transition has 
been investigated for many ions since then, a brief 
review can be found in Garstang’s article. In all these 
theories the mixing of different multiplicity states 
through the spin-orbit interaction was assumed to be 
important. The 1D2 state of O III, for example, has 


* Supported by National Science Foundation. 
1T. S. Bowen, Astrophys. J. 67, 1 (1928). pee 
2See C. U. Condon and G. H. Shortley, The Theory of Atomic 


Spectra (Cambridge University Press, New York, 1935), pp. 282- 
283 


3R. H. Garstang, Atomic and Molecular Processes, edited by 
D. R. Bates (euet Press Inc., New York, 1962), Part 1. 


0.0074 of 3P; wave function mixed through the spin- 
orbit interaction, and the transition to a triplet state can 
occur through this small part of the wave function. 
Most of the phosphorescences of molecules are inter- 
preted as due to the singlet-triplet transition.* McClure 
frst suggested the transition through the spin-orbit 
interaction for these molecules." The present author and 
Koide formed a theory starting with the Dirac equation 
of electron and calculated the singlet-triplet transition 


probability of the benzene molecüle.5 The result with se | 


some approximation gave abot 1 sec for the lifetime of 
the lowest 3B», state of this molecule. Hameka and 
Oosterhoff/ refined the calculation and showed the 
lifetime of the same state to be 190 sec. Calculations for 


* M. Kascha, Chem. Rev. 41, 401 (1947). 
5 D. C. McClure, J. Chem. Phys. 17, 665 (1949) 


* M. Mizushima and S. Koide, J. Chem. Phys. 20, 765 (1952). 
1H. F. Hameka and L. J. Oosterhoft, Mol. Phys. 1, 358 (1958). 
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other molecules have also been done following the same 
idea, and they are briefly reviewed by Garstang.’ 

In this paper a completely different mechanism, 
namely, the magnetic multipole transition is proposed. 
If the probability of the electric dipole transition is 
P,©, the conventional theory of AS==+1 transitions 
gives the probability of about (fw/ uc? P, for the case 
of spin-orbit electric dipole transition, since the spin- 
orbit coupling is a first-order relativistic correction to 
the Coulomb energy which is about w. fiw is the photon 
energy which appears or disappears in our transition 
and uc? is the rest-mass energy of the electron. It will be 
shown that the magnetic quadrupole transition has the 
probability of the same order of magnitude, namely, 
(o/c?) Py. In the same way one can show that the 
spin-orbit magnetic dipole transition has the same order 
of magnitude as the magnetic octupole transition. 


MASATAKA MIZUSHIMA 


MAGNETIC MULTIPOLE TRANSITION PROBABILITY 


'The theory of the spherical photon is given in 
Akhiezer and Berestetsky's book.® They showed that 
the electric field for a photon which is the eigenstate of 
the total angular momentum with eigenvalues j and m, 
and the eigenstate of the energy with eigenvalue w and 
at the same time the eigenstate of the parity operator 


with eigenvalue (— 1)7*!, is 
E,;5— £j Soin"; (1) 
£u jn (fro? 1615 e907) Pe tg (or/c)Y;;m(r/r), (2) 
where e is the permittivity of the vacuum and 
gj(wor/c)= (Qr)! js or/c)/(or/c) ^, — (3) 


with the Bessel function Jj41/2, and Yjjm(r/r) is the 
spherical vector given by the spherical harmonics 


Y jm(x/7) as 


COH (j—m--1) ]4Y mit EG m) Go md- 1) JY jms 
Y; (r/r) i (j4-m)(j— m4-1) PPV 5, a— iL Gm) GA ma 1) ^Y jn [45(74-1) 1^. (4) 


2mY jn 


There exists another state with the same w, j, and m 
eigenvalues, but with the parity of (— 1)7. Such state is 
called an electric state while our state is called a mag- 
netic state. In this paper we are interested in the 
magnetic states only. 

If we take the Coulomb gauge we see from (2) that 
the vector potential A..;n is 


Åojm= Asm tF Aum > (5) 
Aajm= d (fto 1673622) 2e- tg (or/c)Y;;.(r/r). (6) 
Note that the normalization is such that 
JES Earm dT = zho l(w— ow) jimm , 
(7) 
[5 Beyatira0= | Baie Sy jm . 
Thus, if we expand a given field as 
E=F | F(ojm)8,;,do/c^ 
jm 
TO F* (jm) &ujm*dw/ol?, (8) 
HS 


we see the field energy U is 


im j'm' 
FF! jm) 85jm* wj v *dodo'dr/c 
[ror (wjm)F*(wjm)do , (9) 


which indicates that F(wjm) and F*(wjm) are annihi- 
lation and creation operators for the (wjm), photon 
respectively, 
(vojm| F*(@jm) | vojm— 1) 9 (Veojm/dw)'? 
= (Yajm—1 | F (ce jm) | vam) (10) 
where vojm is the number of the (wjm) photon. 
The probability of transition 771 €» move 1s? 

P (nv © nove) 

= (1/7) | i| H i| 2v2)|°6(@rv2—@ nin), (11) 
where nı and n: are molecular states, v; and v» are 


photon numbers, 
Gay (Ea— Ey)/fh (12) 


H— (e/u)3, [pz Ai—si- (v xA], (13) 


and 


where p; and s; are the momentum and spin of ith 
electron, respectively. A; is the vector potential at the 
position of zth electron. e and y are the electron charge 
and mass, respectively. In H; the contributions of nuclei 
are neglected. 

In the same way as in (8) we set 


A; F (o jm)-A5; s (r;)do/ c? 
im 
XX | F*@jm)Aaim*(ti)do/o? (14) 
im 


*A. I. Akhiezer and V. B. Berestetsky, Quantum Electrody- 
ren Fiz. Mat. Lit, Moscow, 1953) (English transl.: AEC- 
5 CRTA ample, L. os and E, B. Wilson, Jr., Introduction 

7 echanics (McGraw-Hi 
Mock. 1935), See 43. Taw Book Company, Inc., New 
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in (13). If 
yj—y—1, V27 V (15) 
in (11), we have 


P(niy —1 e nov) 


= (n/W)(vosjm "9 /c) | (m| (e/u)27 L— pi anim 


Fsi (VXAdajn) || n2? ; (16) 
where 

0077 Onin (17) 
and veim™ is the number of the (magnetic jm) 
photons with frequency wy~wotdw. 

From (6) we see 
(Jio / 1613369)! e 59^ !g (wor s/c) 
Xri?(nxYj;s.)l, (18) 

where l; is the orbital angular momentum of ith 


particle. 
Using formulas 


Di Augjm= 


gri (tix Yjjm)=— vaYLiG-0]^ (19) 


and 


VEIY ;jn[ (j- 0/j]^, (20) 
which can be shown from (4) we have 


—pivoym Sit (V Xam) 
= (ficoo/ 16236269) e o! V g nV jm 
EGG D)72L4-CG4-0/3Y^s;].. (20) 


One can define the (magnetic jm) moment as 
Qj, C9 = (e/u)[ 2.j4- D/4v ^ Q j— 1) Gov c) * 
XE (wei me [GHD I s:] 


v XgiY;n- 


&(e/u)[4v/ Qj4-1) ] ^ 
XX (Vr: Y jm): [G4-1)!L-s;], (22) 


where the last expression is obtained when 


wors/6K1 " (23) 
so that 
gioor/c) an Gowr/)/ Q.j4-1)t- (24) 
Note that - 
Quo? = (e/u)2. (ese 252) E (25) 


which is the familiar magnetic dipole moment. 
From (21) and (22) we see 


D» [—piAaoim+Si° (v XAusin) J(e/ 2) 


=[G+1)/7@77 1)1^[Qj— iis 
xe i! (ioo/c) Qiu "9 / 2a. (26) 


MAGNETIC MULTIPOLE 
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Thus, from (16) the probability of the (magnetic jm) 
moment transition is 


18 (mg) (m mo na) 
= (j+ DCI (j+ D (27-1) }? vam og 
X (arei heo) | (1110m? |n))]*. (27) 


ELECTRIC MULTIPOLE TRANSITION 


The electric multipole transition can be discussed in 
the similar way. Since it is not the main purpose of the 
present paper, we will just show the result. 

The only difference is to start with a photon wave 
function which is the eigenfunction of the total angular 
momentum and energy with eigenvalues j, m, and hw, 
while the parity is (— 1)7, instead of (— 1). 

The (electric jm) moment is defined as 


Oim= Ds e[in/Qj-1)]^réY;., (28) 


where we should include both electrons and nuclei in 
the summation, and e; is the charge of ith particle. This 
definition gives 


Ow? — ^; e, (29) 


which is the familiar electric dipole moment. 
The transition probability is 


P jm? (n € 12) 

= jG--OEQj4- D j— D) T venm Mea 

X (4x) | (| Qs "P n2) |?. (30) 

Since the matrix element of Q jm(™® is about (e/u)ro? h 
as seen from (22), while that of Q;4*P is about ero? as 
seen from (28), taking ro for the atomic dimension, we 
obtain the order of magnitude of P;,"9 by means of 
Ped as 


Pint = (uoi h/f Papin ™® [vagi P) PP. (31) 


SPIN-ORBIT ELECTRIC OR MAGNETIC 
DIPOLE TRANSITION 


The conventional theory of AS=-£1 transition is to 
take into account the effect of the spin-orbit interaction 


H= —i(eh/2,/ €)? s: (Ex v); (32) 


Since this interaction is similar to Qz4, (79, which is. 


given by (33), it can connect a inpet to a triplet. Xem. — 


Jet IL (e1Ms| Has 000) Es 


c, Ms 


EJ|dMs) 


er (400| E. | 1M s)(Es—Ea)*|d00) i 


CREDIS. 
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thus from (30) 
Pimso = (vain Vw / 6rcl?co) 


X| X; (a00| Hso|c1M s) 4 — E) 


c, Ma 


X (c|Qi G9 |b)-+ Z (d00| Heo| 01M s) 


X (Es— E (a| Qin? |d)|?. (34) 


Since the order of magnitude of the electric field E is 
(electric energy)/ro or uworo, we see the order of magni- 
tude of the matrix element of Heo is 7teo/uc? as expected 
from the fact that the spin-orbit interaction appears as a 
relativistic correction. Thus, the transition probability 
due to this mechanism is 


Pi so (9D E (Zicoo/ uc?* P CP $ (35) 


From (31) we see that the spontaneous emission 
probability through the magnetic quadrupole transi- 
tion, which is P¿™® with v— 1, gives exactly the same 
expression as (35). 

In some cases the electric dipole transition is still 
forbidden and only the magnetic dipole transition can 
appear with the spin-orbit coupling. The transition 
probability for such case has the order of magnitude 
given by 

Py oD (fies/uc? P.C . (36) 


From (31) we see 


Pm) = looro / cy p Ong) 
= (tooucooro"/ uc?) P, C09 
= (fioo/ uc? P3079 , (87) 


which shows that the magnetic octupole transition is as 
important as the spin-orbit magnetic dipole transition. 

Somehow the magnetic multipole transition has never 
been discussed, but it should be as important as the 
spin-orbit electric or magnetic dipole transition for 
AS==1 transitions. 


MAGNETIC QUADRUPOLE MOMENT 


From (22) with ;—2 we obtain the five components 
of the magnetic quadrupole moment. They are 


Qon 099 = Dom (79 Qo, m9. (38) 
Qim 9 = (ef HL Lgs? "5i tiga (Siz tts iy) 
B & 
rati Gala (39) 


while Qomi™® is obtained by replacing s by 1/3 in (33). 
Coefficients g’s are tabulated in Table I. 

Let us assume a two-electron system. Such system can 
be either a triplet or a singlet, if we neglect the spin- 
orbit interaction: The wave function of each state is a 

- product of the orbital part and the spin part which can 


r 
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TABLE I. Coefficients q?". 


m qz" qam Ge 
—2 0 0 — 6! (x —iy) 
-1 (Q2)-i») 9 cipe 

0 2z A —4€—21y 

1 3/2) * (4-3) " 

2 SEA 0 61? (x -2y) 0 


be designated as | ¿SM s). It is easy to see that 


(a00 | em enr go2”"522| 510) — (n/2) (a | Gea ga?"| b) J 


(a0 | qrsa tH gaso] bie 1) on 
= (n/2) (a | q427— q24?"| b) , 


(a00 |q 2”sı +92 ”s2]011)= (1/2)(a| n" gs" |b), 


are the only finite matrix elements. We see that these 
matrix elements are not generally zero because the 
orbital part of the wave function |a) and |b) are sym- 
metric and antisymmetric, respectively, under the ex- 
change of two electrons. Since, as shown by Mizushima 
and Koide,’ the corresponding matrix elements of all the 
electric multipole moments and the magnetic dipole 
moment are zero, the magnetic quadrupole moment is 
the lowest moment which has finite matrix elements 
between states with different total spin. 
If we assume that 


|a) 2 N[u(1)»(2)4-(1)« (2) ], 


|5)- N Ew Q)v Q) — 0) 2); 4 
where 
SP 14 | [xa 5 
: (42) 
ya-di- [[«eveoan] | > 
we obtain 
(a|gı—g2|b) 


= 2NN'L(o| 0’) Qui] g |i) — (ulw) o] a] ^) 
+(u|v')@|q|u’)— olw) ulgi]. (43) 
MAGNETIC OCTUPOLE MOMENT 


The magnetic octupole moment Qsm™® can also be 
divided into two parts: 


Q3, 008 = Qomi TDF Q5, 9 , (44) 

Qams™® = (e/ 2x [qi"5;- 30437 (si2--dsiy) 
+3q:2"(siz—tsiy)], (45) 

while Q3mi°"® is obtained by replacing s by 1/4 in (45). 


Coefficients g’s are tabulated in Table IT, 


For the matrix elements we see that formulas (40) and 
(43) holds for g”. 
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TABLE II. Coefficients œ”. 


m q?” q?” q^ 
—3 0 0 — (45/4) (x — iy) 
—2  (15/8)/P(a—iy)? 0 —30'/22(x—iy) 


—1  12¥(x—iy) (45/4)\2(e—iy)?  — 318218) 


0 (92?—3r?)/2 3z(x—iy) —3z(x-Fiy) 

1 12/(x-Fiy) 312(372— 72) — (45/4)? (x-Fiy) 
2  (15/89)À(x--iy? | 30U*z(x--iy) 0 

3 0 (45/4)? (xiy)? 0 


HYDROGEN MOLECULE 


The lowest triplet state of the hydrogen molecule is a 
dissociative state 35,;+. Since it is a dissociative state the 
lifetime of this state due to the spontaneous emission 
cannot be measured. The calculation in this section is 
just to show that the magnetic quadrupole transition 
can be much greater than the spin-orbit electric dipole 
transition. 

The Heitler-London wave function? is well known to 
give a good approximation for the ground state 'Z,* and 
the lowest triplet state *2,*. A large number of better 
wave functions are proposed for these states? but for the 
present purpose of rough estimation the Heitler-London 
wave function will be good enough. 

'The Heitler London wave functions are obtained by 
identifying 


u(i)=w (i)=Yis(rai) , 
v(i)=v (t)=Yis(r0i), 


where fai and rẹ; are distances between the ith electron 
and the protons a and b, respectively, and y; is the 
hydrogen wave function. From Table I it is easy to see 
that the only finite matrix elements of the magnetic 
quadrupole moment are 


(46) 


(12,*00|Q»9 |32,*10) = 4(e21/24u) N.N rao, 
(12,*00|Q;, C9 |32,1— 1) = 24/2 (eh/2u) NN ras, (47) 
Q2,*11|Q; , 09 [12,*11) = 247 (eh/20) NN roo, 
where rap is the internuclear distance. With the ap- 
proximation of V.,=N_=27” we obtain 
Pome) (xp Ky 3,10) 
= (2/15) ve0° (Arche) (eh/ 2u raw. , 


Po (I5, * 3p 2-1) 
= Spes (Arche) eh] Durus, 


(48) 


which shows that the magnetic quadrupole transition is 
more effective as the internuclear distance increases, as 
far as aora» « c. s 

For fhe spin-orbit interaction one can divide the 
electric field E into two parts: The part due to the 
nucleus to which the electron under consideration 1s 
attached and the part due to the other atom. Assuming 
that the charge distribution is spherical around the 
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other atom we have, 
E= E+ Ej (d&b,/dr ai) fai/Tai- (dby/dr,;) Fpi/T5i (49) 


where Y, and $, are potentials due to each part men- 
tioned above. Now 


rai X V= (ifh)lai, reix V= (i/HlaitHreax V, (50) 


where la; is the orbitat angular momentum of the ith 
electron with respect to the nucleus a. 
Using (46) we have 


lau(i)=0, (51) 


V 


while 
(1s| ry, x V | 2p2)= (16/227) (ros /r)i, 


(1s| roa x V | 2py)= — (16/227) (rsa/rB)i, 
where 1 and j are unit vectors along the x and y axes, 
respectively, and rg is the Bohr radius. 

If the orbital part of the wave function of the 

! TIL, (15257) state is 

|11,)— [Vas (1)254 (2) Jas (2) 5 (1) ]/27? , (53) 
we find 
(HI EL HS 32,1251) 

= (4/27) (eh / 22) (d®,/drpjry (roa/TB) (h/2) 

= (Z/2T xe) (eh/2uc)*(ravre) , (54) 
where Z is effective charge of the second atom for the 


electron in the first atom. Z is less than one and goes to 
Zero as fap increases. Since 


(Mw | Quai [12 5°) = (25/359)r pe, (55) 
we see from (34) that 
fixi s = (27/107 )vw e'h’ (utc e) (Ara?)*, (56) 


(52) 


where 
A= EH) ECZ). (57) 


In obtaining this formula we assumed that the main 
contributions come from the mixing of the 'II.,(1s2p7) 
state to the ?E,* state. Note that formula (56) shows 
that the transition probability in this case decreases as 
rap increases as (Z/ra:7)°. 

By comparing (56) to (48) we see 


Pm) / Py sale? 
= (10/Z2) iwo/ Reh)? (A/Rch)?(ras/ra)®, (58) 


where R is the Rydberg constant. In our case 


Z?~1/10, 
7to/ Rch— 172, 
A/Rch~ 1/4, i 
rap/TBg— 1.4, 


at the equilibrium distance of the ground state. Thus 
the above ratio is about 10 in favor of the magnetic 
quadrupole transition over the conventional spin-orbit 
electric dipole transition. When the internuclear dis- 
tance rap increases the ratio will increase very rapidly. 
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SELECTION RULES 


The two comparative mechanisms, the magnetic 
quadrupole transition and the spin-orbit electric dipole 
transition have slightly different selection rules. For the 
case of AS— 2-1 transitions which we are interested in 
here the selection rules for the magnetic quadrupole 
transition are just like those of the electric dipole 
transition, or ; 


AL=0, +1, but L=0 0; parity change, (60) 


while those for the spin-orbit electric dipole transition 
are 
AL=0, +1, +2; parity change, (61) 


in the case of atoms with Z-S coupling. Thus, for ex- 
ample, 1$ 3$ and 5S e» ‘D transitions can only be 
explained by the spin-orbit electric dipole transition. 

Transitions with parity no change cannot be ex- 
plained by either of them, and we have to go to the 
magnetic octupole transition or the spin-orbit magnetic 
dipole transition. Their selection rules for AS=+1 
transitions are 


AL=0, +1, +2, but L=0 0; 
parity no change for magnetic octupole, (62) 
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AL=0, +1, +2; parity no change for 
spin-orbit magnetic dipole. (63) 

Selection rules for molecules can be found by finding 
those for the electric dipole and the electric quadrupole 
transitions. The selection rules for the magnetic quad- 
rupole and the magnetic octupole transitions are the 
same as those respectively. For the benzene molecule, 
for example, the lowest triplet state 3 B4, is known to be 
able to go to the ground state 14,, by the spin-orbit 
electric dipole transition*:? but we see that the magnetic 
quadrupole transition is forbidden for this transition. In 
the case of the naphthalene and anthracene molecules, 
on the other hand, the lowest triplet state is assumed” 
to be 3B»,, and the magnetic quadrupole transition is 
allowed from this state to the ground state 1A 1g. The 
lowest triplet state of the benzene molecule is observed"! 
to have the lifetime of more than 300 sec, while in most 
aromatic molecules the triplet lifetime is accepted to be 
about 1 sec. This may be explained by the above con- 
clusion, since our analysis on the hydrogen molecule 
showed that the magnetic quadrupole transition must 
be more important than the spin-orbit electric dipole 
transition in the case of large molecules. 

10 R, Pariser, J. Chem. Phys. 24, 250 (1956). 

1 D. P. Craig, J. M. Hollas, and G. W. King, J. Chem. Phys. 29, 
974 (1958). 
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Absolute Electron Excitation Cross Sections of Helium* 


Rozert M. Sr. Jonn, FRANK L. MILLER,t AND Cuun C. Lint 
Deparlment of Physics, University of Oklahoma, Norman, Oklahoma 
(Received 16 December 1963) 


The absolute apparent electron excitation functions of helium have been measured for the 3 1S, 41, 5 1S 
61S, 3 1P, 41P, 31D, 41D, 51D, 61D, 34S, 43S, 53S, 33P, 33D, 43D, 54D, and 64D states at pressures 
sufficiently low so that the effects of radiation imprisonment and collisional excitation transfer can be 


neglected. Corrections due to polarization of the rad 


iation and the cascading from the upper excited states 


have been applied to the experimental data and the true excitation functions obtained. The peak values 
and the shape of these excitation functions are compared with the results reported from other laboratories 
Generally good agreement is found with the previous works where the collisional excitation transfer TeS 
properly reduced and allowance was made for the cascading effect. The experimental cross sections show 
satisfactory agreement with the theoretical values calculated by the Born approximation for the 1S and !P 


states at high-electron energies, but are about four 


times larger than the theoretical values for 1D states. 


In the case of the triplet series, the experimental cross sections ex 
: xceed the calculated values by factors of ten 
to one hundred or more. It is concluded that the population of the triplet states is produced mus by proc- 


esses other than direct excitation. 


I INTRODUCTION 


JD ECENT studies of electron excitation of helium 
atoms indicate that some of the atomic states may 

be populated to a large extent by collisional excitation 
transfer and cascading in addition to the usual electron- 


-— —* Supported by the U. S. Air Force Office of Scientific Research, 
srant AF-AFOSR-252-63. 

a Present address: Fort Lewis A. & M. College, Durango, 
olor 


o. 
1 : Alfr ed P. Sloan Foundation Fellow. 


impact excitation process.!? The excitation transfer and l 
its concommitant effect on cascading may be reduced l 
and even eliminated if measurements of the excitation i 
are made with the helium gas at low pressure, i.e., about | 
1 or less. Furthermore the populations of the !P states | 
are affected by imprisonment of resonance radiation at l 
pressures above a few tenths of a micron and therefore l 


R. M. St. John and R. G. Fowler, Phys. Rev. 122, 1813 (1961). 
C. C. Lin and R. M. St. John, Phys. Rev. 128, 1749 (1962). 
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measurements on them should be made below that onset 
pressure in order that the imprisonment mechanism not 
obscure the electron-impact excitation process. Cas- 
cading effects cannot be entirely removed by reduction 
of the gas pressure as excitation of the upper levels goes 
on at all pressures. Thus, in the determination of the 
electron impact excitation cross section, considerable 
care must be exercised in subtracting from the total ob- 
served (apparent) cross section the contributions from 
the various other excitation mechanisms. Indeed, the 
effects of collisional transfer'-? and imprisonment are 
partly responsible for the disagreement of the electron- 
excitation cross sections reported in the early literature 
as well as the discrepancy between them and theoretical 
cross sections. 

With the recent improved experimental techniques 
it is possible to work at pressures low enough to suppress 
the effect of excitation transfer when observing all but 
the weakest transitions. In this paper we shall present 
the apparent excitation functions of 18 excited states of 
helium measured at low pressure. The cross sections for 
direct electron excitation are then obtained after the 
application of a series of corrections. The results of 
several experimental investigations will be compared. 
Additionally comparison of these cross sections with 
the theoretical values will be made and discussed. 


II. EXPERIMENTAL METHOD 


The apparatus for measuring the relative excitation 
functions has been described in a previous paper.* Abso- 
lute values of the excitation functions were obtained by 
calibration against a standard lamp as previously de- 
scribed.*5 The lamp was a tungsten ribbon filament 
pyrometer supplied and standardized by the General 
Electric Company. The lamp was operated at 3 tem- 
peratures in the range from 1400 to 1700°K and its 
emission calculated within a small wavelength interval 
for the wavelength of each transition observed. Emis- 
sivity tables for tungsten determined by Larrabee were 
used. The sensitivity of the detection system was de- 
termined with the three-lamp temperatures. The maxi- 
mum deviation of the sensitivity from the mean value 
at a given wavelength was generally less than 5%. 

The solid angle viewed by the monochromator was 
determined by the use of a small variable size circular 
diaphragm which acted as the aperture stop. Transition 
probabilities used in this paper for evaluating the 
density of a given excited state by means of observation 
of the intensity of light emitted from it are those tabu- 
lated by Gabriel and Heddle.’ 


3 C. C. Li d R. G. Fowler, Ann. Phys. (N. Y.) 15, 461 (1961). 

AR. Se SETHE, C. C Lin: R. L. Stanton, H. D. West, J. P. 
Sweeney, and E. A. Rinehart, Rev. Sci. Instr. 33, 1089 (1962). 

5R. M. St. John, C. J. Bronco, and R. G- Fowler, J. Opt. Soc. 
Am. 50, 28 (1960). 

RD Lawabes J. Opt. Soc. Am. 49, 619 (1959). 

? A. H. Gabriel and D. W. O. Heddle, Proc. Roy. Soc. (London) 


A258, 124 (1960). 


III. ANALYSIS 


The equation relating population gain and loss rates 
per unit volume of the jth state of an atomic system is 


ODN (g)/eS H-22; N (iA (15) + transfer gain 


Electron impact Cascade 
gain gain 
=N (j)A (j)+ transfer loss, (1) 
Cascade 
loss 


where Q( j) is the cross section for excitation by electron 
impact from the ground state to the jth state, /, js the 
electron-beam current passing through the collision 
tube, S is the cross-sectional area of the electron beam, 
and e the electronic charge. The symbols N (g) and N ( j) 
stand for the densities of the ground and jth states; 
A (ij) is the probability of transition from the ith state 
to the jth state, while A(j) is the total probability of 
transition from the jth state to lower states. In the 
above equation we have neglected the effect of im- 
prisonment of radiation. This follows from the assum- 
tion that observations of !P functions, which are directly 
affected by imprisonment, are to be made at pressures 
low enough that imprisonment is negligible. 

At low pressure the transfer excitation terms in 
Eq. (1) can be neglected. When this is done, one obtains 


ODUN (g/eSH-3; N@AG)=NGQ)AG)- Q) 
It is customary to introduce the branching ratio B, 
B(jk)=A(j)/AGR) , (3) 
so that Eq. (2) can be rewritten as 


ODUN (g)/eSJ+ Xi N G)A Gj) = BGR)N G)A Gh) . 
(4) 


Here N(j) A(jk) is the rate at which energy is 
emitted in connection with transition j—>k and is 
determined experimentally from the photomultiplier 
current /p(0,j) measured at an angle 9 relative to the 
electron beam. Since the radiation generally is not iso- 
tropic, a polarization correction factor fp(9 7), is intro- 
duced so that /P(0j)fP(8 j) is proportional to the 
angular average of the light intensity and thus allows 
the determination of the entire light flux emitted from 
the collision chamber. Accordingly we have 


NG) GE) HCL ONLI » (5) 


where Ci(j) is a proportionality constant determined 
from the calibration proceedure using the standard 
lamp. For the purpose of comparison with the previous 
data, we define Q’(j) and Q” (j) as 


^ 


I.N( ) LAC ) bs Ne 
vG) = =0(;) — +E NOAG), (6) 
0" (6,0 Gf») 2 : 


=eSC(jM p0, BOIN (g). (T) 
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A890 ST. JOHN, 
All the factors making up Q" (6,7) are obtained by ex- 
perimental observation except B (j) which is determined 
from the theoretical transition probabilities.’ In fact, 
Q"(0,7) is the cross section which has been reported 
traditionally by observers of excitation functions. We 
shall call it the cross section uncorrected for polariza- 
tion, cascade, and transfer effects; or simply the 
apparent cross section. Likewise, Q'(j) can be regarded 
as the observed cross section which has been corrected 
for polarization but not for cascading and transfer 
eflects, and is obtained from the experimental Q” (j) 
_ cross section. : 
The polarization correction factor? is 


fr(0,j)=3{[300—P(j)1/[100— P(j) cos? , (8) 


where P(j) is the percentage polarization. 


IV. EXPERIMENTAL RESULTS 


'The apparent excitation functions of 18 states as 
measured directly from the automatic processing appa- 
ratus are shown in Fig. 1. The graphs represent relative 
values. After the calibration procedure described earlier 
was followed, the absolute values of the apparent cross 
sections were obtained; the peak values are listed in 
Table I. 

The apparent excitation function of the 3'P level 
displayed in Fig. 1 was recorded at a pressure of 5.9 y. 
'The shape of this curve was quite independent of gas 
pressure. Data obtained at pressures as low as 0.1 u 
were used to evaluate the peak apparent cross section 
of 350X10” cm? listed in Table I. 

The excitation function obtained at 4025-6 A repre- 
sents the sum of two-line functions. These are 4026 A 
(53D— 23P) and 4025 A (71S—21P). One can 


TABLE I. Maximum values of the apparent excitation cross 
sections (low-pressure values). 


Transition Maximum 
observed cross section Ql 

Level (A) (107? cm?) 
31S (7281) 49 
41S " (5047) 24 
S (4438) 9.2 
61S (4170) 4.8 
31P (5016) 350 
41P (3965) 159 
31D (6678) 42 
41D (4922) 17.6 
51D (4387) 9.0 
61D : (4144) 4.7 
38S) s (7065) 107 
435 * (4713) 35 
SBS (4121) 12.3 
QJ? (3889) 97 
33D (5876) 31 
43D (4471) 12.0 
53D (4026) 6.2 
63D 1 (3821) 3.9 


8T, C. iasciseil and M. J. Seaton, Phil. Trans. Roy. Soc. 
(London) A251, 113 (1958). 
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obtain an excellent approximation to the excitation of 
the 7 1S level by extrapolation from the 3, 4, 5, and 6 
levels of that family; application of the appropriate 
pranching ratio yields the function of the 4025-A line, 
with a maximum value of 1.1 10 ?? cm’. This, when 
subtracted from the function representing the sum of 
the two excitation processes, yields the 4026-A function 
which, in turn, yields the 53D apparent excitation 
function. 


V. PREVIOUS WORKS 


The apparent electron excitation functions of helium 
have been measured by Lees? Thieme,” Yakhontova," 
McFarland and Soltysik,” and Heddle and Lucas.? Of 
these Lees, Thieme, and Yakhontova determined abso- 
lute cross sections. Additionally, Stewart and 
Gabathuler have determined the peak values of the 
cross sections of several helium levels. Gabriel and 
Heddle? determined the cross sections for 108-eV elec- 
trons for a number of helium levels. 

The observations by Thieme, Yakhontova, Heddle, 
and Lucas, Gabriel and Heddle, and Stewart and 
Gabathuler were at pressures low enough to greatly 
reduce transfer effects. Of these, Thieme and Stewart 
and Gabathuler, however, did not work at pressures low 
enough to eliminate imprisonment of resonance radia- 
tion, and thus their absolute determinations of the 3 P 
cross sections are abnormally high. All observers worked 
at pressures low enough that light emission from 55, 39 
and 3P states was linear with pressure. All observers 
cited, including the authors of this paper, observed the 
excitation chamber in a direction normal to the electron 
beams. Thus all apparent cross sections determined at 
low pressures are subject to the same corrections re- 
quired for polarization and cascade. 

Table II shows the peak absolute values of the ap- 
parent cross sections of helium levels as measured by 
the authors of the paper, Yakhontova, Stewart and 
Gabathuler, Lees, and Thieme. 5 It also shows our 
values and those of Gabriel and Heddle at an electron 
energy of 108 eV. 

? J. H. Lees, Proc. Roy. Soc. (London) A137, 173 (1932). 

Y O. Thieme, Z. Physik 78, 412 (1932). 

uV, E. Yakhontova, Vestn. Lening. Univ., Ser. Fiz. i Khim. 
14, 27 (1959). (Translation 951 by Atomic Energy Research 
Establishment, Harwell, Berkshire, England.) 

2R. H. McFarland and E. A. Soltysik, Phys. Rev. 127, 2090 
(1962); 128, 1758 (1962). 

A271, 129 oa and C. B. Lucas, Proc. Roy. Soc. (London) 

1 7 
n A ED S and E. Gabathuler, Proc. Phys. Soc. (London) 
. 5 Some of these experimenters published values for cross sec- 
tions for excitation of a given line transmitted from a given level. 
By use of the branching factor one can readily obtain the cross 
section for excitation to the upper level involved. We used the 
transition probabilities of Gabriel and Heddle for determining the 
branching factors and thus obtained level cross sections as dis- 
played in Table IJ. Stewart and Gabathuler listed both line and 
level cross sections. Due to the fact that they used branching 
factors differing from those produced by the Gabriel and Heddle 
transition probabilities, their level cross sections show some slight 
variation from those we list for them. 
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Fic. 1. Apparent excita- 
tion functions of 18 helium 
levels. Cross sections are 
relative; electron energy 
varies from 0 to 500 eV. 


The measurements of Yakhontova are consistent 
with those reported herein. The ratio of Yakhontova's 
peak cross sections to ours averages 0.99 and each lies 
within the range of 0.83 to 1.16 with the exception of the 
49S and 53D cross sections. In those cases the factors 
were 0.77 and 1.37. This rather good agreement 
throughout the levels including 1D, and 3D verifies that 
the pressures used in both investigations were below 
those which cause transfer effects." 


[oe š 7 f the peak cross 
16 Yakhontova determined the absolute value of the 
section of lines from the 3 and 41P levels by operating at a gas 


4025À | 
7T's-2!p| 


Stewart and Gabathuler do not quote directly the 
pressures used in their determinations of the apparent 
cross sections, but they were welt aware of the pressure 


pressure high enough to cause complete imprisonment (which 
would yield a value near that of the level cross sectiot) and then 
determined the line cross sections (5016 and 3965 A) by dividing 
by the appropriate branching factor. Due to a spreading of the 
beam with increasing imprisonment which is concommitant with 
increasing pressure, the measured line intensity will not increase 
as much as the branching ratio when imprisonment changes from 
a nil to its full value. Thus, the line cross sections given By 
Yakhontova were not converted back to level cross sections and 
hence are not tabulated in Table II. 
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A892 ST. JOHN, MILLER, AND LIN 
"TABLE II. Comparison of absolute values of apparent cross sections of helium obtained by several experimenters. 
‘All cross sections are expressed in units of 107? cm?. 
Cross section at peak Cross section at p: a 
Stewart = abrie 
This and e This and 
Level paper Yakhontova Gabathuler Lees Thieme paper Heddle 
31 36 
1 49 
i S 24 20 27.5 28 15 16.5 
bl SY 9.2 74 10.0 5.9 16.5 6.1 7.0 
i 61S 4.8 6.1 2.8 9.5 3.0 4.0 
i 31P 350 4130 4360 3660 350 457 
i 41P 159 950 1270 1080 158 210 
i 31D 42 24 25 
41D 17.6 17.8 24 15.1 32 12 12 
51D 9.0 8.5 12.2 9.3 18.6 6.2 74 
61D 4.7 5.0 6.9 9.9 3 3.0 
33S 107 10.2 15 
43S 35 25 37 36 64 eu 44 
53S 12.3 12.7 17.2 8.6 40 1.17 1.44 
33P 97 83 105 80 1890 15.3 11 
33D 31 36 45 42 4.3 25 
43D 12.0 12.4 18 15.2 23 1.64 4.6 
53D 6.2 8.5 7.1 0.89 3.0 
63D 3.9 4.3 0.53 155 


effects, as they gave intensity versus pressure Curves. 
Some of that data was obtained in the 2-4 range, and we 
can assume the sensitivity of their equipment allowed 
them to obtain the maxima of the apparent cross sec- 
tions at about that pressure. Their relative consistancy 
with our data for 1D and 3D indicates that they were 
able to minimize the transfer of excitation effects. Their 
values for 3 1P and 41P are very large and indicates an 
imprisonment effect as one expects as a pressure of 2 u. 
Disregarding the +P data, the ratio of their cross sections 
to ours averaged a 1.26 and varied within the limits of 
1.06 to 1.50. 

The data of Lees is in serious disagreement with our 
results for the 3 !P, due to his use of pressures in the 
40-~ range. His peak apparent cross sections for the 
1S, 1D, 3S, *D, and ?P levels compared to our values by 
a ratio averaging 0.86. 

The data of Thieme compared to ours shows quite a 
variety of cross-section ratios ; his extremely high value 
for excitation of the 3 1P level is at least in part due to 
imprisonment effects. His very large value for the 3 *P 
level defies explanation other than through erroneous 
calibration procedures. Lees’ and Thieme’s data were 
obtained by photographic techniques and hence more 
subject to errors than the photoelectric data of the 
other investigators cited. The magnitudes of Thieme’s 
data will be given no further consideration. 

Gabriel and Heddle measured absolute values of the 

. cross section for 17 helium levels at pressures apparently 
low for all except the ?D levels. 'The ratios of their 

= apparent cross sections to those presented by the 

- present authors averaged 1.12 for 1S, !P, !D, *S, and *P 
levels, and lay within the range of 0.76 to 1.37. The *D 

. ratio varied up to 3.4 indicating a transfer fillin for the 

ss sections of Gabriel and Heddle. 

mparison of the shapes of the excitation functions 
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is presented in Table III. Each excitation function has 
been normalized to unity for an electron energy of 100 
eV. Included are the data of Yakhontova, Lees, Thieme, 
McFarland and Solysik, Heddle, and Lucas, and that 
presented in this paper. It can be seen that there is 
quite good agreement between the shape data presented 
herein and that of Yakhontova. This is the result of 
both sets of data being obtained under low-pressure 
conditions. The shapes of the !D and *D curves of Lees 
are modified by the !P fillin due to his use of a relatively 
high gas pressure. The curves of Thieme are quite simi- 
lar to those presented herein, a fact derived from the 
low-helium pressures used by him. McFarland and 
Soltysik obtained their functions with a gas pressure of 
10 to 20 u, and thus their curves show some transfer 
effects. The curves of Heddle and Lucas, while ob- 
tained at a pressure of about 1 u, do not show very good 
agreement with the curves of the other experimenters. 

Another factor influencing the shape of an experi- 
mentally determined excitation function is that of the 
shape of the electron beam. If it should undergo any : 
change with electron energy, the light detection system 
being used could change in sensitivity due to (a) a 
variation in the light-gathering ability of the optical 
system with change in the position of the light source 
and (b) a lack of homogeneity of the response of in the 
cathode of the photomultiplier. Care was exercised by 
the authors of this paper to maintain a cylindrical elec- 
tron beam of constant diameter at all electron energies. 


VI. POLARIZATION AND CASCADE CORRECTIONS 


Values of the percentage polarization, P(j), have 
been measured by McFarland and Soltysik? for varying 
pressure and electron energy for several of the tran- 
sitions reported in this paper, and are used for making 
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the polarization corrections reported herein. Since the 
amount of polarization correction is generally rather 
small, the values of P(j) given in Ref. 12 were used 
whenever available, and extrapolations of these values 
were used for the transitions which were not reported. 
To obtain the true cross section of electron excitation, 
Q’ must be corrected for the cascading effect. With the 
aid of Eqs. (5) and (7), Eq. (6) can be rewritten as 


Q)-0'()—-27:0' WA G3)/A Q) . 


The true excitation cross section of the jth state can 
then be determined from a knowledge of Q’ of the jth 
state and of those states that cascade into it. 

For our analysis we had available the values of Q" (7) 
for those states included in Fig. 1 and Table I. These 
were corrected for polarization effects by the use of the 
polarization data of McFarland and Soltysik and values 
of Q'(1) obtained. The shapes of high z values excitation 
functions corrected for polarization was taken to be the 
same as that of the low » functions having the same 
orbital quantum number L. The magnitudes of the high- 
level Q’ (i) functions were assumed to vary as n~% where 
a is an adjustable parameter. Theoretical transition 
probabilities have been compiled for 7 <8;’ an extrapo- 
lation procedure again was used to find the values for 
higher states. 

The excitation functions corrected for polarization 
and cascading effects of the levels of helium are shown 
in Fig. 2. The actual percentage of correction applied 
to an excitation function for each of the polarization and 
cascading effects varied with electron energy due to the 
differences between the energy dependence of the polar- 
ization factor, shape of the excitation function of the 
cascading levels, and the shape of the excitation func- 
tion being corrected. The variation in the cascading 
correction was not so prominent for the triplet functions 
due to the similarity among triplet apparent excitation 
functions. The maximum percentage by which Q’ varied 
from Q" was 0 for !S levels, 4 to 7% for !P levels, 13 
to 17% for 1D levels, 0 for *S levels, 5% for the 33P 
level, and 4 to 5% for the ?D levels. The higher levels 
showed less polarization than the low levels and thus 
received smaller corrections. This situation thus mini- 
mized the errors involved in the extrapolation procedure 
used in procuring the Q'(j) curve from the Q” (j) curve. 
The cascade contributions were then subtracted from 
the polarization corrected curves. The maximum cas- 
cade contribution to the 3!S level was 15% and 
occurred at an electron energy of 300 eV. Cascade cor- 
rections of a similar percentage were obtained for 
higher iS levels. The 3 P level sustained a 14% cor- 
rection at 35 eV and a 4% correction at 100 eV. The 
41P level received somewhat smaller percentage Cor- 
rections and distinctly less absolute corrections. AR He 
1D functions suffered maximum corrections at 450 eV; 
they were in the range of 5 to 87. 

The cascade corrections sustained by 


(9) 


triplet functions 


Taste III, Shape comparison of experimental apparent excitation functions of helium. 
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Frost and Phelps? made determinations of cascade 
contributions. They estimated that no correction was 
necessary for the 15 and 1D curves. They obtained a 10% 
correction for the 3!P and 41P levels at 100 eV. The 
triplet curves, at the peak of the excitation function, 
were populated by cascade as follows: 4*S, 495;3?P, 
49%; 33D, 4*D, 5*D all 10%. 

Gabriel and Heddle? determined cascade corrections 
for electron energies of 108 eV. The 4 1S level population 
was 2% due to cascading. Higher 4S level populations 
were regarded as entirely caused by electron impact. 
Singlet P populations were assumed free of cascade 
effects. The 31D level was found to have 4% of its 
population due to cascade, but higher 1D unaffected by 
cascade. The triplets were populated by cascade as 
follows: 439, 23%; 53S, 7%; 33P, 34%; 3°D, 4%; 
43D, 17%; 53D, 17%. 


VII. COMPARISON WITH THEORY 


The theory of excitation of helium atoms by electron 
impact has been treated rather extensively in the litera- 
ture. Massey and Mohr? calculated the excitation cross 
sections for the 21S, 315, 21P 31P 41P, 51P, 31D, 
41D 5!D and 4!F states by the Born approximation 
and for the 23S, 3?P, 43P, and 37D states using the 
Born-Oppenheimer approximation for incident elec- 
trons of various energies (60, 100, 200, and 400 eV). 
Curves for the theoretical excitation functions for the 
319, 3 1P, 41P, 41D, 51D and 23S states were also in- 
cluded in this work. Bates et al” have given a critical 
analysis of the Born and Born-Oppenheimer approxi- 
mation along with additional results of cross sections 
for some of the low- states of He. As an improvement 
over the Born-Oppenheimer approximation, cross sec- 
tions for the 119 > 219 11S — 23S, and 11$ — 2?P 
transitions have been calculated by Massey and 


"TABLE IV. Comparison of observed and calculated cross sections 
for electrons of various incident energies. Observed cross sections 
are corrected for polarization and cascading. All cross sections are 
expressed in units of 107 cm’. 


60 eV 100 eV 200 eV 
Level Obs. Calc. Obs. Calc Obs. Calc. 
31S 38 26 28 15 18.5 11 


31P 260 340 
GO WO 140 152 110 136 76 


31D 31 21 44 11.1 2.5 
41D 147 3.7 11.6 215 88.1. 55:6 1.3 
33S 21.5 1.3» 7.6 0.605 41 0.13» 
3*P — 42 5.5 13.3 1.1 4.1 0.15 
33D 9.0 0.13 3.5 0.03 2.6 0.004 


a The 3 3S calculated values are extrapolated from 2 4S calculations. 


18L. S. Frost and A. V. Phelps, Westinghouse (Pittsburgh) 
Research Report 6-94439-6-R3, 1957 (unpublished). 
(Lo H. S. W. Massey and C. B. O. Mohr, Proc. Roy. Soc. 

ondon) A140, 613 (1932). 

2D. R. Bates, A. Fundaminsky, J. W. Leech, and H. S. W. 
Massey, Phil. Trans. Roy. Soc. (London) A243, 93 (1950). 
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Fic. 3. Experimental and theoretical excitation functions of the 
31S level. The experimental curve is corrected for polarization 
and cascade effects. 


Moiseiwitsch” using the method of exchange distorted 
waves (EDW). Unpublished calculations by Fox on 
the excitation cross sections of several !S states were 
quoted in the paper of Gabriel and Heddle’? and by 
Seaton.” In Table IV are listed observed and calculated 
(Massey and Mohr) excitation cross sections at elec- 
tron energies of 60, 100, and 200 eV. 

The theoretical and observed cross sections of the 
31S level show good agreement. The calculated cross 
sections for the !S states are subject to a higher degree 
of uncertainty than those of the other states, because 
the wave functions of the 21S states were gotten by 
orthogonalization of the Slater-like orbitals. A small 
change of the effective nuclear charge, for instance, may 
result in a significant variation of the cross sections. 
Thus Massey and Mohr gave 15X10-? cm? for the 
31S cross section at 100 eV while the same cross 
section, according to the work of Fox,” is 46K 107? cm’. 
In the light of this, the experimental cross section may 
be considered to lie within the limits of the theoretical 
calculations. Figure 3 shows the comparison of the ex- 
perimental and theoretical excitation functions of 3 *S. 

Figure 4 shows the comparison between the theoreti- 
cal and experimental excitation functions of 3 1P and 
41P. The discrepancy at voltages below 100 eV is to be 
expected since the theoretical values were calculated by 
the use of the Born approximation. Analyses by 
Altshuler and by Miller and Platzman” indicate that 
the use of approximate helium wave functions may lead 
to an error in the cross section as large as a factor of two 
or so. Thus the over-all agreement between the ob- 


2i H. S. W. Massey and B. L. Moiseiwitsch, Proc. Roy. Soc. 
(London) A227, 38 (1954) ; A258, 147 (1960). 

2: M. J. Seaton, in Atomic and Molecular Process," edited by 
D. R. Bates (Academic Press Inc., New York, 1962). 

23 Fox’s cross section of 345 were given for electron energy at 
108 eV as 43X10-2 cm?. The value at 100 eV is obtained by 
assuming the cross section to be inversely-proportional to the 
energy of the colliding electron over this small range of energy. 

*1 S. Altshuler, Phys. Rev. 87, 992 (1952); 89,-1093 (1953). - 

*5 W. F. Miller and R. L. Platzman, Proc. Phys. Soc. (London) 
AT0, 229 (1957). 
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Fic. 4. Experimental and theoretical excitation functions of the 
31P and 41P levels. The experimental curves are corrected for 
polarization and cascade effects. 


served and calculated excitation cross sections for 3 1P 
and 4!P should be considered as good as can be ex- 
pected at this time. 

The evaluation of the absolute cross sections Q from 
the experimental data for the !D states is somewhat 
complicated by the fact that the amount of cascading 
from the !F to the !D states are not known, since no 
excitation measurements are available for the 'F states. 
We shall use the calculated value? of Q(4!F) along 
with an n~ dependence for the higher !P states. The 
cascading from G to !F will be neglected, since the 
populations of the G states are expected to be very 
small on account of the rapid decrease of the direct 
excitation cross sections with increasing values of L. 
Under these assumptions we have calculated N (nF) 
and found that the !F — 1D cascading contributes less 
than 1% of the total population of the 'D state. This 
result remains essentially the same when the singlet- 
triplet mixing of the F states are taken into considera- 
tion. The absolute cross sections of 3, 4, 5, and 6 !D are 
then evaluated from the experimental data on Q" (see 
Table I and Fig. 1). They are about four times larger 
than the theoretical values. At first, one might ascribe 
this discrepancy to the inaccuracy of our estimated 
populations of the F states. If we were to assume that 
F—'D cascading is entirely responsible for the devia- 
tion of the observed Q from the theoretical values, it 
would be necessary to increase the concentration of the 
atoms in the F states. This, in turn, would require cross 
sections for electron excitation of the !F states to be 200 
to 300 times as large as the theoretical values. The value 
of Q(43F), for exar;ple, would have to be about 
1210-2° cm?, a seemingly unreasonable value. Our 
experimental results for the excitation cross sections of 
the 1D states therefore could not be brought into com- 
plete agreement with theory in a consistent manner. 

Calculation by Fox as quoted by Seaton? gives 
Q(31D)—7X107* cm? at 108 eV as compared to 


4x10? cm? calculated by Massey and Mohr? at 100 


V. The difference between these two values can be 


JOHN, MILLER, 


AND LIN 


taken as an indication of the degree of variation of the 
theoretical cross section (Born approximation) which 
can be expected from the use of different approximate 
helium wave functions. Seaton has pointed out the cal- 
culations by B II approximation show that the coupling 
between 3!D and 3!P is not important for the cross 
section of the 3 1D state. It is very unlikely then that 
the imperfection of the theory could account for the 
disagreement between the observed and calculated cross 
sections for 31D and 4!D. Nor is it conceivable that 
errors in the experimental procedure could have caused 
the observed values to show this mismatch with the 
calculated cross sections. The measured apparent cross 
sections are proportional to the output current of the 
photomultiplier tube; thus, the relative values of the 
apparent excitation cross sections should be quite accu- 
rate. Of course the absolute experimental cross sections 
are subject to the errors of the light-intensity calibration 
as well as the cascade corrections. The amount of cas- 
cading populations in the 'P and 1D are quite small, i.e., 
less than 14% of the total populations. Errors in the 
calibration of the photomultiplier output should affect 
all the cross sections by a constant factor. This leads us 
to conclude that one cannot have simultaneous agree- 
ment between theory and experiment for the 3 1P 41P, 
31D, and 4 !D states. 

For the 33P and 33D states the calculated cross 
sections are consistently smaller than the observed 
values. In fact, the discrepancy is too large to be as- 
cribed to the errors in the theory or experiment or com- 
bination of both. In the case of the *D states one is again 
faced with the problem of having to estimate the 
F— 53D cascading. Calculations show that the contri- 
butions of F—>*D cascading to the population of the 
18D states are less than 1%, i.e., completely negligible. 

Finally for the 4S series, direct comparison between 
theory and experiment is more difficult since theoretical 
cross sections are available only for 2 4S, while our ex- 
perimental results cover 3 35, 4 35, and 53S. In order to 
make any comparison it is necessary to extrapolate the 
experimental data with the assumptions that (i) the 
shape of excitation functions for all the 7» 3S states is 
identical, and (ii) the peak values of these functions 
follow the relation z-* or (n*) as was assumed by 
Phelps and Frost!’ and by Gabriel and Heddle.’ Here, 
n* is the effective quantum number such that the energy 
of the state is equal to a negative constant divided by 
(n*)2, The extrapolation procedure yielded experi- 
mental values of Q(2 8S) equal to 180, 69, and 39X10™ 
at 60, 100, 200 eV, respectively, for the n~“ relations, 
where a=4.0. Values of 140, 51, and 30 10? were ob- 
tained for the (5*)-3 relationship where n*=n—0.30. 
These are to be compared with the theoretical values of 
5, 2.4, and 0.5X10-° cm? calculated by the Born- 
Oppenheimer approximation. Massey and Moisei- 
witsch?! have shown that when the distortion of the 
plane wave is taken into consideration, the calculated 
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Cross sections become much smaller at low voltages. 
Thus it I$ scen that the experimental cross sections for 
the *5 series are about 30 times the theoretical values. 
A comparison of the mismatch of theory and experiment 
of the *S family with that of the *P and *D can best be 
made at the »=3 level. Since no theoretical cross sec- 
tions are available for the 3*5 state they were extrapo- 
lated from n=2 to 1—3 by use of the (1*)-? relation- 
ship. These figures are to be found in Table IV. They 
are used only to provide an estimate of the theoretical 
values and to draw conclusions of qualitative nature. 
Àn examination of Table IV reveals that the fraction 
of the observed cross section accounted for by theory 
is much smaller for the 3 ?D state than for the 3*5 and 
3*P states. However, a more important consideration 
would seem to be that of the magnitude of the un- 
accounted for cross sections, i.e., Q(obs) — Q(theory). 
This value is of the same order of magnitude at a given 
electron energy for the three triplet states under com- 
parison. Thus, the mechanism producing the excessive 
population in the triplet states does not preferentially 
populate any one state any more than the others. 


VIII. CONCLUSIONS 


The electron excitation cross sections for 18 excited 
states of helium have been determined by measuring 
the intensities of the radiation originated from these 
excited states at low pressure. Corrections have been 
made to allow for the cascading and polarization effects. 
The agreement between the observed and theoretical 
cross sections for the 3 ! P, 4 1P, and 3 !S states is satis- 
factory. The experimental cross sections for 31D and 
41D are about four times larger than the theoretical 
values. For the *S %P and 5D states the experimental 
results far exceeds (a factor of 10 or more) the calcu- 
lated cross sections. 

The discrepancy in the cross sections of the triplet 
series with the calculated values is far too large to be 
explained by experimental uncertainty. Nor can one 
reasonably expect the Born-Oppenheimer approxima- 
tion to produce cross sections which are ten to a hundred 
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times too small at electron energies above 60 or 100 eV. 
This is particularly true in view of the fact that the’? 
and 15 cross sections calculated by the Born approxima- 
tion do agree with the experimental values. Further- 
more, calculations by Massey and Moseiwitsch?! show 
that the excitation cross sections obtained from the 
Born-Oppenheimer approximation and from the EDW 
method approach each other at electron voltages above 
50 eV. One does not expect the use of more refined 
method of calculation will change the theoretical cross 
sections (above 50 eV) significantly. In the case of 
3!D and 4!D although the disagreement between 
theory and experiment is less severe, it is very unlikely 
that the cumulative errors of the theoretical calculation 
and the experimental work including the associated 
analysis of the excitation data could account for the 
deviation of a factor of four. 

The explanation which seems most plausible to us 
is that the observed population of the triplet states 
(and possibly, to some extent, of the 1D states) is pro- 
duced mainly by processes other than direct excitation. 
Results of the measurements of the lifetime of the 
triplet states by Holzberlein and Fowler?® also point 
toward the same conclusion. Of course, the mechanism 
or mechanisms with which the triplet states are popu- 
lated must have the correct linear behavior with respect 
to pressure and electron beam current. Efforts are being 
made to perform additional experiments to provide 
further evidence of this “anomalous” behavior of the 
triplet states as well as to search for the mechanisms 
which are responsible for the observed populations of 
these states. 
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A numerical program has been developed for the calculation of atomic photoelectric differential and total 
cross sections, including all polarization correlations. The program 1s designed to calculate relativistic 
Coulomb wave functions in a screened central potential; the outgomg continuum wave function is obtained 


in a partial- 


zave series. Results are presented here for K-shell differential and total cross sections in point 


Coulomb potentials (i.e., unscreened) ranging from charge Z=13to Z=92, and covering the range of incident 
photon energies from 200 keV to 2 MeV. Enough data are presented to permit interpolation throughout 
these ranges. The total cross sections above 1 MeV are found to be significantly lower than previously 
accepted values. Further, the angular distributions from heavy elements deviate greatly from the commonly 
used Sauter distribution. These features are discussed and compared with existing experimental and theoreti- 


cal work. 


I. INTRODUCTION 


n predictions of the cross sections for the 
atomic photoelectric effect have in general been 
unavailable. Except for special limiting cases results 
must be obtained by numerical methods, even when the 
electron wave functions are assumed to be hydrogen-like 
and so available in analytic form. The complexity of 
these procedures has encouraged the use of extrapolation 
formulas based on various analytic approximations and 
indeed, at least for the total cross sections, moderate 
agreement with experiment has been obtained over a 
wide energy range.! The availability of accurate experi- 
mental total cross sections, the increasing need for 
accurate predictions of the differential cross sections, 
and the recent interest in the polarization properties of 
these reactions, have encouraged us to attempt their 
calculation by numerical means.’ 

We calculate in the central-field approximation, that 
is, we assume any atomic electron, whether bound or 
continuum, interacts only with a scalar spherically sym- 


+ This study was supported in part by the U. S. Air Force Office 
of Scientific Research Grant AF-AFOSR-62-452, and in part by 
the U. S. Atomic Energy Commission. 

* Address beginning August 1964: Department of Physics, 
University of Pittsburgh, Pittsburgh, Pennsylvania. 

t Present address: Physics International, Berkeley, California. 

1 We will make some references to the previous work in this field 
in succeeding sections. The status of the subject up to 1954 is 
summarized for instance in W. Heitler, Quantum Theory of Radia- 
tion (Oxford University Press, New York, 1954), 3rd ed. We, in 
general, follow the notaticy of this book, but throughout we shall 
use the units #=c=m-=1. Thus, distances are measured in units 
of the electron compton wave length, etc. We will often use az Ze. 

2 While this work was in progress, an independent numerical 
calculation of K-shell photoelectric cross sections was reported by 
S. Hultberg, B. Nagel, and P. Olsson, Arkiv Fysik 20, 555 (1961), 
hereafter referred to as HNO. This calculation, although using a 
method which cannot be applied to screened potentials and not 
obtaining results above 662 keV, is very valuable both for the 

information it provided on photoeffect at the lower energies and 
because it provides checking points for subsequent calculations, 
such as the present work. 


metric potential. Further, we neglect the effects of 
finite nuclear size, so that for sufficiently small distances 
this potential must simply reduce to the pure Coulomb 
Ze/r potential. For most applications these assumptions 
are appropriate. 

Under these circumstances the theory of the atomic 
photoelectric effect is a simple application of first-order 
radiation theory. We require a fully relativistic treat- 
ment, since even at low energies, relativistic effects can 
be significant in a high-Z element. Then the differential 
cross section for the photoeffect is! 


do/dQ= (2r)*pe| H |? (1.1) 


subject to energy conservation, where 
H- -eQs[E) | arbetaree Win: (1.2) 


The absorbed radiation is described by its momentum k 
and polarization e. Pin is a solution of the Dirac equation 
in a central potential corresponding to an initial bound 
state and Win is a solution corresponding to an outgoing 
electron of definite momentum p and energy € (“plane 
wave” plus igoing spherical waves). 

Given the central potential, the entire problem is to 
solve the Dirac equation for the desired wave functions 
and then to integrate to obtain the matrix elements H. 
We have constructed a code for the IBM-7090 which 
does this. 

In the present paper we present results for photoeffect 
from the K shell of an atom, obtained using wave func- 
tions in the pure point Coulomb field. The range of 
elements from aluminum (Z=13) to uranium (Z=92) 
was investigated for photon energies k from 200 keV to 
2 MeV. Enough data were taken to permit interpolation 
throughout these ranges. For each choice of Z and k we 
obtained (1) the total cross section, (2) the differential 
cross section, and (3) all polarization correlations be- 
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tween incident photon and ejected electron. Results for 
differential and total cross sections are given here; the 
results obtained for the polarization correlations will be 
reported separately. Most of the information is pre- 
sented in tabular form. Modifications caused by screen- 
ing,* extensions to lower and higher energies, and photo- 
effect from higher shells will be discussed in subsequent 
work. 

The organization of the present paper is then as 
follows. In Sec. II we discuss the general properties of 
the photoeffect, as revealed by simple analytic approxi- 
mations, and then develop the complete mathematical 
formalism for the process. Section III is devoted to a 
short discussion of the numerical methods for calculat- 
ing wave functions, some features of which have not 
previously been published. The accuracy of the numeri- 
cal calculations is discussed in Sec. IV. The remainder 
of the paper presents our results and compares them 
with experiment and previous theory. This is done for 
the total cross section in Sec. V and for the differential 
Cross sections in Sec. VI. 


Il. GENERAL PROPERTIES AND FORMALISM 


The main purpose of this section is to develop the 
formalism, based on Eqs. (1.1) and (1.2), needed for 
the calculation of the photoeffect. The wave function 
Vin for the outgoing electron must be written as a sum 
over partial waves with appropriate asymptotic proper- 
ties. Then for each partial wave the integrations over 
angles of Eq. (1.2) may be performed analytically, leav- 
ing a small number of radial integrals to be done 
numerically. The cross sections and the polarization 
correlations will be completely specified by these 
quantities. 

It is useful, however, to first have a qualitative under- 
standing of the general features of the process. For this 
purpose we replace Eq. (1.2) with the corresponding 
nonrelativistic expression 


H= e [ stpse Yin (2.1) 


We assume a pure Coulomb potential, so that for photo- 
effect from the K shell Yin= (a/r) Pet, where a= Ze’. 
We also make the Born approximation and replace Yin 
with the plane wave e'"*. Then the integral (2.1) is 
easily done, and the differential cross section for photo 
effect from both K-shell electrons is given as 


do P sin*ó 
— oia —— - (2.2) 
4ed (eer jun (1—8 cosó)* 


3R. H. Pratt, R. D. Levee, R. L. Pexton, and W. Aron, Phys. 
Rev. 134, A916 (1964) (following paper). R 
* For K-shell ee oe it is believed that screening effects are 
quite small, typically (in heavy elements) of the order of h. 
or all other shells, screening effects are large and the use o 
ydrogen-like wave functions in a numerical calculation is not 
justified. 
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where 0 is the angle between k and p, and e; is the 
component of the photon polarization vector in the 
scattering plane which k and p define. 

This cross section vanishes both in the forward and 
backward directions. For low energies (61) it has a 
broad maximum centered at §=90°; as the energy 
increases the maximum shifts toward smaller angles and 
its width decreases. Equation (2.2) is independent of 
any circular polarization of the incident photons, but it 
is sensitive to linear polarization: If the beam is linearly 
polarized in the direction e, e;—cose, where e is the 
angle between the plane of k and p and the plane of k 
and e; there is no emission perpendicular to the direction 
of polarization. Integrating over angles, the total cross 
section for unpolarized photons of low energy (but far 
above threshold) reduces to 


o— 322a! / 3, (2.3) 


We note that this (1) varies as the fifth power of Z and 
(2) decreases rapidly with increasing energy. 

These results must be modified both for low and for 
high energies. Near threshold the Born approximation 
expansion in a@/@ is invalid; when the exact non- 
relativistic Coulomb wave functions are used it is found! 
that the total cross section (2.3) must be multiplied by 


I 1/2 g—45 cot-13 
f=2e(-) — —, px E Qe ee 


1—6?77 


where J is the ionization energy. This causes an appre- 
ciable reduction from the Born approximation predic- 
tion, which is reached only slowly as the energy in- 
creases, in the entire low-energy region; at the K-shell 
threshold f— 0.12.5 

For higher energies a relativistic treatment based on 
Eq. (1.2) is necessary. Sauter found that, to lowest 
order in Z, the differential cross section from linearly 
polarized photons is given by? 


1—(1—58)'* sinh cos?e 

(1—8 cos) 2(1—8) (1—8 cos)? 
1—(1—58»T 29 

Cee ae |. (2.5) 
4(1—8:)9?  (1—8 cosh)? 


This again vanishes in the forward and backward 
directions; the maximum narrows and moves toward the 
forward direction as the energy incréases. There is now 
some emission perpendicular to tie direction of polariza- 


—- eas 


do p [ sin" cos*g 
dQ bie! 


5 However, in heavy elements relativistic effects are important 
at threshold. Part of this is simply due to the relativistic shift of 
the threshold energy. The relativistic A-shell photoeffect at 
threshold has been discussed by B. Nagel and P. Olsson, Arkiv 
Fysik 18, 29 (1960), in the approximation of an unscreened point 
Coulomb potential. However, as the authors note, screening 
effects are expected to be significant in the threshold region even 
for K-shell photoeffect. 

6 F, Sauter, Ann. Physik 11, 454 (1931). 
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tion. For very high energies the total cross section from 


| unpolarized photons becomes 
(2.6) 


and so still varies with the fifth power of Z but de- 
j creases less rapidly with increasing energy. 

For high-Z elements the corrections to this lowest 
order theory can be very large. T he high-energy be- 
havior of the total cross section has been obtained 
exactly,’ with numerical methods, and for Z—82 (lead) 
the prediction of Eq. (2.6) is too large by a factor of 
five, The expression í 


o=[4rea/k] 7*(1— 4ra/15), 


derived analytically, is in fair agreement with these 
results, and shows the nature of the suppression of the 
Z5 dependence. It has also been shown that the cross 
section does not vanish in the forward direction when 
terms of relative order a? are considered, and such 
effects have been observed experimentally." 

ni In relativistic photoeffect there will also be inter- 
actions with the electron spin, and hence many more 
types of polarization correlations are possible. To 
describe these it is convenient to introduce the usual 
polarization parameters. We describe photon. polariza- 
tion with the quantities 


o=4re'a*/k, 


(2.7) 


f= €1€1— ees, E= eses“ + e221" 7 


2.8 
£y— (ees — e221") , ( 


and we describe the polarization of the ejected electron 
by the direction € of its spin in its rest system. {3 is 
taken along p, £1 in the scattering plane. Then if we sum 
or average over the initial polarization states of the 
bound electron, the differential cross section for photo- 


effect must be of the form 


[2 = ESC], (2.9) 


do (=) 
dQ dQ unpol 1,7—0 
where £o—£o— Coo— 1, and (do/d®)unpor is the differ- 
ential cross section from unpolarized photons, summed 
over final electron spins. The correlation Cio connects 
linearly polarized photons and unpolarized electrons, 
and as already noted, occurs in the lowest order cross 
section. Also occurring in lowest order are C33, connect- 
ing longitudinally polarized electrons and transversely 
polarized photons, and Cu. The correlations Coz, C12, 
Co, and C»; appear in relative order a. The remaining 
^ correlations are forbidGen by invariance considerations. 
"We can now develop the formalism needed for a 
numericakcalculation of these photoeffect cross sections. 
"To begin with, we need wave functions for the electrons. 
A bound state will be specified by capital letters: total 
momentum J, orbital angular momentum L, 
4), the azimuthal quantum number M, and 


att, Phys. Rev. 117, 1017 (1960). 
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total energy (including rest mass energy) Æ. Thus, for 
Win in Eq. (1.2) we write 


; G(r) Quru) 
i-( SUN ) : (2.10) 
iF (7) Qr (f) 
where L’ and « are defined by 
TAD, 
x7 (J-4-3) as JL. (orJ-L 55), (2.11) 


and the spherical spinors Oz are defined in terms of 
spherical harmonics® by 


Crum? Yr 
Susu-( n , (2.12) 
JLM YLM 
where 
J=L+3 J=L—} 
[| [LM 1/2 
Cr =| ; 
2L--1 2L--1 
(2.13) 


T LED Ea 
Silom Nite ent 


The radial functions G, and F, are to be obtained as 
solutions of the coupled equations 


[E--14- e ]F.— [ 4G. dr-- (12-9 (G7) ]0, (2.14) 
[E—1-- e]G.--[dF / drd- (179) (,/7)]- 0. 7 


The potential o must be specified (for the pure Coulomb 
potential g=a/r) and the energy E is then determined 
from the eigenvalue problem. The wave function will be 
normalized by requiring 


i Pdr (F)?+(G)?J=1. (2.15) 


The wave function V, of the outgoing continuum 
electron is written as a sum over partial-wave solutions 
with appropriate asymptotic properties: 


Vtin 3; 4v (Uim (P)U a)i? 


ilm 


( g Qj lm (f) 
tf Q jUm (f) 


Equations (2.10)- (2.14) are understood to remain valid 
with the substitution of the small letters 7, l, l, m, K, © 
etc., and of course p+ 1— e, where p is the momentum 
of the outgoing particle. The two-spinor Us specifies the 
polarization properties of the electron in its rest frame. 
Equation (2.15), however, is not appropriate for 
continuum functions, and is replaced by the requirement 
that the functions are normalized such that at large 


) > ONG) 


3 We use the phase conventions of A. R. Edmonds, Angular 
Mi omentum in Quantum Mechanics (Princeton University Press, 
Princeton, New Jersey, 1957). 


QU NU bu S 


r 
5; 
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distances 


e+ 1\ !? 1 
ck (=) — sin(pr—}lr+6,) , 
TONLE pr 


e—1\ "21 
jh; (5 ) — cos(pr—4$lr+6,). 
Te € pr 


Equation (2.17) also defines the phase shifts 6,. [For 
the pure Coulomb potentials 6, in Eq. (2.17) but mot Eq. 
(2.16) should be replaced by ô+ (ae/p) In2pr—this 
feature arises from the long-range character of the 
potential. ] Note that the sum over j and / subject to 
j=l} in Eq. (2.16) is equivalent to a single sum over 
x, where x ranges over all positive and negative integers. 
Substituting the wave functions (2.10) and (2.16) 
back into Eq. (1.2), H can be written as a summation of 
terms, each corresponding to a given x and m in the 
series for the continuum wave function. It is convenient 
to remove some constants and write this in the form 


H=—e(2n/k)"? > Kr, (2.18) 


(2.17) 


where each 3€, represents a sum over all m values 
consistent with that x. 

Now we choose a z axis along the photon direction k 
and start performing the angular integrations. In the ¢ 
integration nonvanishing contributions come only for 
m=M-+1, and thus the summation over m implied in 
5C, reduces to two terms. This is most conveniently 
written in terms of the circular polarization coefficients 
£47 642 ie, for then 


TE- = de®] (U 4*9; 1M+1 (B))e_R,* (M) 
+ (Užima) eR (M1) J, (2.19) 


where the R’s, a set of numbers resulting from @ and r 
integrations contain all the remaining information of the 
problem. (R’s of +M are easily related.) To proceed 
further for an arbitrary bound state one expands the 
plane wave e* of Eq. (1.2) in spherical harmonics 
and then integrates over products of three spherical 
harmonics. 2 7 

For the purpose of this paper we restrict our attention 
to bound 5 states (J=4, L=0) and the results are quite 
simple. All the information of the problem is contained 
in two sets of numbers, R,*, defined by 


— 
2I-+-1 
X[jciEr)- jux(r)], 


R= | varlen O 


Rec [rius 


(2.20) 


Pe Nua (e «| 
x[ iocos) T jua(&r 241 


-cofuC;-i QI 1) ru (kr) | 2, 
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where 7,— -E1 according as 7=/-F}. [These R’s corre- 
spond to R*(M — --3) of Eq. (2.19).] Then for M=+4 


3C,— «mre | (U 4*0; ap) )e-Re* 


HU) eR], (2.21) 
and for M=— 4 
$7 Arne (QU 4*9 a) eR 

TQ ŽU) eR]. (2.22) 


We shall sum over the two electrons of a bound S state. 
Then the differential cross section for a photon with 
polarization parameter £; [as in Eq. (2.8)] to eject an 
electron with spin direction ¢ (in its rest system, with 
fa chosen along the electron, and ¢ in the scattering 
plane) into the solid angle d2 may be written 


do 3 
—=5A Y BUB, (2.23) 
dQ i,j=0 
where 
fo—(o—1, A=16me7pe/k. (2.24) 
The only nonvanishing B’s are 
Bw-[|J-|*-- | K-]- I2 | Ks/71, 
Bo-— 2 Im[J_*K_+J,*K; ] ; 
Byo= 2 Re[J_*J,+K*K, | 7 
By=2 Im[J_*K,+J,*K_], 
Boy= —2 cos Im[ J_*K,+K*J, ] 
+2 sing Im[J_*J,+K,*K_], (2.25) 
Bo3= —2 cos Im[J_*J,+K.*K_] 
—2 sin Im[J_*K.+K_*J,], 
B31—2 cos Re[J-*K .—J,*K..] 
—sin[|J-|*— | K-|*4- | Ks |? [J41], 
Bs3— cost[ |J. |*— | K- |*- | Ky]? [7+1] 
+2 sing Re[J-*K .—J.,*K.], 
where 
[k—1] 712 
J_=— (4r)? » vel — R.+ P (cosb), 
oT Lidel 
K_= (tr) exile? — 1] PREP? (cost), 
x (2.26) 


J= (4r) E qe? |x| RP (cosh), 


K= (4r) 2 D e'[ |x| JYPRAPP(cos8). 

In the sums of Eq. (2.26) terms for which the P;’s do not 
exist are to be omitted; @ is the angle between photon 
direction k and electron direction p. [The last four B’s 
of Eq. (2.25) take their form when a rotation is made so 
that ¢ refers to a spin along the electron direction. | To 
put our result in the form of Eq. (2.9) we finally define 
the polarization correlations by A 7 


Cis Bi/ Bo. (2.27) 
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The differential cross section from unpolarized photons, 
summed over electron spins is then 


n do 
i (=) =AB 00 
i dQ unpol 


and the total cross section 


(2.28) 


o=2rA J " Bosinódó- AY: [RAR]. (2.29) 
0 K 


Thus, when the quantities R= (and the phase shifts ôx) 

have been calculated, all properties of the process may 

be predicted. And the main work in calculating these 
quantities lies in the prior calculation of the electron 
n wave functions. 


J II. METHOD OF SOLUTION OF THE 
Mn WAVE EQUATIONS 


A. The Bound-State Wave Equations 


The bound-state wave equations 
(E--14- ¢)F.—[dG,/dr+ (1+) (G./7) ]- 0, 
(E—1+ e)G.-- [IF ,/ dr4- (1—«)(F,/r)]=0, 
constitute an eigenvalue problem for the energy eigen- 
value E and the bound-state wave functions PF, Gx. 
Since F, and G, behave like 7*7! near 7—0 for the 


Coulomb potential? and are therefore singular for y « 1, 
it is convenient to make the substitution 


(3.1) 


G,—Gori, F=F gr. 


Equations (3.1) then become 


? 


_ dG, (y+x)_ 
Gait or] : 115, 
LY 72 


(3.2) 


- (ie Cane 
(i= tae D — £ 5. |-o, 
ir 


22 


where F, and G, are finite at 7—0. 

Although the following analysis for the solution of 
Eqs. (3.2) applies to a potential e of any form, we shall 
in at present paper specify the Coulomb potential 
g=a/r. 

The boundary conditions associated with (3.2) with 
y=a/r are determined at the origin by multiplying 
through by 7 and letting 7—0. We have then 


aP, (0) * (y4-:9G., (0) =0 ? 


a0,(0)-- (y—9P.(0)—-0. Ce) 


Since the equations are linear and homogeneous, we may 
choose G,(0) arbitrarily equal to one. /,(0) is then 


D L. I. Schiff, Quanium Mechanics (McGraw-Hill Book Com- 
_ pany, Inc., New York, 1955), 2nd ed., Sec. 44, p. 336. 
Bi 
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determined by either of the Eqs. (3.3), and y is deter- 
mined by the requirement that 


a  —(v*9 


y—K a 


or 

y= (Pa), (3.4) 
where the positive sign is taken to obtain a physically 
acceptable wave function. 

In order to determine E we need to satisfy the bound- 
ary condition at r —o ; that is we must approach the 
asymptotic solution of (3.2). This solution is determined 
by letting r=% in (3.2), leading to 


(E--1)P,—G',—0, 
(E—1)G.--F",—0, 
which have the solution 
a —Ar 
me (3.5) 
F K= C ge ) 
where A= (1—E?)!? with E<1. Increasing solutions are 
ruled out since the wave function must be bounded as 
r —. [In the point Coulomb potential, these forms 
must be multiplied by 7?-!*, where 7 is the principal 
quantum number; this has no effect for the K shell.] 
In order to numerically integrate (3.2), we must re- 
place them by suitable difference equations. In the 
difference equations we have chosen, we evaluate the 
wave functions F and G (for simplification we have 
dropped the subscript x) at the discrete points labeled 
by integral values of j running from zero at r—0 to J 
at a point ry. The coordinate ry is that point where the 
asymptotic boundary conditions (3.5) are satisfied in 
appropriate difference form to within some tolerance T3. 
The conditions (3.2) are applied at j+4 and the deriva- 
tives are therefore evaluated as the slope of the chords 
joining values at j and j4-1. Quantities needed at 7 +5 
are evaluated by averaging. The resulting finite differ- 
ence equations are 


Part F; 
ee ee 
Gim—G; (vx). i 
-J= (6,63 |=0, 
miT; Prin 
é rugs RF (3.6) 
(E—14- pe ee 


+ 


THA fi 


PaF; (y—x) = a 
| Fatt |0, 
Tip 
with 0x jJ —1. 
'The basis of the method is the simultaneous solution 


| 


| 


| 
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of the complete set of difference equations and boundary 
conditions. This method has been described for a two- 
point boundary value problem by Henyey et al. The 
suggestion that the method could also be used for eigen- 
value problems was made to one of us by Henyey!! 
many years ago and has in fact been used by Levee to 
solve many problems. = 

The Eqs. (3.6), although linear in the functions F 
and G, are nonlinear in the unknowns because of the 
appearance of the eigenvalue E. They must therefore be 


s solved by an iterative technique such as the Newton- 
Raphson method described in Henyey et al.! 
Formally we may write (3.6) as 
i Qa (Fist GE) =0 ? (3.7) 
Qi (Pil ,Gi1,G;,E)=0, 
0<j<J-1. 
The Newton-Raphson method when applied to a system 
fu (a1,%2,--+,4x)=0, k=1,2,---,K (3.8) 
corrects an approximate set of values xi?, 2”, ---, £g? 
by variations 6x1”, 6x2”, ---, dx? to give improved 
values x,?*1, xP, ---, æg?! where p is the iteration 
number. The variations are determined from the 
equations 
K Of: 3 
fet uim). dil eecyJ& (3.9) 
t=1 Ox, 
and the iteration is continued until the Eqs. (3.8) are 
satisfied to within some desired tolerance. 
If we define the following two vectors 
" ae (3 
+i = » Gir 25] 
à Qaa ôG; 
we may write, in accordance with (3.9), the variational 
equations derived from (3.6) as 
T AmgimtBingt CET Qja-0, 0SjLJ—1, (3.10) 
where 
JH aQ! Iia Ona PQr! 
Fy ôG B oF; 9G; 
Å= H7 5 aj? 
E 00543" COREG 4 90; 90; 
OF 41 OG 531 oF; 9G; 
90; 
ðE 
Cj= a 
90a 
ðE 


? L, G. Henyey, L. Wilets, K. H. Bohm, R. LeLevier, and R. D. 
Levee, Astrophys. J. 129, 628 (1959). — DEN 
u L. G. Henyey (private communication to b 
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Since the matrix of the coefficients of Eqs. (3.10) is of 
the form 


; do. Qr 102 Ga Ua EE PA 
JE:0 oT x 
1 qe X 
2 2 $ c 
UE BR YA re) | 
[ 


we have J equations in J+2 unknowns. The two addi- 

tional conditions which are necessary to solve the equa- 

tions are given by the boundary condition at j=0 and 

the asymptotic condition at jJ. | 
At j—0, Fy is given by either of (3.3) when we have | 

chosen G,— 1 arbitrarily. Then go=0 and the first equa- 

tion of (3.10) gives qı in terms of à£.. Using this relation 

we eliminate q1 from the second of (3.10) to give qz in 

terms of ôE. The elimination process is continued to the 

final equation which gives qy in terms of ôE. These 

equations will be of the form 


qj— ej0ET fi, 
qo=0. 


0<j<J, | 
(3.11) 


Substituting the first of (3.11) into (3.10) we find 
recursion relations for the two-vectors e; and f; as 


i 
e= — Any (Byres tC), . (3.12) | 
fii Asses (Biri fta). | 
Since qo—0, eo= fo=0 and we may determine all the £j, | 
f; from (3.12) for 0x ; € J. 

At j—J we write the asymptotic solution (3.5) in 
difference form as 


GG, eoa, (3.13) 


Taking the variation of (3.13) we have 


ôG y= ers y 
—(rs—1rs)Gs1(d/dE)6E], (3.14) 


ee ee 


where 


dA/dE-— — E/ (1 — Ez)? (3.15) 


from the asymptotic solution. , 
Now from (3.11) and our definition of g; we have 


0G 1— 65-10 E+ f i, * (3.16) 


and 
6G, —e; P E+ fj, d 


where the superscript “2” refers to the lower element of 
the e, f two-vectors. Substituting (3.16) into (3.14) and 


. ma 
eee ee... 


pm 


y 4 J—5 
EV H(ndr=> Cz Hi) Hr 
PAS ; j-5 
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solving for E, we arrive at 


erst) f .,49— f; 9 
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(3.17) 


SE ; 
ey®+[Gs1(rg— rs) (DV dE) — RON 


The iteration process may be summarized as follows: 
(1) From the pth approximation to F;, G;, and E, com- 
pute the elements of Aja, Bj, C; and Qua for 
0<j<J—1. (2) Using (3.12) compute ei, €», ** ; €75 
fn fec fx using e= fo=0. (3) Compute òE by 
(3.17). (4) Compute the corrections dF), 62, ++, Fs, 
5G1, 6Gs, «++, 6Gy from the first of (3.11). (5) Compute 
the p+ 1st approximations by 


Erti=E?-+5E" 
ppe-PFor Plo 


a = - jJ. 
Gpus G+ 6G? «jJ 


This process is repeated until all the unknowns satisfy 
the inequality 
örr /ar t| <T, k=1, D, 2993 K 


and |8E/E| « T1 for x> Ts. For x< T3 we do not make 


a test. 
In addition to satisfying the above tolerance tests, we 


must also have chosen 7; such that the asymptotic 
solution is indeed valid. To check this we compute 
iuam Pye) 


G Gy 36^ 07717072 
and require that 
Ia Fu/Foal «T 


and that 
| Gy1—Gs-)/Gra | «Ti. 


Tf these inequalities are satisfied we are finished. If they 

are not, we advance J by 10 and resolve the equations. 

This procedure i$ repeated until all tests are satisfied. 
In order to normalize our solutions we require that 


mf pro (Fa Gydr- iex] H (r)dr . 
0 0 


Se "This quadrature was performed by Bessell's formula 


2 H(r)dr 


j-0 Bs 


re the C; take on the values 
Co'= 0.32986, Cr=1.32083, C. —0.76667 , 
RE os C4 1.10139, C#=0.98125. 


PM 
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The final values of the wave functions for the bound 


state are given by 
END 


: Jos;s7. 
Gj-— Nrr316G; 


In the results reported here we have taken 73— 1075, 
T2=10-5, T4— 1075. The actual iteration error is much 
smaller since the tests involve comparison with values 
from the preceding iteration. In fact, since Newton's 
method doubles the number of significant figures when 
within the linear range, iteration errors are probably 
less than 107%. 


B. The Continuum-State Wave Equations 
The continuum-state wave equations 


(HH) f-L(dge/ ar) AHOA] ig 
(<—14 ee tL(dfe/dr)+ 0904/0190. 


constitute an initial value problem since e, the energy of 
the ejected electron, is determined by the sum of the 
incident photon energy and the energy of the bound 
state determined in the previous section. 

As in the case of the bound state, it is convenient to 
make a change of variables here also. 


fie, be ke: 
Equations (3.18) then become 


(e+1+- e)f«— Ldi./ drA- L(y-- 9/7 2 J=9 (3 19) 
(1t geet [dfe/dr+Lly—0)/rel=0 


where f, and £, are finite at r=0. 
In a manner similar to the bound state we determine 
at r=0 for a Coulomb potential a/7 


af.(0)— (y+) g.(0) (3.202) 


or 

Thus y2=«?—a? and since Eqs. (3.20) are linear we 
may again choose g,(0) arbitrarily equal to 1. We then 
may determine f,(0) from either of (3.20). In practice 
we use (3.20a) for x» 0 and (3.20b) for «<0 to insure 
the greatest accuracy. 

In order to normalize the continuum solution prop- 
erly, it is necessary to find the asymptotic solution. This 
is done by matching the solutions of (3.18) with ¢ not 
zero to general solutions of (3.18) with e equal to zero; 


Pra 
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and evaluating the asymptotic behavior from the Zero-q 
solutions beyond the range of the potential. 
Let the matching of solutions be done at a point ro. 

Let ??— &— 1. Then 
Jr) =(= 0)/pro]? 

XLA apro) - AJ o (pro) ] 
gro) — [Ge 1)/2ro]? 

XL44J aa(pro) — A-J- (pro) ]- 
These two equations serve to determine the constants 
A. and 4... The asymptotic behavior of f, and g, is then 


determined by the asymptotic form of the Bessel 
functions, 


2(e—1)N!? 1 zy 
so~ ) Ja cos( r=) 
T pr 2 
TK T 
+A_ cos( p+) | j 
Xe 1) 1 Pu 
ZONI » ) "E (p) 
_ mK 
m 


The normalization factor A and the phase shift 6, are 
given by 


(3.21) 


A — (A4?-- A2), 
c0s0,— A4/ A , 
—sinô = (—1)*4 /A. 


(3.23) 


The asymptotic forms of f, and g, are then 


e€—1) I? A KT 
mare 


3.24 
2(e+1) 1/2 4 ‘ ( a) ( ) 
ZON ) P pr "uu 


T 


In using this with a true Coulomb potential, the 
phase shifts 5, will not become constant with increasing 
7, but rather will continue to increase, behaving like 
(ae/p) In2pr+constant. This must be taken into ac- 
count when checking whether the asymptotic form is 
attained. For potentials that die out more rapidly than 
the Coulomb potential, the 5,’s should become constants 
in rz. i 

The numerical solution of either (3.18) or (3.19) is 
straightforward except near 7=0. Because o Ls 
singularity exhibited in (3.18) at r=0 we use (3.19), but 
even here we have a starting problem. : 

Let us write Eqs. (3.19) in the vector notation 


(3.25) 


O= Aw E 
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where c, has components f,, Z and where 

y—k a 

oque) 
r r. ' 
A= "a (3.26) 

a 
(ep m 
E r 


Consider now the finite difference representation of 
(3.25) based on the Euler approximation 


(vj11—07)/ h— A jw, j20, 


(3.27) 
where //—7;,1—7j-a constant, and where we have for 
simplicity dropped the subscript x. Equation (3.27) is 
restricted to 7>0 since the elements of A; are singular 
for 7=0. In practice v; would be determined from wg by 
a series expansion. Equations (3.27) may be written 
Oj = (I-+NA j)o;. 
The first-order error equation will then be 
wjr = (I+hA;)ôwj, (3.28) 


where we have neglected errors in A;. Equation (3.28) 
may be simplified to 


wj = Bjóo;, (3.29) 
with 
h 1 a 
te) (1) 
jn J^" 
B;= (3.30) 
a hr) 
io LET) — 
jh jh 


and where we have replaced r by ro+-jh= jh. 

If individual errors in w; are to remain bounded, we 
must require that the eigenvalue of the matrix B; be less 
than or equal to 1 in absolute value. The eigenvalues of 
(3.30) are determined from 


(y—k) a 
1— : —X (18) 
: 4 


J 
a (y+«) 
ee, 
J 
The argument holds in the limit of 4 — 0 which leads to 
[1— (v-9/j—X][1 — Q(-9/7;—H-2/;—0. (3.31) 


Remembering that y?—4?— a? we solve (3.31) to give 
\=1, 1— (2y/j). Since we require |X| <1° we have 
—1<1—(2y/j)<1. Now y and j are always greater 
than zero and therefore the right-hand inequality is 
always satished. The left-hand inequality will be 
satisfied if jy. The conclusion is that if we want 
errors to decrease in the solution of (3.19) by numerical 
integration, we must begin the integration at a value of 


io 1 
jh 
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j2. y. The maximum value of y in our calculations was 
20 corresponding to k= 4-20 and it is therefore necessary 
to use the starting series described below for the first 20 
steps. Although the above analysis is applied to the 
simple Euler method it leads to the same result where 
applied to the Runge-Kutta method. In fact, it was 
using the Runge-Kutta method for starting that led to 
the observations of the error growth and the analysis." 
It has been pointed out to the authors by J. Scofield? 
that the modified Euler scheme, eua [ZFA z) 
(T—}hA 54) ]o; has eigenvalues \=1, [1—7/ G--3)Y 
[12-y/ (j9-3) ] in the limit as 4 — 0. Therefore errors in 
this scheme will be bounded since y> 0. 

'The power-series solution to (3.19) around the point 
7, is of the form 


"OL cil — Ta)’, i=0, 1, 2, a i 
E= E bilr ra), a= fl; 2, on 


Substituting into (3.19) we arrive at the recursion 
relations 


ralit 2) bier (i+ T)biad- (y -x)bsa 
= Llet Irat a]ci+ı— (e+ 1)c;— 0 P 


ra 2)ciqa-- (i+ Tea (y—x)ci+ı 
+ (e—1)rate loins t (e— 1)5,—0, 


where co=f(rn), bo g(r»). These relations allow the 
determination of the c;, b; for i0 for successive expan- 
sions about rn=0, ra— I, T= 2h, ***, n= jh. Since we 
are always expanding about the previous point, 
r—r,7—h. 

The series solutions (3.32) and similar equations for 
the derivatives of f and g were computed out to 7 =JJ. 
The series at each j were terminated when the percent- 
age error in f’ and g’ was less than one part in 105. Using 
the functions and derivatives thus determined, the 
numerical solution of (3.32) was then continued by the 
Runge-Kutta method. In the case of a general potential 
where the series solution is not easily arrived at, we plan 
using the modified Euler scheme for starting the 
integration. 

At this point in j we switched back to using f and g 
instead of f and £ so that our numbers would not become 
too small. In addition it was necessary to add a scale 
factor, SF, in both f and g in certain cases to keep 
numbers within machine limits. Using the values 


frs= fas(SF rss) 
gra Ka (SF y)? 
far SISE rI) 7H Q0— Dfaz/rzz 
gaz — gy (SF-rgs) + Q— T)grs/rrz 


(3.32) 


the numerical integration of (3.32) was continued by the 
standard Runge-Kutta method. 


2 This difficulty had been observed and resolved previously by 


‘Levee in a problem in controlled thermonuclear reactions. 
13 J, Scofield (private communication). 
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Although our analysis in this section indicates that 
we should take JJ=20, we actually used JJ — 10. This 
allowed small errors to appear in the normalization 
factors and wave functions for K> 10. These errors were 
of the order of 4% in the normalization factor at x— 16. 
However, the high values of x contributed very little to 
the cross sections and the cross sections show no error 
due to using JJ — 10 instead of 20. 

We require f and g to be normalizéd such that 


—1\ 1 lr 
j= (=) = cos{ pr—— + ;) 
2e pr 2 


Se ( um 
el ——\) Stel | i : 
à i pr NS 2 


From (3.23) and (3.33) we then have the following rela- 
tion for the normalization factor Ni: Ny=2A (ev)? 
where A is determined from (3.21) and the first of (3.23). 

The phase 6 must now be determined. For a potential 
which falls off more rapidly than 1/r the phase will be 
given by (3.23) as s=arctan|A_/A,|, but for the pure 
Coulomb potential s=arctan| A_/A+|— (ae/p) In2pr. 

In the program reported here we have computed A 
at each step of the integration and required that 


(3.33) 


| Ajz10—Aj 
2 

Aj Aj 
It turned out that for the very small T4 chosen, the 
integration was always carried out to the predetermined 


maximum at r—120. At this point the normalizations 
and phases were determined. 


lar. 


IV. ACCURACY OF THE NUMERICAL CALCULATION 


We have seen that we will have all the information 
needed to specify the properties of S-state photoeffect 
when we have calculated the quantities R,* given by 
Eq. (2.20). We have not discussed how many of these 
quantities we need to calculate, i.e., the convergence 
properties of the series in Eqs. (2.24) and (2.27). We 
follow usual practice, and will judge the convergence by 
the size of successive R=. Practical considerations force 
us to choose a limit for |x| in advance, since the methods 
employed for calculating functions and organizing the 
numerical program for machine purposes depend on 
this choice; in the present paper we consider |x| <20. 

Then first of all the calculation of a given R, requires 
bound-state radial functions F,, G,, continuum radial 
functions f, gx, and spherical Bessel functions jalkr). 
Also, in calculating angular distributions we will need 
the associated Legendre functions P;"(cos0), for m=0, 
1, 2. The accuracy of the calculations of these components 
of the general program are discussed in separate sub- 
sections. We can then estimate the accuracy with which 
each R, has been computed, and the accuracy this 


implies for the cross sections. And we must also judge 


i 


i 
1 
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the error which has been made due to the restriction 
|x| <20. 

Checks for the component programs will be discussed 
separately. A number of checks are available for the final 
results: We can compare with analytic results for low Z 
and we can compare with numerical results which have 
previously been obtained for photoelectric cross sections. 
These comparisons are made in the subsequent sections 
where we present our results. In all cases the proper 
agreement was obtained, so that we have full confidence 
in our numerical program. It should perhaps also be 
mentioned that with the present program some ten 
minutes of IBM-7090 time were required to obtain the 
differential and total cross sections and polarization 
correlations for a given choice of element and energy; it 
is expected that further improvements in the program 
will appreciably reduce this figure. 


A. Component Programs 
i. Bound-State Wave Functions 


'The accuracy of the bound-state calculation can be 
tested for the pure Coulomb potential by comparison 
with analytic results. It is particularly simple to check 
the eigenvalue and the normalization of a calculated 
wave function. The energy levels are obtained with 
considerable accuracy. However, the normalization 
coefficient, i.e., the constant which multiplies the known 
small r dependence, probably gives a better indication 
of the over-all accuracy of the wave function. For given 
hzri-—ri,theerror in mormalization increases with Z. 
With h=0.05, the smallest value used, the error is 
negligible for Z=26 or 50 but 0.4% for Z=82 and 
0.6% for Z=92. To obtain 1% accuracy in the cross 
sections for heavy elements we have taken results for 
h=0.1, 0.075, and 0.05 and extrapolated to h=0. [ Note: 
It has since been established that these small errors 
result from a minor error in programming; once this is 
corrected, the error in bound-state normalizations be- 
comes completely negligible. ] 


di. Continuum-State Wave Functions 


The two parameters of a continuum-state solution 
which are easily compared with the analytical results of 
the pure Coulomb potential are normalization and 


Taste I. Accuracy of continuum normalizations. 


E +1 +5 +9 +15 
EVY w o Co GO 

0.01 0.02 0.4 40 

^k TR 0.01 0.03 04 A 
i 0.5 0.7 1.0 5 
aoe ae 0.1 0.2 0.5, 45 
0.600 0.07 0.10 0.45 44 

1.131 0.03 0.05 04 42 

15 0.02 0.03 0.3 40 

2.0 0.2 0.2 04 42 


Taste IT. Accuracy of phase shifts; errors in radians and percent. 


Energy x 
(MeV). + +5 +9 +15 


Low Z 0.354 0.000412) 0.000410.1 55) 0.0005(0.1 05) 0.0000 (0.1 7) 
1.131 — 0.0015(10^7) 0.0016(0.547) 0.0014(0.3 5) 0.001440.2 55) 


High Z 0.200 9,0035() ^ 0.0051(0.37; 0.0001(0.0%) 0.0021 (0,1 9%) 
0.554 — 0.0030(1.5^7) 0.0005(0.047) 0.601910.1 Z) 0.0034/0.2 %) 
0.600 — 0.0007(0.2*7; 0.0001(0.0^;; 0.0006(0.1 7; 0.0012(0.1 65) 
1.131  0.0011(0.3 56), 0.0009 (0.175) 0.0011(0.1 57) 0.0014(0.1 Fo) 


phase. The accuracy of the normalizations calculated 
with h=0.05 for some typical partial waves in low-Z 
(26) and high-Z (82 and 84) elements are shown for 
various energies in Table I. The errors in the higher 
partial waves are nearly independent of energy (away 
from threshold) and Z, but increase rapidly with x. 
These arise from the use of the same switching point 
JJ —10 from power series to numerical integration for 
all «. Except at the highest energy (2 MeV), the errors 
are not sensitive to choices of + in the range from 0.05 
to 0.1. The error in normalization also increases rapidly. 
as the threshold energy is approached. This type of 
error is insensitive to 4 and may be connected with the 
very long periods of low-energy continuum states. In 
Table II we give the analogous errors in phase shifts, 
expressed both in radians and in percent. 


id. Spherical Bessel Functions 


The spherical Bessel functions j;, 0</<20, were 
computed by the method of Corbató and Uretsky™ and 
were checked against the NBS Tables! for selected 
values of J. The errors were less than one part in 105. 


iv. Legendre Polynomials 


The Legendre and associated Legendre functions 
were computed in double precision arithmetic directly 
from the polynomials given by Tallquist.1 These were 
compared with double precision values computed from 
the recursion relations and found to agree to at least one 
part in 10. Rough comparisons can be made to the 
tables of Mursi” and NBS.!5 


B. The Main Program 
i. Partial-Wave Integrals R, 


The errors in an R, are due first to the errors of the 
component programs and second to the errors of the 


M F, J. Corbató and J. L. Uretsky, J. Assoc. Comp. Mach. 6, 366 
(1959). 

15 National Bureau of Standards, Tables of Spherical Bessel 
Functions (Columbia University Press, New York, 1947), Vols. I 
and II. 

16 H, J. Tallquist, Acta Soc. Sci. Fennicze, Nova Series A, Tome 
II, No. 4 (1936) ; Tome II, No. 11 (1938). - 

uZ. Mursi, Tables of Legendre Associated Functions (E. and 
R. Schindler, Cairo, 1941). 

18 National Bureau of Standards, Tables of Associated Legendre 
Functions (Columbia University Press, New York, 1945). 
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TABLE III. Regions contributin 


g to partial-wave integrals. 


Energy \« 

Z (MeV) +1 +5 +9 +15 

26 0.354 84 80 108 (does not contribute) 
1.131 58.5 63 18.5 89.5 

50 0.354 41 45 50 (does not contribute) 

92 0.208 19 28.5 35 (does not contribute) 

S4 0.354 20 26 315 (does not contribute) 
1.131 22 25 27 $ 

82 1.1368 Dum. YX) 21 28.5 
2 ds A 26 27 
2.154 DOS EEDAS 25 25.5 


integration procedures. The errors in integration arise 
from the choice of /; and rmax, the upper limit of the 
integration. 

Table III summarizes, for representative parameters, 
the distance in r which it was necessary to integrate to 
hopefully reduce the residual contribution to less than 
one part in 106. The needed distance increases slowly 
with increasing x; in heavy elements it is nearly inde- 
pendent of energy and is determined by the decreasing 
exponential of the bound-state wave function. Since this 
exponential scales as Z the required distance greatly 
increases as Z decreases. In light elements this has the 
consequence that an energy dependence persists in the 
energy range of concern, and the rapid oscillations of a 
high-energy continuum-state work to reduce the needed 
distance.! Since such a stringent criterion was applied in 
the table, it is possible to obtain satisfactory results from 
the present calculation for a Z as low as 10 or 13, even 
with the restriction 7max<120 required in the machine 
program. 

The error in integration arising from the choice of Z 
can be estimated by noting the change in the values of 
R, resulting from changes in /: and subtracting out the 
portion of this which is due to change in component 
programs with %4. As already noted, up to 1.5 MeV, the 
only component program which is sensitive to choices 
of hin the range 0.05-0.10 is the bound-state wave func- 


Taste IV. Number of x's needed for accuracy to one part in 10”. 


E\n 1 2 3 4 5 6 
0.140 2 3 4 4 5 5 
0.200 3 4 5 6 7 8 
0.279 3 4 6 7 8 9 
0.354 4 5 7 © © wD 
0.400 Ax. 18 8 $ d. K 
0.662 5 3 1» "9 d" 

1.131 1 1⁄2 08 
1.1368 8 13 16 

2.0 x 11000 17 
2,754 14 


Tt should be noted that the distance required for very high 
curacy has a different dependence on parameters from the dis- 
hich characterizes the main region of the integrand. It is 
distance which is discussed in the various analytic 
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tion. When the estimated error from this source is 
subtracted, the residual error to be attributed to error 
from / in integration is generally small and, with one 
exception," completely independent of x. The typical 
error begins to increase at higher energies: the difference 
between R, of h=0.05 and 0.10, still negligible at 662 
keV, is about 0.4% at 1.131 MeV, both for Z=26 and 
82, and 1.5% at 1.5 MeV. 


ii. Number of x’s Contributing to Cross Seclion 


The number of «’s needed to determine the total cross 
section to any desired accuracy increases with energy, 
but it is almost independent of Z. One or two more x's 
are needed to obtain the same accuracy for Z— 26 as 
for Z=84. In Table IV we summarize as a function of 
energy (for heavy elements) the number of x’s needed to 
obtain an accuracy of one part in 10". The restriction 
to |x| <20 results ina limitation to energies below about 
2 MeV if it is desired to obtain the total cross section 


to 1%. 


TABLE V. Estimates of total error in numerical calculation. 


Error in ctotal 
Energy o. 
0.354 0.2 
1.131 0.6 


Low Z 


High Z 0.200 1.1 
0.354 0.5 
0.600 0.5 
1.131 0.8 
1.368 0.8 
2.00 2.0 


iii. Estimate of Total Error in Cross Sections 


We assume that a fit is made to the results for differ- 
ent % and extrapolated to /,—0, and we assume that, in 
agreement with our analysis, this renders errors associ- 
ated with % (bound-state normalization, integration, 
etc.) negligible, i.e. 0.1%. Then the error in R, is 
dominated by the error in the continuum wave function. 
From this estimate of the error in each R,, and from the 
information just presented on the contribution of each k 
to the cross section, we can get an estimate of the total 
error of the cross section. Some rough estimates of this 
kind are summarized in Table V. The main conclusion 
is that the present calculation obtains total cross sec- 
tions accurate to 0.8% in the energy range from 300 to 
1400 keV; the accuracy decreases rapidly toward 
threshold and at high energies. 


V. TOTAL CROSS SECTIONS 


Theoretical predictions for the total K-shell photo- 
electric-effect cross section have been based on four 


2% The exception is for x=1, for which R, ch s 
0.5% between k=0.05 and 0.10. which Re chanics ee 
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TABLE VI. Total cross sections for Z= 13, 26, 50, 84, in barns. 


Z 13 26 50 84 


DN This This Hulme This Hulme This Hulme 
(MeV)\. work work etal. work etal. work etal. 


0.3543 0.01543 0.3885 0.39 7.06 7.1 60.6 60.2 
0.662 0.00289 0.0754 --- 1455 = 13.87 --- 
1.131 — 0.00091 0.0237 0.023 0.462 0.46 4.54 4.61 
[5 0.00054 0.0140 --- 0.271 --- 2.65 --: 
2.0 0.00033 0.0086 --- 0.164 --- 1.57 


main calculations: (1) The exact nonrelativistic solution 
of the problem, usually associated with Stobbe,” 
(2) Sauter’s relativistic calculation,® valid to lowest 
order in Za/B, (3) numerical calculations of Hulme 
et al.” for a few selected energies and elements, (4) the 
high-energy limit of the relativistic problem for arbi- 
trary Z as given by Hall.” Extrapolation procedures, 
based on these four results, can be used to estimate the 
cross section for arbitrary Z and energy; tables of these 
predictions have been given by Grodstein*% (NBS 
tables). Experimental results have generally yielded 
satisfactory agreement with these tables. 

Since 1955 there has been substantial improvement in 
all four of these calculations, and one of them has 
actually been found to contain an error. Namely, 
although Hall's expression for the high-energy limit in 
the form of a double integral is correct, the analytic 
formula with which he approximated the integral is not, 
and overestimates the cross section by a factor of two in 
heavy elements. The exact high-energy limit was ob- 
tained numerically by Pratt’? and later confirmed by 
Hall; as already noted, Eq. (2.7) gives a fairly good 


TABLE VII. Total cross sections for Z —82, 92, in barns. 


2-82 Z=92 


E — This E This ] 
(MeV) work HNO NBS (MeV) wok HNO NBS 


0.200 239 241 236 0.132 1026 — 1100 
0.300 84.0 836 802 0.140 --- 887 942 
0.400 40.7 40.5 404 0208 319 324 33 
0.500 23.7 237 231 0.279 155 154 155 
0.600 156 155 153 0412 599 595 598 
0.662 12.5 124 0.662 204 202 207 
0.900 — 649 668 0900 107 --- 112 
1.31 — 4.10 437 1131 678 -- 7.38 
1.332 2.99 324 1332 493 --- 5.5 
1.5 2.39 265 5 mes er 45 
2.0 142 ig 9 —— 285 2.9 


2 M. Stobbe, Ann. Physik 7, 661 (1930). i 
2H. R. Halme jk McDougall, R. A. Buckingham, and R. H. 
Fowler, Proc. Roy. Soc. (London) A149, 131 (1935). 

2 H. Hall, Rev. Mod. Phys. 8, 358 (1936). edes 
*iG. W. Grodstein, U. S. Department of Commerce, M al son 
Bureau of Standards Circular 583 (U. S- Coveney Punt ing 
Office, Washington, D. Cs DAE also R. T. McGi 3 
NBS Supplement to Circular 5 : nies : 
= H. Mall, University of California, Lawrence Radiation Labo 


ratory Report UCRL 5947-T (unpublished). 
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„ Taste VIIT. Comparison of total cross sections (in barns) for 
£ —82, from the NBS tables, an extrapolation of Pratt, and the 
present calculation. 


fh vE: Extrapo- Present 
(MeV) NBS lation calculation 
1.131 437. PY . 4j 
1.332 3.24 3.03 2.99 
1.5 2.65 2.49 2.39 
2.0 1.8 1.56 1.42 


analytic representation. It is then at first hard to under- 
stand why the extrapolation between Sauter and Hall is 
in good agreement with Hulme. The work of Gavrila? 
and Nagel," who extended Sauter’s result to next order 
in (Za/8), explains the puzzle. Hall’s error essentially 
involved omission of the factor (—4za/15) in EquQ- zy 
but the energy dependence of this term is such that it 
is large only at very high energies. [ Recently, a compli- 
cated analytical expression for still another order in 
Za/B has been given by Gorshkov and Mikhailov.5] 

K-shell photoeffect at threshold has been calculated 
for the relativistic problem, both analytically and 
numerically, by Nagel and Olsson.’ In the low-energy 
region screening effects are expected to be important, 
however, and this has recently been examined in the 
nonrelativistic problem by Cooper.” Since, as already 
noted, the numerical techniques of the present paper 
require modification at low energies, we will not discuss 
this energy region further. 

The numerical calculations of Hulme et al. provided 
the reference points which Sauter-Stobbe and Sauter- 
Hall extrapolations are forced to fit. The accuracy of 
Grodstein’s predictions for intermediate energies rests 
almost entirely on the accuracy of these numerical re- 
sults obtained for 0.3543 and 1.131 MeV, estimated at 
4% for heavy elements and 8% for lighter elements. 
Since this is the main energy region of experimental 
interest, it Is important to verify and extend these 


TABLE IX. Comparison with experimental results (see Refs. 
32-34) for K-shell total cross sections (in barns). 


Energy Experimental Present 


Experimenter ^ Element (keV) result theory 
Seeman Pb 511 23.4 +0.7 22.5 
Missoni Au 662 10.2 «0.3 10.5 
Hultberg and Ü 1173 7.2 30.5 6.32 

Stockendal 
Hultberg and U 1332. 54 +0.3 4.93 

Stockendal i 
Bleeker, Goudsmit Pb 1332 3.24+0.13 2.99 

and De Vries 


* 


26 M. Gavrila, Phys. Rev. 113, 514 (1959). 

27 B. Nagel, Arkiv Fysik 18, 1 (1960). . 

233 V. G. Gorshkov and A. T. Mikhailov, Zh. Eksperim. i Teor. Fiz. 
43, 991 (1962) [English transl.: Soviet Phys.—JETP 16, 701 
(1963) ]. 

2 J. W. Cooper, Phys. Rev. 128, 681 (1962). 
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"TABLE X. K-shell angular distributions for Z=1 
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3, 26, 50, 84 in barns/steradian (when multiplied by indicated scale factor and by 1490). 


Z 13 26 50 84 
E(kV) 354  — 602 354 662 354 662 1131 354 662 1131 
O\Scale 10° 10° 109 10° 107 10-3 1074 Wir ho. — dr) 
0 0.0028 0.0155 0.0033 04 0.79 0.96 
5 1.0 0.5 0.19 14 0280 0.22 2.0 014 191 198 
10 2.8 14 0.62 3.6 0.92 0.59 4.3 043 3.93 340 
15 4.1 1.7 * 104 44 1.58 0.79 4.3 078 520 3.40 
20 4.9 1.6 1.27 4.1 2.00 0.76 2.8 106 550 247 
25 5.3 1.3 1.28 33 2.10 0.598 1.6 123 4.99 1.62 
30 4.9 0.9 1.15 2.3 1.95 0435 — 102 126 398 107 
35 3.9 0.58 0.95 1.50 1.67 0.306 0.67 120 298 0.71 
40 2.9 0.37 0.746 — .1.00 1.36 0211 0.42 108 222 OAT 
45 2.2 0.26 0.567 071 1.07 0.144 — 027 0.93 164 0.33 
50 1.65 0.17 0422 0.48 0820 0.098 0.18 078 119 0.23 
55 120 0.11 0.311 0.30 0.621 0.069 0.13 0.64 0.89 0.168 
60 0.86 0.08 0.228 — 021 0468 0.049 0.005 0.31 0.66 0.125 
65 0.63 0.06 0.168 0.16 0.351 0.035 0.069 041 049 0.007 
70 046 0.04 0.123 — 0.116 0264 0.026 0.051 0.33 0.37 0.076 
75 0.34 0.028 0.091 0.077 0.199 0.019 0.039 0.26 0.29 0.058 
80 0.06 0.020 0.067 0.056 0.150 0.0142 0.031 021 023 0.046 
85 0.19 0.016 0.050 0.113 0.0100 — 0.024 017 0.17 0.088 
90 0.140 — 0.012 0.037 0.086 0.0083 0.019 013 O14 003 
95 0.101 0.028 0.066 0.0064 
100 0.076 0.021 0.051 0.0050 
105 0.057 0.0158 0.039 0.0040 
110 0.042 0.0120 0.030 0.0032 
115 0.033 0.0091 0.024 0.0026 
120 0.027 0.0069 0.019 0.0020 
125 0.020 0.0052 0.015 0.0017 
Z 13 26 50 84 
E(keV) 1131 1500 — 2000 1131 1500 2000 1500 — 2000 1500 2000 
QN Scale dio — ro - ee 10- 10% 10° 107 — 10-4 103 —— 10 
0 0.06 0.05 0.034 0.4 0.36 1.0 1.0 
5 042 04 0.3 1.0 1.0 0.9 2.0 1.8 2.0 1.9 
10 095 0.79 0.6 24 2.0 1.6 3.7 3.0 3.0 2.6 
15 090 0.63 0.43 236 1.60 1.04 2.9 1.87 243 16 
20 0.58 0.31 0.16 150 080 0.41 1.5 0.76 141 — 072 
25 0.33 0.15 0.08 083 039 0.19 07320998037, 0.81 0.39 
30 0.195 0.09 0.05 0.50 0.24 0.126 049 0.24 0.51 0.24 
35 0.119 0.061 0.030 033 0.17 0.079 0325015 0.32 0.146 
40 0.075 0.034 0.015 0.205 0.091 0.040 0.19 0.086 0.21 0.099 
45 0.048 0.019 0.100 0.122 0.053 — 0.024 0.119 — 0.056 015 0071 
50 0.031 0.014 0.008 0.081 0.014 0.085 — 0.043 0.105 — 0.047 
55 0.000 0.011 0.060 0.064 0.030 0.075 0.034 
60 0.015 0.007 0.045 — 0.021 0.058 0.030 
GS 0012 0.032 0.015 0.047 0.023 
70 0.008 0.055 0.012 0.036 — 0.016 


calculated points, and this is the purpose of the present 
section. 

While the present work was in progress a new 
numerical calculation was reported by Hultberg, Nagel, 
and Olsson (HNO).? Like the work of Hulme eż al., the 
method used applies only to the pure Coulomb poten- 
tial; HNO treated two heavy elements in the energy 
range 0.120-0.662 MeV. Moderately good agreement 
was obtained with the values derived from the NBS 
tables, confirming the low-energy high-Z point of 
Hulme et al. 


—— : 
3 Tt should also be noted that in principle, another series of 
hotoeffect total cross sections is available in the numerical calcu- 
tions on the coherent scattering of photons from K-shell electrons 
ried out by Brown and collaborators: G. E. Brown, R. E. 
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The K -shell total cross sections obtained in the 
present numerical calculations are summarized in Tables 
VI and VII. The Z’s of Table VI corresponds to those of 


Peierls, and J. B. Woodward, Proc. Roy. Soc. (London A227, 51 
(1954); S. Brenner, G. E. Brown, and TB. UE Ms A227, 
59 (1954); G. E. Brown and D. F. Mayers, tbid. A234, 387 (1955); 
A242, 89 (1957). As the authors note, the photoeffect cross sections 
can be obtained from the imaginary part of the scattering ampli- 
tude in the forward direction for the coherent process, and results 
are presented for Hg at photon energies of 0.32, 0.64, 1.28, and 
2.56 (allin units of mc). A similar calculation was later reported by 
H. Cornille and M. Chapdelaine, Nuovo Cimento 14, 1386 (1959) 
for photons of energy 5.12 mc? on Hg. Unfortunately, results of 
these papers pertaining to the photoeffect are not presented as 
cross sections. We do not wish to now enter into a detailed dis- 
cussion, but if our interpretation of these results is correct they are 
in fair accord with our numerical work and the highest energy 
results show the suppression from previous values reported here. 
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TABLE XI. K-shell angular distributions for Z=82, 92 in barns/steradian (when multiplied by indicated scale factor and by 1490). 


Z 
Se (keV) 200 z e: 
us ) 300 400 500 600 662 208 279 412 662 
cale 107 10 107? 107 102 107? 107? 107 107? 10? 
0 0.0006 0.0289 0.0414 — 0.523 0.577 0.00030 
5 0.132 0.2 0.135 0.143 1.51 1.55 0.122 039. os 0:228 
10 0.499 0.5 0.397 0.376 3.63 3.56 0.47 0.479 0484 0.489 
15 1.02 0.8 0.688 0.600 5.33 4.97 0.98 ° 0.94 0.85 0.690 
20 1.61 11 0.900 0.717 5.82 5.16 1.57 142 1.13 0.740 
25 2.16 1.36 0.987 0.717 5.34 4.50 2.16 1.83 127 0.673 | 
30 2.59 1.53 0.964 0.642 443 3.58 2.69 2.10 128 0.577 
35 2.88 1.54 0.871 0.536 347 2.71 3.10 2.23 119 0,437 | 
40 3.02 144 0.746 0.428 2.63 2.00 3.37 2.23 105 0,332 | 
45 3.02 1.28 0.616 0.333 1.96 146 3.50 2.12 089 0250 
50 2.91 1.10 0.497 0.255 145 1.06 3.50 1.96 0.4 0.187 | 
55 2.72 0.93 0.395 0.194 1.07 0.78 3.40 1.76 0.60 0.140 
60 2.48 0.78 0.311 0.148 0.80 0.576 3.22 1.54 049 04106 
65 2.22 0.64 0.244 0.112 0.60 0.431 2.98 1.33 0.39 0.080 | 
70 1.96 0.53 0.190 0.086 0455 0.327 2.72 1.14 0.310 — 0.062 
75 171 0.43 0.149 0.066 0.348 — 0.250 2.45 0.96 0.247 — 0.048 
80 1.47 0.35 0.117 0.051 0.269 — 0.193 2.18 0.81 0.197 — 0.038 | 
85 1.26 0.28 0.092 0.040 0.200 0.151 1.93 0.68 0.158 0.030 | 
90 1.07 0.23 0.073 0.031 0.166 — 0.120 1.69 0.57 0.127 — 0.024 | 
95 0.91 0.183 0.058 0.025 0.133 — 0.007 147 0.47 0.103 — 0.0196 
100 0.766 0.148 — 0.047 0.0000 0.108 0.079 1.27 0.39 0.083 0.0161 
105 0.644 0.121 0.038 0.0163 0.088 0.064 1.10 0.33 0.068 0.0134 
110 0.54 0.101 0.031 0.0134 0.073 — 0.054 0.94 0.27 0.057 0.0113 
115 0.45 0.083 0.026 0.0111 0.062 0.046 0.81 0.230 0.043 0.0098 
120 0.38 0.068 0.021 0.0005 — 0.053 0.039 0.60 0.195 0.041 0.0085 | 
125 0.31 0.056 0.018 0.0082 0.046 0.034 0.59 0.166 0.036 0.0075 
Z 82 92 | 
NE 900 1131 1332 1500 2000 900 1131 1332 1500 2000 | 
6\ Scale 10-3 10-3 10-3 10-3 10-3 10-3 10-3 10-2 10-3 10-3 
0 0.75 0.84 0.90 0.90 0.87 1.37 1.56 1.68 1.7 1.68 
5 171 1.79 1.84 1.8 1.75 2.62 2.82 2.9 2.9 2.9 
10 3.34 3.12 2.94 2.76 23 4.81 4.63 442 42 3.6 
15 3.89 3.11 2.57 2.22 14 5.68 4.67 3.93 34 2.25 
20 3.32 2.24 1.63 1.27 0.65 5.04 3.51 2.59 2.04 1.06 
25 2.46 1.46 0.98 0.73 0.35 3.88 2.39 1.63 1.22 0.59 
30 1.73 0:96 0.62 0.46 0.22 2.83 1.61 1.05 0.77 0.36 
35 1.20 0.64 0402 0.290 0.131 2.01 1.08 0.68 0.49 0.217 
40 0.82 0.423 0.264 0.189 0.088 142 0.74 0.46 0.33 0.153 
45 0.57 0.291 0.183 0.133 0.063 1.01 0.52 0.33 0.280 0111 
50 0.41 0.207 0.129 0.093 0.042 0.73 0.37 0.231 0166 0.072 
55 0.294 0.148 0.092 0.066 0.030 0.54 0.27 0.166 0.118 0.053 
60 0.216 0.110 0.070 0.051 0.026 0.40 0.202 0.129 0.093 0.048 
65 0164 0.085 0.056 0.042 0.020 0.31 0.100 0.104 0.077 0.037 
70 0126 0.066 0.042 0.031 0.014 0.24 0.125 0.080 0.059 0.025 
75 0.096 0.051 0.185 0.096 0.061 
or 0.075 0.040 0.146 0.077 0.051 
Re 0.061 0.033 0.119 0.065 0.044 
53 0.050 0.028 0.098 0.054 0.035 


Hulme et al. and those of Table VII to HNO; in each 
case the appropriate comparisons with earlier work are 
made. The first striking feature is the excellent agree- 
ment, far better than the accuracy which had been 
claimed, with the calculations of Hulme ef al. This means 
that interpolations, such as Grodstein’s, based on the 
Hulme values would be expected to be good and need 
no major revision in this energy range (however, see 
below). The errors which remain are of the same mag- 
nitude as screening effects; such effects must be included 
before the change in the cross section from the Hulme 
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values is significant. Agreement with the HNO results 
is also excellent, and within the estimated error of the 
present calculation. The first result of this section is 
then the establishment of the accuracy of the values of 
Hulme eż al. 

In Table VII we have also given the corresponding 
cross section as obtained in the NBS tables. The com- 


31 Since the NBS values are for total absorption cross sections, 
the contribution from higher shells is removed by dividing o the 

simple multiplicative factor which Grodstein used to put Bm 
Another smaller correction is also needed to obtain a K-shell c 
section without screening. 
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¥ic. 1. K-shell angular distributions for U in barns/steradian. 
The small cross sections in the shaded region of the graph are of 
low accuracy. 


parison is rather surprising; even in the 1-MeV region, 
the present results are 5-10% lower than corresponding 
interpolations from the NBS tables. It is hard to under- 
stand this, since the present results agree with Hulme 
et al. and the NBS tables are based on Hulme et al. By 
2 MeV, the difference has reached 25%. Further proof 
that this difference is real is provided by a new calcula- 
tion? which HNO have undertaken to check our results; 
it appears that they have obtained complete agreement 
with our cross section for Z—92 at 1.332 MeV. 

A comparison with the extrapolation formula of 
Pratt? is given in Table VIII for Z=82. This formula, 
which is exact at very high energies and exact for light 
elements at all energies, is in good agreement with the 
Hulme values and so also with the present calculation in 
the 1-MeV range. However Table VIII suggests that 
the very close agreement at 1 MeV is misleading, since 
for the higher energy of 2 MeV the two differ by 10%. 

The recent experimental measurements? of the 
total cross section from the K shell are given in 
Table IX. These resu:ts were all reported as in good 


2S, Hultberg (private communication). We should like to 

thank Dr. Kultberg for undertaking this calculation and for 
communicating its results prior to publication. 

aK. W. Seeman, Bull. Am. Phys. Soc. 1, 198 (1956). 
31S. Hultberg and R. Stockendal, Arkiv Fysik 15, 355 (1959). 
/35 E. J. Bleeker, P. F, A. Goudsmit, and C. DeVries, Nucl. Phys. 
2 (1962). 
Missoni (to be published). 

should also note the measurement by Bleeker, Goudsmit, 
eVries of the ratio c (2.154) /o (1.368) in Pb. A similar ratio 
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accord with the NBS tables, which again says that the 
experimental results above 1 MeV are higher than the 
theoretical values of the present calculation. 

The second result of this section is then the finding of 
significantly lower cross sections in the energy region 
above 1 MeV than are given in the NBS tables or in two 
recent experiments. We have previously outlined the 
evidence for the accuracy of the present calculations. If 
these arguments are correct we see no theoretical 
explanation for the discrepancy. It is true that screening 
effects will modify the pure Coulomb calculations, but 
for the K shell such corrections are expected to be only 
1-295, and in the wrong direction. 


TABLE XII. Comparison of angular distributions with HNO 
results, normalized to agree with this work at the underlined 
angles. 


E (keV) 208 279 412 662 
This This This This 
0 work HNO work HNO work HNO work HNO 
0 0.01 0.05 0.05 0.10 0.13 
5 012 0.13 0.14 0.17 0.18 0.18 0.23 0.27 
10 0.47 0.53 0.48 0.51 0.48 0.48 0.49 0.49 
15 0.98 0.97 0.94 0.94 0.85 0.77 0.69 0.68 
w aa a A 1:381 — 1:19 01:14 0.74 0.74 
25 DG OM WEB SS. EXE ICH 0.67 0.67 
30. 2.69 2.73 2410 2.10 1.28 1.28 0.56 0.56 
33 30 G93 Bes 22 19 1.18 0.44 0.44 
4p aw SO 929 9? 1.05 1.04 0.33 0.33 
AR 350 ac 240 9050 OS) OS Os 0.25 
50 3:50) 3.50 1-96 195 0.74 0.73 0.19 0.19 
55 340 3.43 1.76 1.76 0.60 0.61 0.14 0.14 
60 3.22 323 1.54 1.53 0.49 049 0.11 0.11 
65 2.98 3.00 1.33 1.34 0.39 0.39 0.08 0.08 
Hn» 240 P? 1.14 1.13 031 0.30 0.06 0.06 
75 245 247 0.96 0.96 0.25 0.24 0.05 0.05 
80 2.18 2.20 0.81 0.81 0.20 0.20 0.04 0.04 
85 1.93 1.93 0.68 0.68 0.16 0.16 0.03 0.03 
90 1.69 1.70 0.57 0.57 0.13 0.12 
95 147 147 0.47 047 0.10 0.10 
100 1.27 1.27 0.39 0.40 0.08 0.09 
105 1.10 1.10 0.33 0.34 0.07 0.07 
110 0.94 0.93 0.27 0.28 0.06 0.06 
115 0.81 0.80 0.23 0.23 0.05 0.05 
120 0.69 0.67 0.20 0.19 
125 0.59 0.50 0.17 0.17 


was obtained earlier by G. R. Bishop, C. H. Collie, H. Halban, 
A. Hedgran, K. Siegbahn, S. duToit and R. Wilson, Phys. Rev. 80, 
211 (1950); an early result for c(2.62) in Pb is given by G. D. 
Latyshev, Rev. Mod. Phys. 19, 132 (1947). See also the total 
photoelectric absorption cross-section measurements of W. F. 
Titus, Phys. Rev. 115, 351 (1959), at 662 keV in a series of 
elements, and of B. I. Deutch and F. R. Metzger, ibid. 122, 848 
(1961) at 279 keV in TI. Particularly interesting for this paper are 
total absorption measurements at 2.62 MeV in various elements 
by I. E. Dayton, Phys. Rev. 89, 544 (1953) and W. F. Titus (to be 
published), which lie appreciably below the earlier theories. 
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TaRLe XII. Ratio of experiment to theory for angular dis- 
tributions in U, both distributions normalized to unity at the 
underlined angles. 


Experimenter —Sujkowski Hultberg 

Angle\ Energy (keV) 279 412 662 1332 
0° 6.2 2.2 0.89 0.71 

15° 0.95 0.84 1.00 1.00 

30° 1.00 1.00 1.04 1.28 

45? 1.05 1.10 1.32 0.87 

60° 0.94 1.33 146 0.91 

1S 0.92 1.58 1.68 0.67 

90° 0.98 1.90 1.69 0.00 
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VI. ANGULAR DISTRIBUTIONS 


Until quite recently the only theoretical result for the 
angular distribution of relativistic K-shell photoeffect 
was that of Sauter, Eq. (2.5), and there was no experi- 
mental information. When corrections of relative O(a) 
were computed by Banerjee,** Gavrila,” and Nagel? it 
was found that they did not significantly change the 
predicted angular distributions, including the prediction 
of a vanishing cross section in the forward direction. On 
the other hand, the experiments of Hultberg?-* indi- 
cated some striking deviations from the Sauter distribu- 
tion: A nonvanishing forward cross section, a shifting 
of the maximum in the cross section toward larger 
angles, and a tendency of the cross section to hold up at 
large angles. We shall return to the theoretical discus- 
sion of these matters following the presentation of our 
numerical calculations and their comparison with HNO 
and experiment. 

The K-shell angular distributions obtained in the 
present numerical calculations are summarized in 
Tables X and XI. We present these results in some 
detail, despite their limited accuracy, since they do dis- 
play the significant deviations from the Sauter distribu- 
tion and so provide more nearly correct predictions. The 
very small large-angle cross sections are not tabulated. 
For qualitative purposes we also show in Fig. 1 the 
Z-— 92 distributions for several energies. Forward scat- 
tering increases with increasing energy, the maximum 
moves in, and backward scattering remains finite but 
decreases with increasing energy. 

For the lower energies we may again compare these 
results with those of HNO which become available 
(although they have not been published) in the course 
of the present calculation. The comparison for U is 
given in Table XII, where the HNO values, which are 
normalized to unity at the angle of maximum emussion, 
are read from their graphs and normalized to agree with 


38 H. Banerjee, Nuovo Cimento 10, 863 (1958). . 

3 A. Hedgran and S. Hultberg, Phys. Rev. 94, 498 (1954). 
“S, Hultberg, Arkiv Fysik 9, 245 (1955). 

aS. Hultberg, Arkiv Fysik 15, 307 (1959). 
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Fic. 2. Comparison with the Sauter angular distribution from 
Z=% for a series of energies. The ratio R of the present distribu- 
tion to the Sauter distribution is plotted, where both distributions 
have been normalized to their respective total cross sections. The 
approximate angles of maximum emission are marked. 


our absolute results at the underlined angles. The two 
calculations are in good agreement. 

Experimental results for K-shell angular distributions 
have been obtained for uranium by Hultberg? (412, 662, 
and 1332 keV) and by Sujkowski'? (279 keV); no other 
true K-shell distributions are known to us, excepting the 
early work of Hultberg.” In Table XIII we give, for 
a few selected angles, the ratio of experiment to theory. 
The experimental results are available at quite fine 
angular intervals, and the 279- and 662-keV cases have 
elsewhere?” been compared with the HNO predictions 
in graphical form. Except at forward angles the 279-keV 
result of Sujkowski seems to be in good agreement with 
theory, and it should be noted that at this low energy 
the forward cross section is very small. The higher 
energy data of Hultberg shows more marked deviations 
from theory ; now that fairly good theoretical estimates 
are available, further experimental data in this energy 
region would be desirable. Note that at higher energies 
theory predicts a /arger forward cross section than was 
observed. [Angular distributions for Au at 412 keV in 
the range 207-90? have now been obtained by Bergkrist 
and Hultberg” in excellent agreement with theory. ] 

Let us next compare these new theoretical results 
with the much-used Sauter distribution. This has been 
done for a series of energies in Fig. 2. What :s plotted is 
the ratio R of the present distribution to the Sauter 
distribution, with both distributions normalized to their 
respective total cross section [(de/d9)/e]. Thus, the 


27. Sujkowski, Arkiv Fysik 20, 269 (1961). 


5 K. Bergkvist and S. Hultberg, Arkiv Fysik (to be published). - 
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Fic. 3. High-energy behavior of angular distribution. F(x) 
= (4c'a*e)-1(da/dQ) is plotted against x— A0 for Z=82 at three 
energies and compared with its form in the high-energy limit. 


straight line R—1 corresponds to the Sauter distribu- 
tion; an angular region in which R= constant is a region 
in which the distribution locally has the Sauter shape. 
The only case shown is for Z—84. For Z— 26 the curves 
essentially follow the straight lines R—1 and even for 
Z=50 the deviations are barely significant within the 
errors of the calculation This was why, as was eluci- 
dated by later theoretical work, it was possible to get 
along with a Sauter distribution for many years—the 
shape was much better than the normalization. Roughly 
speaking, the error in the Sauter total cross section is 
(aa) (from omission of the characteristic e~** factor) 
but the significant errors in the Sauter distribution are 
only 0(a?). But Fig. 2 does show that in the heaviest 
elements the deviations from the Sauter distribution be- 
come significant. The rise in R at small angles occurs 
because the Sauter distribution, unlike an exact calcula- 
tion, vanishes in the forward direction. If this were the 
only region which deviated from Sauter form the curve 
would elsewhere behave as R=constant. Instead, at 
larger angles it falls off more slowly than the Sauter 
form, and indeed there is no sizeable angular region of 
Sauter shape. These qualitative features were first re- 
marked in the experiments. Finally, the angle @max of 
maximum emission, roughly indicated on the graphs, 
occurs in the region in which R is rising, so the cross 
section is staying up in comparison to Sauter, and so 
max is being shifted toward larger angles in comparison 
to the Sauter distribution. This too was first noted in 


the experiments. 
The relative 0(a) corrections, mentioned earlier, do 
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not shed much light on these properties—they appear 
to be O(a?) effects. This arises, as was noted by Nagel,” 
both because at intermediate energies the O(a) term is 
not large and because its distribution is similar to the 
Sauter form. There is a tendency for a maximum at 
larger angles and for a larger cross section at back angles, 
but especially for the latter effect the increase is in- 
sufficient. Nagel also pointed out that the reason the 
O(a) distribution vanishes in the forward direction is 
that it arises as a cross term with the Sauter matrix 
element. The lowest order nonvanishing cross section in 
the forward direction Z (0) is relative 0(a?), and this Na- 
gel obtained. It has been conventional to express this 
in terms of the ratio x&ZI(0)/Z (Omax). From Table XI 
we would estimate that at 1332 keV, this ratio is 0.30 
for Pb and 0.38 for U. These compare with the theo- 
retical values 0.27 obtained for Pb by Sauter and 
Wiister in a numerical calculation and 0.42 obtained 
by Nagel for Pb from the relative 0(a?) term above. 
Hultberg’s experimental value for « in U at 1332 keV 
was 0.245, which again reflects the fact that the experi- 
mental forward cross section at the higher energies lies 
below theory. (Some new measurements of x have 
recently been reported by Rimskii-Korsakov ef al.*°) 

It is finally of some interest to discuss the behavior of 
the K-shell angular distribution in the high-energy limit. 
At high energies, the significant angles are 0(1/k), and 
in analytic work it is appropriate to expand in 1/k and 
0, but for arbitrary x-k6. Neglecting 0(a?), Nagel?" 
writes the distribution in this limit as 


x? T 1 
~ [1-a ] ; (6.1) 
(14-22)? 2 (14-32)? 


including the 0(a) correction to the Sauter distribution 
which has its maximum at +=1/v2. Nagel (and also 
Mork and Olsen‘) find that in this limit forward 
scattering is characterized by x—5.85a? to lowest non- 
vanishing order in a. Mork and Olsen also obtained an 
exact expression for the forward scattering in the high- 


43a This result was also independently obtained by Arne Reitan, 
Physica Norvegica 1, 113 (1961). 

4F, Sauter and H. O. Wiister, Z. Physik 141, 83 (1955). 
Actually these authors obtained (0°) numerically and took 
T(@max) from the usual Sauter distribution, a procedure which 
certainly becomes invalid at high energies as will be seen shortly. 

45 Nagel (Ref. 27) compares the Sauter-Wiister 0.27 value with 
Hultberg’s 0.245, calling the agreement rather good, apparently 
assuming that for large Z, x does not change appreciably with Z. 
The present results do not support this assumption, and we con- 
clude there is a significant difference between theory and 
experiment. 

K.K. Aglinstev, V. V. Mittrofanov, A. A. Rimskii-Korsakov, 
and V. V. Smirnov, Bull. Acad. Sci. USSR 25, 1146 (1962); A. A. 
Rimski-Korsakoy and V. V. Smirnov, Zh. Eksperim. i Teor. Fiz. 42, 
67 (1962) [English transl.: Soviet Phys.—]ETP 15, 47 (1962)]; 
and Bull. Acad. Sci. USSR 26, 1180 (1963). 

41 K. Mork and H. Olsen, Proceedings of the Physics Seminar in 
Trondheim, No. 5, 1960 (unpublished). 
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energy limit; such an expression was also later obtained 
by Weber and Mullin, who graph F(0) against Z, 
where 

do 


4e?a5e dQ 


F(6)= (6.2) 


in dimensionless units. In the high-energy limit, F is 
independent of energy. For Pb we would compute 
x—2.1, an absurd result («x:1) which could be inter- 
preted as a gross failure of the expansion of F(0) in a. 
In fact, the error lies in the supposition that the position 
of the maximum is given by the Sauter maximum, 
whereas numerical calculations of Boyer? and Nagel? 
show that in heavy elements the maximum occurs for 
forward emission and the distribution decreases mono- 
tonically. Using Nagel's calculation? we find that the 


58 T, A. Weber and C. J. Mullin, Phys. Rev. 126, 615 (1962). 

49R. H. Boyer, Ph.D. thesis, Oxford University, 1957 (un- 
published). 

99? B, Nagel, Arkiv Fysik 24, 151 (1963). 


ratio of forward emission to emission at the Sauter 
angle is 1.6. In Fig. 3 we plot F(z) for Z=82 for three 
of the energies of our numerical calculation and also 
show the high-energy limit from Nagel’s work. This 
again makes very clear how far removed 2 MeV is from 
the asymptotic energy region. Even the manner in 
which the minimum at forward angles gets filled remains 
to be calculated by higher energy work. (The high- 
energy limit has now also been discussed by Gorshkov 
and Mikhailov.*!) 
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The seven possible polarization correlations between incident photon 
| photoeffect have been obtained with numerical methods for the range of elements 
the range of energies from 200 keV to 2 MeV. The photoeffect can serve as a polarizer of electrons, a trans- 
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and ejected electron in K-shell 
from Z=13 to Z=92 and 


J mitter of polarization from photons to electrons, or an analyzer of polarized radiation. In heavy elements at 
\ suitable angles of emission all these correlations can be large. Previous work on the correlations is discussed. 


i4 I. INTRODUCTION 


Hh HE atomic photoelectric effect can serve as an 
analyzer of polarized radiation, à transmitter of 
polarization from photons to electrons, or a polarizer 
of electrons. The preferential ejection of photoelectrons 
in the plane defined by the momentum k and polariza- 
tion vector e of linearly polarized photons has long been 
known.! In recent years the high degree of correlation 
between circularly polarized photons and longitudinally 
polarized electrons has been discussed-* The photo- 
effect can also serve as a polarizer, producing trans- 
versely polarized electrons from unpolarized radiation.* 
In practice not too much use has been made of the 
photoeffect for these purposes, due both to experimental 
problems and to the lack of precise knowledge of the 
expected correlations." 

We have recently completed a numerical calculation 
of K-shell photoelectric cross sections, covering the 
range of elements from Z=13 to Z=92 and the range 
of energies from 200 keV to 2 MeV, neglecting electron 
screening. Our numerical methods and the resulting 
differential and total cross sections have been reported 
elsewhere?; the purpose of this paper is to present and 
discuss the results which we have obtained for all the 
possible polarization correlations between incident pho- 
ton and ejected electron. For a qualitative understand- 
ing of the nature of the correlations we give a set of 


* This study was supported in part by the United States Air 
Force through Air Force Office of Scientific Research Grant 
AF-AFOSR-62-452, and in part by the United States Atomic 
Energy Commission. 

t Address after August 1964: Department of Physics, Univer- 
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f Present address: Physiss International, Berkeley, California. 
_1W. Heitler, Quantum Theory of Radiation (Oxford University 
Press, New York, 1954), 3rd ed. 

(1938) Olsen, Kgl. Norske Videnskab. Selskabs. Forh. 31, No. 11 

3H. Banerjee, Nuovo Cimento 11, 220 (1959). 

_ 4U, Fano, K. W. McVoy, and J. R. Albers, Phys. Rev. 116, 
1147 (1959); 116, 1159 (1959). 

— 5B. Nagel, Arkiv Fysik 18, 1 (1960). 

— 6R. H. Pratt, Phys. Rev. 123, 1508 (1961). 

j _. W. Fagg and S. S. Hanna, Rev. Mod. Phys. 31, 711 (1959). 

8R. H. Pratt, R. D. Levee, R. L. Pexton, and W. Aron, pre- 

eding paper, Phys. Rev. 134, A898 (1964). 
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figures showing the energy dependence of the correla- 
tions in a high-Z element; these are found in the sepa- 
rate sections in which we discuss each correlation. We 
have appended to this paper a representative set of 
tables for the element Z=84; a complete set of tables 
is also available? 

To define these correlations we need to introduce the 
usual polarization parameters. We describe photon 
polarization with the quantities 


Eee ee, fot be, 


(1) 


t= i (eie — e21“). 


Here e, is the component of the photon polarization vec- 
tor e which lies in the plane of the incoming photon k 
and outgoing electron p, and e; is the component perpen- 
dicular to the plane (along kX p), so that (¢1,¢2,k) form a 
righthanded set. Photons linearly polarized parallel or 
perpendicular to the production plane are characterized 
by a nonvanishing £1, circularly polarized photons by 
nonvanishing £s. Similarly, we describe the polarization 
state of the ejected electron in terms of the direction ¢ 
of its spin in its rest system; ¢s is taken along the electron 
direction p, tı in the scattering plane, and $» perpen- 
dicular to the scattering plane (along kXp), so that 
(catat) again form a right-handed set. Longitudinally 
polarized electrons are characterized by nonvanishing 
£s. It should be noted that the coordinate systems used 
for the description of photon and electron polarizations 
are different. 

In terms of these parameters the differential cross 
section for K-shell photoeffect, summed over both K- 
shell electrons, is of the form 


do do 3 
SEES 3 56715 
= Gee X 5664] ( 


i,j—0 


° These tables are available in limited supply from R. H. Pratt 
as Stanford University Institute of Theoretical Physics Report 
ITP-99 (unpublished). This report has also been deposited as 
Document No. 7829 with the ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, Wash- 
ington, D. C. A copy may be secured by citing the Document 
number and by remitting $7.50 for photoprints, or $2.75 for 35 mm 
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where £o (9 Coy— 1,and (do/d2) snpor.is the differential 
cross section from unpolarized photons, summed over 
final electron spins. The quantities C;; satisfying 
|Cij| <1, are the polarization correlations with which 
this paper is concerned. 

It would appear from Eq. (2) that, not counting 
Coo=1, there are fifteen polarization correlations. But 
in fact invariance considerations permit only seven 
nonzero correlations. This follows most directly from 
the fact that the matrix element 


H= —e(27/k)"? Í d*rjsis foi ee™ Yin (3) 


is invariant under the substitutions 

(ee) > (—en, e). (4) 
For we may absorb these substitutions into the polari- 
zation parameters by 

(Ey Es £:) — (£1, — £s — &), 


(61,5263) > C7£»t(s —&), (3) 


y cw, 


and similarly for the bound-state polarization param- 
eters which are summed out. If Eq. (2) is to be invariant 
under the substitutions (5), then we conclude that, not 
counting Cos 1, the only nonzero correlations are 


Cos, Cio, Cis Co, Cz, Car, C33. (6) 


These seven correlations are indeed all nonzero, and 
they will be discussed in the subsequent sections. Co» 
produces transversely polarized electrons from un- 
polarized photons, Cio is the well-known correlation 
which analyzes linearly polarized photons, and C33 
converts circularly polarized photons to longitudinally 
polarized electrons. 

We have written down in Eqs. (2.25)- (2.27) of Ref. 8 
expressions for the Cy in terms of reduced partial-wave 
matrix elements R, and continuum wave-function phase 
shifts ô; it is from these equations that the results 
presented in this paper were calculated. Some informa- 
tion which can be deduced from these equations con- 
cerning the behavior of the correlations at small angles 
should be mentioned. First of all, since the angular dis- 
tribution (de/d9)uspoi. does not vanish for forward (and 
backward) angles (unlike the Sauter distribution;-? 
valid only to lowest order in az Zo), one sees that all 
correlations except C33 vanish in the forward and back- 
ward directions, and Css(0^)— r1, C33(180°)  — 1. 
Further, for small angles the correlations Co», C12, C21, 
and Cs; go as 6, while Cio and Css go as ®. And finally, 
for small angles one has the identity C= —Czi. How- 
ever, one can see from the figures and tables that small 
angles, at these energies, means much less than 5?, and 


is required. Make checks or money 


mi Adv: "ment S : 
Soal A hotoduplication Service, Library of 


E payable to: Chief, P 
Ongress. E 
1? F, Sauter, Ann. Physik 11, 454 (1931). 


indeed all the correlations undergo very rapid and vio- 
lent changes over this region of very small angles, in 
contrast with their comparatively slow variations 
through the main angular regions. Indeed, in the figures, 
we have not attempted to draw in the variation of the 
correlations between 0 and 5°. 

Before proceeding to the specific correlations we may 
also discuss their behavior at low and at high energies 
and as a function of Z. In the nonrelativistic theory, 
a-e in Eq. (3) is replaced by p-e, and all dependence on 
electron spin disappears. This leads to a differential 
cross section proportional to (1+£;), so that Cio= 1 for 
all angles and all other correlations vanish. Of course 
even at threshold there are relativistic corrections of 
relative order a? [which for example cause C15(0?) — 0, 
and away from threshold there are relativistic correc- 
tions of relative O(1), as given by Sauter, which in 
fact for the larger angles cause the preferred direction 
of emission to be perpendicular to rather than in the 
plane of k and e. In the relativistic theory correlations 
with the electron spin can occur. In addition to Cio 
the correlations C3, and Cs; are also O(1), while the 
remaining correlations are O(a) and so mainly important 
only in heavy elements. At very high energies one can 
show! that for angles of significant electron emission 
the only nonvanishing correlations are Cy, Co, and 
C33, and further C;5— +1 and C= — Ca. 


Il. THE CORRELATION C;; 


The correlation Coz permits the production of trans- 
versely polarized electrons from unpolarized photons; 
the resulting electron spin is perpendicular to the pro- 
duciion plane. The correlation vanishes both in the 
nonrelativistic and extreme relativistic limits and is 
O(a). Figure 1 shows Coz for Z=92 at some representa- 
tive energies. Like the more familiar correlation C1, Co» 
exhibits a “crossover” feature: For energies above some 
minimum energy the spin direction kX p is favored at 
smaller angles and —kXp at larger angles. Previous 
work on this correlation is due to Kolbenstvedt and 
Olsen? and Nagel? who obtained the analytic expres- 
sion for the correlation to lowest nonvanishing order in 
a, and show curves which are qualitatively similar to 
ours. [These curves fail near the forward direction 
where the angular distribution itself is O(a*) rather 
than O(1); the same comment will apply to all the 
analytical calculations subsequently mentioned.] Nu- 
merical results at threshold have been reported by 
Nagel and Olsson.? In the report on their numerical 
calculation of K-shell cross sections Hultberg, Nagel, 
and Olsson! show one representative curve for Cos 


11 B, Nagel, Arkiv Fysik 24, 151 (1963). 

12. Kolbenstvedt and H. Olsen, Proceedings of the Physics 
Seminar in Trondheim, 1960 (unpublished). 

13 B. Nagel and P. Olsson, Arkiv Fysik 18, 29 (1960). ` 

4S, Hultberg, B. Nagel, and P. Olsson, Arkiv Fysik 20, 555 
(1961). : 
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Fic. 1. Correlation Co» for the production of transversely polar- 
ized electrons from unpolarized photons, as a function of angle for 
Z=92 at several energies. 


(P; in their notation), and in unpublished work? they 
have given curves for Z=82 and Z— 92 over a range of 
energies up to 662 keV, which are in good agreement with 
the corresponding results of our tables. As has been 
noted,” the electron polarization which Co» implies 
is analogous to that produced in relativistic Coulomb 
scattering with unpolarized electrons, and can be 
analyzed in the same way; this correlation has the 
advantage? of not vanishing as rapidly at high energies. 


II. THE CORRELATION Cio 


The correlation Cio, as already noted, reflects the 
fact that the photoeffect can serve as an analyzer for 
photons linearly polarized parallel or perpendicular to 
the production plane. In the nonrelativistic theory! 
C1o— 4-1, which says no electrons are emitted perpen- 
dicular to the polarization direction of the photons. Of 
course, even at thteshold there are relativistic correc- 
tions of O(a), which, for example, cause Cy(0°)=0. 
The calculation of Cio to lowest order in a in the rela- 
tivistic theory was made by Sauter,!° and like Co it 
exhibits a crossover feature: Above a minimum energy, 
emission in the plane of k and e (as predicted by the 
nonrelativistic theory) 1s favored for smaller angles of 
emission, whereas emission perpendicular to the plane 
is favored at larger angles. As discussed by Olsen; in 
lowest order in a the minimum electron energy for the 
crossover feature is e=% (in units Z—c— m, -— 1), i.e., 
22340 keV for the photon, and at higher energies the 


15S. Hultberg, B. Nagel, and P. Olsson (unpublished). (Dr. 
Nagel now informs me that they have made calculations for all 


seven polarization correlations.) 
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crossover angle 6, is determined from 
k(k+1)(1—B cos&;) -2, (7) 


where k is the photon energy in the same units. Olsen 
also noted that in the same approximation, at the cross- 
over angle 6, such that C1o(0.)—O0, electrons from 
circularly polarized photons are completely polarized 
along the photon direction. We shall return to this feature 
in our discussion of Ca: and C33. We should also note 
that the analytical corrections to Cio of order a were 
obtained_by Gavrila!® and Nagel.’ 

Results of our numerical calculation of Cy are given 
in the tables appended to this paper and in Fig. 2. We 
also give in Table I some information on the crossover 
angle 6. as a function of k and Z. At low energies, the 
correlation is quite close to its nonrelativistic value 
except at large angles (and extremely close to the for- 
ward direction). As the energy increases the correlation 
appears to decrease monotonically for the angles of sig- 
nificant electron emission, and it will vanish in the high- 
energy limit. However, Cio can still be quite sizeable at 
2 MeV. Table I shows that in heavy elements the cross- 
over angle 0, begins to vary significantly from the lowest 
order prediction Eq. (7); the crossover occurs signifi- 
cantly /ater. 

Among the seven photoeflect polarization correla- 
tions, only Cio appears to have so far been the subject 
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Fic. 2. Correlation Cio for the analysis of linearly polarized pho- 
tons, as a function of angle for Z —92 at several energies. 


16M, Gavrila, Phys. Rev. 113, 514 (1959). 
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of experimental investigations. In the energy range of 
interest there are experiments by Hereford and co- 
workers! and by Brini e/ al.! In both cases the objective 
was to test the predicted crossover feature of Sauter’s 
relativistic theory; the first group appeared to see such 
a feature while the second group did not. Now that a 
more precise theory is available, and especially in view 
of the shift of the predicted 0, as shown in Table I, 
further experiments on this question would seem 
desirable. 


IV. THE CORRELATION Ci; 


The correlation Ci; alters the production of trans- 
versely polarized electrons from that given by Co 
in the case that the initial photons are linearly polarized 
parallel or perpendicular to the production plane. To 
determine C» will require also determining Coz and Co, 
or at least designing a set of experiments which subtract 
out their effect; C1» is the only one of the seven correla- 
tions which cannot in this sense be determined inde- 
pendently. Like Coz, the correlation Cy: vanishes in 
the nonrelativistic limit and is O(a); unlike Coo it re- 
mains finite in the extreme relativistic limit. Connec- 
tions between Ci: and C»; have already been mentioned: 
C15— — C for very small angles and also for all angles 
of significant electron emission (which are also small 
angles, but not as small) in the extreme relativistic 
limit.®1 There does not appear to be any discussion of 
the general analytic form of C15, although an expression 
to lowest nonvanishing order in a could be extracted 
from Nagel's work.* An expression for the lowest order 
result valid in the extreme relativistic limit was ob- 
tained by Banerjee.? The complete analytical expression 
for the relativistic limit has now been given by Nagel," 
and he has evaluated it numerically for three choices of 
Z. Results of our calculation of C1» are detailed in the 
Appendix and in Fig. 3. Since the correlation is O(a) 
it is most important in heavy elements, but it can become 
quite sizeable for intermediate Z at back angles (and 
Nagel’s work" indicates that in the extreme relativistic 
case it is greater for intermediate than for large Z). 
C1; is always positive in the regions considered here, but 


TABLE I. Crossover angle 6; for which Cio(@-) =0 as a function 
of charge Z and photon energy (MeV). 


B AX 13 26 50 84 
0.354 22134? z:132* 221332 £575 
0.662 =74° =68° 70° 80° 
1.131 42° 41° A 56° 
1.5 =35° 735° 36 47° 
2.0 2225? 2224? SI 40 


PUISS dE: ford, Phys. Rev. 81, 482 (1951); 81, 627 (1951) ; 
We IL, Hereford and If P. Keuper, ibid. 90, 1043 (1953); W. H. 
MeMaster and F. L. Hereford, ibid. 95, 123 (1954). Geert 

15D. Brini, L. Pelli, O. Rimondi, and P. Veronesi, Nuovo 


Cimento 6, 98 (1957). 
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Fic. 3. Correlation Ci: for the production of transversely polar- 


ized electrons from linearly polarized photons, as a function of 
angle for Z —92 at several energies. 


Nagel’s work" indicates that for very heavy Z and high 
energies there will be a crossover. Finally, for heavy Z 
and intermediate energies Cy, is large, especially at the 
more backward angles, and so produces major effects 
in the production of transversely polarized electrons 
from linearly polarized photons. 


V. THE CORRELATION C; 


The correlation C21, like C1», produces transversely 
polarized electrons from linearly polarized photons. 
However this correlation, also O(a), is between electrons 
polarized în the production plane and photons polarized 
at 45? to the production plane. As we have said before, 
C12— — C» for very small angles and also in the extreme 
relativistic limit. The remarks on previous work also 
apply to both processes: from Nagel's work* one can 
extract an analytic expression for the lowest nonvanish- 
ing order in a and from his more recent work!! one of 
course has the extreme relativistic limit (Cza = — Css, 
note there is a difference in siga and in his choice of 
coordinate system). Results for C2; are in the Appendix 
and in Fig. 4. Remarks on magnitudes apply to both 
correlations and need not be repeated here. However, 
the correlation C2: exhibits, even at the lowest energies, 
a crossover feature; the crossover angle increases with 
energy, unlike Co» or Cio. Since C3; isdarge at intermedi- 
ate energies for back angles the photoeffect provides, a 
fairly efficient mechanism for the transfer of these 
polarizations. 
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_ Fic. 4. Correlation Ca: for the production of transversely polar- 
ized electrons from linearly polarized photons, as a function o 
angle for Z=92 at several energies. 


VI. THE CORRELATION C» 


The correlation Css, which is O(a), produces longi- 
tudinally polarized electrons from linearly polarized 
photons. The photon polarization should again be at 
45? to the production plane. There does not appear to 
have ever been any analytical discussion of this correla- 
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tion, although the result to lowest order in a can again 
be extracted from Nagel’s work.® Results for C»; are 
given in the Appendix and in Fig. 5. In the range con- 
sidered here the correlation is always negative. And 
although C23 is O(a) and vanishes in the nonrelativistic 
and extreme relativistic limits, for intermediate and 
large Z at intermediate energies it becomes quite sizeable 
at large angles and so provides a fairly efficient mecha- 
nism for the transfer of these polarizations. 


VII. THE CORRELATION Ca 


The correlation Cz: produces electrons transversely 
polarized in the production plane from circularly polar- 
ized photons. The correlation vanishes in both the non- 
relativistic and extreme relativistic limits, but it is 
O(1) and very large for intermediate energies and 
angles. Figure 6 shows Cy: for Z=92 at some representa- 
tive energies; further results are given in the Appendix. 
At very forward angles the correlation is positive, but 
it soon crosses over and remains negative through most 
of the angular range, with a broad minimum at inter- 
mediate angles. (The curves suggest that there could 
actually be an energy, near 600 keV, and angle, near 
80°, for which Csi=—1.) Analytical expressions for 
Cn to lowest order in a were obtained by Olsen? and 
Banerjee’; Nagel? also obtained terms of relative order 
a. Numerical results obtained by Hultberg, Nagel, and 
Olsson!!5 (in their notation Csi=—Ps) are in good 
accord with the corresponding values of our tables. 
Numerical results at threshold were given by Nagel 
and Olsson." 
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TABLE II. D(0.) as defined by Eq. (8) for Z=84 as a function 
of photon energy & (in MeV). 


k 0.662 1.131 1.5 2.0 
D(0.) 0.963 0972 0.965 0.961 


It was noted by Olsen? that, to lowest order in a, 
at the crossover angle 6, for which C19(0.) — 0, electrons 
from circularly polarized photons are completely polar- 
ized along the photon direction, i.e., 


D (0.) S C3 (6.) cos&,— C31 (0e) sin8,—1. (8) 


For this to be true requires that, in the language of 
Eqs. (2.25)- (2.27) of Ref. 8, 

|K_|?=|J,|?=0 when Re[J_*J,+K_*K,]=0; (9) 
it does not seem that this can in general be true. We 
show, however, in Table II the calculation of D(@,) 
for Z=84 from our results; the deviations from Olsen's 


prediction are very small, and almost within the errors 
of the calculation. 


VIII. THE CORRELATION C;; 


The correlation Cz; is a transmitter of helicity, pro- 
ducing longitudinally polarized electrons from circu- 
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larly polarized photons. The only one of the seven 
correlations which does not vanish in forward and back- 
ward directions [C33(0°) = —C5,(180?) — 17], it vanishes 
in the nonrelativistic limit, but becomes 1 for all angles 
of significant electron emission in the extreme relati- 
vistic limit.^ We show results for Cs; in the tables ap- 
pended to the paper and in Fig. 7. Even at intermediate 
energies the correlation is large; it appears to approach 
one at high energies somewhat more rapidly in the lighter 
elements. Since C5; ranges from +1 to —1 it exhibits a 
crossover, which, however, at high energies moves to 
very backward angles. Analytical expressions for C3; 
to lowest order in a were obtained by Olsen; by Baner- 
jee, and by Fano, McVoy, and Albers*; Nagel? also 
obtained terms of relative order a. The extreme rela- 
tivistic limit of Cz, was discussed by Pratt." Like Coz 
and C31, numerical results obtained for C3, by Hultberg, 
Nagel, and Olsson!*/!? are in good accord with the cor- 
responding values of our tables. Nagel and Olsson! 
have given numerical results at threshold. The connec- 
tion between the crossover angle for C15 and production 
of completely polarized electrons from circularly polar- 
ized photons has been discussed in the previous section. 


ACKNOWLEDGMENTS 


This work was suggested and encouraged by Dr. 
Harvey Hall, to whom all the authors are much in- 
debted. Thanks are also due to Dr. Sólve Hultberg, for 
helpful discussions, correspondence, and prepublication 
results. 

APPENDIX: TABLES III-VII 


TABLE III. Polarization correlations for Z=84 at 354 
keV as a function of angle. 


(deg Cos € 16 C 12 Cx Ca Cy Css 
0 0.000 0.000 0.000 90.000 0.000 0.000 1.000 
5 026 077 04 .—04 =00 0.3 0.5 
10 0.16 0.88 0.24 —02  —00 0.0 04 
15 012 090 018 -—016 -—00  —00 0.4 
20 0.10 0.90 0.15 —0.12 -00  —015 04 
25 0.10 0.90 013 —0.00 —0.04 -—022 0.38 
30 0.00 0.88 0.12 -—0.07 —0.05 -—029 0.36 
35 0.10 0.87 0.12 —0.05 —0.06  —036 0.34 
40 0.10 085 0.12 —0.03 —0.07 -042 031 
45 0.10 0.83 013 -—001 -—008 -—048 0.28 
30 010 O81 013 0.01 —010 —053 0.25 
55 Oii 073 0.14 003 —O12 —O058 . 091 
60 011 0.75 016 006 —014 -—063 0.18 
65 O11 0.72 O18 009 —0.16 -—O67 0.13 
70 0.12 068 020 013 018 —0.71 0.08 
75 0.12 0.65 0.22 0:18 2021 —074 0.03 
80 012 0.61 0.26 0.22 —0.24 -—076 -—0.02 
$5 0.12 056 030 0.28 -—027 —0.78 —0.08 
90 0.12 051 035 034 —030 —0.73  —0.15 
95 012 046 040 O41 —0.33 -—0378 —0.22 
100 012 O41 046 048 -—035 —0.76  —0.30 
105 012 036 0.53 0.56 -038 —0.73 —0.38 
110 011 031 0.60 O64 —040 -069 —046 
115 0.11 026 068 072 —041 -—063  —0.54 
120 010 021 0.75 0.79 -—041 —0.56 —0.62 
125 040 016 O81 086 -—041 -—049 —0.70 
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TABLE IV. Polarization correlations for Z —84 at 662 keV TABLE VI. Polarization correlations for Z=84 at 1500 keV 
as a function of angle. as a function of angle. 
0 Cos Cio Cis Cn Cas Cu Cas 0 4 
; (deg) Coz Cio Cis C5 Cos Car Ci 
0 0.000 0.000 0.000 0.000 0,000 0.000 1,000 
5 0.218 0.57 040 .-—0.38 —0.04 0.294 0.74 0 0.000 0.000 0.000 0.000 0.000 0.000 1.000 
10 0177 0.78 0.30 —027  —0.04 0.058 0.62 5 0.15 0.45 0.43 —0.43 —0.01 0.22 0.85 
15 0.161 081 0.264 —0231 —0.03 —0.102 0.56 10 0.14 0.61 035 —0.34 —0.03  —0.05 0.79 1 
90 0.151 0.80 0,252 —0.221 —0.05 —0.222 0.54 15 0.14 0.58 0.32 —0.29  —0.00 —0.29 0.76 x 
25 0.144 0.7 0241 —0.198 —0.07 —0.34 0.53 20 0.18 049 032 —0.26 —0.09  —046 0.73 
30 0.159 0.73 0.236 —0.157 —0.09 —0.46 0.50 25 0.21 040 0.35 —0.28 —0.12 —0.56 0.72 
35 0.190 0.69 0.250 —0.14 —0.10 —0.56 0.46 30 0.22 030 040  —0.33 —0.16  —0.61 0.72 
40 0.195 0.63 0.276 —0.16  —0.3 —0.64 0.44 35 0.21 020 042 —0.32 .-—023 —0.65 0.72 l 
45 04174 0.56 0.292 —0.16 —0.18 —0.71 0.42 40 0.22 011 038 —0.24 —0.28 —0.67 0.71 0 
50 0.184 049 0.298 —0.1 —0.22 —0.78 0.38 45 0.23 0.03 0.4 —0.2 —0.2 —0.64 0.73 
55 0.225 042 0316 —01 —0.24 —0.82 0.36 50 0.19 —0.05 0.4 —0.33  —0.62 0.74 
60 021 0.336 0.352 —0.1 —0.30 —0.87 0.34 55 0.1 —0.1 0.4 —0.41  —0.62 0.73 
65 0.14 0.233 0.386 —0.1 —0.36 —0.92 0.30 60 0.1 —0.1 0.4 —0.4 —0.59 0.73 
70 0.13 0.160 0.410 —0.1 —0.41 —0.94 0.26 65 —0.2 0.3 —0.4 —0.53 0.76 
75 0436 0.109 0.42 0.1 —0.5 —0.93 0.24 70 —0.2 0.3 —0.4 —0.49 0.76 
80 0.2 0.001 0.46 0.1 —0.51 —0.94 0.22 
85 —0.01 —0.140 0.52 0.1 —0.56 —0.95 2415 
90 —0.06 —0.196 0.54 0.2 —0.60 —0.92 0.08 
95 —0.05 —0.18 0.55 0.3 —0.65 —0.86 0.07 
100 —0.12 —0.28 0.59 0.3 —0.69 —0.82 0.04 
1 105 —0.27 —0.4 0.66 0.4 —0.7 —0.76 —0.08 Taste VII. Polarization correlations for Z —84 at 2000 keV 
i 110 —0.34 —04 0.69 0.5 —0.7 —0.69 —0.19 as a function of angle. 
f 115 —04  —04 0.68 0.5 —0.7 —0.63 —0.2 
te 120 —04  —04 0.68 0.5 —0.7 —0.54 —0.25 
125 —05 -—04 0.7 0.6 —0.6  —043 —0.4 


6 
(deg) (deg) Cos Cio Cie Cn Cs Ca Css 


o 0.000 0.000 0.000 0.000 0.000 0.000 1.000 
5 0.14 0.42 0.46 —0.6 —0.01 0.17 0.88 
10 0.12 0.53 0.37 —0.36 —0.04  —0.11 0.83 


'TABLE V. Polarization correlations for Z=84 at 1131 keV 15 0.13 0.45 0.32 —0.29 -—008 Á—0.36 0.82 

as a function of angle. 20 0.18 0.32 029 —0.23 —0.11  —0.51 0.80 

25 022 4023 035 -—03  -—013  —0.54 0.80 

30 0.20 014 0.46 —0.4 -0.19 —0.55 0.80 

35 02 05 04 —027  —0.57 0.80 

(deg) Coo Cio C2 Cn Ca — Cn Ca — 340 02  —000 03 —027; —053 0.81 

0 0000 0.000 0.000 0.000 0.000 0.000 1000 45 02  —0.07 03 —027 —0.47 0.84 

5 018 048 042 —041 —0.019 0263 0.82 50 01  —01 —03  -—047 083 

10 015 0.67 0.34 0.33 —0.035 0.007 0.79 sS Qu Wl —0.4 —0.5 08 

15 0415 0.68 0.30 —0.27 —0.051 —0.210 0.70 60 01  —01 —04  —04 08 

20 017 0.63 0.30 —025  —0.072 —0.379 0.67 65 —0.2 —04  —05 0.8 

25 019 0.56 0.32 —0.26 —0.098 —0.507 0.65 70 —0.2 —04 0.8 
30 021 O48 0.35 -—027 —0.136 —0.602 0.63 
3s 023 038 0.37 -—027 -—018 —0.675 0.62 
40 0.23 028 038 —0.24 —023 —0.73 0.61 
45 022 019 038 —0.22 —027 —0.75 0.61 
50 0.22 0.10 040 —022 —0.32 —0.76 0.61 
55 020 0.01 04  —022 —039 —0.76 0.61 

60 016 —0.06 04  —018 -—043 075 0.60 í 
a au =018 A oe 5045090741061 
70 007 —019 04 —01  —049 —0.70 0.61 
75 001 —024 04  —041  —055 —0.66 0.61 
80 —0.06 —0.28 04  —0.1  —0:.58 —0.64 0.59 r 

85 —013 —032 04 Q0  —0:58 —0.62 0.57 
90 —0.19 —0.36 0.4 —0.59 —0.57 0.56 
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The wave function for the electrons is investigated when a set of narrow bands (valence states) has its 
energies within a wide band (conduction states). The valence states are linear combinations of localized 
states which are attached to cach lattice site. The intra-atomic Coulomb and exchange integrals for the 
localized states are much larger than the bandwidths of the valence states, Some of the narrow bands are 
neither completely empty nor completely filled. The wave function is therefore expected to be correlated 
because it is disadvantageous for the electrons to crowd into the same lattice site, or take up some con- 
figuration contrary to Hund's rule. This correlation is important in transition metals, where it is considered 
to be the cause of ferromagnetism. The correlated wave function is obtained by applying to the uncor- 
related antisymmetrized product of Bloch functions an operator which provides each configuration of 
localized valence states with an appropriate amplitude and phase factor. The procedure is worked out in 
detailfor the case of few particles (electrons or holes) in the narrow bands with the help of a diagram analysis. 
The localized orbits of different lattice sites do not have to be orthogonal to on_ another, and the computa- 
tional rules are actually simplified thereby. The example of a twofold degenerate band such as the upper 
part of the 3d band in Ni is treated, and the conditions for the occurrence of ferromagnetism are stated in the 


case of few 3d holes per lattice site. 


INTRODUCTION 


HE present work studies the correlation of elec- 
trons in a situation which is considered as typical 
for the 3d electrons in a transition metal. The character- 
istic features of the valence and the conduction electrons 
in the transition metals are not obtained from first 
principles, but they are assumed according to a number 
of experimental facts and their intuitive interpretation 
in order to yield a model Hamiltonian. The aim is 
therefore to find an approximate ground-state wave 
function for the model Hamiltonian, and to investigate 
the mechanisms which tend to favor a ferromagnetic 
ground state. 

In accord with other theories for the transition 
metals! it is assumed that there is a fairly clear cut 
distinction between three types of electronic states: the 
core states representing the atomic 15, 2s, 2p, 3s, and 3p 
orbits, the valence states corresponding to the atomic 
3d orbits, and the conduction states corresponding to 
the atomic 4s orbits. Each set of states is orthogonal 
to the two others, and there will be no correlation 
between two orbits belonging to different sets. 

This is the most obvious assumption about the elec- 
trons in a transition metal, and it is certainly not 
definitive. But it is felt that this complete separation 
between the 3d and the 4s electrons can be lifted in a 
later, refined model without basically changing the 
present results. 

There is some striking experimental evidence in favor 
of the independence and completely different orbital 
character of the valence electrons on one hand and of 
the conduction electrons on the other. Neutron diffrac- 


* Present address: IBM Watson Laboratory, Columbia Univer- 


sity, New York, New York. d 

1 Cf. in particular the review article of J. H. van Vleck, Rev. 
Mod. Phys. 25, 220 (1953) which gives a qualitative description 
of the problems discussed in the present work. 


tion in iron? has shown that the magnetization density 
in the lattice is mostly concentrated very tightly around 
the lattice sites, whereas a small fraction of the total 
magnetization is distributed uniformly throughout the 
lattice. The simple linear dependence of magnetization 
on the alloy composition, e.g. in the Ni-Cu system,? 
seems to indicate that the Bloch states which carry 
most of the magnetization are being filled quite inde- 
pendently from the presence of conduction electrons. 
The conduction electrons are formally included in the 
present investigation. Unlike the valence states which 
are defined in terms of tightly localized orbits, the con- 
duction states may best be thought of as plane waves 
which were orthogonalized to the core and valence 
states. The lowest conduction state is assumed to lie 
well below all the valence states, and the highest con- 
duction state well above. The noncrossing rule makes 
it inevitable to introduce bands which are conduction 
like in part of their Brillouin zone and valence like in 
the remaining part. In the transition region the Bloch 
states are a mixture of the two kinds, but all these 
transition regions together will be assumed to occupy 
a volume in reciprocal space which is small compared to 
the volume of one Brillouin zone. With this hypothesis 
all the actual computations will bear out the kinematic 
and dynamic independence of the conduction states 
from the valence states. The presence of the conduction 
electrons in the crystal manifests itself then mainly 
through the seemingly arbitrary number of valence 
electrons even in a "simple" metal, such as pure Fe, Co, 
and Ni. Other effects of the conduction electrons are 
harder to evaluate and will be either implicitly con- 
tained or completely neglected. The screening of the 


2C. G. Shull and Y. Yamada, Proceedings of the International 
Conference on. Magnetism and Crystallography, Vol. TIL, 1 (1961); 
J. Phys. Soc. Japan 17, Suppl. B (1962). Y 

3 H. C. van Elst, B. Lubach, and G. J. van den Berg, Physica 
28, 1297 (1962). 
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nuclear charge by the conduction electrons is implicitly 
taken into account through the effective crystal poten- 
tial, but the coupling of the spins of 3d electrons 
through the conduction electrons is not mentioned at 
all since its treatment belongs to a more refined model. 

The 3d electrons are characterized by the localized 
(at each lattice site) states which they occupy. No 
precise description of these atomic states is needed 
besides their transformation properties under the sym- 
metry operations of the lattice, and the crucial assump- 
tion that two such localized orbits at different lattice 
sites have a small overlap integral. The smallness of 
this overlap is not a consequence of a clever choice, such 
as in a Wannier function, but a genuine expression for 
the tightness of the 3d orbits around their lattice site. 
Therefore, the matrix elements for a transition of a 
localized 3d electron to a neighboring lattice site due to 
kinetic energy and the crystal potential, are always 
smaller (by a factor of the order of the overlap integral) 
than the matrix elements for a transition within the 
same lattice site. The transitions between lattice sites 
due to the Coulomb repulsion or exchange between 
electrons are smaller by the square of the overlap 
integral than the transition within one lattice site. The 
only transitions between lattice sites which are admitted 
by the model Hamiltonian are therefore the ones due 
to the kinetic energy and the crystal potential. Those 
terms determine the band structure of the 3d electrons. 
The bandwidth is then smaller than the intra-atomic 
Coulomb and exchange integrals. 

There are three terms in the model Hamiltonian: 
(i) kinetic energy and crystal potential terms, (ii) 
Coulomb repulsion terms at the same lattice site, (iii) 
exchange terms at the same lattice site. Terms involving 
the integral over four orbits at the same lattice site, 
but not of the ordinary Coulomb repulsion or the 
ordinary exchange type, are neglected. They are not 
necessarily small, but it is believed that the important 
ordering of atomic levels at one particular lattice site 
can already be understood on the basis of the Coulomb 
repulsion and exchange integrals. In particular Hund's 
rule for an atomic system follows already from such a 
reduced Hamiltonian. 

Since the two particle terms (ii) and (iii) are larger 
than the one particle terms (i), the resulting problem is 
one of strong interaction. It is, however, complicated 
by the nonvanishing overlap between orbits on different 
lattice sites. The usual formulation in terms of creation 
and annihilation operators for the localized states has 
to be modified. The aestruction operator for a given 
state is not exactly the Hermitian conjugate of its 
creation operator. Although this complication does not 
arise in the usual formulation, it does not really make 
the computations more awkward. It turns out that the 

various expectation values for the correlated wave func- 
tion can be evaluated only in the limit of few particles 
(electrons or holes) compared to the number of lattice 
sites. The nonvanishing overlap integrals help to 
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simplify the general rules in the diagrammatic analysis, 
although the number of diagrams is increased. 

The present investigation amplifies considerably a 
short account published earlier.* The main idea is still 
the same. The uncorrelated antisymmetrized product of 
Bloch states is modified by applying operators which 


essentially project the uncorrelated wave function into ` 


a subspace of configurations. with the appropriate 
correlations. The resulting correlated wave function is 
believed to be a good approximation to the ground state 
for all densities of electrons or holes. Practical results, 
however, ie., the calculation of various expectation 
values, have been possible only in the limit of low- 
particle densities (with the exception of one statement 
concerning the occupation probability for valence states 
inside the Fermi surface in the case of a nondegenerate 
band, cf. Ref. 4). In the earlier account there was neither 
the possibility of nonvanishing overlap, nor of de- 
generate bands, nor of conduction states, nor of com- 
bining configurations according to Hund's rule. All 
these new features are brought out in the example of a 
twofold degenerate band, where ferromagnetism is 
favored by both terms, (ii) and (ii). 

For the nonferromagnetic state the intra-atomic 
Coulomb-repulsion terms (ii) are more effective in 
scattering the particles into Bloch states of high energy, 
because two uncorrelated particles crowd more often 
into the same lattice site if their spins are antiparallel. 
Although this effect is probably not serious enough to 
induce ferromagnetism, it may come close to canceling 
the loss in band energy which particles of the same spin 
experience due to the exclusion principle. The depend- 
ence of this mechanism on the density of states was 
shown in the earlier account, and is again found in the 
example at the end of this report. But now, there is also 
the effect of the intra-atomic exchange which had been 
pointed out by Slater? and which is due to the terms 
(iii). Whereas one may hope to show some day that the 
terms (ii) can never induce ferromagnetism at all by 
themselves, it would still seem highly arbitrary to 
neglect them altogether in the discussion of ferro- 
magnetism and consider only the terms (iii). 

A brief account of the following sections will now be 
given. The first- and second-order density functions are 
defined in Sec. 1 together with the relevant overlap 
integrals. The creation and destruction operators are 
introduced in Sec. 2 and the important projection 
operators for the localized valence states are studied. 
The basic idea for the definition of a correlated wave 
function is explained in Sec. 3 by studying the correla- 
tion energy due to intra-atomic Coulomb repulsion in 
the simplest possible case of all Coulomb integrals being 
equal. The correlated wave function is defined in Sec. 4 
quite generally by applying a suitable set of operators 
(projections operators for the various states of the whole 


1 Martin C. Gutzwiller, Phys. Rev. Letters 10, 159 (1963). 


(te Be Slater, H. Statz, and G. F. Koster, Phys. Rev. 91, 1323 
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ion of each lattice site) to the uncorrelated antisym- 
metrized product of Bloch states. The same idea is 
realized in Sec. 5 in a slightly different manner, which 
will then help to expand all expectation values in 
powers of the particle density. 

The computation of expectation values for the corre- 


lated wave function is outlined in Sec. 6; it leads to 


evaluating the expectation value of a product of creation 
and destruction operators for the uncorrelated wave 
function. The latter expectation values are represented 
in Sec. 7 by diagrams which contain vertices, solid lines 
for particle propagators, and dotted lines for overlap 
integrals. The usual breaking up of a diagram into 
closed linked diagrams is performed in Sec. 8; also it is 
shown how to obtain diagrams with external points 
from closed linked diagrams. 

The first-order density function is studied in Sec. 9 
in order, first to eliminate a number of simple diagrams 
which lead to a renormalization of the initial uncorre- 
lated wave function, and second, to obtain all the terms 
up to second order in the particle density. The second- 
order density function is similarly treated in Sec. 10. 
'The main correlation parameters occur at most quad- 
ratically and can therefore easily be determined by 
minimizing the expectation value for the total energv. 
This procedure is applied in Sec. 11 to the twofold- 
degenerate band characteristic of the upper end of the 
3d band. The conditions for ferromagnetism are then 
stated. The density of states per unit interval of energy 
at the upper end of the valence band has to be larger 
than the average density in the limit of very strong 
localized Coulomb repulsion. The degeneracy of the 
valence band is important in the more realistic case 
where the Coulomb repulsion is not infinitely larger than 
the bandwidth so that two holes can occasionally crowd 
into the same lattice site. 

The results of the last section are derived for a small 
density of valence holes and are based on the assumed 
correlated wave function. This wave function consti- 
tutes a guess which may not be particularly good for 
low densities, because the correlation may spread over 
many neighbors for low-particle densities as opposed to 
densities of about 1 per atom. Since the low-density case 
can be treated directly with the help of the ladder 
approximation, the present investigation may be viewed 
as an attempt to propose a wave function for all 
densities and to treat it in the limit where exact results 


can be obtained. 


1. THE SINGLE-ELECTRON WAVE FUNCTIONS 


The lattice is made up of L identical atoms in some 
elementary arrangement, say face-centered cubic. The 
lattice sites are labeled by the letters f, g, or h. To each 
Site belongs a set of single-electron wave functions, 
Which we shall divide into three categories: core states 
(corresponding to 1s, 2s, 25, 3s, 3p levels); valence 
States (corresponding to 3d levels); and conduction 
States (corresponding to 4s levels). 
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The core states are assumed orthogonal to the valence 
and conduction states of all atomic sites. Also, the core 
states are uncorrelated, so that they contribute only 
through the first-order density function pi(z,y), namely 


ns) e v f dev - fes 


KW (2,32, - 7 + xw )W" (y, - - - xw) 
=> x(x)x (y)--contribution from 
. valence states and conduction states, (1) 


where the first term on the right is from the core states. 

In this formula the core states x may be either 
localized or Bloch-like. The contribution of the valence 
and conduction states to p; is complicated because of 
correlation. The contribution of the core states to the 
second-order density function p»(£,y;m,y) is then 
already contained in 


py (55 y) -4N(N—1) J dxs- -dey 


XW (Ex, Xa - + oy UY Gy Xa e tN) 
-$[e(£mpeoy) (5 y)ei m) J] 

--Fproperly correlated part arising 

from valence and conduction states. (2) 


Since the effects of the core states are completely 
described by (1) and (2) together with their orthog- 
onality to the valence and conduction states, it is quite 
feasible not to mention the core states any more in 
working out a correlated wave function for the valence 
and conduction states. Therefore, the first term in (1) 
will be mentioned again only when it comes to com- 
puting the total energy of the crystal. In keeping with 
this procedure we shall now call V the number of 
electrons in valence and conduction states only, and the 
total wave function V will have only as many variables 
x as there are electrons in valence and conduction 
states. Formulas (1) and (2) are interpreted henceforth 
without the first term on the right-hand side of (1). 
The eventual inclusion of core states is easily performed. 

The localized valence and conduction states are 
c3, (x), where g is the label of the atomic site, 8 is the 
band index (including spin index), and x is the com- 
bination of spatial and spin variable. The band index 
is sometimes labeled a or y. Bloch states are constructed 
by 

Va (x) = (1/ £17) x V cz.Ba@a0(%)- (3) 
I 


k isa wave vector of the first Brillouin zone and x is the 
index of the band. The coefficient V2.3 depends on the 
lattice site g through a factor exp(i£g), but the depend- 
ence on the indices x and 8 may be quite complicated. 
In particular, V, may contain, for certain combinations 
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of x and &, contributions from both valence and con- 
duction states. The exact values of V8; will result 
from some secular problem, exactly as the band struc- 
ture is obtained in the tight-binding approximation. 
The variety of Bloch states Vx depends directly on the 
initial choice of localized valence and conduction 
states Pgo- 
Formula (3) can be inverted by writing 


eg (3) = (1/ LP) » U go a (2) » (4) 


where one has the relations 


(0172/59. U af kV kk, yh Say5shs 
xk 
(1/ D) E VatgoU go xx — 9s - (5) 
Bo 


The coefficients Us)... depend on g through the factor 
exp(— ikg). 

The Bloch functions V,.(x) will be assumed ortho- 
normal to one another, i.e., 


| he OW: OSD One (6) 


But the localized orbitals g(x) will not be assumed 
orthonormalized, and this makes it necessary to 
introduce their overlap integrals 


Reus | PA OVONEME 
= (1/L) 2 aaia QU) 


The only restriction which may be imposed for con- 
venience is 


Re@fyf)=ser, (8) 


so that the localized orbits belonging to the same lattice 
site are orthonormalized. Also, the values of R(æf,yh) 
differ from zero only when the sites of f and 7; are close 
to each other, and then they are small compared to 1 in 
all cases except when the indices œ and y refer both to 
conduction states. 

Two remarks are in order at this point. First, it is 
convenient to introduce the nonorthogonality integrals 
R, even if they should be ultimately assumed to vanish. 
Indeed, the results of the diagrammatic analysis of 
Sec. 7 are more easily expressed if the values of R(a.fjy/) 
are not immediately simplified to 6.75sx. Second, it may 
be that certain simple types of correlations, special 
cases of the ones to be introduced in Sec. 4, do not 
necessitate the assumption of R(@f,yi) being different 
from 04585. Indeed the nonorthogonality of the valence 
states may be taken fully into account by a proper 
choice of the correlation parameters. An example is the 
Heitler-London theory of the hydrogen molecule. But 
no such equivalence between correlation parameters 


E |. and nonorthogonality integrals has been found in the 
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more general case to be discussed in the following 
sections for the Ni-Cu alloys. 


2. SECOND QUANTIZED FORMALISM 


The many-electron wave functions may be treated 
most conveniently in the formalism of second quantiza- 
tion. Therefore, we introduce creation operators cx; and 
annihilation operators cx corresponding to the Bloch 
functions V.a(x) of (3). These operators satisfy the 
anticommutation relations 


cter Oc ek= Dask, 
CrO tF OnICKK=O , (9) 
Cue ad chacti— 0. 
The localized states ego (x) are created by 


35 Ugo xke ck, (10) 


Li? xk 


a'gg— 
and they are destroyed by the operators 


p V kk: BogCkk - (11) 


beg= 


It is important to notice that btagxa'g, because the 
localized orbits are not orthonormalized. However, 
one has 

A asdynt bou! as=Sar5sn 


ai ral y al ua as = 0 ; (12) 
Dasbynt byrbas= 0. 


These relations follow from (9) and (5). Other anti- 
commutators between a’s and b’s can be expressed in 
terms of the overlap integral R of (7). 

A one-particle operator Q:(x,y) with matrix elements 


(QN) | NOUS | POS 


is given in second quantized formalism by 


@ı= »» ca ck Qilar 5 (14) 


xE,Al 


In terms of the matrix elements 
(fle: il Cas dx f onos) (15) 


the operator Q: becomes in the second quantized 
formalism 


= E efl Oi oibus, (16) 


which is obtained directly from (13) and (14) bY 
inserting (3) and (11). 
In the same manner, a two-particle operator 
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Q2(E,x; 1,7) with matrix elements 


(asfeafal Ql rubo f t [t [m fay 


x (DPA (é) © anf2(E) 


E ei (x) © asfs (x) 


O2(E,%5 ny) 


€ 1) RADI (17) 


€yh (y) Erzh (9)! 


is given in second quantized form by the formula 


Q:= 2. Dr eSI 


(a1/1,a2/2) (yihiy2h2) 
X (ox fios f2| Qe [vtvysis)ba s bains. (18) 


The summation goes only over different pairs (o f1,o f2) 
and (y1/1,y2/t2). The formulas (16) and (18) have been 
derived because in this work the assumption of ortho- 
normality is not made for the localized states, so that 
there remains the question whether the a or the b 
operators are to be used in representing Q, and Q». 

The operator 2!5,08,— Pg, destroys the electron in 
the localized state ea,(x) and then creates right back 
again. This operator acting on an arbitrary many elec- 
tron state isolates that part which has an electron in 
the localized state cg, (x). In view of 


P?5,— ai5,3,0! 5,5, — a! 5,085 — Ppa, (19) 


the operator Ps, is indeed a projection operator. How- 
ever, it is not a Hermitian operator, and has to be 
handled with some care. One way of examining it in 
more detail is to express a'g, in terms of b'g,. With the 
help of (5), (10), and (11) one finds 


d'5,—2, b. R(af gg), =O po 2 b'asR(afBg). (20) 
af af#g 


The last step is a consequence of the orthonormalization 
(8) for the localized states belonging to the same lattice 
site. If the index f refers to a valence state, the second 
term on the right hand side of (20) is small to the extent 
that the overlaps between a localized valence state and 
all localized states on neighbors are small. The overlap 
between a valence and a conduction state is small 
because of the different angular symmetry, and the 
overlap between two valence states is small because of 
their tight localization. In this sense the operator Ps; is 
approximately Hermitian for the localized valence 
states. ih 
Since the projection operator Pg, is not Hermitian 
We cannot associate a measurable quantity with it. As 
long as Pg, is only used to construct some correlated 
wave function for the whole crystal, the lack of 
Hermiticity is not a drawback. But Pg, cannot be used 
to answer the question as to the probability of finding 
the localized valence state yay occupied. Such detailed 
knowledge may be of no interest. However, one may 
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find it useful to answer the more general question as to 
the average number of electrons in localized valence 
states. A Hermitian operator whose expectation value 
answers the last question can be constructed under 
certain additional assumptions which do not seem un- 
duly restrictive. For instance, the conduction states 
may be defined as plane waves which were orthogonal- 
ized not only to the core states but also to the valence 
states. An operator can then be constructed whose 
expectation value gives the average number of electrons 
in conduction states, and therefore the difference 
between the total number of electrons and the number 
of electrons in conduction states gives the number of 
electrons in valence states. We shall use such a number 
in the future whenever this seems convenient. 


3. ABOUT THE CORRELATION IN 
THE VALENCE STATES 


With the help of the projection operator Pg, the 
contribution of the localized valence state c, to the 
wave function Y of the whole crystal can be investi- 
gated. More generally, one can speak about the con- 
figurations # which contribute to Y. A configuration is 
a set of localized valence states, and it can be described 
by enumerating the index pairs 8g which make up the 
configuration. To each configuration ® belongs a 
projection operator which is just the product of Pg, for 
the indices belonging to the configurations $, and of 
(1— Pa,) for the indices of valence states not belonging 
to the configuration d. 

The total wave function ' can now be written as a 
sum over all possible configurations $ of localized 
valence states in the crystal. The terms in this sum are 
not mutually orthogonal, but for the discussion of that 
part of the Hamiltonian which refers to the valence 
states of one particular atom only, the lack of orthog- 
onality may be temporarily forgotten. 

The largest contributions to these intra-atomic energy 
terms, besides the kinetic energy and the potential 
energy in the field of the nucleus and the core electrons, 
come from the mutual Coulomb repulsion of the valence 
states (since the core states are always occupied, their 
repulsion provides merely a constant shift). Indeed all 
the Coulomb repulsion integrals between valence states 
contain one term which is the largest and identical in 
all of them.® It is the Coulomb repulsion between two 
spherically symmetric charge distributions, each with 
total charge equal to one electronic charge, and a radial 
dependence equal to that of the valence states. If v of 
the valence states at a particular atom are occupied, 
this term in the Coulomb repulsion becomes simply 
Cv(v—1)/2. 

For some arbitrary wave function Y, the number v of 
occupied valence states at some particular atom is, in 
general not fixed. Y may contain configurations d 


6 John C. Slater, Quantum Theory of Atomic Structure (McGraw- - 
Hill Book Company, Inc., New York, 1960), Vol. I, pp. 311, 490. 
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which have different numbers of occupied valence states 
at the particular atom under examination. To the 
approximation where the configurations ® are con- 
sidered mutually orthogonal, a probability p, can be 
defined for Y, to have exactly v occupied valence states 
at the particular atom. p, is simply the sum of the 
squares of all those configurations ® which occur in the 
expansion of Y and have exactly v occupied valence 
states at the particular atom. 

Since the Coulomb repulsion C»(»—1)/2 is the 
largest among the many two-particle terms, it is 
important for the wave function Y to minimize the 


quantity 
3 2 ro- 1)p,=q, (21) 


with the summation going over the possible occupation 
numbers of the valence states, e.g., v goes from 0 to 10 
for the entire set of 3d states. There are two obvious 
conditions to be satisfied for the set of probabilities $». 
The normalization condition is 


> p=l, (22) 


and the total number of valence electrons is given by, 
say, ñ per atom on the average, so that 


È vp,—n. (23) 


The solution to this simple minimization problem is 
found most easily in the following way. 7 is assumed 
to lie between two integers » and p+1, i.e., 


p<n<p+l. (24) 
q can then be expressed with the help of (22) and (23) as 
q= (u/2) Qi—4—1)2-3 X (u—»)(u--1—»)5,. (25) 


The coefficients of ?, and $,,! vanish, whereas the 
coefficients of all the other ?'s are positive. Therefore, 
q reaches its minimum for the values 


$ouki—-h, Pfaui-h—n, 
$,-0 for v¥p and »vzéui. (26) 


The values of py and p,41 are obtained from (22) and 
(23) after all the other p,’s have been set equal to zero. 
The statement expressed in (26) has some striking 
consequences. For example, in the case of iron we have 
1— 1, and it follows that any reasonable wave function 
W for the whole iron crystal should contain only con- 
figurations which have either 7 or 8 occupied valence 
states on each atom. Similarly, in Ni we have u—9 and 
there ought not to be any configuration with less than 9 
valence (3d) electrons at any lattice site. 

'The requirement (26) is generally violated by the 
itinerant electron model. Let us consider for example 
the Ni-Cu alloy with 100? atomic percent Cu substi- 
tuted for Ni. We shall assume that 0.6— holes per 


. atom have to be distributed equally over two degenerate 
valence bands in the ferromagnetic ground state, and 


over four degenerate valence bands in the nonferro- 
agnetic ground state. In terms of the numbers v of 


T 
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holes we then have 
0.6— 


2 0.6— 2" Nec . 
m (7) (4+—) ferromagnetic, 

y 2 2 

4 0.6—p\” 
«Queso 

y 4 


The second term in (25) becomes equal to (0.6— p)?/4 
for the ferromagnetic and 3(0.6— p)?/8 for the non- 
ferromagnetic case. 

The Coulomb repulsion C can be estimated as follows. 
The outermost 3d electrons see at least 1.6 unscreened 
nuclear charges. But since there are at least two of these 
outermost 3d orbits, the average unscreened nuclear 
charge in these orbits must be at least 2 so that such an 
outermost 3d orbit can be thought of as a hydrogenic 3d 
orbit in the field of an He nucleus. If we think of one of 
the two mutually repelling electronic charge distribu- 
tions as completely concentrated at the nucleus (this 
tends to overestimate the value of C, but the hydrogenic 
3d orbit has been chosen rather conservatively), C 
becomes equal to 4/9 Ry. The Coulomb repulsion 
energy is, therefore, (0.6— 5)/9 Ry per hole for the 
ferromagnetic and (0.6—p)/6 Ry per hole for the 
nonferromagnetic case. 

'The surplus of Coulomb-repulsion energy for pure Ni, 
i.e., p=0, corresponds, according to our rough estimate, 
to about 10 000°K in the ferromagnetic and to about 
150009K. in the nonferromagnetic itinerant model, 
whereas the Curie temperature in Ni is found to be 
650°K. It seems futile to discuss the itinerant model 
further without a serious attempt to find a modified 
wave function which is not beset by the large surplus 
of Coulomb repulsion in either the ferromagnetic or the 
nonferromagnetic state. 

Since there are many configurations ® with not more 
than u+1 and not less than u occupied valence states 
on any lattice site, a good wave function V for the whole 
crystal is expected to be a linear combination of all such 
configurations. Indeed any configuration 9 by itself 
does not give a particularly low expectation value for 
the kinetic energy and crystal potential terms in the 
Hamiltonian which correspond to a single electron 
moving in an effective potential of the lattice symmetry- 
Exactly these last terms were minimized by the 
itinerant model. They constitute the first order if the 
terms in the Hamiltonian are arranged in ascending 
powers of the overlap R between valence orbits On 
neighboring sites. The Coulomb repulsion C belongs to 
the zero order in such an expansion. 

The basic idea is then to enforce the result (26) upo? 
the itinerant model wave function by eliminating those 
configurations which violate the requirement (26) at 
some particular lattice site. Such a correlation corre- 
sponds exactly to the correlation in the Heitler-London 
wave function for the hydrogen molecule. But since we 
have now to do with 3d orbits the requirement (26) still 


0:6—p\ = 
—} nonferromagnetic. 
4 


|] 
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leaves many configurations at each lattice site, so that 
some additional correlation beyond the Heitler-London 
type (26) seems indicated. In particular Hund's rule 
should now be enforced at each lattice site. 


4. A CORRELATED WAVE FUNCTION FOR THE 
VALENCE ELECTRONS 


The ground state Y in the itinerant electron model 
is obtained from the vacuum state o by creating JV 
electrons in Bloch states V4 (x). Therefore, we have 


Y= II C ao 7 (27) 


(xk) 


where the set (xk) of occupied Bloch states is chosen in 
some manner, e.g., such as to minimize the expectation 
value of the total energy. The particular choice of the 
Bloch functions Y, and the set (kk) is not under dis- 
cussion right now, and we shall assume that both the 
functions V,;(x) and the set (xk) have been determined 
once for all. 

Wo is now expanded into configurations ® of localized 
valence states by the method described at the beginning 
of the last section. The new (correlated) wave function 
W is then obtained by transforming each configuration 
(a1 f1,°-+,efar) in the expansion of Yo into a linear 
combination of related configurations (yn, - - - yyarfar)- 
Most generally this transformation is described by an 
operator whose matrix elements give the coefficient of 
the configuration (yn, - - - yia) in the transforma- 
tion of the configuration (o fi: - joa fa). 

A more specific assumption has to be made. As the 
most stringent assumption we will assume that the 
above operator O does not transfer electrons from one 
lattice site to another. Then it is helpful to simplify 
matters by assuming O to effect transfers of electrons 
within one atom independently of the transfers of elec- 
trons in neighboring atoms. We have, therefore, a set of 
operators O9, 0, QO, --- applicable to lattice sites 
which are occupied by 0, 1, 2, --- electrons in the 
valence shell. The determination of these operators O is 
essentially equivalent to bringing about the necessary 
correlation of the valence states. In terms of the creation 
and annihilation operators we can write 


O= > E (n+ |O™ lar: a) 


i779) (Ga 7 797) 
Xa, Y a bas i Ser Eds ay (tp , (28) 


to be summed over all different groups of » indices 
(a1-+-a,) and. (yı: - y»). In the case of 3d bands there 
is a maximum of 10 different indices. 

For example the requirement (26) is enforced by 
setting all Q(2 —0, except QG =O — 1. This consti- 
tutes the generalization of the Heitler-London model 
for the hydrogen molecule. A somewhat weaker corre- 
lation is brought about by allowing the O™ to deviate 
from the above value, but keeping each O® a multiple 
of the unit matrix. 
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It is now evident how Hund’s rule can be favored. 
The configuration at each lattice site can be expanded 
into the states corresponding to the various spectral 
levels. The operators O can now be chosen so as to 
eliminate certain unfavorable spectral levels, or at least 
to reduce their contributions, whereas the levels which 
are lowest according to Hund’s rule can be left un- 
reduced. Therefore, the operators O may have off- 
diagonal elements. The values of the matrix elements 
of O can be determined variationally. 


5. ALTERNATIVE FORMULATION FOR THE 
CORRELATED WAVE FUNCTION 


If one is interested in a metal with only few electrons 
in the valence band, it is natural to expand all the 
results in powers of the density of valence electrons in 
the lattice. The operator O of the previous section is 
then best replaced by a slightly different expression 
which will be defined in this section. 

'The operator O will be expressed in terms of matrix 
elements 25— 1, 21(y,o), Us (yvvs,002), etc. These matrix 
elements are numbers out of which (yi---y,]O?] 
01:*:0,) can be computed by taking all kinds of com- 
bination of ds, vi, d», --- and summing them. 

First there is a summation over v from 0 to 10 and for 
each v a summation over all different v-tuples (a1---a,). 
Second, for each v there is a summation over all parti- 
tions P? of the integer v. Third, for each partition 
there is a summation over all possible groupings of the 
v-tuple (ær * +œ») according to P, and over all permuta- 
tions of (y1- +y»). A particular grouping of (a1: +æ») is 
written as (ol---o7) where o! - - -a^ are associated with 
the 1’s of the partition P, (a#™a#t*?), (arrar), «+. 
with the 2’s, etc. The permutation of (y::-y,) is 
written (7!---7’). With these preliminaries one can 
now write 


1 10 
0== 2 yl Ey 2 


@ 7=0 (ai-::ay) (yi---3,) P") groupings permutations 
of (ag*++ay) of Cris v0 


Xtra dlre) An “Bg (zar yan argat?) UTE 
Xa, z aba " bar. (29) 


The term v—0 is put equal to 1. 

The factor 1/c expresses the fact that the same wave 
function Y. results, if the left-hand and the right-hand 
sides of (29) are just multiples of each other with the 
same factor for all operators. 

Since the operators O(? have an intuitive interpreta- 
tion according to the preceding section, the problem 
arises of finding the matrix elements 9 from known 
operators O by solving Eq. (29). The special case of the 
O's being chosen according to the requirement (26) is 
treated in the Appendix A. We shall assume that o=1 


from now on. This implies that O9 —1, or in other — 


words, that a configuration will not be discriminated 


against for having no electrons at all at some particular — 


lattice site. Such an assumption is only reasonable 


Ce a ee amat. 
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Fic. 1. Two equivalent vertices corresponding to s. 


there are fewer valence electrons on the average in the 
lattice than lattice sites. à 

The theory has been phrased in terms of electrons, 
but in the case of almost-filled bands it is more appro- 
priate to consider the holes in the valence bands. This 
is done simply by switching the positions of at with 
respect to b in (28) and (29). The projection operator 
1—P-1i—a!b—ba! appearing in (28) then goes into 
a'b— P, which tests the absence of a hole exactly as 
1—P tested the absence of an electron. Again, an 
equivalence between (28) and (29) can be established 
only in the cases where there is no physical reason to 
discriminate against a configuration for having no holes 
at all at some particular lattice site. Since the present 
investigation is directed mostly toward the understand- 
ing of the Ni-Cu alloys, all the following calculations will 
be done for the case of an almost-full valence band. 

A picture can be associated with any particular term 
in (29), if each factor d; is represented by a heavy dot, 
called a vertex, with v directed lines across. The end- 
points of the lines are labeled on the incoming end by a, 
and on the outgoing end by y. A combination of such 
representations of #’s is called a picture. Two pictures 
are called equivalent if they can be brought to congru- 
ence including the labeling of lines, by shifting their 
parts. Figure 1 shows two equivalent vertices. 

Conversely, given a picture, i.e., a combination of 
vertices with labeled ingoing and outgoing lines, one 
associates a product of 's and operators at, b with it. 
To each vertex corresponds a factor v!#, with the 
arguments occurring in the picture. Formula (29) can 
then be expressed as a sum over all pictures to be made 
with two sets of labels (o1---o10) and (yi: - 19). 


6. CALCULATION OF EXPECTATION VALUES 


The correlated wave function V is now given by 


T= 0M, (30) 


where O, is the operator (29) belonging to the lattice 
site g. Since Yo represents almost filled valence bands, 
at is replaced formally by b everywhere, and vice versa, 
everything else in (29) remaining at its place. The first 
thing to be done is the computation of the absolute 
value (V |) of the correlated wave function Y. 

Formula (29) is inserted in (30) by attaching an 
additional index g to the 9,, br, and aa. The product 
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over the lattice sites is worked out for both Y and Y^, 
the scalar product is taken term by term, and the results 
are added up. Each term in this last sum can be repre- 
sented by a picture in the following way: 

To each 2,, or 2",, associate a heavy dot called a 
vertex, to be labeled by the lattice site g. Each operator 
aag associated with @,, is represented by a solid line 
with an arrow going into the vertex of 2,,, and each 
operator Gag associated with à*,, is represented by a 
solid line with an arrow coming out of the vertex of 
3^ „o: Each operator byg associated with ,, is represented 
by a dotted line leaving the vertex of Pus, and the 
operator 0*4; associated with 2",, is represented by a 
dotted line going into the vertex of S'po The lines are 
labeled exactly as the operators they represent. Figure 2 
is an example of such a picture representing a term in 
(V |). Two pictures like Fig. 2 are called identical if 
they can be brought to congruence including labels and 
directions by merely shifting the parts of the picture 
without ever detaching a line from its vertex. If one 
permutes the arguments yi: * "Yu ind, the same picture 
arises according to this definition. 

Conversely, given a picture like Fig. 2, it is im- 
mediately possible to write down the corresponding 
term in the evaluation of (Y |V). For each vertex with 
y. incoming solid lines one writes an operator 


pug (Yar Yue" 7 Oty) Dra" i ina i ia 


with the corresponding labels, and analogously for each 
vertex with y outgoing solid lines. All operators belong- 
ing to ad" are written to the left of all operators belong- 
ing to ad. The order of operators belonging to different 
lattice sites is immaterial. Operators belonging to the 
same lattice site have to be pulled together in the order 
suggested by (29). A particular label Bg on at can then 
not occur more than once, and similarly any particular 
label on b, bt, or a cannot occur more than once. The 
freedom left in ordering the operators belonging to 
different lattice sites cannot change the expectation 
value of the whole operator product because of the 
simple anticommutators (12) for at and 5. 


1 

j aj! a, 
“boat XM l ! 
5 VA SS rr i = 
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Fic. 2. Example of a picture for a term in (30). 
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The absolute value ('V|W) becomes now a sum over 
all different pictures like Fig. 2, where one associates an 
operator with each picture according to the rules of the 
previous paragraph, and then computes the expectation 
values of that operator with the wave function Wo. 

The computation of an expectation value like 
(V|b.b,,|W) which arises from the one particle 
operator (16) is done in almost the same way. bfas is 
represented by a small cross labeled af with an ingoing 
dotted line, and bya is represented by a small cross 
labeled yh with an outgoing dotted line, in addition to 
all the features in the pictures for (¥|W). Everything 
stays the same as with (V|W) except the operators 
associated with ð" have to be written to the left of 
bt aby, and the operators associated with 3 stand to 
the right. The small crosses associated with bfas and bx 
are called external points to distinguish them from the 


vertices (heavy dots). 
7. EVALUATION IN TERMS OF DIAGRAMS 


The expectation value of the operator associated with 
a picture like Fig. 2 can be evaluated for the uncorre- 
lated wave function Y, with the help of Wick's theorem. 
According to this theorem one has to run through all 
complete systems of contractions, that is through all 
ways of grouping the annihilation and creation operators 
into pairs. The contribution of such a complete system 
of contractions is given by the product of the expecta- 
tion values of each pair in the contraction for the wave 
function Yo. The order of the operators in each pair has 
to be the same as in the original operator, and there is 
a sign to be determined from the number of permuta- 
tions necessary in order to bring the partners of each 
pair together. 

In the case of the uncorrelated wave function Yo only 
the pairs consisting of one annihilation and one creation 
operator give a nonvanishing result. There are four 
types of such pairs and their expectation values are 
easily computed with the help of the definitions (10) 
and (11), as well as the relations (5). They are 


(Yol tajli yh | Wo) =W (a. f,yh) s 
(Yol basa’ | Wo) = 2 S(af Bg) W (Bg,Yyh) , 


(Wo| aa. ynl Vo) — 2 W (a f,8g)S (Bg,vP) > (31) 


(Wo] 5,55, | Wo) 5 raf) 
— X S(vhB"g")W(8"g^,82)5(8 go), 


B' g' B'* g'* 
where W and S are defined by 


1 
W (a f, yh) —— = Omen UP EE 
L (xk) 


1 
S (af,yh) T 2 V aka V xk vh . (32) 
xk 


TRANSITION 


A931 


METALS 


Fic. 3. Simplest diagrams for py. 


The summation over (xk) means that only those indices 
kk are included in the summation which correspond to 
empty Bloch states of Yo. It is easily checked that the 
matrix S(of;yk) is just the inverse of the overlap 
matrix R(a f yl) defined in (7). 

It is immediately evident from writing down the 
operator corresponding to a picture like Fig. 2 that 
a and at, as well as bf and b, have always to be paired 
in the sequence in which they appear in the first and 
last formula of (31). However, the pairs made up of 5 
and at, or of a and bt, may occur in either sequence, 
except if the two partners of such a pair belong to the 
same lattice site. In that case b always precedes a’, and 
a precedes bt, exactly as in the second and third formula 
of (31). If b and at, or a and bf do not belong to the same 
lattice site, the commutation relations (12) tell us that 
only the sign of the expectation value changes when the 
order of the partners in the pair is changed. But this 
sign change is accounted for when the over-all sign to 
be attached to the product of the quantities (31) is 
determined. 

A complete system of contractions can be represented 
with the help of pictures like Fig. 2 in the following 
way. A pair of the type aat is represented by simply 
connecting the two solid lines representing a and af, 
the arrow points from a to at. Similarly a pair of type 
bat is represented by tying the dotted line of 5 to the 
solid line of at, inserting a short cross line such as in 
Fig. 3 at the point where the dotted and the solid lines 
meet. This short cross line marks the end of the dotted 
and the beginning of the solid line, and corresponds to 
the summation over 8g which appears in the formula 
(31) for (Wo|basatyn| Vo). A similar treatment is given 
to a pair abt. Finally, since a pair bfb gives rise to two 
terms as shown in (31), it will be represented in two 
ways as shown in Fig. 3, first by simply connecting the 
two dotted lines of bt and b, and second by joining the 
dotted line of 5f with the dotted line of b through the 
intermediary of a directed solid line. The arrows go 
from b to bt which is reflected in the inverted arguments 
of the last formula in (31) as compared to the preceding 
formulas in (31). The minus sign appearing in the last 
part of (Wo|d'asbys| Yo) will be absorbed in the general 
rules determining the overall sign. This pictorial 
representation of a particular complete system of 
contractions by connecting the appropriate lines in a 
picture like Fig. 2 is called the diagram associated with 
that system of contractions. Two diagrams are identical 
if they can be brought into complete congruence (in- 

cluding labels and arrows) by shifting the lines in the 
drawing without ever breaking any of them. 

In the present formulation the particle lines (solid or 
dotted) run from one vertex to another, but they do not 
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run through a vertex. None of the solid lines at some 
vertex is in a particular relationship with any given 
dotted line belonging to the same vertex, and vice 
versa. However, if one starts from a given diagram in 
order to write its contribution to (Y|W), the result 
depends on the order in which the arguments o and y 
are listed in the # corresponding to some particular 
vertex. In establishing the over-all sign to be attached 
to the contribution, it is convenient to adopt the 
following rule: The particle line ending in o; at some 
vertex 2, continues on the other side with the particle 
line labeled y;, and the particle line ending in y; in 3,” 
continues with oj. In this manner all particle lines are 
joined two by two to form so-called loops. A closed loop 
closes itself in the diagram, an open loop ends at external 
points, which are the ones represented by small crosses. 
The over-all sign to be associated with a particular 
diagram is obtained in the following manner. The solid 
and dotted lines are organized into loops according to 
the rules of the previous paragraphs. A factor (—1) is 
now inserted for each segment of solid line in the 
diagram and for each closed loop. 

In the original picture, e.g., in Fig. 2, the labels 
associated with outgoing solid lines were all different 
from one another; similarly, the labels of the incoming 
solid lines were all different; and similarly for the dotted 
lines, excepting the labels of dotted lines attached to 
external points. The expectation value associated with 
a picture like Fig. 2 is the sum over the contributions 
from all different diagrams to be drawn from the 
picture. The sum of these contributions is easily found 
to be an antisymmetric function of the labels associated 
with solid outgoing lines, of the labels associated with 
solid incoming lines, etc. As long as the over-all sign 
and the individual factors associated with a diagram 
are determined by the above rules, using in particular 
only the pair expectation values (31), it is formally 
possible to let labels of outgoing solid lines coincide, or 
to let labels of incoming solid lines coincide, etc. The 
sum over all diagrams associated with a picture con- 
taining identical labels on outgoing solid lines, or on 
incoming solid lines, etc., will give a vanishing result. 

"Therefore, it is now possible to let all the labels in a 
diagram run through all admissible values, independ- 
ently of one another. The restriction to admissible 
values refers to the fact that only localized valence 
states are involved, but no conduction states. The over- 
counting resulting from this mutual independence 
makes it necessary tc.divide by 


pilu!” P O (1 (pita (2) t2. a0 


where unu» :-- is the number of 21,95,: ` :, and v1,v2,--- 
is the number of 371,3"2,--- in the diagram. 

In summary, the expectation values of Y can be 
written as a sum over all different diagrams. A diagram 
consists of vertices, solid lines, dotted lines, and possibly 
external points. There are two kinds of vertices, with 


rS outgoing solid (incoming dotted) lines and with in- 
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coming solid (outgoing dotted) lines. There are two 
kinds of external points, with one incoming dotted line, 
and with one outgoing dotted line. Each line going out 
of one vertex (or external point) has to be tied to a line 
going into some vertex (or external point). A solid line 
is tied directly to another solid line; a solid line is tied to 
a dotted line, and vice versa, by inserting a short cross 
bar at the joining point; a dotted line can be tied either 
directly to another dotted line, or indirectly through 
the intermediary of a solid line segment, as shown in 
Fig. 3. Each vertex is labeled as a lattice site, and the 
lines going into and coming out of a vertex carry labels 
of band indices for localized valence states. Each 
external point carries the label of a lattice site with band 
index. The lattice site labels on the vertices and the 
external points are all different from one another. 
Similarly the band index labels on any particular vertex 
are different from one another. Two diagrams are 
different, if they cannot be brought to coincidence in- 
cluding the direction of lines and the labels on them by 
simply shifting the vertices on the paper without 
breaking any lines. 

The contribution of a diagram is obtained as follows: 


(1) To each vertex associate a factor à, or 9^, where 
p. is the number of incoming or outgoing solid lines; 

(2) Attach a different lable 8g to each short cross bar 
in the diagram; 

(3) With each solid line from a f to yh associate a 
factor W (af,yh); 

(4) With each dotted line from af to yi associate a 
factor S@fyyi) ; 

(5) Insert a factor (—1) for each solid line and each 
closed loop in the diagram ; 

(6) Insert a factor 


1/py!po!- ++ pylpgl--- (A!) ras (21) zin Uy 


where u is the number of 21,45» the number of 45, etc., 
y; the number of #1, etc. in the diagram ; 

(7) Sum independently over all labels excepting 
naturally the labels of external points, the labels 
associated with vertices run only over the band indices 
of localized valence states, whereas the labels associated 
with short cross bars run over all band indices (valence 
and conduction). 


A. special rule applies to the diagrams with more than 
one pair of external points, such as occur in the com- 
putation of Q; in (18) which involves the operators 
Db sub afibrihbyana: Lf the open loops go from yl to 
a»f» (rather than o4fi) and from y2hz to aif; (rather 
than af.) an extra factor (— 1) has to be inserted. 

In the following discussion of particular diagrams it 
is convenient to distinguish various types of labels 
which appear in the diagrams by the following rules: 
The double label consisting of a small Greek letter for 
the band index and a small Latin letter for the lattice 
site is replaced by one capital letter. An external point 
with outgoing dotted line is labeled I’, and one with 
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incoming dotted line is labeled T”. An outgoing dotted 
line on a vertex is labeled Æ’, and an incoming dotted 
line on a vertex is labeled Z”. A short cross bar is 
labeled G^ when it joins an incoming solid line to an 
outgoing dotted line, and it is labeled G" when it joins 
an outgoing solid line to an incoming dotted line. 
Finally, an incoming solid line on a vertex is labeled F’ 
and an outgoing solid line on a vertex is labeled F”. 
Clearly, the labels I’ and T” are fixed, the labels F’, F”, 
H', and H" run through all localized valence states in 
the lattice, whereas the labels G’ and G" run through 
all localized states (valence and conduction) in the 
lattice. Also the labels I" and I" may stand for valence 
as well as for conduction states. 


8. LINKED DIAGRAMS 


Any part of a diagram which can be completely 
separated from the remaining diagram without breaking 
any line, is called an unlinked part. Once a diagram is 
divided up completely into unlinked parts, its total 
contribution can easily be obtained from calculating the 
separate contributions for each unlinked part, the total 
being equal to the product of its parts. 

Unlinked parts without external points are called 
closed linked diagrams, if they cannot be broken up 
further into unlinked parts. The contributions of all 
closed linked diagrams appear simply as a factor 


expl 2 A] (33) 


closed linked 


to the diagrams with external lines. A stands for the 
number calculated, according to the rules of the previous 
section, for a particular closed linked diagram. 

Since (33) is just the normalization integral (VV), 
this factor divides out if p; and p» are computed with a 
normalized correlated wave function Y. Therefore, we 
can write with the help of (16) and (18) 


pey)= E er(a)e"r-() 
xm 


open linked with 
2 external points 


p(&; my) =401(E,n) pr (x,y) mi (E,9) 01 (2,2) 


ATT”), (34) 


5 em(E er4(x)| 
onrada D op (P) grala) 
eran en) 
eran en") 


x SD, NOME IGE (6S), 
open linked with 
4 external points - 

The interest of the last formula lies in the net 
separation of the "properly correlated” part of p» by 
the diagram analysis, i.e., the part of p» which cannot 
be simply understood as the effect of many-particle 
statistics and the exclusion principle. 
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According to the last two formulas it is only necessary 
to obtain the contributions of all open linked diagrams 
with two or four external points. These diagrams can 
be obtained in a systematic fashion from the closed 
linked diagrams, of which there are fewer, of course. 

A closed linked diagram can be associated to each 
open linked diagram by simply joining the end points of 
each open loop, i.e., joining I" and F”, T^; and I^, 
I" and I". In doing this, a diagram may arise which is 
not allowed under previously established rules. Indeed, 
the external points are always reached by a dotted line, 
but these dotted lines may not be attached at their 
other ends to a point H’ of H”. Instead, they may both 
be attached to a point G' or G". The closing of such an 
open loop would then lead to a sequence solid-dotted- 
solid line where neither end of the dotted line is attached 
to a point //' or H”, contrary to our rules. However, in 
view of (5) and (32), such an illegal sequence may be 
contracted into one solid line, and in this way a legiti- 
mate closed linked diagram is obtained. 

Conversely, it is evident that we obtain all open 
linked diagrams if we open in turn every solid or dotted 
line, or every couple of solid and/or dotted lines in all 
closed linked diagrams, provided the diagram does not 
decay into two unlinked parts. The rule for opening 
solid or dotted lines is simple. First write the expression 
for the closed linked diagram. Second, in order to 
obtain the contribution to p; make in turn the following 
replacements. 


W(HF)— Y; W(HG')S(G's")S(1"G")W (GF), 


GGG!" 
(36) 
S(HF) > —S(HY")S(I"F), 

for every factor W and S. Third, in order to obtain the 
contribution to the properly correlated part of p; make 
the replacements (36) in turn on every couple of factors 
with the pairs of external points (P, ^1) and (T^5,T^;). 
The over-all sign remains the same as in the closed 

linked diagram. 


9. THE FIRST-ORDER DENSITY FUNCTION 
The original expression for pı contains two terms 
which cannot be associated with a closed linked dia- 
gram. Their diagrams are given in Fig. 3, and their 
contributions to p; follow directly from the last formula 
(31) together with (3) and (11) 
D Va(2 a), (37) 


[<k] ^ 


where [x] is the set of Bloch states which are occupied 
by electrons. The complementary set, occupied by holes, 
is designated by (xk). This is just the pı from the un- 
correlated wave function Yo which forms the base for 


the itinerant electron model. 
It is of interest to notice that the. total number of 


electrons in the lattice 


v= Í dips ay) (38) 
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Fic. 4. Examples of bubbles. 


is correctly given by (37). The correction terms which 
come from more complicated diagrams have to cancel 
one another in the integral (38). This fact is easily seen 
from (36) if we note that the integral (38) for any open 
linked diagram with two external points is equal to 
(2:1) times the contribution of the corresponding 
closed linked diagram; +1, if the open diagram resulted 
from opening a solid line; —1, if the open diagram 
resulted from opening a dotted line. A contribution to 
N from opening a solid line in a closed linked diagram 
is exactly canceled by the contribution to NV from 
opening a dotted line in the same closed linked diagram. 

This last property of the diagrams enables us to order 
the contributions to p; according to powers in the 
density m of holes in the valence band. m is defined as 
the ratio M/L where M is the average number of holes 

in localized valence states for the whole lattice. The 
order of some diagram is simply the power of m which 
occurs in the contribution of the associated closed 
linked diagram. 

There are more terms of order one than (37). They 
arise from the closed linked diagrams which contain 
only ð, but no 25, 95, etc. Due to the simple structure 
of these diagrams they can all be evaluated. Their 
contribution together with (37) leads to substituting 
each occupied state V,i(x) by a state V (x). 

It is shown in Appendix B, how taking account of à; 
throughout all possible diagrams is tantamount to 
redefining W in terms of V; rather than v, and to a 
proper modification of the operators 2», 9s, --- 

Another class of simple diagrams for pı is provided 
by the opening of the dotted and the solid lines in one 
of the two partial diagrams in Fig. 4. The vertices to 
which these “bubbles” are attached have other ingoing 
and outgoing lines, but the additions implied in all 
those other lines can be performed even before opening 
the bubbles. The result appears to the bubbles exactly 
as, would have appeared to them. Quite generally, 
any part of a diagram which can be reduced to 2; by 
taking loops off a vertex, appears to that diagram like 
a ð insertion. Therefore, the effect of all such 
“dangling” parts in a diagram can be treated exactly 
as the effect of 24 contributions was treated in 
Appendix B. 

The following result emerges: The treatment of vı 
insertions is extended to include the ‘‘dangling” parts 
of a diagram. All that appears in the redefinition of 

Appendix B is the sum of # and all possible dangling 
parts. Since #1, 0», ds, - -- are essentially determined by 
the assumptions concerning 09, Q9, O® --- it seems 


GUTZWILLER 


that one is not free to choose 2; so as to cancel all the 
dangling parts. But the freedom to choose Wo still 
remains, and one might determine vı, 95, 95, --- varia- 
tionally in order to determine O€?, 0, O9, --- after- 
wards. In that case it is just as well to take 2,—0 and 
neglect all diagrams with dangling parts. We shall 
adopt this last viewpoint in the future development. 

Up to the second order in the density of holes there is 
only one closed linked diagram out of which contribu- 
tions to p; arise. This diagram is given in Fig. 5. It is of 
second order in 9». With assumed values for the con- 
stants determining the band structure, and for the 
overlap integrals, the band-structure term in the 
expectation value of the energy becomes, therefore, 
quadratic in 9». The Coulomb-repulsion term related 
to these second-order terms in p; is not only quartic in 
2», but also in the density of holes, so that it can be 
neglected with respect to the band-structure term 
arising from p1. 

The explicit calculation of p; is done most easily and 
most usefully in terms of the Fourier transforms with 
respect to the lattice sites. Since 2» differs from zero 
only when all of its arguments are at the same lattice 
site, the Fourier transform does not affect à» which 
simply becomes independent of the momenta exchanged 
between its in- and outgoing dotted and solid lines. 
Since the band indices are all carried along explicitly, 
the wave vectors & are always to be reduced to the first 
Brilouin zone. This implies conservation of wave 
vectors at each vertex up to a vector of the reciprocal 
lattice. 

The terms in pı corresponding to the diagrams of 
Fig. 5 can easily be written down if the overlap R(af, yh) 
is simplified to 35,4875,. The localized orbits are then 
effectively Wannier states, and the two vertices of the 
diagrams in Fig. 5 have to be at the same lattice site. 
The particle propagator W(F’’F’) has to be evaluated 
at the same lattice site f'— f". It is shown in Appendix 
C that these assumptions have the following conse- 
quences. W differs from zero only if the orbits with 
band indices o" and œ” belong to the same irreducible 
representation of the crystal symmetry operations 
around the fixed lattice site f’=f”. For subsets (o) 
and {a’’} of the same irreducible representation, W is a 
multiple of the unit matrix. 


NA B 
NES ec x^ \ 
XX 


Fic. 5. Only closed linked diagram contributing to p; in the second 
order of density, and the two pı diagrams arising from it. 
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As long as the spin variable is specifically excluded 
from transformation under the crystal symmetry 
operations, W may have nonvanishing elements be- 
tween orbits of the same irreducible representation, 
but different spin directions. Such a situation is typical 
of an uncorrelated wave function Wo which describes an 
antiferromagnetic metal or one with some kind of spiral 
structure. Although such cases are not hard to handle, 
we shall simplify matters by writing down our formulas 
in their absence. It follows then that 


Wel fro! f)=Mba' a" , (39) 


where Ma is the probability of finding the orbit 
a=a’=a" at f occupied by a hole. 
The Fourier transforms are written in terms of 


ug, (R) — e Us, xk. (40) 
According to (4) this quantity indicates the contribu- 
tion of V,,(x) to the localized states of index 8. There is 
no simple relationship between « and 8. In the most 
general case the index « simply numbers the consecutive 
Bloch functions belonging to the wave vector k. How- 
ever, if the picture of a conduction band as somehow 
distinguishable from the valence bands is at all signifi- 
cant, the following assumptions about eg,(E) can be 
made. For given £ and « the quantity wg,(E) varies with 
the wave vector & fairly abruptly in such a manner that 
in most regions of the Brillouin zone V4(«) can be called 
either valence-like or conduction-like. Since ug«(k) is 
assumed to be a continuous function of £, the transition 
regions have a nonvanishing but small volume in the 
reciprocal space. 

The propagation function W(ef,yh) of (32) is now 
written as 


1 
Wefyh) == 35 w^ 4 (E)us(E)e** 9-9 . (41) 


L (xk) 


According to the above assumptions concerning 25,(£) 
the propagation of a hole from af to y is done mainly 
by those terms xk in the sum which have the same 
character as o and y. In particular, if a and y have 
different character W(ofyy/) is small, and if œ and y 
refer both to valence states W(a,f,yk) does not exceed 
the density m of holes in valence states. It is this last 
statement which allows us to classify diagrams accord- 
ing to the number of solid lines in them. 

The contribution of the diagrams in Fig. 5 to pi 


Fic. 6. Schematic dis- 
tribution of holes in 
Teciprocal space with 
(drawn out) and with- 
Out (dotted) correlation. 
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Fic. 7. Diagrams con- =z 
tributing to the properly 
correlated part of p; in the 
second order of density. 
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The summations (x) and (A) are restricted to the labels 
which are occupied by holes in Wo, and the summation 
over (8'8") goes over all pairs of valence state indices. 
Since all indices a, £, 8’, 8", y refer to valence states, 
only those indices x and contribute to (42) which 
correspond to valence like states. The interpretation 
of (42) is therefore meant to apply only to valence 
states. 

The first term in (42) describes a decrease in the 
occupation probability for those states which were 
originally fully occupied by valence holes. The second 
term spreads the holes which were freed by the first 
term over the whole available reciprocal space of 
valence character. The situation is schematically shown 
in Fig. 6. The kinetic energy and crystal-potential terms 
in the total energy can be obtained immediately. 


10. THE SECOND-ORDER DENSITY FUNCTION 


The closed linked diagrams which give contributions 
to the properly correlated part of p; have to have at 
least two solid and two dotted lines. On the other hand, 
as long as we are interested only in terms up to the 
second order in the density of holes, the only diagrams 
to be considered have not more than two solid lines. 
This shows that the only contributions of interest come 
from the diagrams in Fig. 7. Again these diagrams are 
at most of the second order in às, so that the determina- 
tion of d» by minimizing the expectation value of the 
total energy for the correlated wave function Y is 
straightforward up to the second order in the density 
of holes. 

The case of small overlap between valence states is 
of particular interest. The largest terms in the Hamil- 
tonian are the Coulomb integrals with all orbits belong- 
ing to the same lattice site. The contribution of the 
diagrams above can then be evaluated quite easily, 
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- Correlation (44) forces fewer holes to be scattered 
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Fic. 8. The diagrams which contribute to the formula (43). 


because in the lowest order of density and overlap only 
the diagrams in Fig. 8 with all external points at the 
same lattice site as that of à» make a contribution. 

The computation is straightforward for the same 
assumptions as were used in the derivation of (42). 
Together with the contribution of the valence hole term 
in (37) to the first term in (35) one finds for the terms 
in p» which are quadratic in the density 


ea; (X) (3) 
Qa; (X) Paz (E) e (9) e 4,1) 


Qj Q2 Qj Q2 
x » E jw ) pens 
(pL Nei Be 1 f» 
qm e fifa ^e 
fi “rar” 7)] es 
1 Be 1 B» 


at each lattice. The summations go over all different 
couples of valence states (o10:), (8182), and (yry2). The 
function ô= 1 if the couples (050?) and (8162) are equal, 
otherwise 0— 0. 

The function (41) is positive definite for x— y and 
=n. For given values of the densities mg the à» can be 
chosen such as to make p; vanish. Such a requirement 
is reasonable if the one-center Coulomb and exchange 
integrals are much larger than the nearest neighbor 
overlaps divided by the atomic distance. This is just 
the assumption of narrow bands, and it leads im- 
mediately to a formula for #2, namely 


"ai Q2 O1 Q? 
2a F =| ): (44) 

M 2 Yı Y2 
With this value for V+ one can compute the corrections 
to p up to the second order in the densities mg according 
to the preceding section. The redistribution of electrons 
in reciprocal space because of the strong correlation (44) 
is different according to the distribution of electrons 
in Yo Therefore, (44) already leads to a difference 
between ferromagnetic and nonferromagnetic arrange- 

ment of electrons in Y. 

'The advantage which correlation (44) gives to the 
ferromagnetic state, arises from having to overcome 
less accidental crowding of holes on the same site in Yo. 


ga) ea) 


(a1a2) (y1y2) 
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outside of the Fermi surface if the holes all have the 
same spin direction. Correlation (44), however, prevents 
the electrons from taking advantage of the energy gain 
they experience when two of them crowd into the same 
lattice site with their spins parallel. 

Quite generally some energy gain can be expected if 
the 9» as given in (44) is reduced by some small 
quantity e, 


Qj Q2 Qj Q2 Qj; Q2 
oo( ) -( Jed ) , (45) 
2m E vt V2 net qe 
since the correlation energy connected with (43) in- 
creases with €, whereas the single-particle energy due 
to the redistribution of electrons in reciprocal space 
decreases with e. The coefficient of the former increase 
is of the order of the one-center Coulomb integral C, 
whereas the coefficient in the latter decrease is of the 
order of the valence bandwidth Æ. Stabilization occurs 
at e= E/C ; the energy gain per lattice site is E?/C times 
the density of holes squared. 

'The constant C to be inserted again depends on the 
spin arrangement in V. The expression (44) has to be 
integrated over x= y and =n with the kernel ¢*/ |«— £|. 
The resulting integral over the products of the 2 by 2 
determinants is just equal to the Coulomb integral 
minus the exchange integral in the important case 
(0302) = (yry2). The exchange integral differs from zero 
only when o; and os have the same spin, so that C is 
smaller in the ferromagnetic case, i.e., the energy gain 
from relaxing the correlation (42) is greater. 

Among the off-diagonal elements (aia) (yry») some 
lead to exchange integrals (spin-flip transition). In the 
nonferromagnetic case, these off-diagonal elements may 
be expected to enforce Hund's rule locally to some 
extent. 


11. TWOFOLD DEGENERATE VALENCE BAND 


As the simplest possible application of the formulas 
(42) and (43), the case of a twofold degenerate band 
will be treated. This corresponds to the conditions in 
the Ni-Cu alloys, where the d states are in a cubic 
environment and there are some holes in the Bloch 
states made up of the a?— y? and 22’—a?— y? states. 

With the single particle Hamiltonian Ho for the 
kinetic energy and the crystal potential one has 
Hwxx(%)=Epex(x). In computing the expectation 
value of Ho for the correlated wave function Y a 
quantity similar to the left-hand side of (39) arises. By 
the same group theoretical arguments as in Appendix C, 
it can be shown for valence states a and y that 


1 
— D c(h) Eau (E) — maEs s, , (46) 
L (kx) 


which defines the energy E, the average energy of the 
valence holes in the state a. The expectation value of Ho 
becomes with the help of (42) for the correlated wave 
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function, omitting the contribution of the conduction- 
like states as being unaffected by the correlation, 


Y: (Ba—malia) +4 Ds (C cg) 


BGPA NB' g" 


a 
[manga meme Ee ( P ) (47) 
B Be 
where E, is the average energy of all the states in the 
band a. E, arises formally if in (46) the index xk is 
summed over all values, and the factor ma on the right 
hand side is dropped. The Hermitian character of 
Vo, i.e., 

07 (8/8" aB)=0:(06,8'B"), (48) 
has been assumed in writing (47). The correlation term 
in (47) is positive, because E47» Ea, i.e., the holes in Vo 
have chosen to occupy the highest states available. 

In treating the twofold degenerate band, the total 
density of valence holes is denoted by m, the two orbits 
by a and b, the spin directions by attaching an arrow to 
the orbital label. In the ferromagnetic case each band 
is occupied by m/2 holes per lattice site, which gives an 
average energy Ey according to (46). In the non- 
ferromagnetic case the corresponding quantities are 
7/4 and E,. In computing the Coulomb-interaction 
terms according to (43) we shall assume that only the 
ordinary Coulomb repulsion and exchange integrals are 
different from zero. Therefore, only the terms containing 


did: 
aze] f 2 
|x—yl 
dxdy 
Bae | |> mee lg»), 
xdy 


Cae | es o(y) |? 

e[ [= ol er ar, 
dxdy 

r-eff ea(x) & o(x)- e) e'«, 
x—y| 


are taken into account for the evaluation of (43). Corre- 
spondingly, among the 2» only those will be assumed to 
be different from zero which immediately relate to the 
above integrals. Therefore, the following abbreviations 
are introduced for the nonferromagnetic case 


aj al 
28: Jae 
at al 
4 [eo 
(^ Dex Jerem (50) 


| es) [2] e (1? 


(49) 
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In this terminology the expectation value for tbe 
energy per lattice site is given in the ferromagnetic 


case by 
AE— mE, (n3/2)( —e'(E,— E) 
-cFom/4)8(C—J), (51) 
and in the nonferromagnetic case by 
4E— mE, (m*/8)(En—E){ (1—e4)?+ (1— eg? 
-F2(1— ec--2(1— ep) + 2e} 
+ 2/16) (A €4?+ Beg?+2(C—J)ep? 
-cF2C(ec--er)—4Jece;). (52) 
The remaining terms in the Coulomb repulsion are 
the same in the ferromagnetic and in the nonferro- 


magnetic situations. 
The minima are reached for 


2(E,— E) 
CFG 
with the energy (51) becoming 
m? 2(E,— E)(C—J) 


4E—mEprE——————— (54) 
EMO NAG) EI) 

Since E; is very near the top of the band structure if 
m is small, the quantity 2(Z;—#) can be interpreted 
as the width of the band. For the nonferromagnetic 
case one finds 


(53) 


2(E,— E) 
—A4TX(E,— E)" 
—OBA2(—É)' 
2(E,—E)(C+2E,—2E) 
eo ———————————, (55) 
(C+2E,—2EY—J? 
2(E,—E) 
"Cea 
2(E,—E)J 
 (C--2E,— Ey — ^ 
with the energy (52) equal to 2 
m?[ 2(E,—E)A 
-net 
161 4--2(E, — E) 
n 2(E,—E)B 20. E)(C—7) 
BUB) GCIHPONERUS 
ECHE» —25)— J? PEE 
2———— ——— ———2Ó(E,—E)i. (56 
(Ct2E,—2Éy— 7 ) | ca 
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The quantities e are related by the equation 
€p— ec ej. By virtue of this relation the sum of all 32 
operators commutes with every component of the total 
spin of the atom. A group of atomic states, e.g., the 
triplets made with the orbitals a and b which differ only 
in the direction of the total spin, occur with the same 
weight in Y, provided they had the same weight in Wo. 
‘The expectation value for any component of the total 
spin vanishes at each lattice site, in Y as well as in Yo. 
But the expectation value for the square of the total 
spin at any given lattice site may have changed. 

The occurrence of ey can be viewed in the following 
way. p» as given by (43) becomes diagonal in the two- 
particles wave functions (afal), (bf bl), (et df), 
(al bl), (at b!)+ (al b1), (at 51)— (al b1). The coeffi- 
cients of the last two are m?(1—ec-4-e;)/16 and 
m*(1—ec—e7)/16. The correlated wave function Y 
prefers therefore the triplet over the singlet, even if 
there is no ferromagnetism present. V is an example of 
how Hund's rule can be enforced even in unfilled bands. 

In comparing (54) and (56) one notices that the last 
term in (54) is still smaller than the last two terms in 

(56) provided (E,—E) < (E;— E) (1+-J/C). This condi- 
tion should be satisfied since one expects the d band to 
have a reasonable energy density at its top, so that 
E,—E;XE;—E for small density of holes. If one 
assumes for simplicity's sake that the last term in (54) 
is equal to the last two terms in (56), and also that 
4 — B, then the question is whether or not m(E,— E s) 
is smaller than m? (E, —)A/4(A4-2E,,— 2E). If so, the 
ferromagnetic correlated state V; has a lower expecta- 
tion value for the total energy than the nonferro- 
magnetic V. 

If the band is parabolic at its top, E,— Em, and 
ferromagnetism is not possible in the present theory. 
If the density of states as a function of energy reaches 
a nonvanishing value vo at the top of the band, 
m(En— Ez), is quadratic in m, and ferromagnetism 
occurs, provided 


2v (E, — E)» [14-2(E, — E)/A]. (57) 


The density vo is meant to refer to only one of the four 
bands, arising from the spin and the orbital degeneracy. 
One may consider 2(E,— E) as the width of the d 
band; 2vo(En—E) is then the density of states at the 
top measured against the density of states resulting 
from a uniform distribution of the states throughout 
the band. The inequality (57) requires vo to be larger 
than the average of v throughout the band by a factor 
2[14-2(E,—E)/A ] In particular, this condition is 
satisfied if there is a peak in the density of states at its 
top, such as one may find in the fcc cubic structure. 

It may be interesting to note the following: In a non- 
degenerate band with the Coulomb repulsion integral 
between localized states given by A, the condition for 
ferromagnetism is again (57). However, the ferro- 
magnetic state in the nondegenerate band does not 
have the additional advantage which is due to the last 
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term in (54) being smaller than the last two terms in 
(56). This advantage of the degenerate band over the 
nondegenerate band vanishes only in the case of very 
strong Coulomb repulsion between the two degenerate 
orbits. Therefore, the results of this section bring out 
the importance of degeneracy for achieving ferro- 
magnetism. 


ACKNOWLEDGMENTS 


The author wishes to express his indebtedness 
to Professor R. Brout, Dr. S. Nettel, and Dr. H. 
Thomas for many very helpful discussions and 
suggestions. 


APPENDIX A 


Equation (29) has to be solved in the special case 
where O(? differs from zero only if the set of band 
indices (yi---y,) coincides with the set (a1---a,), the 
factor associated with O being independent of the 
set (yr y,) m (m 7:05). The 2, will be assumed to 
have these same properties. It is then not necessary to 
write down two sets of indices for every operator OO? 
or 9,, and even the common set of band indices no 
longer has to be mentioned since the factors associated 
with O depend only on the number v for O™ or u for 
v, We shall, therefore, only discuss these factors. Since 
the indices (yi---y,) in 9, can still be permuted with 
respect to the indices (o1:::o,) which are always 
supposed to be in some standard order; the quantity u! 
2, will always occur in the forthcoming argument. 

The formulas (28) and (29) have now to be matched. 
We take a set of m indices (1,---,8m and collect the 
coefficients of atg,,---atgbg,---bg,, in (29) and in the 
sum over v of (28), where we have to insert the definition 
(19) of the projection operator Pg. In this manner we 
find that 


1 m! 
— p» Qu... 
Q m-2gi --- =m, uilual- - - 

> O,u22 0,- - - d 


m 


=r cot y? (A1) 


y 


In order to solve this system of equations we multiply 
each side with 2”/m! where z is an arbitrary real num- 
ber, and add over all values of m. This gives 


1 lm 
— exp (Oiz-Eooz^----)— 67 > —00, 
[0] 


v=0 yl 


(A2) 


after some elementary algebra. The terms with m=0 
were put equal to one; w can be fixed by the condition 
QO — 1/o. 

The case 0 —0 presents difficulties in this formula- 
tion. Although they could have been avoided, it seems 
important to bring out quite clearly that equation (29) 
is useless as O tends to zero. O —0 means that any 
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configuration with at least one lattice site completely 
empty is excluded in the correlated wave function. A 
requirement like 09 =0 is absolutely reasonable when 
there are at least as many particles as lattice sites. The 
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Xy—---—- e—a- X 
Fic. 9. Simple p; diagrarns 
containing ^m Rhee sa ccc 
XEM ——MÓÓ 


present theory can, therefore, be expected to break 
down if it is physically necessary to discriminate against 
configurations with completely empty lattice sites. 
However, as long as such a condition does not have to 
be imposed, we can set w=1, since only the relative 
weights between the O(? and between the 2, are 
physically significant. 
The case of greatest interest is 
0=0M=1, QO-e., (A3) 


from which we obtain 


Pizo- =m(1424ć)— 


P »=n—1 [1 n— y k 
-Ecoczz —(C "C)- eo 
n=1 =0 n— y v 
This gives for 31, 2», --- the values 
21—0, 2,— —}(1—e), 


ds=4- (4/2), 9-—1(1—6e2), (AS) 


The chain of equations can be broken off at any point. 
If w is left explicitly in the relation between 9, and 
O the conditions 


etc. 


1 
OO= (A6) 


@ 


Qe, 


oW=1, 


lead to the equations 


(1 +oz+ «-) —z-—d0--)0:2---- (A7) 


[0] 
91—o—1, Pairs sand 


w € [^2 
seo 7) , $4— ——(e?—0ve4-€), etc. (A8) 
$ 2 4 
The case e—1, w— © is physically sound, but 
mathematically useless in terms of #1, 9», etc. 


APPENDIX B 


The two diagrams of Fig. 3 which contribute to (37) 
can be interpreted as follows. The first diagram gives 
pı for the wave function where all Bloch states xk are 
filled. The second diagram subtracts from the first the 


contributi 
Ontribution X Va GV a (y) (B1) 
(xk) 


which comes from the empty Bloch states of Vo. 


If we combine this last contribution to p; with the 
three contributions of Fig. 9, we can write the result as 


E. Ya (2) a (y) , 


(d) 


(B2) 


with the new Bloch state 


War (x) — Va (x)-- E Vos (x) (E) agii (By)u, (E). (B3) 


[71:52 
We have used the Fourier transform 


S8. 2. S (ao, Bg)e'*v. (B4) 
g 


The result of all the other diagrams with only 21 can 
be stated with the help of a matrix T(&) which has for 
every k as many rows and columns as there are holes 
with wave vector k in Yo. We have 


Ta=bat 22 (tarSasti (Bye pH tað (Pa) Sg or 
apy 
Tua i(B'a)Ss s 18" y)yu A), (B5) 
where all the sums over band indices are taken at fixed 
wave vector k. Instead of subtracting (B1) or (B2) from 
the pı of a wave function with all Bloch states «k filled, 
one has now to subtract 


È Jala) T Jy) 5 


KAR 


(B6) 


The inverse of T which is inserted in (B6) as com- 
pared to (B2), simply normalizes the functions J,;(x). 
The term coming from the second diagram of Fig. 3 has, 
therefore, been replaced by a very similar term, namely 
one that arises from replacing the Bloch states V. (x) 
by the properly orthonormalized modification of 
Vx). Indeed, the states J,;(x) are not orthonormal, 
but one can find linear combinations which are ortho- 
normal, say V,«(x), and (B6) transforms then to 


2 Jav a). 
(d) 


Let us now examine an arbitrary diagram without 
any 2, in it. The particle lines connecting two vertices 


(a) ev 
Fic. 10. The four possible (b)  €---———9———-—--4 
particle lines between vertices 
if 21 c0 (c) e-—-----———»——. 
Qo 
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A940 
———9---------i p —9——— 
e. Fig. 11. The three possible in- 
sertions of 2; into a solid line. 
a 


or connecting a vertex to an external point are of the 
four different types shown in Fig. 10. (An external 
point is equivalent to a connection with a vertex.) The 
occurrences of 9; can arise in the following two ways. 

First, the solid lines in Figs. 10(b), 10(c), and 10(d) 
are broken up an arbitrary number of times by inserting 
one of the three pieces shown in Fig. 11. The effect of 
these insertions is equivalent to the substitution 


1 
Wef,yh)=- pa OP rs pU 
L (xk) 


1 
L Qu U aaa (B8) 


Second, a ® can be inserted at the ends of the solid 
lines in Figs. 10(b), 10(c), and 10(d) without breaking 
them. This gives the five diagrams listed in Fig. 12. 

Combining Fig. 10(b) with Figs. 12(a), 12(b), and 
12(c) is tantamount to substituting 


W(G'G) — Y; [orre to (F"G")] 
F"F' e 
XW("F)5pe-401(FG)] (B9) 


in the diagram 10(b). The substitution (B9) in Fig. 
10(c) or Fig. 10(d) can be justified only if the inter- 
pretation of the vertex to the right of Figs. 10(c), 10(d), 
12(d), and 12(e) is changed by introducing a fictitious 
label G” into Figs. 10(c) and 10(d), and G’ into Figs. 
12(d) and 12(e). This is done by the substitutions 


8o(H’1H'2,P' F's) > > 95 (H',H',G 1G!) 
G'1G'2 


X [onra toC E) [Serer 01 (G'2F’2) I, 
Ji (HA HUS F" F”) = p» [ornen 4-03 (F^ 1G" 3) ] 


G'^1G''» 
X [bren tHe (E" 1G" ;) 19° ;(H"H";,G",G";), (B10) 


and similar formulas for #3, etc. If the vertices in 
Figs. 10(c) and 10(d) are interpreted as 9», ds, etc., 
then the combined eifect of Figs. 10(c) and 10(d) as well 


(a) e-—--—--e —_»>_----- -. 
(b) Ga  ——HM————s------ e. 
Fic. 12. The five possible 
(C) | e-—————————.——--. Ù insertions without break- 
d ing a solid line. 
Pace ee 
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as 12(d) and 12(e) is again described by the substitution 
(B9). In all three diagrams 10(b), 10(c), and 10(d), the 
effect of 2, insertions is now described by the combined 
substitutions (B8) and (B9). 

Now, it follows immediately from (4) that 


VL [voe Gd Uox, (B11) 


which shows with (32) that 


W (F"F')= IE pr (x)dx 


x J Yr OYE VWO). (B12) 


K is used as abbreviation for the index pair xk. On the 
other hand, if we insert into this formula the sum over 
all K, rather than only the K of the holes, then we can 
write 


-X UsosUpxo [vss | yr Od 
x ve (x)S (G'G "a (9) 
= pues (G'G")R(G"G") 
—R(P'F. (B13) 


If we take the Fourier transform as in (B4) and insert 
(40), it follows that 


Y wu (Eu (E) =Rayr= >, R(aoyyg)ei*v. (B14) 
K g 


From this last relation, we find that 
VL [sopa Uert} v (GF) Urx o (B15) 
F 


Therefore, the combination of (B8) and (B9), i.e., the 
elimination of all 21 from the diagrams, is effected by 
substituting 


WR") > f PAO jl Ve (dy 


X X Pal) Tuo) (B16) 


(XXE) 
and replacing all 2, 33, --- by de, s; ---. 


APPENDIX C 


The Bloch waves V,.(x) are assumed to arise from 
some secular problem as a linear combination of the 
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functions vg, (x). Under a coordinate transformation 
x= T«'--t compatible with the lattice, one has 
Va(x)—e* uu) with k=Tk. (Cl) 


Also one has for the localized orbits 


Vgs(x)— 2) Thapas with g=Tf+t, (C2) 


where the matrix Ts, gives the transformation of the 
orbits corresponding to the coordinate transformation 
ge 

In view of the orthonormality of the functions y, (x) 
one finds for the coefficients Ugx of (4) the formula 


1 
—Vox= [ v*x(a)vols)de. (C3) 
AL 
If one inserts (C1) and (C2) it follows that 
Uer=e ^ TgoU Fr, (C4) 


with F—(af) G-—(8g, K=(kk), K'—(xk). Any 
expression depending on Ux, Vx, Vc, K, G, etc., can be 
written in terms of U pg, Wx, Vr, K', F, etc., with the 
help of (C1), (C2) and (C4). In the new expression in 
which only the latter quantities and the coefficients 
Tga occur, it is possible to replace again F — G, K' — K, 
etc. If the resulting expression coincides with the 
original one, it is called invariant with respect to the 
transformation T. 

Now one has for W as defined in (32) the transforma- 


tion 


W(G"G')s« Y: T prar Tyre W EVR’ ES) 


according to (C4). In order to obtain (39) we now put 
t=0, g mg =f" =f’. The arguments in W are then 
simply 66’ and wc’. 

The formula (C5) is now averaged over all symmetry 


operations of the Jattice which leave the lattice site g=f 
fixed. This gives 


1 
W (8"8^) = > (- »» Tore Tyu) W (ala!) (C6) 


a'a! 


If the orbits have been grouped according to the 
irreducible representation to which they belong, it is 


possible to apply one of the important results of group 
representation theory." 


1 1 
7 D» DO grat (T)D® p (T) =p 8g pBatarr, (CT) 
J 


where DPga(T) is the matrix representing T in the 
representation j, the indices a’, 6’, o/', 8" being re- 
stricted to the values relevant to j' and j". j is the 
dimension of the representation j. | is the number of 
operations in the group. If (C7) is inserted into (C6) 
the formula (39) and the explanation preceding it 
follow immediately. 


7Cf. Eugene P. Wigner, Group Theory and its A pplication to the 
Quantum Mechanics of Atomic Spectra, translated by T. T. Griffin 
(Academic Press, Inc., New York, 1959), p. 83. 
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Mechanism of the Ferrimagnetic to Antiferromagnetic Transition in Mn,_,Cr,Sb 


H. S. JARRETT 
E. I. du Pont de Nemours and Company, Central Research Department," Wilmington, Delaware 
(Received 16 December 1963) 


The mechanism of the first-order ferrimagnetic to antiferromagnetic transition (exchange inversion) in i 
Mns-;Cr,Sb, x«0.41, was investigated by exchange magnetostriction. A second-order term has been 
included in the strain energy which gives an effective elastic constant dependent on magnetization and "i 
magnetic ordering. It also introduces a discontinuous change in the magnetization at the transition and 
hence a contribution to the entropy change in addition to that caused by the change of lattice dimension. 
For Mne_,Cr,Sb, the effect of this term is small but necessary to obtain the qualitative behavior of the 
discontinuous change in lattice dimension and the shift in the Curie temperature with composition. The a 
anisotropy behavior of Mnz-zCrzSb was investigated for various values of x, and it is shown that an aniso- 
tropy in the critical lattice dimension of only 4X(107* À accounts for the observed change in magnetic 
anisotropy with exchange inversion that occurs for transition temperatures between 200 and 350°K. For 
transition temperatures above 350°K, anisotropy is positive, and spin-flopping measurements can be made 
on the antiferromagnetic state. An expression for the critical field in the presence of exchange inversion is 


obtained and compared with experiment. 


INTRODUCTION 


HE basic features of exchange inversion have been 
investigated theoretically by Kittel! In this 
paper Kittel’s theory is extended to include Curie-point 
shifts, magnetization, entropy, and magnetic anisotropy 
and is applied to Cr-modified MneSb. Kittel proposes 
that a transformation from ferromagnetic (F) to anti- 
ferromagnetic (AF) spin ordering will take place by a 
first-order process when there is, during normal thermal 
contraction, a critical lattice dimension c, at which the 
sign of the exchange interaction changes from F to AF. 
Accompanying the first-order transition is a discon- 
tinuous change in dimension of the crystalline unit cell. 
The magnitude of the discontinuous change, as Kittel 
has pointed out, is equal to the difference between the 
magnetoelastic expansion of the ferromagnetically 
ordered lattice and the contraction of the antiferro- 
magnetically ordered lattice. Experimental data that 
have been obtained support qualitatively the essential 
features of this theory. 

The ternary composition Mns .Cr;Sb is an ideal 
magnetic material on which to check semiquantita- 
tively the features of an exchange-inversion theory. 
MnsSb forms sets of 3-layer sheets? of strongly coupled 
spins (J/k~450°K); the spins between sets are only 
weakly coupled by exchange forces (J/k~80°K at the 
Curie temperature). The net saturation moment of this 
3-layer set is approximately 0.9 Bohr magneton.? We 
shall assume that detailed knowledge of the interatomic 
exchange forces is not needed and that this 3-layer set 
may be replaced with a single plane of spins strongly 
coupled ferromagnetically. Effects of magnetoelastic 
energy within this plane will also be neglected. 

Thus, the model representing Mn2Sb is a stack of 
weakly coupled ferromagnetic planes, the interplanar 

* Contribution No. 830. 
1 C. Kittel, Phys. Rev. 120, 335 (1960). 
2T. Heaton and N. S. Gingrich, Acta Cryst. 8, 207 (1955). 


3C. Guillaud, thesis, University of Strasbourg, 1943 (un- 
published). L, Neel, Ann. Phys. 3, 137 (1948). 


exchange changing sign at some accessible lattice 
dimension. A main function of the Cr modifier is to 
provide an initial lattice contraction so that, depending 
on the amount present, the critical lattice dimension 
for exchange inversion occurs at some temperature 
below the Curie temperature. 


FREE ENERGY 


In order to account for the exchange energy depend- 
ence on lattice dimension, the exchange energy is 
expanded in a Taylor series to quadratic terms about 
the critical lattice dimension c, at which the exchange 
energy vanishes: 


J (c)=p(c—6e)+ (9/2) (c— e. (1) 


Here p and p’ are the first and second derivatives of the 
exchange interaction evaluated at ce. This expansion for 
the exchange energy is identical to that used by Kittel 
except that he considered only the linear dependence 
of J on lattice parameter. 

When the elastic energy R(c—cr)? is included, the 
interplanar free energy per unit volume becomes 


F/V=tR(c—cr)? 
—[p(c—ce)+ (p’/2) (c—c.) Jmm: cosp. (2) 


Here, R represents the elastic constant divided by the 
square of the average lattice dimension, and cr is the 
lattice dimension in the absence of magnetic forces. 
The parameter m; is the reduced sublattice magnetiza- 
tion obtained by normalizing the magnetization to the 
saturation value M^ at absolute zero. 

The equilibrium value of c is given by 0F/dc=0, 
which yields 


p mati» 1 
c= ei coso | 
R 


pmm ‘MyM 
x| : cosé-+ Gas: ihe cox | (3) 
R R 
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and, upon substitution into the free energy, Eq. (2), we 
obtain 


F 1/pmymN? p 
excu ent cos'ó — —nym»(cr— c.) 
R 


VR ZN UR 


p | pif» 1 
SS E ens | , 
2p J R 


Both the elastic energy and the second term in the 
exchange energy appear quadratically. A quadratic term 
in the exchange produces an effective elastic constant 
that is dependent on the spin orientation. 

The condition for a transition to occur between F 
and AF spin configurations is no longer cr=c, of the 
Kittel theory but is readily obtained by equating the 
free energy for F ordering to the free energy for AF 


ordering to give 
p | p mmo 2712 
riu r1. 
2 R 


This expression reduces to the Kittel condition when 
p'—0, and approaches the Kittel condition as the 
exchange-inversion temperature approaches the Curie 
point, where the sublattice magnetization is small. We 
now obtain from Eq. (3) the differencen in interplanar 
spacing Ac between the F and AF states, 


CT—6c— (5) 


Ac=cF—cAF 


2pmymo p! 
[1+ (cr 2 (6) 
R[1— (p'myms/ Ry] p 


Substitution of Eq. (5) for the transition condition gives 
the discontinuous change in lattice dimension at the 


transition 
2 pnma pmymaN?- 12 
Ac, — 1 j 
R R 


which is similar to the Kittel theory but is modified by 
the quadratic exchange term. 

So far, this development has paralleled that of Kittel, 
but the results given demonstrate how the quadratic 
exchange term alters related conditions of the linear 
exchange theory. These expressions will also be neces- 
sary when attempting a fit to the experimental data 
using p/R and p'/R as adjustable parameters. 

The free-energy expression, Eq. (4), is not complete, 
since the intraplanar exchange energy Jm? and magnetic 
ordering entropy S have not been taken into account. 
Therefore, the following terms must be added to 


Eq. (4): 
— (J1/2)m2— (J2/2)m2—kTS1—kTS2, (8) 


where the classical entropy for N spins‘ with reduced 


(7) 


* J. S. Smart, Phys. Rev. 90, 55 (1953). 


ERROMAGNETIC TRANSITION 
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moment m is given by 


IC 6569-455] 


for each sublattice. 


ENTROPY CHANGE 


Addition of the terms (7) to Eq. (4) gives the total 
free-energy expression of the system including, in a 
rather disguised form, the lattice entropy. This fact 
may be demonstrated by differentiating the free energy 
with respect to temperature and taking the difference 
between the total entropy for F ordering and AF 
ordering. At the transition, given by the condition of 
Eq. (5), we obtain 


(9) 


p 10m Ac dcr 
AS= rad ] 


R moaT 2 aT 


Thus, the change in entropy of the lattice at the 
transition is included in the dimension cr. The first term 
in the bracket of Eq. (9) is of interest as it represents 
the magnetic contribution of the effective elastic energy 
to the change in entropy. Magnetization terms do not 
enter otherwise in Eq. (9) because the sublattice 
magnetization is assumed to be unchanged at the 
transition. The magnitude of the first term of Eq. (9) 
at room temperature is of the order 


1p’ 1 Om Ac 0.3 0.03 
-—— — — 2 (l. 0015)(— —) 10-5/?K. 
WI DOM wm X 6.5 


for Mn2Sb, while 
1 dcr 
- — ~20% 10-9/°K, 
c oT 


and hence the first term is negligible. Therefore, the 
change in entropy at the F/AF phase transition arises 
largely from the lattice distortion and becomes 


Eee 


where C is the appropriate elastic constant in the 
direction of the strain Ac/c. 


(10) 


MAGNETIZAT*ON 


Minimization of the free energy, Eq. (4) plus (8), 
with respect to the sublattice magnetization and 
rearrangement of terms gives 


mM o | ; an 


m- tanh Hw. 
kT 


5 A. E. Austin, E. Adelson, and W. H. Cloud, Phys, Rev. 131, 
1511 (1963). 
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where 


2R ,J pnwN? pam? 
Hy= E (i-e) 
NMoNR R 2R 


+ (=09| ee) | cos 


pn? Po 
x] a= cose | ) o (ly) 
R 


It has been assumed in Eq. (12) that the two sublattices 
are equivalent and that Ji=J2. The magnetization 
given by Eq. (11) is actually the Brillouin function for 
spin 4. This form arises because the entropy expression 
assumes only two spin orientations. The first term of 
Eq. (12) represents the Weiss field due to intraplanar 
exchange, while the second term is due to interplanar 
exchange and, it is important to note, is dependent on 
the spin configuration through the cos@ term. If the p. 
term were not included, the cosó dependence disappears, 
and there is no difference in the Weiss fields for F and 
AF ordering at the transition. 

Thus, the magnetization is discontinuous at the 
transition because of the second-order p’ term. However, 
even for fairly large values of p'/ R7, the discontinuous 
change of sublattice magnetization is of the order of a 
few percent and would be difficult to detect by neutron 
diffraction, for example. A large change in sublattice 
magnetization would alter all previously derived 
expressions, but for practical purposes we may ignore 
the small change in magnetization as long as p'/R «3. 
"This restriction is no stronger than the initial assump- 
tion that the exchange energy dependence on lattice 
dimension can be represented by Eq. (1). 


uu. UEM 


CURIE TEMPERATURE 


Also contained in Eq. (11) is the unusual effect of a 
temperature-dependent Weiss field. In most molecular 
field treatments it is assumed that exchange interactions 
are constant over the entire temperature range below 
the Curie temperature. In exchange inversion, however, 
the exchange interaction is dependent on temperature 
through lattice contraction. Such effects should, there- 
fore, influence the Curie temperature. 

The Curie temperature for the magnetization ex- 
pressed by Eq. (11) is 


R p p J 
r=] (cr,— | rr en a) | cose] +—, (13) 


where (cr. — ce) indicates that (cr—c,) is to be evaluated 
at Te. We thus have a self-consistent equation in which 
— T, is contained explicitly on the left-hand side and 
nplicitly on the right-hand side of Eq. (13). The second 
J/Nk in Eq. (13) represents the Curie temperature 
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of the ferrimagnetically ordered 3-layer set of planes, 
while the first term is the interplanar Curie temperature, 
The Curie temperature is dependent on cos. For 
positive p the Curie temperature for F ordering is higher 
than the Curie temperature for AF ordering by twice 
the interplanar exchange energy. Thus, as temperature 
is decreased, ferromagnetic ordering first occurs and is 
the most stable configuration. 


APPLICATION TO Mn:Sb 


In this section, the foregoing model is applied to 
Cr-modified Mn2Sb.° It should be emphasized that the 
foregoing expressions represent a single composition 
whose F/AF transition occurs at a temperature T;. In 
applying such a model to Mn2,Cr,Sb for various 
values of x, in which T, varies with chromium content, 
one must investigate whether the behavior of all 
compositions can be represented by a model whose 
parameters are independent of composition, and yet 
whose thermodynamic quantities exhibit the observed 
functional dependence on composition. 

Compositional changes are introduced into the func- 
tion (cr—c.) by assuming that the temperature depend- 
ence of cr remains the same as that of Mn2Sb, but that 
the modifying element (chromium) reduces the absolute 
value of cz. Thus, the requirement on the modifying 
element is that the lattice contraction introduced by it 
is just the amount required to satisfy Eq. (5) at the 
transition temperature T. 

There are probably some deviations from this 
assumption at high chromium content, but the theory 
is not quantitative enough to warrant consideration of 
such deviations. At this point, we must mention a 
metallurgical property of this material. An increase in 
the value of x does not represent a proportional decrease 
in the value of cr. It has been found that the chromium 
tends to segregate into an ever-present MnSb 
Widmanstáütten precipitate,’ and there is not a corre- 
sponding increase of chromium content in the Mn:Sb 
phase proportional to the amount introduced into the 
melt. It must therefore be assumed that the transition 
temperature T, occurs when Eq. (5) is satisfied and 
that the chromium in Mn2_,Cr,Sb has introduced just 
the reduction in cz required to satisfy Eq. (5) at Ts. 

We shall also assume that c; is constant throughout 
the entire range of transition temperatures. The value 
of c, is critical only when calculating the F and AF 
lattice parameters for a given transition temperature. 
Otherwise c, always appears in the expression (¢r—¢c) 
and, if this difference can be obtained, ce needs to be 
known only approximately. A third assumption is that 
the magnetization of the sublattice of our model is given 
by the measured magnetization rather than by Eq. (11). 
As has already been discussed,5 the spins of the 3-layer 


$ F. J. Darnell, W. H. Cloud ; 1 as ;. 130, 
647 (1963). oud, and H. S. Jarrett, Phys. Rev 


1J. D. Wolf and J. E. Hanlon, J. Appl. Phys. 32, 2584 (1961). 
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sets of atomic planes remain ordered antiparallel and 
appear to be strongly coupled by exchange forces. We 
assume, therefore, that the details of the magnetization 
behavior of Mn 1 and Mn u ions within this plane do 
not contribute to exchange inversion except through the 
temperature dependence of the magnetization of the 
entire 3-layer set. One would not expect that Eq. (11) 
should represent the magnetization of Mn;Sb. On the 
other hand, the conclusions drawn from Eq. (11) by use 
of the Weiss field is semiquantitatively correct even 
though the functional form is not. This functional 
dependence arose because we have used classical 
statistics in the magnetic entropy and assumed only a 
single sublattice. 

To obtain the parameters p/R and p’/R, consider the 
discontinuous change in the c lattice parameter given 
by Eq. (7). The elastic constant R does not need to be 
known until absolute energy comparisons are made. The 
best fit to Ac, is given by p/R=0.02 A and p’/R=0.4. 
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Fic. 1. Discontinuity in c axis as a function of transition 
temperature T.. Experimental results (Ref. 6) are shown by the 
open circles. 


Within the limits of experimental error, the value of 
p’/R is not too critical. For example, a 0.1 variation in 
p'/R can be tolerated and still obtain a fit with the 
experimental data. The value of p/R is more critical 
for only 0.001 À variation can be tolerated here. This 
fit of experimental data to the theoretical equation is 
shown in Fig. 1. Since the introduction of chromium 
reduces both T, and the saturation moment, the 
magnetization of MnsSb cannot be used in Eq. (7). 
Instead, the magnetization measured at 7'— T; of that 
composition whose F/AF transition temperature 1s T 
is used. This magnetization is lower than the magnetiza- 
tion of Mn2Sb at the corresponding temperature. 

We may now obtain the temperature dependence of 
(cr—c,) for unmodified Mn2Sb. From Eq. (3) we obtain 
cr, the dimension of the unit cell for ferromagnetic 
ordering at $—0. Rearrangement of terms gives 
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Fic. 2. Dimension (cr—c.) as a function of temperature 
calculated from the thermal contraction of MnsSb. 


where the values of the magnetization to be used in 
Eq. (15) are those of Mn2Sb. It is seen in Eq. (14) that 
ce must be known. The experimentally measured lattice 
dimension of Mn2Sb is cg. The value of c, within reason- 
able limits does not affect appreciably the empirically 
determined temperature dependence of (cr—c.). Selec- 
tion of the value of c,— 6.506 A is discussed later in the 
paper. Figure 2 shows the temperature dependence of 
(cr— c.) obtained from Eq. (14). 

The change in entropy may now be calculated by use 
of Fig. 2 and Eq. (10). The value of the entropy is 
dependent only on the rate of change of cr with tem- 
perature. The calculated change in entropy is shown in 
Fig. 3 in units of the elastic constant. Agreement with 
the experimentally measured AS given by the open 
circles is good, if it is assumed that the elastic constant 
C is 4.7 X10? ergs/cm?. Some experimental values? of 
AS in units of the elastic constant are shown for com- 
parison. Such agreement is also indicative of the validity 
of the assumption that the temperature dependence of 
cr is nearly independent of chromium content. We have 
been unable to measure an elastic constant along the c 
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Fic. 3. Entropy change versus transition temperature. The data 
of Flippen (Ref. 9) are shown for comparison. 


Flippen and F. J. Darnell, J. Appl. Phys. 34, 1094 


8R. B. 
(Eros Gea m2 |—(p/R)m?, (14) (1963); W. A. Doerner and R. B. Flippen (to be published). 
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axis because of the severe cracking normal to the c 
direction that always accompanies samples large enough 
for ultrasonic measurements. The value required for C 
is comparable to Ci; of tungsten, which is large com- 
pared to that of most materials, but not improbable. 
'The few measurements that have been taken to deter- 
mine the elastic constant by ultrasonic methods have 


fit, so the density of magnetic ions in the 3-layer set, 
6.2 102/cm?, is selected as the density JV of ions for 
the model. The effective value of R is obtained from the 
change in entropy, Eq. (9), and the scale factor is 
R/Nk-1.5x 10! SKVA?. 

'The parameters which have been evaluated allow 
calculation of the sublattice magnetization from Eq. 


. . H ^ta 
indicated that the elastic constant is greater than (11). The sublattice magnetization for (cr— c.) given 
3X10! ergs/cm?, which is, at least, consistent with the in Fig. 2, which would correspond to unmodified 
required value from the entropy data. MnsSb, is essentially a Brillouin function with J = 3, 

The Curie temperature is obtained from Eq. (13) as For (cr—ce) modified to give T,>0°K, the sublattice ; 
a function of T,. It must be remembered that a different magnetization differs slightly from the unmodified 
(cr— c.) curve is required for each chromium composi- (cr—¢-) because of the reduction of Curie temperature, 
tion. The new (cr—c.) is found by reducing cr by the but the shape is still essentially a Brillouin function. 
contraction of the MnsSb lattice introduced by the The calculated discontinuity in M for T,—300?K is 
chromium. The temperature which satisfies the equality only 0.8%. There is little agreement between the 
in Eq. (13) is the Curie temperature for the composition. experimental and the theoretical magnetization com- 
The Curie temperatures for various ternary composi- pared to the rather remarkable agreement in the fore- 
tions were thus calculated from the experimentally going calculations. This result is to be expected, of 
determined lattice contractions and the results are 
shown in Fig. 4 plotted against the associated F/AF Coro 
transition temperature 7,. The measured Curie tem- 0.085 
peratures are included for comparison. 0050 
For the foregoing calculation the intraplanar ex- 3 0.045 
| change energy must be known. From the measured Š oo 
l MmnsSb Curie temperature J/Nk is found to be 470°K f oos 
by subtracting the value of the first term for Mn2Sb $^ 0.030 
on the right-hand side of Eq. (13). Actually an unknown 3 oo 
parameter is introduced by the scale factor R/Nk, which $ a0z0 
converts elastic energy to temperature units. Our model $ oos 
; compresses the 3-layer set of planes into a single layer. 0.010 
1 "T'herefore, the effective density JV of the magnetic ions 0.005 
j is not known, and the value we select for JV influences oL 
i the value of the intraplanar Curie temperature. How- DRARSIHON TEMPERRTIRE K 
j ever, JV cannot be varied widely because the T, calcu- 
| lated for the modified Mn;Sb materials would deviate Fic. 5. Reduction of the ferromagnetic lattice dimension 
i 3 2 with increasing T;. 
widely from the experimentally measured values. There 
is no real advantage gained in making a least-squares ` ; : 
course, since the functional form of the magnetic 
ae entropy: is based on a molecular field model of a single M 
sublattice. 
550 A. value for c, has been used in the foregoing calcu- i 
T lations, but its derivation has not been established. The P 
expression for cp is obtained from Eq. (3) for $—0 and 
Y 530 m=m,, the measured value of the magnetization at Ts. 
E: 6 The lattice contraction in the F state due to the intro- 
z duction of chromium is then given by 
= 510 
È E 6»— 6c (p/ Rm? Qs) 
t 500 = jec / 
3 1—(0'/R) m2 Sie er 
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Fic. 4. Dependence of Curie temperature on F/AF transition 
temperature. Experimental results (ones and Flippen, to be 
submitted) are shown for comparison. 


and is shown in Fig. 5 with the experimental points 
obtained from Fig. 1 of Darnell et al.® 

_ The value of c, was selected to fit the low T, mate- 
rials. It is seen that deviations by a factor of 2 occur 


for high T, materials, which indicates that our initial 


assumptions about the passive role of the chromium 
may not be entirely valid. Either c, is not constant over 
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the range of transition temperatures or the temperature 
dependence of cr changes at high chromium content. 
However, in view of the success of the theory which 
assumes the functional form of cr is not changed with 
the addition of chromium, it seems more probable that 
c, is not constant. 


ANISOTROPY 


The anisotropy energy has not been taken into 
account in the foregoing treatment. On the whole, the 
anisotropy energy is 1 to 2 orders of magnitude smaller 
than the smallest exchange interaction, and its inclusion 
has not been necessary. There is however, an effect of 
anisotropy which is quite apparent. Consider Fig. 10 
of Darnell et al.6 It is seen that the temperature T4 at 
which the anisotropy changes sign for F ordering is not 
continuous across the T, boundary into the AF region, 
but lies on this boundary from 200 to 350?K before 
entering the AF region. This effect is ascribed to an 
anisotropy in the interplanar exchange energy. In 
Fig. 6 the free energy is represented schematically as a 
linear function of temperature. Although the actual 
functional dependence is more complex, this representa- 
tion serves this discussion. The anisotropy contribution 
is also exaggerated in proportion to the free energy so 
that it can be seen in the figure. The curves are labeled 
according to the orientation of the spins with respect 
to the crystallographic axis, parallel or perpendicular 
to c, and according to F or AF ordering. 

It has been shown by Darnell et al.* that the anisot- 
ropy energies for F and AF ordering, which arise from 
dipole-dipole and crystal-field effects, are so nearly the 
same that the small differences can be ignored. Thus, 
the first-order contribution to the magnetic anisotropy 
energy to be added to the free energy is the same for F 
and AF ordering, and anisotropy is continuous across 
the F/AF transition. However, in second order, the 
anisotropy mixes with the exchange energy producing 
an anisotropic exchange. The effect of an anisotropic 
exchange is shown in Fig. 6 by the difference in the 
temperature at which Fart and Fr* intersect and Far!! 
and Frll intersect. The free energy at the intersection 
for parallel ordering is lower than that for perpendicular 
ordering by the anisotropy energy. For transition 
temperatures between the Far’ and Fr intersection 
and the Far!! and Frl! intersection, Fig. 6(a) results. 
Since the magnetic system follows the lowest free 
energy, the F/AF transition takes place with a con- 
comitant transition from positive to negative anisot- 
ropy. This situation is the condition for a transition 
between 200 and 350°K. 

Figure 6(b) represents the free-energy relations for 
F/AF transition temperatures 7.7 350^K. Here the 
Far!! and Fr intersection is at a temperature lower 
than the F/AF transition temperature T., and the 
anisotropy remains positive at Ts. Figure] 6(c) shows 
the situation for a low F/AF transition temperature, 
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Fic. 6. Schematic representation of the effect of anisotropic 
exchange on magnetocrystalline anisotropy: (a) 200?K «T, 
<350°K, (b) 7,» 350*K, (c) T, «200*K. 


T,<200°K. Here the Fy'! and Fr' intersection is at a 
temperature higher than T, and the anisotropy changes 
from positive to negative before the F/AF transition 
occurs. 

Relative to the free energy for parallel orientation of 
spins, the free energy for perpendicular orientation is 


Far, r — Far, sl - AFAy, r tK.. 


The differences between the F* and F!! curves of Fig. 6 
are due to the temperature dependence of the (AF--K) 
terms. Thus, the anisotropy of Mn;Sb must contain not 
only terms of the type described by Darnell e£ al. and 
designated here by K? but also an anisotropic exchange 
AF whose contribution cannot be distinguished experi- 
mentally in a given composition. 

These effects may be described more quantitatively 
by considering the variation of the free energy with the 
interplanar exchange p(c—ce). If it is assumed that only 
p is anisotropic, and that its anisotropy Ap— pi — p: is 
small, the first-order deviation in the free energy is 
obtained by differentiation with respect to p, giving 


em 
XC) cose (S nete) cosg E NEU 


From Eq. (15), the magnitude of the anisotropy in the 
free energy AF depends on the spin ordering through 
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cosġ. In the Kittel approximation, p’=0. At the F/AF 
transition (¢7—c.)=0, and AF becomes independent of 
spin ordering for the F and AF configurations. Without 
the second-order exchange term p’, the shift in the free 
energy is the same for both F and AF ordering, and 
behavior of the type described by Fig. 6(a) does not 
arise. Explicit calculation of A(F/R) from Eq. (15) 
shows that this contribution tó the anisotropy of the 
AF state is smallest near Ts. Therefore, although there 
may be a second-order contribution to the temperature 
dependence of the anisotropy from an anisotropic p/ R, 
no significant difference in the temperature at which 
the anisotropy changes sign in the F and AF states 
occurs. Similarly, an anisotropic p'/R suffers the same 
deficiencies. 

The most reasonable additional source of anisotropy 
is a variation Ac,=c!—c,!! of the critical lattice 
dimension with spin orientation, 


d p' r5 
a(-) = E = (5e cos | 
R R 
p Pp 
x| =+ er e) þe cosóAc,. (16) 
IR IR 


Here, in the Kittel approximation p’=0, the incremental 
changes in the free energy are in opposite direction for 
the F and AF states,-i.e., this contribution to the 
anisotropy from Eq. (16) changes sign at T,. Such 
behavior is just that which is necessary to account for 
the qualitative arguments summarized by Fig. 6. Thus, 
the first-order anisotropy constant may be expressed as 


K=K°+pm’Ac, coso, (17) 


where K’ contains all other contributions to the anisot- 
ropy that are predominately independent of spin 
ordering, e.g., crystal-field anisotropy and contributions 
such as expressed by Eq. (15). 

In order to show that the additional contribution to 
the anisotropy in Eq. (17) is plausible, we compare the 
temperatures at which the anisotropy changes sign for 
the F and AF states. For the situation described by 
Fig. 6(a), anisotropy changes sign when exchange 
inversion occurs for a range in T, of 150°. By use of the 
value of p/R found previously, Ac, must be 4X10~ A. 
Such a value for Ac, is reasonable, unfortunately how- 
ever, it is difficult t» detect directly. It is significant, 
however, that such a small anisotropy in the exchange 
can play a dominant role in the magnetic anisotropy 
whenever magnetoelastic effects are present. 


SPIN FLOPPING 


The appearance of positive anisotropy in the anti- 
ferromagnetic state suggests that spin-flopping experi- 
ments can be performed on the high-chromium Mns»Sb 


compositions. Such experiments have been reported by 
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Flippen.? Spin flopping has not been treated in an 
exchange-inversion material, and we outline briefly 
here the derivation of the spin-flopping conditions. 
Exchange inversion and spin flopping are quite different 
phenomena and should not be confused. Besides the 
obvious fact that exchange inversion can take place in 
the absence of an externally applied field, the most 
important feature is that the F/AF transition can be 
induced by an external field applied along any crystallo- 
graphic direction. Spin flopping, however, can occur 
only when the anisotropy of the antiferromagnetic state 
is positive and when the external field is applied parallel 
to the anisotropy field. The spin configuration in the 
"spin flopped" state is canted, and this state is stable 
only in the presence of the applied magnetic field. The 
state is not stabilized when the external field is applied 
perpendicular to the anisotropy field or when the 
anisotropy is negative. 

The condition for an exchange-inversion transition 
has been treated thus far in the absence of external 
fields. However, an F/AF transition can be induced at 
temperatures below T, by application of a sufficiently 
strong magnetic field. The condition for an F/AF transi- 
tion in the presence of an applied field is given by 
equating the free energies for F and AF ordering, 
including the magnetic energy of the applied field, 


F(r)=F (0)+2H-M (18) 


from which is obtained the critical field for the F/AF 
transition 


m ( /pm?? 
gae (=) p -- (cr— ce) 
Mo 


[EAT m 


It can be verified that the derivative of Eq. (19) with 
respect to temperature and application of the magnetic 
form of the Clausius-Clapeyron equation gives Eq. (8). 
Also, it can be seen by direct substitution that the 
critical field vanishes when Eq. (5) is satisfied. 

The anisotropy has been ignored in Eq. (18). Anisot- 
ropic exchange introduces anisotropy in the critical 
field for the F/AF transition. If this were the only 
contribution to the anisotropy, the calculated value 
of Ac, gives a critical field perpendicular to the c axis 
about 2000 G larger than the field applied parallel for 
a given temperature. When all sources of anisotropy are 
included, one would expect the anisotropy of the critical 
field to be proportional to the magnetocrystalline 
anisotropy field. Such anisotropy in the critical field 3s 
mentioned by Bierstedt!? in connection with his rests- 


? R. B. Flippen, J. Appl. Phys. 34, 2026 (1963). 
1' P. E. Bierstedt, Phys. Rev. 132, 669 (1963). 
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tivity measurements. More detailed unpublished data 
of Bierstedt show that the lower critical field is parallel 
to the easy direction of the anisotropy. Thus, the lower 
critical field changes from parallel to perpendicular to 
the c axis for compositions with T,< 200°K, the tem- 
perature below which the anisotropy changes sign 
before the exchange inversion occurs. 

The second-order quantity p’/R is small, and its 
inclusion contributes only additional complications. We 
shall neglect it in the succeeding calculation of spin 
flopping. Eq. (19) becomes 


m 
HAFIF =—»(cp—c,). (20) 


Mo 


To obtain the spin-flopping conditions we again equate 
the free energies 


F(1)=F(0)+2H-M+2K sin'é. (21) 


Here the solution of Eq. (21) is anticipated by assuming 
that H is directed along the anisotropy field of the 
antiferromagnetic state and that the “spin-flopped” 
state is canted with the moments of the two sublattices 
making the same angle 0 with respect to the applied 
field. Equation (21) may be rearranged to give 


mplcr—c) +K 
cos] cost ( R 1) 


Pmt 


xcos04- R 


HAFICeM KR 
——|- =0, (22) 


pn pim 

where K is the anisotropy energy of the canted state. 
In order to obtain the critical field H,^F/€ for spin 
flopping, the equilibrium value of 0 must be determined 
by setting the torque equal to zero, 9F/800—0, where F 
is understood to contain the anisotropy and magnetic 
energies. Thus, the equilibrium condition 


[-—— DU 


cos*@—— 
2 2pm 


HAFCM KR 


6 C0580 peers 
pm pn 


R=0 (23) 


must be satisfied simultaneously with Eq. (22). 
General solution of Eqs. (22) and (23) are complex, 
however, the essential features may be obtained by an 
approximate solution applicable to the experimental 
conditions of Flippen. Substitution of the field depend- 
ence of the F/AF transition and the observed critical 
field for spin flopping in Eq. (23) indicate that 62x90", 
and cos'ó can be neglected. Therefore, by solving 
Eq. (23) for cosó and substituting into Eq. (22), the 


spin-flopping condition is obtained 


1 | pm we 
nne (a ortnm] 
Pi 


f pm | 1/2 
-K{K [| ptcr~co | ): (24) 


When K is small compared to the magnetic strain 
p^m*/ R, the second term of Eq. (24) may be neglected 
for any value of (cr—c,). Substitution of Eq. (20) gives 
the relation between the spin-flopping field and the field 
for exchange inversion 


2 | pmi l2 
Hane] Leu "p z (25) 


The anisotropy field of the canted state is found to be 
of the order of 1000 Oe or 2 10° erg/cm? from analysis 
of the data of Flippen. 


INTERMEDIATE STATE 


Darnell et al. and Austin et al.5 show that at low 
transition temperatures a third magnetic state appears 
as an interruption in the F/AF transition. The fore- 
going model does not account for such behavior 
principally for two reasons: First, effectively only 
nearest-neighbor interactions are contained in the 
model, and there is no competition between exchange 
interactions for spin direction that might give rise to a 
helical structure. Second, the model assumes a Bravais 
lattice and even if the details of the exchange inter- 
actions were included, only proper helical structures 
would result. The spin configuration of the intermediate 
state must be represented by a mixture of two 
configurations, 


S, — D4-U cos(k-r,)-- V sin(k-r,) , (26) 


a k=0 ferromagnetic component and a helical com- 
ponent commensurate with the lattice periodicity given 
by k:=2x/3c. For x«0.022 in the composition 
Mno -Cr-Sb the intermediate state is stable at absolute 
zero. In order to account for such a spin configuration, 
one must treat the actual crystal structure of MnsSb 
by the generalized Luttinger-Tisza method proposed 
by Lyons and Kaplan and Lyons e! al." Preliminary 
calculations indicate that the spin eigenfunction given 
in Eq. (26) can lower the energy eigenvalues below the 
ferromagnetic and antiferromagnetic states. Details of 
this calculation will be published in a forthcoming 
paper. 
DISCUSSION 

The foregoing theory based on a highly simplified 
model of Mn;Sb demonstrates considerable success in 
accounting for the behavior of the F/AF transition in 

1 D. H. Lyons and T. A. Kaplan, Phys. Rev. 120, 1580 (1960). 


2D. H. Lyons, T. A. Kaplan, K. Dwight, and N. Menyuk, 
Phys. Rev. 126, 540 (1960). 
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Mns-Cr;Sb. As pointed out by Bierstedt," this transi- 
tion is accompanied by hysteresis. For transitions near 
room temperature, the hysteresis is less than 1 deg of 
temperature. However, below about 200°K, hysteresis 
begins increasing sharply and reaches a value of about 
20° for a transition occurring near the temperature of 
liquid nitrogen. First-order transitions to the ferro- 
magnetic state in MnAs and the associated hysteresis 
have been investigated by Bean and Rodbell.? They 
point out that the maximum hysteresis arises when the 
spin system remains at the initial minimum of the free 
energy, even though it is not the lowest minimum, until 
the free energy is “down-hill-all-the-way.” The meas- 
ured hysteresis in MnAs is about 2/3 of this maximum 
value. 


15 C. P. Bean and D. S. Rodbell, Phys. Rev. 126, 104 (1962). 
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This situation does not hold for the F/AF transition 
in Mn; ,Cr;Sb. Kittel has shown that the maximum 
hysteresis in ar is Aar=2pm?/R. The free energy used 
in this treatment does not alter appreciably this 
expression. At room temperature, therefore, the 
expected hysteresis in Aar using p/R obtained pre- 
viously is of the order of 10? A, which may be trans- 
formed to a thermal hysteresis of about 50° by use of 
Fig. 2. This large discrepancy with the observed 
hysteresis indicates some very efficient mechanism 
nucleates the new state so that for transitions near 
room temperature the most stable state is nearly the 
equilibrium state. At low temperatures this nucleation 
mechanism must become less efficient, since the thermal 
hysteresis increases rapidly. A satisfactory model has 
not been obtained. 
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Magnetic Susceptibilities of Transition Elements in Host Crystals. 
II. Ni?+ in ZnO and CdS] 
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The magnetic susceptibilities of Ni?t-doped ZnO and CdS crystals have been measured along and per- 
pendicular to the trigonal crystalline axes over the temperature range of 28-500°K. The magnetic suscepti- 
bilities of both crystals approach constant values at low temperature and decrease more rapidly with in- 
creasing temperature at T>60°K. An unusually large magnetic anisotropy is observed for Ni?*:ZnO. The 
non-Curie behavior can be explained on the basis that the ground state of Ni? is nonmagnetic (41) and the 
temperature-dependent susceptibility arises mainly from the ions in the first two excited states (4» and E). 
By fitting the theoretical susceptibilities to the experimental values, the spin-orbit coupling constant of 
Ni**:ZnO is obtained as —175--25 cm"! and the trigonal field splitting of the T1[3T (F) ]state (the lowest T: 
state) as 100-10 cm™. The corresponding quantities for Ni?*: CdS are — 1702-10 cm and 102-4 cm™. In 
both crystals the Az component of the T3[?T (F) ] state lies below the Æ level. The large reduction of the 
spin-orbit coupling constant from the free-ion value indicates a rather strong covalency between the Ni?* 
ion and the ligands. Combination of the trigonal splittings of the T,[*71(F) ] state with those of T»[371(P) ] 
Observed in the optical spectra leads to a determination of the trigonal field parameters. The experimental | 
values of the trigonal parameters are consistent with those calculated by using the point-charge model and » 
assuming a local contraction of the lattice with a slightly larger contraction for the three anions off the 
trigonal axis than for the one on the axis. 


y 


‘I. INTRODUCTION tures. By combining these results with those of electron 


N a previous paper we have pointed out that the mag- 
netic susceptibilities of V?*-doped corundum may be 
used quite effectively for determining certain structural 
parameters of the crystals. For example, the trigonal 
field splitting of thé lowest 37, level of V**:Al;0s 
was determined from the Van Vleck temperature- 
independent susceptibility which was evaluated from 
the magnetic data in the temperature interval of 77— 
295°K, and the zero-field splitting of the ground state 
was obtained from the susceptibilities at low tempera- 


Í Supported by the U. S. Office of Naval Research. 
Alfred P. Sloan Foundation Fellow. 

_ 1 W. H. Brumage, C. R. Quade, and C. C. Lin, Phys. Rev. 131, 
949 (1963). 


spin resonance experiments and optical spectra,? the two 
trigonal field parameters and the spin-orbit coupling 
constant were estimated. The trigonal field parameters 
were found to be in reasonable agreement with the ones 
calculated from the empirical point-charge model. In 
this paper we shall report some magnetic measurements 
of NP*:CdS and Ni**:ZnO. The susceptibility data 
make it possible to locate the first two excited states 
which hitherto have not been determined accurately- 


ED S. McClure, J. Chem. Phys. 36, 2757 (1962). 
sAn estimation of the energy of the first excited state of 
D o has been, made by intensity measurements. S 3 
rdo, D. L. , and R. C. Li ., J. Chem. Phys 
SW Eae r C Liases, Jr oue 
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Values of the spin-orbit coupling constant and the 
trigonal field parameters can then be obtained, and the 
latter are compared with the calculations of the point- 
charge model. 

In both CdS and ZnO host crystals, a Ni?* ion is 
situated in a tetrahedral site.*5 Since Ni?* has an elec- 
tron configuration of (3d)5 and since the pattern of the 
energy level splitting due to a tetrahedral field is just 
the inverted form of that of an octahedral field, the 
ordering of the crystalline Stark levels of Ni?*: CdS and 
V**: AlO; are identical if one ignores the efiect of the 
trigonal distortion. However, the spin-orbit coupling 
constant of Ni** is considerably larger than that of V**. 
Thus, in Ni**:CdS the spin-orbit splittings are much 
greater than the trigonal terms, while these two effects 
are of comparable magnitude for Ni*:ZnO. It is 
convenient to consider the spin-orbit interaction as the 
major perturbation to Ni?* in a cubic field and then 
introduce the trigonal terms as the second perturbation. 
Figure 1 shows the energy level diagram of a Ni?* ion 
in tetrahedral site. 

The ground state A; is nonmagnetic, i.e., the average 
magnetic moment for this state is zero. At very low 
temperature the susceptibility should approach a 
constant value which is equal to the sum of the 
diamagnetism of the host crystal and the Van Vleck 
temperature-independent susceptibility. If we choose 
the trigonal axis as the z axis, it is easily shown that the 
matrix elements of uz and y, connect an A, state with 
only the E states, while u, has elements between 4; 
and A» only. It follows that for an external magnetic 
field along the z axis, the Van Vleck term is dictated 
primarily by the first A» level (in the trigonal field) 
while in the case of a perpendicular field, the lowest E 
state gives the major contribution to the temperature- 
independent paramagnetic susceptibility. As the tem- 
perature is increased, the excited states begin to receive 
appreciable population and the susceptibilities then 
depart from the low-temperature asymptotic values. 
Because of this strong dependence of X. and X. on 
temperature, the positions of the first 4» and the first E 
States can be determined quite accurately. In the case 
of Ni?*: CdS, we are able to obtain from the magnetic 
data a trigonal splitting of the lowest 4;—E pair as small 
as 10 cm~. Furthermore, since both the first excited 45 
and E levels are a few hundred cm~ above the ground 
state, direct determination of the location of these 
levels is difficult by other means and the magnetic 
measurements are particularly effective for this purpose. 


II. EXPERIMENT 


The crystals of Ni?*: CdS were furnished by Dr. T. L. 
Estle of Texas Instrument Inc, and Ni'*:ZnO by 


iR. W. G. Wyckoff, Crystal Structure (Interscience Publishers, 
Inc, New York, 1948), Vol. 1; G. Heiland, E. Mollwo, and F. 
Stockmann, Solid State Phys. 8, 195 (1959). 5 

5H. A. Weakliem, J. Chem. Phys. 36, 2117 (1962). 
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Fic. 1. Energy levels (?F) of Ni?* in a trigonal field. 


Dr. H. A. Weakleim of the RCA Laboratories. The 
measurements of the magnetic susceptibility were made 
by means of the Faraday balance used in our previous 
work. In Tables I and II are shown the magnetic 
susceptibilities of Ni?**:CdS and Ni?*:ZnO crystals 
along and perpendicular to the trigonal axis over the 
temperature range of 28-500? K. The concentrations of 
the nickel ions in ZnO and CdS are 0.076 and 0.84%, 
respectively, as determined from the analysis of the 
magnetic data. The accuracy of the measured suscepti- 
bility in ZnO (2-475) is somewhat lower than in CdS 
(+2%) because of the smaller concentration of Ni** in 
the former. 


TABLE I. Magnetic susceptibilities (per gram sample) 
of Ni?*-doped CdS crystal. Concentration: 0.847. 


JE XuX107 1 X1% 107 
(°K) (cgs-emu) CK) (cgs-emu) 

77.3 8.30 e 3113 7.90 

94 7.75 86 7.60 
108 7.12 101 7.18 
128 6.32 121 6.35 
154 5.40 134 5.85 
174 4.68 164 4.80 
202 3.86 184 4.16 
217 3.51 203 3.72 
232 3.11 219 3.48 
255 2.63 249 2.77 
296 1.98 294 1.98 
391 0.90 389 0.80 
489 0.21 448 0.37 
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'TABLE II. Magnetic susceptibilities (per gram sample) 
of Ni?*-doped ZnO crystal. Concentration : 0.076%. 


It XX 10? T xix 107 
(°K) (cgs-emu) (°K) (cgs-emu) 
28.0 0.00 28.0 —1.85 
77.3 —0.27 77.3 —1.85 
97 — 0.53 107 —1.85 
108 —0.75 114 —1.95 
130 —1.08 118 — 2.00 
140 —1.30 151 —2.10 
203 —1.90 167 —2.13 
253 —2.16 201 —2.35 
273 —2.22 250 —2.46 
300 —2.33 300 —2.57 
335 —242 330 —2.13 
403 — 2.66 401 — 2.86 
501 — 2.90 501 — 2.90 


IH. THEORY AND ANALYSIS OF DATA 


The theoretical analysis of the magnetic suscepti- 
bility of a (d?) or (d$) atomic system in a trigonal field 
has been treated in detail for the V3*: A1;05 crystals." 
As pointed out in Sec. I, for the case of Ni?* in a slightly 
distorted tetrahedral site, it is advantageous to consider 
the spin-orbit coupling as the major perturbation on the 
perfect tetrahedral field and the trigonal distortion as 
a minor one. Thus, a slight modification will be made 
to the theory presented in Ref. 1. 

The Hamiltonian for a Ni? ion is written as 


H=HotV-+dL-S+Vitwo(L+gS)-#. (1) 


The various members in the right-hand side of Eq. (1) 
represent the Hamiltonian of a free Ni?* ion, the cubic 
(tetrahedral) crystalline field, the spin-orbit interaction 
term, the trigonal field, and the interaction with the 
external field 3€. As was done before, the two crystal 
potential terms are taken as 


= » r3A ni Y4,0(0:0:)4- (10/7)!2 


i=1,2 
<[V1,3@ib:) — Ys.-3(0:0:) ]) , 
V,  {r2B2V 2,0(0ib:) +r BF 4,9(0:0:)) - 


i=1,2 : 


(2) 
(3) 


Here the summation is to be extended over the two (3d) 
holes rather than electrons. The crystal-field parameters 
are defined as 


B— — án) m, 
y= —Ber m, 
T= — (VS) BÉ(P)/1MO m, 


and, in the case of a pure cubic field £ is equal to 2Dg. 
Let us first consider the part of the Hamiltonian 
H'— Hg VAL: S. (4) 


*See, for example, D. S. McClure, Solid State Phys. 9, 399 
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The lowest configuration (3d)? of a Ni ion gives rise 
to the terms P, 1D, *P, 1G, and 1S which are split by the 
cubic field, e.g., 

sp 'Tiı H’ THA, (5) 


The spin-orbit coupling causes further splitting of the 
levels as 


3ip— 3T. 


87, 9 Ait E--T- T5, 
$T, — As - E-- Ti To, 
345 =? T». 


(6) 


The eigenfunctions of Ho corresponding to the various 
terms of the (3d)? configuration are taken as the usual 
Clebsch-Gordan type combinations’ of one-electron 
orbitals, and the multiplet separations are treated as 
parameters to be determined from the optical spectra. 
From the eigenfunctions of Ho we can construct the 
cubic-field functions according to Table IV of Ref. 1. 
These functions are characterized by the irreducible 
representations of the cubic group to which they belong. 
Since V, has matrix elements connecting *7\(?F) and 
a 


37, (3P), a transformation 
a p 
3p =s ay, 


must be applied to these two states in order to diago- 


(7) 


TABLE III. Spin-orbit functions of ?F and ?P. 


Da 
(Ta: $(A1,871,0) = (Y T1, +, —1) —¥ (971,0,0) HI GT 1, —, +1) }/ V3 
$(T1371,0) = (Y G73, +, —1) — (71, 7, +1) }/ V2 
$(Ty3T1, +) = {F GT1, +,0) —¥ Q71,0, +1)}/ V2 
$ (71371, —) ={ —¥ CTi, —,0) +¥ (71,0, 1)]/ V2 
$(ET1, +) 2 ( —¥ CTs, —, — 1) +Y Ti, +,0) +Y GT3,0, +1) / V3 
$ ET —) 2 {+ QT, —, —1) +1 GTi, +,0) HY G73,0, +1) ]/ V3 
$(T23T1,0) = (Y (371, +, —1) +2 (371,0,0) E (371, —, 4-1) ]/ V6 
$(T23T1, +) — (2Y (T1, +, +1) — (371, —,0) — Y (71,0, — 1) ]/ /6 
$ (T2371, —) = ( —2Y (T1, —, —1) — (71, +,0) — (71,0, 4-1) }/ V6 
Ta: $(Az3T2,0) = [ —¥ (72,0,0) —¥ (372, —, +1) +¥ GTa, +, —1)/ /3 
$\T23T2,0) = (Y (T5, +, —1) EY QT, —, +1) }/ 2 
$(T23T5, +) = [Y ETa, +,0) —¥ (372,0, 2-1) ]/ V2 
$(T23T2, —) = (Y 973,0, —1) +Y (972, —,0) }/ V2 
ETa +) — [V GT2, —, 1) +Y (75,0, +1) +Y GT, 45,0) )/ V3 
(ETa, —) = [V (T2, +, +1) +Y (972,0, —1) —Y (372, —,0))/ V3 
$(11,3T2,0) = (2 (172,0,0) — GT 2, —, +1) HY GT, +, —1))/ V6 
$ (T33T5, +) ={2¥ (T2, +, +1) —¥ (72,0, —1) —v ETa, —,0)]/ V6 
$ (T1373, —) ={ —2¥ (T2, —, —1) — (973,0, +1) — Qs, +.0)}/ V6 
34s: $(T2,3A2,0) —Y (345,0) 
$(T2A2, +) =F (142, +1) 
$(T2,3A2, —) =Y (345, —1) 
3P 
Ti: $(A1,371,0) 5 (Y (71, +, —1) — T1,0,0) - 971, — +1) 7 V3 


$(T1371,0) = (Y 973, +, 1) Y 971, 2, 2-0) )/ V2 

$ (T1371, +) = (Y 071, +,0) — 973,0, 4-1) ]/ 4/2 

$ (T1311, —) = f 971,0, 1) — (731, —,0) )/ 4/2 

$(ET1, H 2 ( Y (471, —, —1) +¥ QT1, +,0) Fr 071,0, +1) )/ V3 

9$ (E371, —) = [E GTi, +, +1) +Y Q71,0, 21) +¥ 973, —,0))/ V3 
$(T2371,0) = (Y 971, +, —1) +2¥ 073,0,0) +¥ 871, —, 41))/ V6 

$ (T3371, —) ={ —2¥ 71, —, —1) — GTi, +,0) —¥ 671,0,1) ]/ V6 

$(T23T3, +) — (2w GTi, +, +1) —Y 071,0, —1) —¥ (T1, —.0)}/ V6 

EM a S D 0g aa 


TE. U. Condon and G. H. Shortley, The Theory of Atomic 
Spectra (Cambridge University Press, TUTO 1951). 
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nalize Hot V. This results in a mixing between the F 
and P states. When the spin-orbit coupling is taken 
into consideration, the nine functions associated with 
the *T (also *7y) must be regrouped in such a way as 
to exhibit the decomposition shown in (6). The symbol 
$ (T2? T3,4-) is used to denote a new function obtained 
in this manner corresponding to the T» representation 
and originating from the ?T, state. The last symbol 
inside the parenthesis specifies a particular component 
of the degenerate state. The functions ¢ are listed in 
Table III. In the representation with $ as the basis, 
H' is not completely diagonal since L-S does connect 
members of the *7 manifold with those of ?T. 

When the trigonal field is introduced, the T; and 7; 
split into A2+£ and A,+£, respectively. The energy 
matrix (in the absence of external field) then factorizes 
into blocks corresponding to Ai, 4», and E. For mag- 
netic susceptibilities below 500°K, the highly excited 
states are insignificant, so we will consider only the 
manifold associated with the ?F state (with F-P mixing 
included). The energy matrix is shown in Table IV. Any 
mixing of these states with higher excited states can 
arise only from the spin-orbit coupling or from V, and 
are too small to affect the susceptibilities. 


A. Ni?*:CdS 


The optical studies reported by Weakliem? suggest 
that the environment of the Ni?* in CdS is very nearly 
cubic because of the apparent isotropy of the spectrum. 
The same conclusion can also be reached from the small 
observed magnetic anisotropy (Table I). It is therefore 
convenient to diagonalize first the Hamiltonian H’, 
defined by Eq. (3), and subsequently treat V, by first- 
order perturbation. To diagonalize H’ we shall take the 
values of 8 and Ap (the spacing between *F and *P for 
the limiting free ion) as 257 and 8550 cm“, respectively, 
according to the results of Weakliem.5 This gives the 
F-P mixing coefficients as 


2,—0.991, a2=0.138. 


The wave functions which diagonalize H' are now 
labeled by the irreducible representations of the cubic 
group. Thus, a wave function of a particular symmetry 
species can be expressed as linear combinations of the 
functions listed in Table IIT, e.g., 


W(T2,+) =06(T23Tr,+)+00(T2,322,+). — (8) 


When the trigonal potential is introduced, these func- 
tions may still be used as the approximate wave function 
of H'-- V,, and the effect of V, is to produce splittings 
or shifts of the energy levels. The wave functions and 
energies of the lowest nine states which are associated 
with the ?T,(F) group are determined in this manner. 
With the wave functions for these nine states, the 
magnetic susceptibilities have been calculated by means 


of the standard procedure? and are expressed in terms 
of the energy levels of the low-lying states and the 
matrix elements of L and S between these states. The 
formulas for X,, and X, are 


| (n, D]? 
X= arp] — 2 exp(— AV/ET) 


Ay 


KO D]? bas 
X T lot exp(—Ao/kT) 
Ay Ay— Ay 


[(2|ms|2)|? _ | (2{ue|é)|? 
xm: n exp(— As/kT) 
2kT s A Nz | 


Ila]. Glati] 
x r t2 rn | enti) 
i$ A;— 3 


Lula 
e [n4] 4)]*] 


pe exp(—A;/kT) 
A1— A, 


[/ Giu Eee ^ 
; ) 
LU RT 5 PRR ( 


[Olazti]? 
x= 4NBE| ONIN Cm 
[Glas f 
x[z E epar) 
i AL.) 
21n42)]?. [Cle DI? 
xp [uz] Eas | lj | +exp(—ae/e7) 
2kT 172 Aj— Aa 
(3|u213)/? _ | Gluzl DI? 
x is d= : | tepat) 
2kT P3 Aj—â2 
| (lezli)? 
x[z RE epar) 
i A;—As 
[Gled [Glad De 
Ses Sm 
2kT i85  A;—As 
where 


Bo=1+exp(—A1/kT)+2 exp(— As/ET) 
+2 exp(—A2/kT)-rexp(—As/kT) +2 exp(—A;/AT), 
and - 
=D Sass 
7=0 L 23,49: 
In the above equations the energy spacing between 


the &th level and the ground level is represented by A; 
and the numbers 0, 1, 2, 3, 4, 5 represent the Ar, T145, 


3 T. H. Van Vleck, The Theory of Electric and Magnetic Suscepti- 
bilis (Oxford University Press, London, 1932), p. 182. 
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TABLE IV. Energy matrix. 
3734 (F) STATS CF) STATS GF) 34» T2 GF) 3TA GP) TTP) 
—98--3A— Ty V2(—10y—7) —1G0)54y—7 3 (15)"2(y—-27) —68—-(14/3)y (5/3) V2 y—BV2r 
—98—4-b3y-br 3S) mA—4(15)5(4,—7) —8(30)^(y—27) (5/3)V2y—BV2r —68— (11/3) y+ (8/3)r 
38—1A—y—5r 2V2) 2(8)!2(3y—8r) (5/3) (34 —87) s 
188--14y V3 (45 — 207) (6) (— 454-157) 
Ap—2d —14v27 
Apta+l4r i 
TiTi CF) 3TA CF) ?TT VP) STAT GP) 
—98-- —1Ty—7 — G0) (4 —7) 3015) —3(15! ^ 4y— 7) —68—3y—8r 
38—M-r (/3)v 32(—10y—157) —3 (30)? (—3y— 87) 
3H (1/3)y —3(15)!"(—3y—87) 
Ap—d— 14r 
TTA (°F) STEQF) ?T,TS GF) ?TSTS (SF) STE QF) 
—98--H —21-H (6 (—5y—57) +5 (5y—47) —2() HG) —1 30) ^(&—7) 
—98—3A.— Ty v2 (—Sy—47) $ (30) ^ —4-F7) $5): 
—98—3.-12y—4e — 4S ALS) 7) 3010) ^(4—7) 
38—D-Er- ir 19)! ^ (— 105—157) 
38--5A-- (/3)v 
§T2T1 CF) 3A 2T2 GF) STAT AP) STE (SP) STATS (P) 
15) 
309/md— —6—7) — GO oy 2) — —68- Doer (6) (— $y—47) V3 (Ey-F4z) 
(15)1/2 
3 (10)? (45 —7) (27—47) (62 (—$y—47) —68— (14/3)y —$V2y — 42r 
(30)1? 
(5)12(2y—4r) (—2y+47) \3 (Ey--47) V2(—$y—4r) —6B-+4V2 (—23y— 24r) 
$3 10y-H157) 2/2 3) (9-87) (B! (5-87) (5/125 (—35—87) 
4v2(—10y—157) 0 (B1! (3-87) OE 3(10):5(—3—87) 
SBHH iy- őr 0 (5/12) ^(—3.—87) 3(10)/5(—3,—87) 3(5)/2(—3y—87) 
188--14y V2 (6y —307) V3 (—4y--207) (6)! 2 — 107) 
£y teta -—T(91; +r 
x Ag —Hir 
Ag TA Ir 


M 


on the various crystal field parameters.” The algebraic 


T4E, EE, T2A;, and T2E levels, respectively. Here the 
work, which is quite lengthy, will not be reproduce 


matrix elements of L and S are to be evaluated using 
the wave functions of the type of Eq. (8). These 
elements can be written as functions of the mixing 


fici A E *W. H. B i duc , 1964 
S cients as defined in Eq. (8) which, in turn, depends rumage, Ph.D. thesis, University of Oklahoma; 


(unpublished). 
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here and the readers are referred to Ref. 9 for the 
mathematical details. 

The excited states beyond the *T,(F) manifold are at 
least about 3000 cm~ above the ground state, so they 
can be ignored as far as the magnetic susceptibility is 
concerned. It was found that the theoretical suscepti- 
bilities at temperatures below 500°K depend on à and 
a parameter K which is defined as 

K=—30y—3r, (10) 
but is practically independent of the individual values 
of y and r. The calculated susceptibilities can be fitted 
to the experimental data by choosing \=—170 cnr}, 
and K — — 20 cm (see Fig. 2). These give the positions 
of the first two excited states (A2 and E) as 230 and 
240 cm~, respectively. It is rather remarkable to note 
that one is able to determine a splitting as small as 
10 cm from the magnetic measurements. This occurs 
because the magnetic anisotropy depends quite sensi- 
tively on this 45— E splitting. When the uncertainty of 
the experimental data is taken into consideration, the 
trigonal splitting is determined as 10-4 cm-!. Like- 
wise, the accuracy of ^ and K is estimated to be 


A-— —1702:10 cm, 


K=—20+8 cm. (11) 
The determination of the individual values of y and 7 
will be discussed in Sec. IV. 


B. Ni?*:ZnO 


The rather large magnetic anisotropy of Ni**:ZnO 
(Fig. 3) indicates that the trigonal field here is consider- 
ably greater than that in Ni?*:CdS. In fact, from the 
susceptibility data we find that the spin-orbit coupling 
constant and the trigonal field parameters are of 
comparable magnitude. The approximate method for 
determining the energy levels and wave functions used 
in the previous section is no longer applicable here; 
rather one must solve the entire 21X21 secular equation 
as given in Table III. The amount of F-P mixing in this 
case is, however, quite small and can be neglected. With 
a given set of values of A, y, and 7, one can solve the 
secular equation numerically and calculate Xı and X; 
according to Eq. (9). The wave functions for the lowest 
nine states are given in Ref. 9. Again the calculated 
susceptibilities are governed chiefly by \ and K, and by 
fitting the theoretical values to the experimental data, 
we found 


(12) 


Figure 3 shows the comparison between the theoretical 
and observed susceptibilities. The energies of Ge mai 
two excited states (4+ and E) are 160 and 260 cm™. 


IN ZnO AND CdS 
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Fic. 2. Magnetic susceptibilities (per gm-ion) 
of Ni**: CdS from 77-500^K. 


IV. DISCUSSION 


From the optical spectrum, Weakliem? has deter- 
mined the trigonal splitting of the level T.[*7,(P) ] of 
Ni?*:ZnO as 41 cm™. By combining this splitting with 
the parameter K given in Eq. (10), the two trigonal 


xe? 


Fic. 3. Magnetic susceptibilities (per gm-ion) 
of Ni*:Zn0 from 77-500°K. 
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field parameters can be determined as 


y=7.5 cm ; 
, for NP*:ZnO. 
T= — 1.4 cm 


(13) 


Here we found that the splitting of the T:PT:(P)] 
state is determined primarily by 7. On the other hand, 
K depends mainly on y, the contribution of 7 on K being 
quite small. For Ni*:CdS, the trigonal splitting of 
T4[371(P)] was not observed in the optical spectrum, 
presumably too small to be resolved. In fact, the optical 
spectrum of Ni**: CdS shows nearly complete isotropy. 
If, however, it is assumed that the contribution of the 
7 term in K can be neglected also for Ni**: CdS, as is in 
the case of Ni?+:ZnO, one can make an estimate of y 
from K as 


for Nit*: CdS. (14) 


It has been shown that for the case of V+: Al;O; the 
trigonal field parameters can be accounted for by the 
empirical point-charge model.! It is therefore interesting 
to make a similar analysis here. Following Ref. 1, we 
shall first compare the values of y/B calculated by the 
point-charge model with the experimental results. The 
use of y/B, rather than y, gives a better test of the 
point-charge model, since the former is independent of 
(r^) as well as the effective charge assigned to the ions 
of the host crystals to account for the departure from 
purely ionic bonding. Let us first assume that the im- 
purity ion is situated exactly at the site of Zn?*. From 
the crystal structure data* and using only the four 
nearest-neighbor oxygen ions, we obtain from the point- 
charge calculation y/8— 0.04 for Ni** :ZnO as compared 
with the experimental value of 0.027. This computation 
has been repeated by including in the calculation of the 
crystalline potential all ions of the host crystal within a 
radius of 20 A from the impurity center (rather than 
four nearest neighbors). The ratio y/8 now becomes 0.05 
which does not differ substantially from the value 
calculated with the nearest neighbors only. To explain 
the deviation of the point-charge value from experi- 
ment, we have considered the possibility of small 
displacement of the substituent along the trigonal 
axis. ? The minimum value of 7/6 that can be obtained 
by such a displacement is about 0.04. Another possible 
distortion is a local contraction of the lattice due to the 
difference in ionic radii between Ni^* and Zn?'. This 
type of distortion has been considered by several 
authors.* If the degree of contraction of the oxygen 
ion on the trigonal axis (called the unique oxygen ion) 
differs slightly from those of the other three, a change 
of 7/6 wil result. For example, a contraction of the 
three off-axis oxygen ions by 0.028 A relative to the 

“unique ion in the direction of the cation site will give 
the experimental value of y/B 0.027. 


y¥=0.67 cnr, 


wR, S, Title, Phys. Rev. 131, 2503 (1963). 
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The calculation of 7 from the point-charge model is 
considerably more difficult, as it is necessary to intro- 
duce an additional assumption of using the self- 
consistent field atomic wave function of N?* to evaluate 
(7?) which is not always valid. Also, the convergence 
of this parameter in summing over the lattice is rather 
poor. We have carried out the summation to 30 A and 
the results still fluctuate considerably. However, we are 
able to obtain a rough estimate of 7 as 1 cm™ for the 
case of zero displacement of the Ni?* from the cation 
site and no distortion of the lattice. With the distorted 
lattice as used in the previous paragraph (and no dis- 
placement of the impurity ions), the value of 7 becomes 
— 1 cm™ which is of the same sign as the experimental 
value. In view of the assumptions involved in the 
calculation of 7, it is not reasonable to expect anything 
more than a qualitative agreement with experiment, and 
thus the point-charge calculation of this parameter will 
not be pursued further at this stage. 

In the case of Ni?*:CdS, the point-charge model 
gives, for zero displacement and no distortion, y/8 
—0.018 which is much larger than the experimental 
value. We have used only the nearest-neighbor anions 
for this calculation since the inclusion of ions at lattice 
sites further away, as we have seen earlier, will not 
change this parameter greatly. From the apparent 
isotropy of the optical spectrum? as well as the smallness 
of y/8, we note that the crystalline field around the NP* 
in CdS has considerably smaller trigonal components 
than is inferred from the crystal structure of CdS. Like 
the case of Ni?*:ZnO, the experimental value of y/8. 
cannot be explained by the displacement of the sub- 
stituent alone. On the other hand, the picture of local 
distortion where the lattice relaxes toward the impurity 
ion in such a manner as to restore back to a nearly 
perfect tetrahedral configuration, does account for the 
small apparent anisotropy. Indeed, the experimental 
value of y/8 corresponds to a contraction of the three 
off-axis anions through 0.02 A relative to the fourth 
one. The idea of local lattice contraction in Ni**: CdS 
was suggested by Weakliem in his optical studies.? 

Thus, in both Ni#+:ZnO and Ni**:CdS, the experi- 
mental values of the trigonal field parameters are 
consistent with those calculated from the empirical 
point-charge model! with a local distortion of the Jattice 
such that the three anions of the trigonal axis relax 
toward the Ni? slightly more than does the one on the 
axis. Of course, the point-charge model is a highly 
idealized picture. When it is applied to crystals with an 
appreciable amount of covalency, a “displacement” of 
the anion may result from an actual movement of the 
lattice site as well as a shift of the electron cloud due to 
a change of the degree of covalency. In our application 
here no sharp distinction can be made between these 
two effects and the term "lattice distortion" really 


u D, S, McClure, J. Chem. Phys. 38, 2289 (1963). 
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refers to a movement of the effective center of charge 
of the anion. 

The spin-orbit coupling constants of Ni?* in both CdS 
and ZnO are about one-half of the free-ion value 
(—340 cm™). They are also substantially smaller than 
the spin-orbit constants of Ni in octahedral field, 
e.g., —245 cm" for Ni*:MgO,"? and —250 cm"! (or 
— 270) for Ni Tutton salt. Similar reduction from 
the free-ion values was also found for the Slater-Condon 
parameters for Ni** in CdS and ZnO from the analysis 
of the optical spectra.5 These results point toward the 
idea of strong bonding between the impurity ion and 
the ligands. One also notices the fact that the Slater- 
Condon parameters are appreciably smaller for Ni?* in 
CdS than in ZnO which is suggestive of a stronger 
covalency in the former. However, the same sort of 
difference in X(— 1704-10 cm™ versus — 1752-25 cm~!) 
is not so obvious from the magnetic measurements, 
although a larger covalency in Ni?+: CdS is compatible 
with our results within their limits of accuracy. It is 
interesting to note that a substantially larger reduction 
of the spin-orbit constant has been reported in 
Cu?*:ZnO.5 

Our analysis leads to the conclusion that the magnetic 
susceptibilities of Ni** in CdS and ZnO can be explained 
quantitatively by using the ligand field theory. The 
considerable reduction of the spin-orbit coupling con- 
stant indicates delocalization of the valence electrons of 
the Ni*. It has been pointed out that even in the case 
of Cu**:ZnO where an unusually strong bonding be- 
tween the impurity ions and the ligands is found, the 
delocalized electrons in Cu?*:ZnO extend only over the 
Cut ion and its nearest neighbors. Thus the use of 
the ligand field theory in our case should be justifiable. 
The effect of the covalent bonding is taken into account 
by making the spin-orbit constant an adjustable 
parameter. In fact, if one assumes different degrees of 
covalency between the one-electron orbitals f2 and e as 
was done for Cu?*:ZnO, the spin-orbit interaction is 
then described by two parameters, i.e., (t2|He-0|f2) and 
(2| Hs-0|e). However, since at T<500°K the magnetic 
susceptibility is governed primarily by the ground state 
and the two trigonal components of T;[*71(F)], and 
since all these two states belong to the (¢2)? configuration 

? W, Low, Phys. Rev. 109, 247 (1948); J. H. E. Griffiths and 
J. Owen, Proc. Roy. Soc. (London) A213, 459 (1952). Fe 

3R. E. Dietz, H. Kamimura, M. D. Sturge, and A. Yariv, 
Phys. Rev. 132, 1554 (1963). 


(in the notation of the strong-field scheme), the suscepti- 
bility should be rather insensitive to (la| Heale) and 
hence the value of À determined here corresponds to 
the coupling constant of the l; state. It has also been 
customary to introduce an additional parameter, the 
"orbital reduction factor" k, which relates the matrix 
elements of the orbital angular momentum evaluated 
by using the ligand fiéld LCAO MO with those evalu- 
ated by the pure atomic functions.?^4 Generally, this 
factor is not much less than one even when there is a 
considerable reduction in X. For instance, in the case 
of Fe'* in ZnF5,5 the spin-orbit constant is about 60% 
of the free-ion value and £ is around 0.95. Even in the 
very strong bonding case of Cu?*:ZnO where A is less 
than 10% of the value of a free Cu, the orbital reduc- 
tion factor is only 0.46.7 In our theoretical analysis 
here, this reduction ratio is not used, i.e., it is taken as 
unity. When the orbital reduction ratio is included in 
the theory, all the matrix elements of u which are 
relevant in the calculation of the susceptibility, are 
modified by the same multiplicative constant. The 
inclusion of & merely alters the theoretical x by a 
constant factor and has the same effect as changing the 
concentration of the nickel ion. Thus the orbital 
reduction ratio cannot be obtained from our suscepti- 
bility data as the precise concentrations of Ni^* in both 
ZnO and CdS are not known. On the other hand, the 
crystal field parameters \ and K affect the shape of the 
curve of x versus T rather than the absolute magnitude 
of the susceptibility, hence the values of these param- 
eters which we gave in the preceding paragraphs still 
stand correct even when the orbital reduction factor is 
taken into consideration. 
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The partition function of an electron gas in the presence of scattering centers, such as exist in a dilute 
solution, is obtained by using the method first introduced by Matsubara. It is shown that the electronic 
specific heat at low temperatures, in such a case, contains a term arising from virtual electron scattering. 
If the Fermi limit in the pure metal lies just beyond a sharp peak in the density-of-states curve, this term 
increases the specific heat. Comparison with the observations by Rayne on the dilute solid solutions of zinc 
and germanium in copper indicate that the effect discussed is probably the explanation of the observed initial 
increase in the low-temperature specific heat at small concentrations. 


1, INTRODUCTION 


TESS electronic properties of disordered alloys are 
difficult to analyze because there are no simple 
quantum numbers in terms of which the single-electron 
wave functions can be described. Equilibrium proper- 
ties, however, are fully determined when the partition 
function is known, and this can be obtained as the trace 
of an operator with respect to any complete orthonormal 
set of total wave functions. For dilute solid solutions the 
Slater determinants of the Bloch functions of the pure 
metal form an obvious choice for such a set. In this 
representation, the wave function of the ground state of 
the alloy is not a single Slater determinant, as in the 
Hartree-Fock approximation for a pure metal, but an 
infinite series of such determinants. The first member of 
the series corresponds to the ground state of the pure 
metal, and the other members to various excited states. 
In wave-vector k space the alloy has not therefore a 
sharply defined Fermi surface in its ground state but one 
which is broadened by the effect of virtual electron 
scattering. There must, of course, always be a precisely 
defined Fermi limit in terms of the true one-electron 
states of the alloy but these are not representable in k 
space. When the solute concentration is a few percent, 
the broadening of the Fermi surface of the ground state 
will be large compared to the thermal broadening in the 
pure metal at low temperatures. 

If the density of states in the pure metal has special 
features lying just below the Fermi limit, these will not 
affect the low-temperature electronic specific heat of the 
pure metal, but they will affect that of the dilute alloy 
if the ground-state broadening, just referred to, is large 
enough to include these features. This appears to be 
exactly the situation which exists in copper and its solid 
solutions. In pure copper it is believed that the Fermi 
limit lies just beyond the peak of the density-of-states 
curve. Hence, in dilute solid solutions, the influence of 
this peak may be felt on the equilibrium properties and, 
in particular, on the low-temperature electronic specific 
heat. It will be shown that in such a case it leads to a 
slight increase in the specific heat even when the electron 
concentration remains constant. This increase is closely 


* On leave of absence from Imperial College, London, England. 


related to the residual electrical resistance and is deter- 
mined by the matrix elements of the scattering potential. 


2. PERTURBATION EXPANSION OF THE 
PARTITION FUNCTION 


The partition function is obtained as a particularly 
simple example of the method introduced by Matsubara’ 
and developed by Thouless.? Let H denote the complete 
Hamiltonian, V the number of electrons in a volume v, 
B—1/RT, and œ= uf. If u denotes the partial potential a 
is the activity. The grand partition function Z is given 
by the trace of V, where 


jenem, () 
and 
Z-Tm -—1.(n|e*W-88 |n). (2) 


The wave functions |), which are used for the evalua- 
tion of the trace are Slater determinants of Bloch func- 
tions including the spin factor and relating to the pure 
metal. 

To simplify the notation let 


H—yN — Ho4-V (x), (3) 


where Hy is the Hamiltonian of the pure metal minus 
uN, and V(r) is the potential energy of an electron due 
to the randomly distributed impurity atoms. V(x) is 
therefore the difference between the potential in the 
solid solution and that in the pure metal. 

The interaction representation is defined by 


Vr eB, (4) 


and if we write 
His (u) = eos y (y)e-Hov | (5) 


or in terms of the creation and annihilation operato» 
Cx (u), Cr (u), 


Hau) =D (k| V (DEC), Q 
then i 
9U1/08— —Hs(8)r, () 


1 T. Matsubara, Progr. Theoret. Phys. (Kyoto) 14, 351 (1955): 
*D. J. Thouless, Phys. Rev. 107, Yo (1957); The Quanti 
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which may be solved by iteration as follows: 


B B uj 
u=1- f dutta) | tuf dusH (u) Hy (12) 
0 0 6 


B uz u2 
-f daf du | dusH (1) Hs(u2) H (15) 3- etc. , 
0 0 0 
(9 


8) 
where 
Bu£€u D. 


The above solution not only satisfies (7) but also the 
boundary condition, y; —1, when 8—0. 

From (4) and (2) it follows that, with v; as 
given by (8), 


Z jn|e- Apr n) 
Zt 3 n(n e708 | n) i 


where Zo is the partition function relating to the pure 
metal. 'The terms of the sum (8) can be evaluated in a 
manner exactly analogous to that used for the deter- 
mination of the energy of the ground state in the many- 
body problem. 

Writing 


(9) 


Ho 2 z(e u)Ci Cr, 
it follows from relations similar to (5), and applicable 
to any operator, that 

Cilu)= Cp lu, 
C. (u) S Cyte imu, 
An essential feature of Matsubara’s method (cf. Ref. 2) 
lies in the following definition of the normal product 
which ensures that its trace in (9) vanishes: 
NCC, ]= (1— fir) Cit Ck fire. (11) 
For example, the contribution to Z/Zo of any normal 
product occurring in (8) is given by 
(A/Z) X (n | e- &98N[Cé*Cx]| n) 
= (ZYE (a "ot foc ei RCM 


(10) 


which is zero, if 


ei (1/29: (n| e- CC; n) fr, T 


(1/Z9) £ (n| e- 9 °C, Cy* | n)  1— fe- 


The number fx, defined by (13), is just the occupation 
number for the state &, i.e., 


fi (12-8692). (14) 


The contractions then introduce factors fe or 1— fr as 
indicated below. Using the linked-graph theorem, the 
result to second order can be given in the following form. 


In(Z/Zj) -In(Y: all graphs) =» (linked graphs) 
= -8 Vixfick8 ZIC APTE 


xLf—f0/(e-e)] ete. (15) 
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Kk J V be V Se bey 
XJ V ME yu Bs 
We Xu cA 


Order |. 2. 3. 4. 


Fic. 1. Diagrams for the-perturbation series of the partition 
function involving only two wave vectors. 


The second-order term in (15) diverges logarithmically 
near the Fermi limit, and it is therefore necessary to 
make a partial summation over all orders of (8) in 
which only two wave vectors, k and J, are involved. If 
we represent the vertices by points with the values of 2 
increasing upwards, the graphs of the terms to be 
summed are as shown in Fig. 1. 

When the operators are u ordered as in (8), it follows 
from (13) that an upward line corresponds to C,Cz* and 
gives a factor 1—f, and a downward line to Ci*C; 
which gives a factor fı. The energy denominators for all 
these particular graphs are powers of ej—e, and the 
matrix elements at the vertices either Vir, Viz, Viz, or 
Vu. [Henceforth we use this abreviated notation rather 
than that of (15).] It is now easy to see that the above 
sequence of graphs gives the following contribution to 
In(Z/Z;). 


|Val*fi(4— fü) 
—BY Vaiss s e 
k EL 


€1— Ek 
VifitVau(1— fi) 
xh- Je r-e P+] 
€1— €k 
syan E 
F p AES td eed (ifi Vul—f))- 


(16) 


which remains finite at the Fermi limit. It may be noted 
that the lower limits of the integrals in (8) (Le., the 
zeros) give no contribution because factors like e(*«-*95, 
etc., combine with f; and f, in such a way as to lead to 
exact cancellation. There are, of course, many other 
terms in the expansion of y, but all will involve higher 
powers of the nondiagonal matrix elements Vz, and will 
therefore be neglected in the present approximate 
considerations. 3 i 

The range of ¢, and e; (now written for convenience 
as e and 7, respectively) over which the summand of the 
second sum of (16) differs appreciably from zero is shown 
in Fig. 2. 

Since all quantities in (16) apart from Vi: depend 
only on the energies of the states kand}, the first step in 
the summation is to average | Vi]? over all relative 
directions 0 of the vectors k and /, and to assume that the 
result depends only on e and 7. This assumption is 
correct if the matrix elements are calculated with 
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Fic. 2. Range of integration 
for the second-order correction 
to the partition function. 


respect to free-electron wave functions. It will be 
assumed, therefore, that 


1 T 
2 i | Vial? sin8d6— K (e,n) = K (1, €) o (17) 


I" To simplify the calculations the following assumptions 
ME and approximations are also made: (i) that in the 
pu demoninator of (16) f; and f; may be replaced by their 
{fe values at absolute zero, (ii) Vi; is independent of k and 
$ equal to a constant V. Hence, if N (e) denotes the 
density of states, the sum (16) may be written as an 


(18) 


where 
€2 uo 
(19) 


integral as follows: 
N(9N ()K (em) 
= Oli f@)ildedy 
J i n—e+2V 2 1 
-faf PAE GD N (n)K (em) 
mey 
N (ƏN Q)K (en) 
E E 
n—e+2V 
d (9N (2) K (em) 
pe eti) ) 
n—e+2V 
(14-e8(—20)1, 
(= io 
<(1FePO}-!, ex, 
and the second and third integrals are taken along the 
boundary of the shaded region of Fig. 2. The term aris- 
ing from the immediate neighborhood of (uo,uo) has 
been. omitted as negligible compared with the others. In 
_ the last integral the symbols e and n may be inter- 
— changed; and therefore, if 
“NMK (en), f° NK (en) 
aS eam | 
o €—n+2V 


d», (20) 
nean 2 V. 


_ the whole ot the second sum of (16) may be written 


—BE,—8 | KOO (21) 


o E; is the second-order energy due to the perturba- 
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tion and is independent of temperature. If E, denotes 
the energy correction in the first order, then 


nZ- Indo BUE) 8 | N (e)g(e)e (ode. 
0 


(22) 
The last term of (22) can be written 
-6 iL N()e(96 (94e 
^ N(e)g(e)de * N(e)g(e)de 
-e| 14-e-86-» Ji 1+ ¢8(—») 4 (23) 


Let x—(e— p), then 


ax x x 
Ix | JG) 
e7 B B 


co * dy 
N(e)e(e)o(e)de- 
ef OOO E 


AM] m 


in which the usual approximation has been made of 
replacing the lower limit —8y by — in one integral. 

The partition function for the pure metal is well 
known and may be expressed 


mZ Bn) B) I nate) 
x| N(ut- --N(u—-) lds, Q3) 
B B 


n= | Nod, and B)= [ v «te (26) 


where 


are both functions of temperature through their de- 
pendence on u. On substituting (24) and (25) into Q2) 
it follows that for low temperatures, 


InZ—an(u)—BLE(u)-4- Ei4- Ez] 


2 


ivo [9e] |. (27) 


It may be noted that N (u) is the density of states includ- 
ing the factor 2 which arises as a consequence of eac 
state being doubly occupied by electrons of opposité 
spins. 

The properties of an assembly consisting of à fixed 
number of electrons are obtained by using the relation, 


(9 InZ/ào)g—, (28) 


to determine u when ^ is constant. The energy is then 
given by 


— (8 InZ/08).— 


(29) 


es 
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The specific heat of the system C, (it has been assumed 
throughout that the volume is constant) is given by 


B? /0E 
GO J =yT. (30) 


Although y may readily be evaluated, it is sufficient for 
the present purpose to note that 


T mad 
nji- E xe | | ; (31) 
N (u) 


K 


where yo refers to the pure metal. 


3. EVALUATION OF MATRIX ELEMENTS 


It will be assumed that the potential V (r) arises from 
v screened unit positive charges randomly distributed 
through the volume v which contains JV electrons and 
the same number of lattice sites. The solute concentra- 
tion is therefore given by c=v/N. Hence 


V(r)- —eé»;.;exp(—-o|r-R,|)/|r-R;|, (32) 


where R; denotes the position of the solute atoms, and 
æ is the reciprocal of the screening length. 

If in order to estimate the magnitude of Vz: we use 
free-electron wave functions defined by 


ERU. r (33) 
a/v 
the following well-known result is readily obtained. 
7 Zd : Coa qe 
Va=— " TELE "8 (34) 


As a result of the random nature of the distribution of 
solute atoms, 


[Vial = : (33) 
uem ea eae 5 
A ( v / (k Fe 


The average of this expression over all directions of 1 
relative to k depends only on #? and 7? and therefore is in 
accordance with assumption (17). 

In order to simulate the effect of the peak in the 
density of states of copper just below the Fermi limit 
andat the same time enable the integrals to be evaluated 
without too much difficulty we adopt the following 
model. 


N(n)=NotNpi(n—a), &«nuo, (36) 


where Nois a constant. Here p is a fraction of order 10^. 
It is further assumed that K (u,n) — 0 for large values 
of 7 as would be the case with (35). Substituting (36) 
into (20) gives 


Ho ] 2 Nok (e, 
g(9- E poen. nt | NS 


e—7+2V eur 2 
NpK (ee) 


s D 
e—e2V 
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When euo it will be seen that the first two integrals 
are of opposite sign and tend to cancel each other, and 
their contribution to g' (u) will not be considered further. 
The third term represents the effect of the peak in JV (c) 
and its variation with e results mainly from the de- 
nominator. Hence a reasonable approximation is 


£ (y) 7 —N pK (uyà)/ (a—e--2V)? , and N’(u)=0. (38) 
Thus 


[See |. (39) 


(u—e+2V)? 


To estimate the additional term in 4 we use (35) and 
(17) and find 


K(upg)Np (Axe), hw? y 
AMET 
uo? deep) NN. 


where 2 is the atomic volume which may be written 
Q—izrj, and w=(h?/2m)e®. Hence using the free- 
electron approximation to determine us and writing for 
simplicity, w= p, 

TUE LE 4 256\ 1^ (€2/r,)? 
2 d 


p? Q? \2m 


c, (41) 


a 
p B 


which gives finally, neglecting 2V compared with u— ei, 
0.13% - (e/r,)? i 
EN s 
(1—e/p) p? 
Suppose the ratio of the two energies is unity, that e 
lies {p of uo below the Fermi limit, and that 5710-1, i.e., 
one-tenth of the electrons lie under the effective part 
of the peak. These rough estimates which seem reason- 


able enough then give y=~yo(1+-c), where c is the con- 
centration expressed as a fraction. 


y= + 


4, RELATION TO RESIDUAL RESISTIVITY 


Imagine a hole to exist in the Fermi distribution at 
absolute zero in the neighborhood of e. Elastic scatter- 
ing due to impurities will result in a finite lifetime 7 of 
any particular state k of this hole. This lifetime will be 
approximately equal to the relaxation time for impurity 
resistivity. Each level therefore will have a width of 
order 2/7. The delta function of (36) is therefore effec- 
tively replaced by a function of width 4/7, and if u—e, 
is small part of the tail will overlap the Fermi limit and 
give a contribution to the linear electronic specific heat. 
This argument can be made semiquantitative as follows. 
The linewidth at e may be written 


hír 
z((e—0*-- uv?) | 


and hence if the density of states in the pure metal is 
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No(m) in the solid solution it is 


3 No(n)h/r 
Vel <a a 
aS | E OO 


For the model in which the density-of-states Mo(n) is 
as given by (36) it follows that 


(h/ 1)Np 
a{ (u—ex) 2+ (8/7))) 


N(u)=Not (45) 


Hence when 7;/rp—e1, which implies that the correc- 
tion is small and corresponds to a second-order effect, 


Np(h/7) 


N (w) = Not (46) 
wo Tu’ (1—6/u)? 
and 
2/7) | 
=F) lo = (47) 
j n| a í 


where the approximation JVouc-N has been used. The 
relaxation time is given by the well-known formula, 


h mkv 


T 2mh 


| | Vii 2(1— cos) sin0d0 , (48) 
0 


where v=QN is the volume which contains v impurities 
and with respect to which the electronic wave functions 
are normalized. If we define half of the integral as 
K' (uu), then a simple calculation shows that 


h/r—37NK'/2u, (49) 


and 


Y= 01+ [SKN p/2(u— 41)" ]) , (50) 
which is similar to (39) apart from the numerical factor 
3/2, the neglect of 2V in the denominator and the differ- 
ent definitions of K and K’. Both derivations show that 
the addition to the electronic specific heat is proportional 
to the residual resistivity. 
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5. COMPARISON WITH EXPERIMENTAL RESULTS 


The electronic specific heats of dilute solid solutions 
of Zn and Ge in Cu have been measured by Rayne? ang 
by Veal and Rayne.‘ It is found that initially for smal] 
concentrations of the solute y increases in both cases. 
Since the Fermi limit lies beyond the peak of the density 
of states a simple theory would suggest a decrease of y 
due to the increase of the electron concentration, This 
decrease is easily calculated and is indicated by Vea] 
and Rayne. If we attribute the difference between the 
observed increase and the expected decrease to virtua] 
electron scattering and express the results for the initial 
slopes by giving the values of b in y/yo— 1--bc, where 
c is the fractional concentration, then for Zn we find 
b=1.4 and for Ge, 5— 5.8. Equation (42) appears there- 
fore to give values of b of roughly the right magnitude, 
Moreover, as might be expected, Ge with the greater 
scattering power has the higher value. 

'There are other examples where the observed elec- 
tronic specific heat is larger than would be anticipated 
from a simple band model. For example, Keesom and 
Kurrelmeyer® find for a copper-nickel alloy containing 
approximately 20 at.% Ni, a y value which is 1.56 times 
as great as the y value for pure Cu. Conventional theory 
visualizes the d band as just filled in CugoNiso. At 80% 
Cu the Fermi limit will be beyond the point where N (e) 
rises steeply due to the d levels. A similar situation to 
that discussed in the preceding theory therefore exists 
and may perhaps be explained in the same way. The 
measurements of Hoare and his collaborators$? on 
Ag-Pd and Au-Pt point in the same direction. 


ACKNOWLEDGMENTS 


The author wishes to thank the members of the 
Physics Department of the Oklahoma State University 
for their kind hospitality and the National Science 
Foundation for the award of a Senior Foreign Scientist 
Fellowship. 


3 J. A. Rayne, Phys. Rev. 108, 22 (1957). 

4B. W. Veal and J. A. Rayne, Phys. Rev. 130, 2156 (1963). 

5 W. K. Keesom and B. Kurrelmeyer, Physica 7, 1003 (1940). 
a ue 5 Hoare and B. Yates, Proc. Roy. Soc. (London) A240, 42 

7D. W. Budworth, F. E. Hoare, and J. Preston, Proc. Roy. Soc: 
(London) A257, 250 (1960). 


= 


^ 


f 
| 


i 


x” 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


PHYSICAL REVIEW VOLUME 


134, NUMBER 4A 13 MAY 1964 


Motional Narrowing of Nuclear Magnetic-Resonance Lines 
in Manganese-Doped Lithium Fluoride* 
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Nuclear magnetic-resonance experiments have been performed between 25 and 550°C in LiF powder 
doped with manganese. Motional narrowing of the Li? resonance line occurred somewhat below 400°C. 
Resonance absorption lines were not observable in the temperature range 200-390°C, probably because an 
effective loss of paramagnetic relaxation centers increased the spin-lattice relaxation time. Electron spin- 
resonance experiments verified that the paramagnetic manganese does cluster upon heating above 200°C. 
A method of analysis is developed which eliminates errors due to constant linewidth contributions in the 
analysis of motional narrowing data in terms of diffusion activation energies. The activation energy for 
motion of Li ion vacancies in LiF is found to be 0.74 eV. 


I. INTRODUCTION 


OTIONAL narrowing of nuclear magnetic-reso- 
nance lines has been used to investigate extrinsic 
diffusion in several doped ionic crystals. Examples in- 
clude! LiBr where vacancies present on Li sites gave rise 
to extrinsic Li-ion diffusion, and?? CaF, where F ions 
were seen to diffuse. In the present study, motional 
narrowing is observed in manganese-doped LiF powder. 
The substitution of Mn? ions for Lit ions in LiF gives 
rise to an equal number of Li-ion vacancies to maintain 
charge neutrality. For this reason, it is possible to 
interpret motional narrowing in this system in terms of 
the diffusion of Li-ion vacancies. 


IL. EXPERIMENTAL TECHNIQUES 


Powdered LiF samples were prepared by precipitation 
with NH,F from an aqueous solution of LiCl containing 
0.1% MnCls. They were subsequently filtered, washed, 
and then dried at 100?C for a short time. The Li? nuclear 
magnetic-resonance absorption line was observed at 7.5 
Mc/sec using a modified Varian wide-line spectrometer. 
Experiments were performed between 25 and 550°C by 
inserting an air-cooled high-temperature Dewar in the 
Varian probe. 


III. METHOD OF DATA ANALYSIS 


To date, the analysis of motional narrowing curves to 
give diffusion coefficients has been only moderately 
successful. In metallic aluminum? and lithium;?:9 experi- 
mental activation energies for diffusion have varied 


* This work was partially supported by the U. S. Atomic Energy 
Commission as well as the Advanced Research Projects Agency, 
through the Center for Materials Research at Stanford University. 

1R. R. Allen and M. J. Weber, J. Chem. Phys. 38, 2970 (1963). 

2W. J. Veigele and A. W. Bevan, Jr., Phys. Rev. 131, 1585 
(1963). E 

* W. J. Veigele, Bull. Am. Phys. Soc. 9, 25 (196). —  — 

* E. F. W. Seymour, Proc. Phys. Soc. (London) A66, 85 (1953). 

5H. J. Sukenik, thesis, University of California, 1958 (un- 
Published). 

* H. S. Gutowsky and B. R. McGarvey, J. Chem. Phys. 20, 1472 
(1952). 


Írom values obtained by other techniques’ by up to 
30%. This is due both to changes in lineshape and 
constant contributions to the linewidth, which make 
data analysis uncertain. 

A method has been developed which enables analysis 
of motional narrowing data to produce accurate values 
of diffusion activation energies. It eliminates errors due 
to any constant contributions to the linewidth, in- 
cluding magnetic-field inhomogeneities, and is appli- 
cable in temperature regions where the lineshape is 
nearly constant. 

Since narrowed lines are nearly Lorentzian,” the 
actual dipolar linewidth 6H. can be related to the 
diffusion coefficient D by the relation 


8H ,— CóH,D-1— A exp(E/kT), (1) 


where E is the activation energy for motion of Li-ion 
vacancies. The constant A contains the concentration 
of vacancies in the system and the factor relating the 
linewidth to (1/7:). The relation between the measured 
line width ôH m and the actual dipolar linewidth can be 
written 

ôH ,,— aóHa- 06H, (2) 


where 8H; is a constant. 

One of the methods of data analysis that has been 
used by others consists of plotting InôH „m versus 1/T and 
equating the slope of the curve to E/k. If, however, a 
constant addition to the line width is present such as 
given in (2), the slope S obtained in this manner is not 
E/k but rather 


dlnH, E 1 
d(1/T) k [1--(08Hs/aA)e-£'*T] 


Equation (3) shows that S becomes smaller than E/k at 
higher temperatures, which is the region of interest in 
the present case. 


(3) 
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? D. F. Holcomb and R. E. Norberg, Phys. Rev. 98, 1074 (1955). 
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Fic. 1. Linewidth of Li? resonance line in Mn-doped LiF as a 
function of temperature. 


If, however, ôH m is first differentiated with respect to 
1/T, followed by taking the logarithm of this derivative, 
we have 
i dH m E 
. InA=1n =const-+-—. (4) 
(1/T) kT 


Bs The slope S of a plot of nA versus 1/T will now yield 
i5 the true value of E/k, independent! of ôH. This analy- 

ü sis is easily performed, since A can be evaluated at 
several temperatures on a plot of the experimental data 
for ôH m versus 1/T. 


IV. EXPERIMENTAL RESULTS AND DISCUSSION 


2 The temperature variation of the observed linewidth, 
iS defined as the width in gauss between the points of 
i maximum and minimum slope, is given in Fig. 1. The 
; E limits of error shown were determined from estimates of 
F the experimental accuracy in determining the linewidth 
; and from the variations between values obtained from 
several experimental measurements at the same temper- 
ature. The room-temperature values of linewidth may 
be compared with the half max-min linewidth of about 
6 G given by Chang ef a^ in similar samples at 
779K. Nuclear-resonance lines of F° in this material 
narrowed by about 25% in the same temperature range 
in which the Li? narrowed. 


11 Tt should be noted that if we include the bilinear term in the 
expansion of Eq. (2), (céHa5H 1), nothing is changed in the result 
except the constant a. 

n'T. T. Chang, W. H. Tanttila, and J. S. Wells, J. Chem. Phys. 
39, 2453 (1963). 

iT. T. Chang, thesis, University of Colorado, 1962 (un- 
published). 
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Fic. 2. Plot of In[d3H /d (1/T) ] versus 1/T for the data of Fig. 
1, which yields activation energy of 0.74 eV. (Nole adked in proof. 
The scale on abscissa and ordinate should contain a factor of 105.) 


Absorption lines could not be obtained in the temper- 
ature interval 200-390?C because of a long spin-lattice 
relaxation time. This was probably caused by an 
effective loss of paramagnetic-relaxation centers from 
the system. Electron spin-resonance experiments were 
performed, indicating that the manganese ions are 
uniformly distributed in the lattice at room tempera- 
ture. After heating the samples for short times above 
200°C, the electron spin-resonance line observed at 
room temperature loses its hyperfine structure due to à 
strong exchange interaction. This is probably due to 
clustering of Mn in the Li sublattice, similar to the 
clustering observed by Jennings in Mn-doped LiCl 
single crystals. The electron spin-resonance spectra be- 
fore heating are very similar to those observed by Chang 
et aj 32,18 

The analysis described in Sec. III has been performed 
on the high-temperature data of Fig. 1. The resulting 
straight line, shown in Fig. 2, yields an activation energy 
of 0.74 eV, with a precision of about 10%. This is in 
good agreement with the activation energies obtained in 
the extrinsic region by investigators using other tech- 
niques; electrical-conductivity results? have given 
E=0.65 eV and spin-echo experiments in single crys- 
tals! have given 0.71 eV. By comparison of the present 
data with the results of electrical conductivity experi- 
ments,” it can be determined that the vacancy concen- 
tration in the present work is about 5X 10-5. 


uD. A. Jennings, thesis, University of Colorado, 1962 (un- 
published). 

15 Y. Haven, Rec. Trav. Chim. 69, 1471 (1950). 

16 M. Eisenstadt, Phys. Rev. 132, 630 (1963). 
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We consider a harmonic crystal with a single isotopic impurity whose coupling to its neighbors includes 
a cubic anharmonic term. The impurity is chosen to be light enough that a localized mode exists. The 
Hamiltonian for this system is truncated, in a physically reasonable way, so as to yield an exactly solvable 
Schrödinger equation. The solutions give a new insight into the nature of the localized mode, which in this 


approximation is exactly analogous to an unstable 


particle in the Lee model of field theory. The wave 


functions are exhibited and the lifetime is calculated for a light impurity in a simple cubic lattice at T=0. 


I. INTRODUCTION 


N isolated impurity in a harmonic crystal gives 
rise to certain changes in the vibration spectrum!-5 
of the system. In particular, if the impurity is lighter 
than the host-crystal atoms and the impurity-mass to 
host-mass ratio is less than a certain critical number 
dependent on the atomic arrangement, then a so-called 
localized mode appears above the quasicontinuum of 
phonons. The characteristics of this mode are that when 
it is excited only the atoms near the impurity participate 
in the motion (the attenuation length decreases as the 
frequency increases); and that its frequency is higher 
than the maximum phonon frequency. The latter may 
be looked upon as the reason why the mode is localized; 
its frequency is too high to propagate in the lattice. The 
localized mode is a single normal mode of the system, 
with a sharp frequency. 

It is the purpose of this note to investigate what 
happens to the localized mode when the interatomic 
force is allowed to contain certain anharmonic terms.® 
The “single-particle” description of independent oscil- 
lators (harmonic forces only) then goes over into the 
analog of an interacting many-body system, and the 
physics and mathematics encountered is similar to that 
which occurs in a wide variety of problems. The local- 
ized mode state of the harmonic approximation becomes 
mixed with the multiphonon states whose energies lie 
nearby. The strength of the anharmonic interaction 
determines the width of the energy interval in which 
the mixing occurs; the energy width in turn governs the 
coherence of the localized mode (now only an approxi- 
mate eigenstate) and therefore its lifetime. 


* Work done under the auspices of the U. S. Atomic Energy 
Commission. : 

t This work was reported at the Moscow Conference on Solid 
State Theory, December 2-12, 1963. M 
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42, 259 (1962) [English transl: Soviet Phys.—JETP 15, 182 
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6 Thi ion has been studied before by different methods. 
Sea Poe eens Phys. Rev. 122, 443 (1961) and A. A. van 
dudin, 1962 Brandeis Lectures in Theoretical US (W. A. 
Benjamin and Company, Inc., New York, 1963), Vol. 2. 


Lowest order perturbation theory involves the calcu- 
lation of the diagram shown in Fig. 1, with time in- 
creasing from left to right. The single localized mode 
which is present in the initial state (dashed line) decays 
into two phonons in the final state (wavy lines) through 
a cubic anharmonic interaction. The matrix element of 
the interaction operator 


h=}, Brinda ay (1) 


im 


between a localized mode state (k) and a state in which 
two phonons (/ and m) are present is all that is involved 
in the perturbation calculation. Such a calculation has 
been performed by Klemens,? who used the phonon 
operators of the peifect crystal, and the localized mode 
operator’ of Montroll and Potts.? 

The interaction Eq. (1) is one of the terms which 
arise from the expansion of a cubic anharmonic term 
in the lattice potential energy 


H'—», Ci(xi-xj* (2) 


in terms of the phonon and localized mode creation and 
annihilation operators. The cubic anharmonic term 
has no lower bound for large relative displacements; 
hence one would not expect any solutions of a problem 
in which it is the only perturbation. For our anharmonic 
interaction, however, we choose only a part of Eq. (2), 


Fic. 1. Feynman 
diagram for decay of EEE 
localized mode into 
two phonons. E 


7 The localized mode displacements assumed by Klemens were 
those given by Montroll and Potts as asymptotic forms, except 
that he inadvertently omitted the alternation of sign with each 
successive lattice distance from the impurity. This omission tends 
to make Klemens' result for the width too large, as does the fact. 
that he used phonon operators for the perfect harmonic crystal 
rather than those for the harmonic crystal with a. single isolated. 
impurity. 
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such that the resulting systems may easily be solved 
exactly. Our method exhibits the solutions for the wave 
function explicitly, hence has an esthetic advantage 
over a perturbation calculation, although the quantita- 
tive results should be taken only to lowest order in the 
anharmonicity parameters. 

We shall keep only Eq. (1) and its Hermitian conju- 
gate and consider their sum to be the anharmonic 
interaction and drop the rest of the terms in Eq. (2). 
The principal justification for this is that it allows an 
exact solution to be easily obtained, but it is possible to 
argue as follows that at least some of the dropped terms 
are unimportant. The anharmonic interaction is large 
only when the displacements are large, and the displace- 
ment of any atom in the crystal due to the excitation of 
a single phonon is proportional to VW”, where N is the 
total number of atoms. On the other hand, the localized 
mode displacements do not vanish as one lets N>, 
but approach a finite limit which decreases more or 
less exponentially with distance from the impurity 
atom. It might therefore be reasonable to expect that 
the most important terms in the expansion of Eq. (2) 
are those which involve the creation or annihilation of 
one or more localized modes. Most of the terms in the 
expansion of Eq. (2) do not satisfy this criterion. 

We therefore take 

H'=hFht (3) 


as our anharmonic interaction. The basic Feynman 
diagram is Fig. 1 and the time-reverse of it, and we have 
essentially defined a Lee model®® of the anharmonic 
crystal. 

The total Hamiltonian for this model is 


H = H ot 1 ? (4) 
where Ho is the harmonic Hamiltonian 


H=} ho,(n,-3) 5 


1, 0410; . 


Il. EIGENSTATES AND EIGENVALUES 


The ground state of H is the same as that of Ho, and 
the one-phonon states are identical too. It is only when 
we get to the one localized mode state and the states 
containing two phonons that the effects of H’ begin to 
be felt. 

We therefore consider the localized mode state, which 
is mixed by H’ with two phonon states, and let 


V — Cibi >, Cini; (5) 


where the C's are constants subject to a normalization 


condition 
[Cx] *--X2| Cis]? 1, (6) 
im 


8T. D. Lee, Phys. Rev. 95, 1329 (1954). 
» V. Glaser and G. Kallen, Nucl. Phys. 2, 706 (1956). 
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and | 
p= Q4 bo, 
-1/2 (7) 
Pim = 2 Ju O1. o , 
with p the ground state, and k always referring to the 
localized mode, J, m to phonons in the quasicontinuum. 
By requiring that Eq. (5) be an eigenstate of H with 
eigenvalue Æ one immediately obtains 


(E— e)Cx— 21 P 2 Bin Cin , 
lm (8) 
(E— €in) Cin 21 PBiimC k ) 


where e, and e, are eigenvalues of Ho with ; and Pim, 
respectively. Therefore unless E happeus to coincide 
with one of the ems or C; vanishes we must have 


[Bimli 
i-o PR = (9) 


Im [;— Elm 


which has as many solutions E= Æ, as there are terms 
in the sum, plus one. Equation (9) is very similar to 
that obtained by Glaser and Killen’ in their investiga- 
tion of a Lee model? for an unstable particle. A graph 
of the right- and left-hand sides of Eq. (9) appears in 
Fig. 2. The intersections define the set of eigenvalues Ær. 

Certain features of Fig. 2 deserve comment. One is 
that for a given size crystal it is always possible to have a 
single negative eigenvalue Ej for the anharmonic coef- 
ficients Bj, large enough. However, because as it is 
defined in Eq. (1) Brim should decrease as NV!” in order 
to correspond with Eq. (2) with C;; independent of N; 
the point of appearance of the negative eigenvalue as 
a function of anharmonic-force strength will be inde- 
pendent of crystal size for large crystals. The negative 
eigenvalue appears only for very large Bi. Its exist- 
ence is due to the fact that our Hamiltonian is not posi- 
tive definite. Another feature is that an eigenvalue 
pops up out of the quasicontinuum, reminiscent of the 
appearance of the localized mode in the harmonic case. 


Fic. 2. Graphical solution of E r ymptote 
E q. (9). The lowest asymp. 
will generally be at an energy of order fwz/N, the highest slightly 
below 27ior, where or, is the maximum phonon frequency. 
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This state appears at a smaller, but still unphysically 
large, value of the anharmonic coupling. Like the 
negative eigenvalue, the critical-coupling parameter at 
which the positive energy state appears becornes in- 
dependent of crystal size for large crystals. 


III. COMPLETENESS 


Since it is our purpose to study the evolution of the 
localized mode state as anharmonic forces are intro- 
duced, it behooves us to be sure that the set of states 
given by Eqs. (5) and (8) really spans that part of the 
Hilbert space which can be reached by operating 
repeatedly with the anharmonic Hamiltonian on the 
harmonic localized mode state ,. To do this it is suf- 
ficient to show that 


25 WW, =B,.0F +2 6,5, (10) 
` lm 


where Y, is an eigenfunction of the Hamiltonian [ Eq. 
(4) ] with eigenvalue £y [a root of Eq. (9)], and which 
has the form of Eq. (5). The calculation is straight- 
forward and proceeds as follows. Upon substitution of 
Eq. (5) into (10), we obtain 


E Wy =T CLAC ADD 
X 5 


+> Ga (A)Cimt (A) BiB tmnt 


im 


+ 2D Crm (A)Cimt (br ibit 


Um!’ Alm 
+> Cg (A)C imt (A)®.Bint 
lm 
HE Cim(A)Cet(A)Pimeb', (11) 


Um 


and illustrate by calculating the coefficient of &,®,!. 

We introduce a function C,(Z) of the complex 
variable E, which has the property that it coincides 
with C;(X) at E= E, and is analytic in a small region 
surrounding each E. Then 


EO0)-— | C&(GE)C (Ed InD(E), (12) 


where 
D(E)- E- &—F(E), (13) 
and Bul? 
F(E)=25 — —, (14) 
tm E— €im 


The contour C encircles the roots of D(E) in a counter- 
clockwise sense, and excludes any singularities of C,(E). 
Equation (12) follows from the fact that the eigenvalues 
Ex are the roots of D(E). Now, from Eqs. (6) and (8) 
We see that to within a phase factor 


CX(E)- [17 F' (C217 (15) 


ANHARMONIC CRYSTAL A967 


has the properties we require for C4(E), and therefore 


1 dE 
EGNEN | .— 1, (6) 
^ 2ri J c E— e, — F(E) 


which follows by expanding the contour to an infinite 
circle. Using some care in excluding the singularities 
of the C's, the other coefficients in Eq. (11) can be 
similarly shown to be unity or to vanish, and Eq. (10) 
follows. 


IV. LIFETIME OF LOCALIZED MODE 


If we imagine that in the anharmonic crystal we can 
somehow excite the harmonic localized mode, its life- 
time may easily be calculated. To do this it suffices to 
calculate 

(E (0),v (0) — (bv (1), (17) 
with 
Y (D —55 CA) a iE, (18) 
Li 


and the C's chosen so that 
Y (0)—9,. (19) 

Equation (19) implies that 
CA)=C A), (20) 


and the scalar product of Eq. (17) is easily seen to be 


1 1 

Y (0), (D) - — | EdE. (21 
eO | ccpit @ 
Because of the presence of the exponential we can't 
evaluate the integral as easily as before, and must 
carefully study the denominator in the integrand. 

If we are interested in times short compared to the 
longest periods of vibration of the crystal, then we may 
as well consider the infinite crystal. Therefore, we study 
the behavior of F(E) in the limit N —<. It will become 


an integral 
her p(k e)de 
FG | (22) 
9 JC 


with p(k,e) real and positive, of form similar to that 
encountered in many fields, but in particular in the 
theory of lattice-vibration spectra. F(Z) has acquired a 
cut in the locus of the poles it had before N +=; the 
poles, in other words, have become" dense along the real 
axis from 0 to 2/e;. By analogy with similar integrals 
which have been calculated in other connections, we 
can sketch (Fig. 3) the real and imaginary parts of 
F(E—i6), where ô is an infinitesimal positive number. 
The qualitative features of Fig. 3 which are certainly 
independent of the crystal model used are the general 
behavior of the real part for E«0 and E> ?2žwz; the 
fact that the imaginary part is positive for 0« E<2hw, 
and vanishes outside this range, and the rapid increase 
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(a) (b) 


Fic. 3. Real (a) and imaginary (b) parts of F(E—38) for 
N — e. The dashed line in (a) is E— ex. Notice that, in this model, 
for a very large anharmonic term, the localized mode (defined now 
by the intersection) is an eigenstate, since then the imaginary 
part of F vanishes. For a realistically small anharmonic force, 
however, the intersection will not be far removed from e. 


in general of the absolute values of the real and imagi- 
nary parts of F(Z) as E increases from 0 to 2ñwz. This 
latter follows from the rapid increase of the density of 
states of two phonons which is a factor in p(k,e). 

D(E) and hence the denominator of the integrand of 
Eq. (21) can never have any roots but real ones, be- 
cause of the Hermiticity of the Hamiltonian. We can 
therefore contract the contour around the cut and obtain 


( (0) (2) 
1 per h(E)dE 
=| on ?— MEOS) 
T LE— e«—g(E) J-- (E) 


where we have set 
F(E—418) 2 g(E)-- iR (E). (24) 


The form of Eq. (23) is such that if 4(E) is small and 
g(Z) is slowly varying in the neighborhood of Ey where 
Eo is chosen such that 


Eo— e&—g(£9) —0, (25) 
then it becomes approximately 
(v (0) (D) — e re (26) 
with 
T/2=h(Eo)=mp(k, Eo). (27) 


This is the same result which would be obtained from 
first-order perturbation theory, except there Eo would 
be replaced by ex. To obtain a numerical answer from 
Eq. (27) one needs first to find out what Bitm is implied 
by a reasonable form of C;; in Eq. (2); this is not a 
trivial task because ^t involves finding the eigenvectors 
of the secular matrix of the imperfect harmonic crystal. 


V. CALCULATION OF F(E) 


We shall now take a simple model for the harmonic 
crystal with a single impurity, for which we already 


"know the eigenvectors,” and calculate F (E) as expressed 


in Eq. (14). 
V. M. Visscher, Phys. Rev. 129, 28 (1963). 
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A simple and perhaps fairly reasonable form for the 
anharmonic energy is 


H' =b(x000—2100)?— b (4:000— 3100)? , (28) 


where «;;, is the displacement in the x direction of the 
atom in the (ijk) simple cubic lattice site from its equi- 
librium position. In the model that we are using mo- 
tions in the 3 Cartesian direction are independent and 
separable, so Eq. (28) represents a general centra] 
cubic anharmonic interaction between the impurity at 
(000) and its nearest neighbors. For positive b, H” gives 
rise to an extra repulsion for small interatomic distances, 
an extra attraction for large ones. It can be thought of as 
representing a physical situation where the impurity 
is a misfit in an otherwise perfect harmonic crystal. 

If we now expand *;;, in terms of the creation-an- 
nihilation operators @,', a, according to 


Xp >, (Cia. -- Curae) =D, Xx, 29) 


(where the sum is over the normal modes of the im- 
perfect harmonic crystal) and substitute into Eq. (28), 
we can pick out the terms having the form of those 
contained in 7 [ Eq. (1) ]. 


h=3b $^ [ (Ci,o00— C1,100) (Cm, 000— C m, 100) 


l,m 


— (C1,000— C 1,190) (Cm,000— C m,—100) ] 
X (Cx,000'—Cz,100')aitamta,. (30) 


Here, as before, / and m refer to normal modes in the 
quasicontinuum of the imperfect harmonic crystal 
(phonons), and k refers to the localized mode. 

In arriving at Eq. (30) we have made use of the fact 
that the localized mode is always an even excitation of 
the lattice about the impurity position, i.e., Cz,100 
= Cr, 100. If we were to use boundary conditions such 
that the phonons, too, are either even or odd with re- 
spect to the impurity position, then the contributions 
to the sum in Eq. (30) will have (Z even, m odd) or 
(l odd, m even). Thus 


Bim — 6b (C1,009— C1,100) (Cn,—100— Cm,100) 
s (C, 001 — C 100!) aitam 1a (31) 


for | corresponding to an even mode; Bim vanishes 
otherwise. 

Specializing now to the case of a mass impurity (no 
change in the harmonic forces connecting the impurity 
to its neighbors) we can calculate the C's. 

Equation (5') of Ref. 10 may be written 


Kijk = £ijk, 000 0c0 5 X000" , (32) 


where g*=g(w,”) is given there in Eq. (23^), and ôa 
is the difference between the impurity and host mass. 
Equation (32) implies that the frequencies of the normal 
modes for which the displacement of the impurity atom 


|- 


|- 
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docs not vanish are given by the roots of 


r (33) 
and that the C's of Eq. (31) satisfy 


£000,009 ĝuas = 


Cia (gao*/ £oo")C 2,9 (34) 
for the even modes, and for the odd modes the C's are 
the same as for the perfect crystal. In Eq. (34) we have 
denoted the triplet (ijk) by a. For the even modes, then, 


a complete specification of the C's awaits only a normal- 
ization condition, which, upon requiring that 


[hata ] — —thbaa’ (35) 
and 


[25,251 ]— ôs: ? (36) 


can be straightforwardly shown to be 


i h ôu f 
Csal’ — que? / [= Ge) qu ]. (37) 
2 7 H 


wsh 
Then, if we use the fact that 
i £095— £195 — 2 (we/wr)?g00°-+ (2/uor?), (38) 


which can be verified from the forms of the g's given in 
Eq. (25’) of Ref. 10, Eq. (31) can be rewritten 


| Butm|?= (240)*(u'/ p)*k*I* | Cx, o|*| Cz.e|? 
x |Cm1—Cm,1|?, (39) 


where u'— u4-óp and /=w,/wr, k—ox/er, and where m 
refers to an odd mode in which the impurity is located 
at a node. It then follows that Cm.a is not different from 
that for the perfect lattice, and therefore 


|Cm,1—Cm,-1|?= (4/2 uNom)4 sinoz, (40) 


where JV is the number of atoms in the crystal and ez" 
is the x component of the wave vector of the mth 
phonon. 

We are now in a position to write down an integral 
form for the sum in F(E), Eq. (14). The sum over / 
can be replaced by an integral as follows: 


1 h dor 
ZIC, 9|? — imn 
2ri J c 21 (goo!) *— One 
TE ie 
2my Jo 


where the contour encircles the zeroes of the denomi- 
nator in the range 0<w:<wz, and the second step is 
achieved upon taking the limit N > and Ee. 
the contour about the cut which then appears from 
to or. 


Imgw'(er—1e) 
UE UAR ECT EEUU 
so Ge uwt Rego) T (ôu? Imgoo')* 


— 
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Fic. 4. Real and imaginary parts of 2-go0(w). 


and y is the harmonic force constant. The sum over m 
can be similarly expressed: 


h h re 
=> —— Í dos Go(m) , (43) 
m 2uNos, — 2my Jo 


where [ 
Go(m) = 2er Ymgos(o,2— ie). (44) 


Now, if we take E to be removed from the real axis, we 
can write 


24bfio :N? / uv 4 
P(E) =2( ) (5) k*1 C; ol? 
TY H 


f didml*G(1)Go(m) (sin? e," 
0 


E— €1— En 


» (45) 


where the bracket denotes an average over all modes 
with energy em. The dimensionless function G(/) has 
been calculated numerically in Ref. 10; its shape for - 
various impurity masses is exhibited in ‘Fig. 9 of th 


and imaginary parts of goo. |Cz,o|? is, as can be see 
Eq. (29), just the contribution of the localized mode 
the mean-square displacement of the impurity; t 
has also been calculated in Ref. 10. We can, 

quantitatively, identify ids 


bCzo/y—r 


as the ratio of the anharmonic potential energy 
impurity atom to its harmonic energy site he 
ized mode is excited. 

The imaginary part of F(E), according to E 
is equal to the half-width of the localized mode 


NA 


ra- MB)- Quy (s 
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An approximate numerical integration” for the case 
where E=1.12hw,, which is the harmonic localized 
mode energy for this model if p'— 0.5 p, yields the result 
that the integral in Eq. (47) is about 5X 1077. For this 
case, then, we find 


T/2z0.2horr^. (48) 


Because of the fact that r is identified with the an- 
harmonic interaction of an isolated impurity atom with 
its neighbors, it is difficult to make a numerical estimate 
for it. The usual parameter of anharmonicity, the 
Griineisen constant,” is measured for interactions be- 
tween atoms of the host crystal. But if we associate 
these quantities with each other, we find that a reason- 
able Griineisen constant (of the order of unity) for a 
crystal implies that r for the force between each atom 


11 An analytic approximation can be made in the limit of small 
impurity mass. 

12See, for example, P. G. Klemens, in Solid State Physics, 
edited by F. Seitz and D. Turnbull (Academic Press, Inc., New 
York, 1958), Vol. 7, p. 1. 
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and its nearest neighbors is of the order of 10-2. There- 
fore, from Eq. (48) we see that the order of magnitude 
of the ratio of the half-width to energy of the anhar- 
monic localized mode is 1074. Within the limitation of 
the calculations, this result is not in disagreement with 
the widths obtained in the references cited above.9* No 
careful quantitative work will be done here, because we 
do not believe the model we use to be realistic enough to 
warrant it. The significance of the present work lies 
more in the form than in the magnitude of the answers, 

The real part of F(Z) is just Eq. (45) with the inte- 
gral replaced by a principal value integral. It will be of 
the same order-of-magnitude as the imaginary part; 
hence the shift of the localized mode (downward in 
energy for frequencies close to the phonon cutoff) is 
comparable to its width. 
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Energy Bands for the Iron Transition Series* 


: L. F. Matruetsst 
Solid-State and Molecular Theory Group, Massachusetts Institute of Technology, Cambridge, Massachusetts 
(Received 16 December 1963) 


Preliminary energy-band calculations for elements of the iron transition series have been completed 
using the augmented plane-wave method. The results include plots of energy as a function of wave vector 
along a line of symmetry for elements crystallizing in the face-centered cubic (Ar, Co, Ni, Cu), body- 
centered cubic (V, Cr, Fe), and hexagonal close packed (Ti, Zn) structures. These results indicate the 
presence of systematic trends in the band structures for the various elements and provide some justification 
for the application of the rigid band model to transition metals and their alloys. 


I. INTRODUCTION 


NERGY bands have been calculated for a 
majority of the elements in the iron transition 
series using the augmented plane-wave (APW) 
method.12 While the present results are preliminary in 
nature and not in any sense complete, they may be of 
some interest to experimentalists and theoreticians who 
are concerned with the electronic structure of the 
transition-series elements. The present results represent 
energy bands for three.different crystal structures, with 
a variety of lattice constants. Despite the detailed dif- 
ferences that are imposed by symmetry requirements 
and variations in lattice constants, the results suggest 
some interesting and rather clear-cut trends in the band 
structure of these elements as one proceeds through the 


* This work was supported by the National Science Foundation. 

1 Presently employed at Bell Telephone Laboratories, Incorpo- 
rated, Murray Hill, New Jersey. 

1 J. C. Slater, Phys. Rev. 51, 846 (1937). 

2 T. H. Wood, Phys. Rev. 126, 517 (1962). 
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transition series. These calculations lend some support 
to the rigid band model for the transition series. They 
support the hope that systematic studies of the band 
structure of the transition-series elements can provide 
useful qualitative, and perhaps quantitative, informa- 
tion concerning their electronic structure. 

As in all calculations involving d electrons, the results 
are sensitive to the choice of potentials. The crystal 
potentials used in these calculations were all constructed 
in an analogous manner, and were approximated by a 
superposition of atomic potentials. The method involves 
the use of Hartree-Fock solutions to the corresponding 
atomic problem? and the free-electron-exchange ap- 
proximation.’ The details of this method for construct- 
ing approximate crystal potentials have been described 
earlier? though a brief resumé is presented in Sec. II of 


*R. E. Watson, Phys. Rev. 119, 1934 (1960); R. E. Watson 
and A. J. Freeman, Phys. Rev. 123, 521 Qe 
J- C. Slater, Phys. Rev. 81, 385: (1951). 
L. F. Mattheiss, Phys. Rev. 133, A1399 (1964). 
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ENERGY BANDS FOR Fe 
this paper, along with other information pertaining to 
this present series of calculations. The energy bands 
along a single line of symmetry in the appropriate 
Brillouin zone are presented in Sec. III for A SEES 
Cr, Fe, Co, Ni, Cu, and Zn, while the last section 
contains a brief discussion of these results. 


II. DESCRIPTION OF THE CALCULATIONS 


In these calculations, the crystal potential has been 
approximated by a superposition of atomic potentials. 
The Coulomb and exchange contributions to the crystal 
potential are treated separately. An approximate 
crystal Coulomb potential and charge density in a given 
atomic cell is obtained by expanding the neutral atom 
Coulomb potentials and charge densities of neighboring 
atoms about the origin, using Lówdin's alpha function 
expansion, keeping only the /—0 or spherically sym- 
metric terms in these expansions. Using the free- 
electron-exchange approximation, the exchange po- 
tenial is proportional to the cube root of the super- 
imposed atomic-charge densities. 

The potentials obtained by this method are generally 
rather flat near the boundaries of the atomic cell, at 
least in the case of metals, so they are readily approxi- 
mated by a “muffin-tin” type potential, as required by 
the APW method. The constant value of the potential 
outside the APW spheres is taken as the average value 
of the potential in this region. This usually results in a 
discontinuity in the potential at the sphere radius 
amounting to a few hundredths of a Rydberg. 

In the construction of approximate crystal potentials 
for transition-series elements, there is frequently some 
ambiguity in choosing the most reasonable atomic con- 
figuration. This sort of difficulty can only be answered 
satisfactorily by experimental information and/or self- 
consistent energy-band calculations. For the present, 
we have been content to study the effect that changing 
the atomic configuration has on the band structure. In 
addition, there are magnetic effects which create addi- 
tional complications in this series of elements. F or sim- 
plicity, all magnetic effects have been neglected in these 
calculations, and the crystals have been assumed to be 
nonmagnetic in character. xe 

The lattice constants which have been used in these 
calculations have generally been the room-temperature 
values as tabulated by Pearson.’ The exceptions are 
those for Ar and Zn. The lattice constant for Ar is E 
low-temperature value obtained by Dobbs and Jones. 
In the case of Zn, Harrison? has extrapolated the room- 
temperature lattice constants to low er 
the results are expected to be sensitive to the choice o 


op öwdi . Phys. 5, 1 (1950. — 

"WB. Pearson, A Handbook of Latice Spacings and Smee 
of Metals and Alloys (Pergamon Press, Inc., New Yors, Physics, 

* E. R. Dobbs and G. O. Jones, in Reports on Progress im Sar 
edited by A. C. Stickland (The Physical Society, London, 4254, 


Vol. 20, p. 516. 
ow. ns Harrison, Phys. Rev. 126, 497 (1962). 


TRANSITION SERIES 


TABLE I. In this table, we list the elements of the iron transition 
series, their structures, the lattice constants used in this series of 
calculations (in atomic units), and the assumed atomic 
configurations, 


Element Structure a(au) c(au) Configuration 
a fcc 10.0346 (3s)2(3p)* 
Ca 
Sc 
Ti hep 5.5755 8.8503 (3d)3 (4s)! 
V bcc 5.7225 (3d)*(4s)! 
Cr bec 5.4512 (3d)* (4s)! 
Mn 
Fe bec 5.4168 (3d)? (4s)! 
Co fcc 6.6975 (3d)* (4s)! 
Ni fcc 6.6590 (32)9 (4s)! 
Cu fcc 6.8309 (3d) (45) 
Zn hcp 5.0120 9.1453 (3d)'* (4s)? 


c/a ratio. For purposes of comparison, his values have 
been used in these calculations. Table I contains a 
summary of the elements considered in these calcula- 
tions, their structures, the values of the lattice con- 
stants, and the assumed atomic configurations. 


IIl. RESULTS 


The principal results of these calculations are pre- 
sented in Fig. 1. These results represent plots of energy 
as a function of wave vector along lines of symmetry 
from the center to a boundary of the appropriate 
Brillouin zones. For the face-centered cubic structure 
(Ar, Co, Ni, and Cu), the bands are plotted from T 
along the A direction to the point X, using the notation 
of Bouckaert, Smoluchowski, and Wigner." In the body- 
centered cubic structure (V, Cr, and Fe), the bands are 
plotted from T along the A direction to the point H. 
Finally, in the hexagonal close-packed structure (Ti and 
Zn), they start at T and proceed along the line T in the 
k:=0 plane which terminates at the point K, one of the 
vertices of the hexagon (in the notation of Herring). 

The energy is in Rydbergs and the wave vectors for 
the different elements are drawn to scale for purposes 
of comparison. The horizontal dashed lines represent 
rough estimates of the Fermi energy for each element. 
For simplicity, some of the more highly excited bands 
have been omitted in some cases, particularly in the 
face-centered cubic structure, or in other situations, 
they have been sketched in by dashed lines. 

In Fig. 2, it is shown what effect-varying the atomic 
configuration has on the band structure of a typical 
element, namely vanadium. The bands to the right are 
the ones shown in Fig. 1 for vanadium; the ones to the 
left are those obtained from a potential which results 
from an atomic configuration containing an additional 
4s electron and one less 3d electron. In general, this 
results in a narrowing of the 3d band and a decrease in 


YL. P. Bouckaert, R. Smoluchowski, and E. Wigner, Phys. 


. 50, 58 (1936). 
RNC JE J. Franklin Tnst. 233, 525 (1942). 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar _ 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A972 i. I, ING ad BS WSS 


CHROMIUM 


VANADIUM 


9 
3 
w 
8 
z i 
z 
w 
o 0.575 S315) o 0.58 1.16 
COBALT NICKEL COPPER 
(fcc) (fcc) (fcc) 

[7] 

o 

[4 

ul 

o 

[*] 

> 

x 

z 

wW 

» 


o 0.47 0.94 o 0.47 


priate Brillouin zone. For the face-centered cubic structure, the bands are plotted from T along A to the point X. For the body-centered 
cubic structure, they are plotted from T along A to the point H. Finally, for the hexagonal close-packed structure, they are plotted fro. 
T along T to the point K. The energies are in Rydbergs and the wave vectors are in atomic units. | 


Fic. 1. Energy bands for Ar, Ti, V, Cr, Fe, Co, Ni, Cu, and Zn as a function of wave vector along a line of symmetry in the appro: P 


the energy separation between the top of the 3d band same crystal structure. This seems to lend some support 
and the bottom of the 4s-4p bands. to the rigid band model for the transition-series ele- | 
i ments, an approximation which has been of considera; | 
IV. DISCUSSION ble value in understanding the electronic properties 0f ' 


E : these elements and their alloys. 
— Tn the simplified picture of the energy bands for the 1 y 
iron ME nosse elements, one finds a narrow 3d There is a gradual narrowing of the 3d band as one 
: band in the midst of a rather broad 4s-4p band. The Progresses through the series. This effect was dT 
- width of the 3d band and especially its position relative by Slater in order to explain the occurr ie a e. 
. £o the bottom of the 4s-4p band depend rather critically Magnetism in the latter part of the senes." n E 0.5 
EU potential. Nevertheless, the results of Fig. 1 from Cu to Zn, the 3d band suddenly drops about ": 


exhibit a reasonably smooth variation from element to Ry below the bottom of the 4s-4p bands, and its widt 


ent, especially for those substances having the ^ 7.C. Slater, Rev. Mod. Phys. 25, 199 (1953). 
M 7 
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decreases to less than 0.1 Ry, As a result, the energy 
bands for zinc are very free-clectron-like. For those 
elements where the 3d band falls in the middle of the 
45-45 bands, the interactions between states having the 
same symmetry causes considerable modification to 
the free-electron bands, though at points of symmetry, 
the effect is sometimes small. The bands for Ti and 
Zn demonstrate this effect nicely. 

The results of Fig. 2 emphasize the uncertainty which 
is inherent in any energy-band calculation for a tran- 
sition-series element. These uncertainties have been 
pointed out previously in the literature, particularly by 
Callaway.? These difficulties can only be cleared up 
satisfactorily with the aid of more detailed experimental 
information regarding the band structure of these ele- 
ments in addition to self-consistent energy-band 
calculations. 

The results presented here are not complete enough 
to permit detailed comparisons to be made with experi- 
ment or a discussion of the resulting Fermi surfaces. 
However, there are some striking similarities between 
the energy bands shown in Fig. 1 and the results ob- 
tained by earlier calculations. In particular, there is 
good qualitative agreement between the Cu results 
shown in Fig. 1 and the bands calculated by Segall! 
and also by Burdick.!5 Similarly, the results for Fe are 
in good agreement with the published results of Wood.? 
The agreement for Ar with the results of calculations 
by Knox and Bassani!® is good, and has been described 
previously. In the case of Cr, it is difficult to compare 
the present results with those of earlier calculations by 
Asdente and Friedel!’ since they neglect the inter- 
actions between the 3d band and the 4s-4p bands. How- 
ever, the present results do justify, to some extent, the 
treatment of Cr by Lomer,!® who used the results of 
Wood’s iron calculations to discuss the energy bands in 
antiferromagnetic Cr. 

In the case of Zn, the ordering of levels is identical 
with that obtained by Harrison? This ordering differs 
from that obtained in earlier calculations for hexagonal 

13 . 99, 500 (1955). 

«E. Segall Hoo Rev 125, 109 (062). 

!5 G. A. Burdick, Phys. Rev. 129, 138 (1963). 

!5 R. S. Knox and F. Bassani, Phys. Rev. 124, 652 (1961). 


MC d J. Friedel, Phys. Rev. 124, 384 (1961). 
ww Arras and T Plus Sec (London) $0, 439 (1962) 
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Fic. 2. Energy bands for vanadium along the line A obtained 
from two different potentials. The bands to the left resulted from 
an atomic configuration of (3d)3(4s)? while those to the right in- 
volved a (3d)*(4s)! configuration. 


close-packed metals by Herring and Hill for Be!? and 
Falicov for Mg.” This change in ordering might be due 
to the presence of an occupied 3d band just below the 
4s-4p bands. The results for Ti agree qualitatively with 
those obtained by Altmann and Bradley.” Finally, the 
Ni results are in good agreement with those obtained 
by Hanus.” 
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The steady-state properties of transverse circularly polarized waves propagating along a static magnetic 


field in a uniform plasma (metal) are considered. Using 


the Landau theory of Fermi liquids, we compute the 


effect of correlations on.the reflection properties of a semi-infinite metallic plasma. The expressions are 
numerically evaluated and discussed for a range of parameters pertinent to the alkalis. 


I. INTRODUCTION 


Tias properties of electromagnetic waves propa- 
gating in a plasma along a magnetic field have 
been extensively studied.—-? In previous calculations 
the electrons were treated as the current carrying con- 
stituent of a noninteracting gas. For real metals the 
a priori neglect of electron-electron interaction effects 
is unsatisfactory since the mean energy of interaction 
of the electrons (Coulomb energy) is of the order of 
their mean kinetic energy. Under such conditions sub- 
stantial correlation effects in the motion of the electrons 
may be expected. From a practical point of view, how- 
ever, it is known that for a wide range of physical 
parameters, treatments which neglect explicit correla- 
tion effects provide an accurate description of the elec- 
tromagnetic properties of metals and semimetals. We 
are interested in determining the range of physical 
parameters, frequency, external magnetic field, carrier 
density, etc., where the effect of electron-electron cor- 
relation may, hopefully, be measured experimentally. 

In 1956, Landau* constructed a semiphenomenologi- 
cal theory of a system of fermions, such as Hes, inter- 
acting via a short-range two-body force. This work was 
later extended by Silin® to a Fermi liquid with long- 
range Coulomb interactions. We make use of the Fermi- 
liquid theory to compute the reflection properties of a 
semi-infinite slab of metal placed in a magnetic field 
oriented perpendicular to its surface. In this geometry 
for electromagnetic waves incident normally, it is 
known? that the metallic sample will exhibit an ab- 
sorption edge at approximately the Doppler shifted 
cyclotron frequency. 


w./o=1+ (g/g) (V v/0) , (1) 


where we is the electron cyclotron frequency, q is, 
crudely speaking, the wave number of the electro- 
magnetic field in the medium, go the wave number in 
free space, and Vr is the Fermi velocity. The resonance 


1 P, B. Miller and R. R. Haering, Phys. Rev. 128, 126 (1962). 

2P, M. Platzman and S. J. Buchsbaum, Phys. Rev. 128, 1004 
(1962). 

3P. M. Platzman and S. J. Buchsbaum, Phys. Rev. 132, 2 
(1963). 

4L. D. Landau, Zh. Eksperim. i Teor. Fiz. 30, 1058 (1956) 
[English transl. : Soviet Phys.—JETP 3, 920 (1956) ]. 

5 V. P. Silin, Zh. Eksperim. i Teor. Fiz. 33, 495 (1957) [English 
transl.: Soviet Phys——JETP 6, 387 (1958) ]. 


in the electrons motion which exhibits itself as a rapid 
fluctuation in the surface impedance of the sample is 
shifted from the value w,/w=1. This comes about since 
the electrons traveling with the phase velocity of the 
wave see a Doppler shifted frequency. It is these elec- 
trons which interact most strongly with the wave. As a 
result the resonance in the impedance is shifted. Speci- 
fically we are interested in possible shifts in the position 
of, and modifications of the shape of the Doppler shifted 
electron-cyclotron resonance as the result of electron- 
electron interactions. 


II. CALCULATION 


'The basic assumption of the Fermi-liquid theory is 
that it presupposes that as the interaction is turned on, 
the single-particle states in the neighborhood of the last 
occupied one remain approximate eigenstates of the 
interacting system, and that there is a one-to-one cor- 
respondence between these states and the single-electron 
states of the noninteracting Fermi gas. These approxi- 
mate eigenstates are called quasiparticle states. For 
slowly varying external disturbances, the transport 
properties of the Fermi liquid are completely described 
by the quasiparticle distribution function f(P,X) in 
momentum and configuration space. (The external dis- 
turbance varies slowly enough in space so that the lack 
of commutivity of P and X is unimportant.) In equi- 
librium and at zero temperature the distribution of 
quasiparticles fo(P,X) is 


fo(P,X)=1, |P] «Pr; 


zx). | P| 2 Pr. Q) 


For the spherically symmetric electron gas the quasi- 
particle energy is ES(P)— P?/(2m*), where m* is an 
effective mass whose value depends on the dynamics of 
the interaction between quasiparticles. In nonequilib- 
rium situations, the quasiparticle distribution function 
satisfies a transport equation similar to the Boltzmann 
equation. We are interested in the transport equation 
in the linear approximation, i.e., when the deviation 
from fo is small. If we write f=fot/i, with 


*'The distribution function f(P,X) is a matrix in the spin 
variables F(P,X) — f(P,X)-I-m(P,X)-o. The function m(P,x) 35 
pertinent only when quantities which depend on the spin (the 
Spb ity) are of interest. We will be concerned only wit 
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fi=6LE(P)—Ev |g the linearized transport equation 
for g becomes 


ðg/It+V: Vx(g+E1)+ (e/c) (V x H)- Ve(g-- E) 
—eE-V=—yg. (3) 


The quantity v, is a phenomenological short-range col- 
lision term which permits the system to relax to equi- 
librium; V.=0E°(P)/dP.= P,,/m* is the velocity of the 
quasiparticles and 


E- if S(P,P’)é[Er—E°(P’) ]g(P^ X) 
X[dGP'/Qxhy] (4) 


is the change in energy of the quasiparticles produced 
by a change in their distribution function. 

The correlation, or interaction function S(P,P") is 
the basic quantity characterizing the Fermi liquid. 
Owing to the presence of the delta function in the dis- 
tribution function fo and in the transport equation, the 
value of S(P,P") is only of interest for | P| =| P’| = Pr, 
where Pr is the Fermi momentum. S(P,P’) may then 
be considered to be a function of the angle between P 
and P'. It is conveniently represented by an infinite 
sequence of Legendre polynomials. 


S(P-P')  Y:, S.P, 9 (cos) , (5) 


where P is a unit vector in the direction of P. For con- 
venience, we define a set of dimenionsless quantities 


o= Som? P p/ (2r) h 
à, [Sum*P r/ Qu) JL (n—1)V/ (4-1) ]u2 1. 


The correlation function S(P-P’) contains all the in- 
formation pertaining to explicit dynamical many-body 
effects. The limit S — 0 is the free electron or Hartree 
limit. In this limit the transport equation, Eq. (3) 
becomes the usual Vlasov equation and the only “cor- 
relation" effect which is included is the self-consistent 
electromagnetic field. 

The correlation function is as important to the many 
electron problem as the shape of the Fermi surface is in 
the one-electron problem. Unfortunately, the function 
S(P-P") (for the region of metallic densities), cannot 
be computed from fist principles." Hopefully then a 
transport experiment of the kind to be considered here 
might enable one to measure the characteristics of the 
scattering function S(P- P"). Ideally one would like to 
perform an experiment in which a specific moment ij 
the scattering function is measured. It was first bone 
out by Landau‘ that the zeroth and first moments of e 
Scattering function were simply related m pu T F 
mentary properties of the quasiparticle gas. The 5 o i : 
mass m* of the quasiparticles is related to the irs 


(6) 


1 C. Herring has recently computed the SHE foncion > ES 
a power series in z, (rz, 22n!/5 ay where n = the e Zoe See 
@x is the Bohr radius) for metals r= 2-6 (pri 
tion). 


moment Sı by 
m*/m=1+46, (7) 


and the velocity of ordinary sound 2 is related to the 
zeroth moment So by 


vs= (05- 1) PL P p/ (3mm*)!!?]. (8) 


A cyclotron resonance experiment in the extreme 
anomalous limit, in the so-called Azbel’-Kaner 
geometry,? measures only the effective mass? On the 
other hand, the cyclotron resonance experiment in the 
so-called Galt geometry? involves, as we shall see, all 
of the moments. In the Galt geometry a static magnetic 
field perpendicular to the surface of the sample is 
utilized and the reflection (or absorption) of circularly 
polarized electromagnetic waves propagating along the 
magnetic field is measured as a function of the applied 
field. The solution, neglecting correlations, of the coupled 
Maxwell-Boltzmann equations in the presence of a 
single boundary (assuming specular reflection of the 
carriers at the surface) may be reduced to the solution 
of an equivalent infinite medium problem.’ The final 
expression for the surface impedance Z is 

Z 7 dq 
77 Oils) | ———, 
Zo o [d—«(q9)] 


where Zo is the impedance of free space. The quantity 
€(q,o) is the finite wave number and frequency-depend- 
ent dielectric constant for the infinite medium, 
[e(g,9) — 1H-* (,9)/ ico ]. 

The quantity E, is Eq. (4) is related to a first-order 
change in the distribution function by the relationship 


(9) 


Ey l S(P,P^ X,X)oLEp— E(P’) ]g(P" X^) 
diP'qix' 


y. fü 

(2x yi? “ 
In writing Eq. (4) we have assumed that S(P,P’,X,X’) 
=5(P,P’)é(X—X’). With this assumption the spatial 
part of the problem is identical with that in the non- 
interacting case. For an interacting gas of quasiparticles 
in real materials correlations are not local in space but 
do in fact extend over distances of the order of a Debye 
length (Ap2-V r/wp). For metallic densities this is of the 
order of the interparticle spacing. The Debye length is 
small compared with distances over which the field 
varies appreciably, typically distances of the order of 
the high-frequency classical skin depth (As\c/wp), 
so that to a good approximation we can neglect nonlocal 
effects. With the locality assumption on the function 


8 M. Ya. Azbel, E. A. Kaner, Zh. Eksperim. i Teor. Fiz. 30, 811 
(1956) [English transl.: Soviet Phys.—JETP 3, 772 (1956)]. 
9 This statement has been proved by J. M. Luttinger (private 
ication). 
ony. K. C UD W. A. Yaga, E. R. Meith, Dx B Coal 
A. D. Brailsford, Phys. Rev. 114, 1396 (1959). 
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x 


Fic. 1. Polar coordinate system used for solution of the trans- 
port equation, Eq. (3). H is the magnetic field, g is the propagation 
vector, and P is the quasiparticle momentum. 


S(P,P’,X,X’), the expression for the impedance within 
the framework of the Fermi-liquid theory is stil] given 
by Eq. (9). However, the dielectric constant e(g,o) will 
now include the effects of correlations. 

This dielectric constant is obtained by linearizing the 
transport equation, Eq. (2), with an rf field varying as 
exp(iq:X—wt) and solving for the induced current. The 
current is given by 


ne BP 
I«g)-— I BEE E) (10) 
m 


(2r) h 
or, equivalently, using Eq. (4) 


ne 


I@(q) et P*(g+-E;)6(E 
a = a 5 FW — F). 2 
D=o | Go EDEP- 0D 


m* 


We choose our coordinate system (see Fig. 1) so that 
the static magnetic field H and the propagation vector 
q point along the z axis. The angles 0 and ® are polar 
angles specifying the direction of P. For a right-handed 
circularly polarized wave (E=E,Ux+iE,U,) the 


transport equation for g(P,9) is, 


UEM igP Og 
 —iag(P,q)-+-— cos6g (P,g) + w.*— 
m* Ob 


mp à 
E cossas | 
m* 9o 


x Í PP'/ (2rh)'S(P,P')5(Er—Ep”)g(P’,9) 


ip 
XI sind, (13) 


‘= |eH/m*c| and 6=ot ive. 


En 
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If we set g(P,g) (0b)  g(0)e*, then it is easily 
shown that 


ie EV r“ n (qV r* cos0--o.*) 


g@)= A(0), (14) 
£ 
where V p*=P r/m“ and 
t= (0— w" — qY r* cos0) . (15) 


The quantity A(@) is 


A(8)— X 8nPn™ (cos) 


n=0 


0 
xj (0^) P, (cos0’)d(cos6’) . (16) 


Equation (14) is an infinite set of coupled linear equa- 
tions for the quantities Q,— Jz? P, (cos)g(8)d (cos). 
If we terminate the spherical harmonic expansion for 
S(P-P’) after a finite number of terms, then Eq. (16) 
may be solved self-consistently for the quantity g(0). 
Knowing g(0), we may substitute into Eq. (11) and 
evaluate the current. 

For the purposes of this paper we will include the 
first three moments of S(P-P’) So, S1, and S2. We have 
no real justification for terminating this series after so 
few terms; however, there are at least some crude argu- 
ments which suggest that a Legendre expansion for 
S(£-P’) converges rapidly. Landau has shown that the 
function S(P-P’) is proportional to the negative of the 
forward scattering amplitude of two quasiparticles. A 
typical type of exchange scattering diagram which 
enters into a microscopic calculation of S(Ê- P’) is 
shown in Fig. 2. The dashed line is a bare Coulomb 
line and the solid lines represent electron (hole) prop- 
agators. This type of scattering diagram contributes à 
term to S(P-P’) of the form 


2 


Sea. (P - P") ~ " (17) 
|P/—P |*e; (|P' — P], 0) 


where ez(q,0) is the zero-frequency longitudinal di- 
electric constant. If ez is computed by summing the 


“MING = 


m 


Fic. 2. A typical type of exchange scattering diagram which 


contributes to S(P- P"). Solid lines are electron propagators 2P 
dashed lines represent Coulomb propagators. 
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simple set of bubble diagrams shown in Fig. 2 (RPA 
approximation), we find 


eL (q0) -1-- (qp*/g?)F (q) , (18) 
where F(g)ei for q«&gr and F(q)~} for q=2qr. Using 
this expression for the dielectric constant, we find 

S(0)/5(x) S [1+ 8P z/qp7]. (19) 


For potassium S(0)/S(x)2«0.7 so that on the basis of 
this crude computation the scattering is roughly 
spherical. 

If we truncate Eq. (5) after three terms, solve 
Eq. (16) and substitute into Eq. (11), we find for the 
scalar conductivity 


teda 1445,/3—a8,0* 


(14-43,/3) a 
E "aW" ELE ] , (20) 
[1+ (V*/aW*) (1/ (952) -4/15 —eyW*)] 


where y= (@—w.*)/qV r*, and wp= (4rne?/m)!?. The 
functions V* and W* are defined by 


2 sin?0d(cos0) 
yx : ; (21) 
z (@—w.*—qV r* cos) 


À 9 sin? cosód (cos) 
z (@—w.*—gV p* cos) 


The conductivity, although it is reasonably compli- 
cated, approaches some rather simple and physically 
meaningful limiting values. In the limit g—0, c; 
approaches 

(o) q20=twp?/(G—w.) . (23) 


There are no effects due to correlations in this limit. 
"This is a consequence of the fact that total momentum 
is conserved in electron-electron collisions. The induced- 
current (for a zero wave vector) electric field is not 
affected by the electron-electron interactions. In the 


12 
z Ye /w = 0.03 
o 10 Ve /C = 0004 
A ô,= 0.1875 
3 8 
S 
So 
N 
T 
4 
D 
3 
mo 


wpf% 


ive shift in the 
Fic. 3. A plot as a function of «»/ of the relative s 


Position of the Doppler shifted absorption edge due to correlation 


effects. 
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Fic. 4. A plot of the Doppler shifted absorption edge as a 
function of magnetic field. The plasma frequency, collision fre- 
quency and Fermi velocity are fixed. The four curves show the 
effect of correlation on the edge. The curve labeled (1) has a à1 0 
and a 62=0, i.e., uncorrelated. Curve (2) has à; —0.1875, 6:=0. 
Curve (3) has 5,—0.1875, 5,— —0.166, and curve (4) has 
à; — 0.1875, 3, —0.05. 


limit w/w, — 0, the effect of correlations to lowest order 
in y-/w, vanishes. This is true not only for our truncated 
scattering function, but for a general scattering function 
as well. If we examine the transport Eq. (3) and the 
second expression for the current Eq. (12), then 
neglecting the relaxation term and the time derivative 
term we see that (g-+Z;) satisfies the usual transport 
equation, without correlations. The energy delta func- 
tion in Eq. (12), is just sufficient to produce a factor m* 
canceling the 1/m* in front of the integral leading to 
the uncorrelated conductivity. 


III. EVALUATION AND DISCUSSION OF THE 
SURFACE IMPEDANCE 


Typically, the absorption coefficient of a semi- 
infinite metallic plasma exhibits a rather sharp reso- 
nance at the Doppler shifted resonance frequency." If 
we substitute for g in Eq. (1), the value which is ob- 
tained from a simple analysis which leaves out nonlocal 


effects, i.e., 
qqé-— —o;'/[w(w—w-) J . (24) 


Then we may rewrite the approximate Doppler shifted 
resonance condition Eq. (1) as 


w./o=1+[(V r/c) (w/w) F^. (25) 


At low frequencies, where (w,/wV /c)>1 and w./w>>1, 
the effects of correlations are not observable. At higher 
frequencies, where w,/wVr/c-2w./w~1, correlations 
are important. 

Since the simple resonance condition, Eq. (24), at 
these “high frequencies” is now a function of the “mass” 
of the particles, in this case the bare mass, and since 
correlations do play a role, it seems natural to ask what 
mass comes into play. There are, of course, two masses 
in the problem; the so-called bare mass and the clothed 
mass [see Eq. (7)]. In Fig. 3 we have plotted the 


u J, Kirsch and P. B. Miller, Phys. Rev. Letters 9, 421 (1963). 
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shift in frequency of the Doppler edge from its un- 
correlated value as a function of (wp/w). For fixed values 
of (v./«) 0.03 and V p/c— 0.004 there is a shift away 
from what might be called a bare-mass resonance 
toward higher values of the magnetic field, i.e., towards 
a clothed mass resonance. We have taken 0— j' so that 
the effective mass equals 1.25 bare electron masses (the 
observed effective mass in potassium). The three 
curves in Fig. 3 are for three values of à» (850, 02— $, 
8,— —1). A positive à» tends to decrease the mag- 
nitude of the computed shift in frequency whereas a 
negative 52 increases the shift. The important thing to 
note here is that the resonance occurs at neither the fre- 
quency determined by the bare mass nor by the clothed 
mass. The shift is a dynamical function of the frequency. 
In Fig. 4 we have plotted the actual absorption line 
for fixed values of the plasma frequency wp/w= 100, 
Fermi velocity, collision frequency, and à;. The effect 
of a negative à» on the line shape is rather striking. It 
produces an over-all smearing of the line relative to the 
uncorrelated line. The magnitude of the steeply rising 
portion of the absorption curve is severely reduced. In 
addition, a negative à» causes the absorption edge to 
exhibit a rather prominent minimum. In some cases 
(i.e., for the case 85— —$ or —35) the minimum or hole 
in the absorption edge effectively causes a splitting of 
the edge into two peaks. 

Physically, we can understand the origin of this de- 
crease in absorption. Suppose for the purpose of this 
rather crude argument that the electric field inside the 
metal could be characterized by a single wave number 
g, as it can be in the local or classical theory. The edge 
in the absorption is due to a sharp increase in the con- 
ductivity when w./w~(1+qVr/w). The electrons at 
the Fermi velocity traveling in the direction of the wave 
see a static dc field spiral out around the lines of force 
and pick up energy from the field. Suppose we now ask 
ourselves if it is possible to introduce a mechanism 
which, in this region of increasing conductivity, would 
tend to decrease the induced current or conductivity, 
thus producing an associated dip in the absorption edge. 
Electron interactions can produce just this effect. As 
the quasiparticles (the current carriers in a Fermi liquid 
theory) are dragged through the surrounding, now 
slightly incompressible fluid, they create a backflow by 
pushing other quasiparticles out of their way. This 


13 C. C. Grimes, Bull. Am. Phys. Soc. (to be published). 
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backflow carries current. At a definite wave number q 
and frequency w, it is possible for the backward flowing 
current to exactly cancel the forward flowing current 
so that the conductivity goes to zero. For the circularly 
polarized mode we simply require that 


[eo sind (cosh) —-Q,— O0. (26) 
This implies that 
y* /1 
em ( re ay )-o. (27) 
WNS 15 


For a finite 52 (second moment) it is possible to find at 
least one solution to Eq. (27) (to zeroth order in v./o). 
The dips in the conductivity, which show up in this 
simple model are characteristic of the interacting 
Fermion system. Had we retained more terms in the 
multipole expansion of the scattering function, it is 
likely that there would exist multiple solutions of 
Eq. (27). In the actual boundary value problem, q is 
not well defined and the zero in the conductivity appears 
as a minimum in the absorption curve. If the minimum 
happens to fall in the high field or classical region of the 
curve, then the magnitude of the effect is reduced since 
in this limit g— 0 and there are no effects due to 
correlations. 

The observation of correlation effects in the alkali 
metals, unfortunately, is extremely difficult since the 
magnetic fields required in order to keep the Dopper 
shifted edge in the neighborhood of the cyclotron fre- 
quency are enormous. However, experiments could be 
performed in semimetals, semiconductors, or doped 
insulators. 

One should look for a material, preferably one with 
a simple band structure, which has an 7,> 1. In addition, 
wp/v0r/c~w/w~1 and e/v2»1. The frequency range 
in which one can work will primarily be limited by the 
magnetic fields which are available. 
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The de Haas-van Alphen effect h 


as been observed in a copper single crystal by means of a null-deflection 
torsion balance technique. Detailed measurements have been carried out which provide an 


accurate deter- 


mination of the topology of the Fermi surface necks in copper. Effective mass values are found to be in 
excellent agreement with cyclotron resonance results. Unusual behavior observed in the angular dependence 
of the amplitude of the oscillatory torque is attributed to the spin-splitting factor in the theoretical expression 


for the magnetization. 


I. INTRODUCTION 


HE first observation of the de Haas-van Alphen 
effect in copper by Shoenberg was accomplished 
by means of an impulsive field technique. His detailed 
results! confirmed the model of the Fermi surface 
proposed by Pippard,? and demonstrated conclusively 
that the Fermi surface deviates from sphericity and is 
multiply connected along the (111) directions, forming 
"necks" at the hexagonal faces of the Brillouin zone. 
The precise size and shape of the necks, however, could 
not be ascertained with pulsed magnetic fields. In this 
paper we report on detailed low-field measurements of 
the de Haas-van Alphen (dHvA) effect in copper 
resulting in accurate determinations of the neck topog- 
raphy and the associated effective masses. In addition, 
we report on what we believe to be the first unequivocal 
observation of the vanishing of the amplitude of the 
dHvA oscillations due to the zero value of the spin- 
splitting factor for m*=0.5im (and g— 2). 


II. EXPERIMENTAL 


The measurements reported herein were made with a 
torsion balance apparatus which has been described in 
detail in a previous publication. Magnetic fields up to 
40 kG were supplied by a 22-in. Varian magnet, and 
were measured to within 0.1% with a rotating coil 
Probe. The magnetic fields were swept automatically 
at the slow rate of —200—400 G/min to reduce the 
effects of eddy currents in the sample. The copper single 
crystal, supplied by K. R. Garr of Atomics Inter- 
national, was grown by the Bridgeman technique from 
a 99.999075 pure rod purchased from the American 
Smelting and Refining Company. A suitable sample 
Was spark cut in the form of a 0.1-in.-diam X0.15-in.- 
long cylinder and was mounted so that the [110] axis 
Was along the axis of suspension. The magnetic field 
TIDPADNENS ati a, 

Supported in part by the U. S. Atomic Energy Commission. 


j Shoenberg, Phil. Trans. Roy. Soc. London Á255, 85 (1962). 


A250, 325 
A. B. Pi il . Roy. Soc. London » 
(1957) ippard, Phil. Trans 


3A.S. Joseph and W. L. Gordon, Phys. Rev. 126, 489 (1962). 


could thus be rotated in the (110) plane of the crystal 
and was oriented in this plane to within 1°. 


IH. RESULTS AND DISCUSSION 


The values of the period of the dHvA oscillations 
arising from extremal orbits around the necks are shown 
in Fig. 1. The period has a maximum value of 4.64 10-5 
G~ along the [111] axis (— 1297 greater than that 
reported in Ref. 1) corresponding to a minimum cross- 
sectional area of 2.06X 10! cm-^?. As the field is rotated 
away from the [111] direction the period decreases and 
can be determined over an angular range of approxi- 
mately 48°. To within the experimental accuracy, the 
data can be represented by the expression 


P(8) m*(0) m, 1/2 
= = cou 1 tao) f (1) 
P(0) m*(8) n, 


with zu/m,-— 2.55-:0.10, indicating that the shape of 


| 

4.70 uie 
4.60, 

4.50} 


PERIOD 


-24*-20*-6*-12*-8* -4° O° 4° 8* 12° 16° 20° 24° 
8 
. 1. de Haas-van Alphen period in copper versus orientation 
LES (110) plane. The solid line is a plot of Eq. (1) with the 
parameter 7/77 2.55. 
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the neck closely approximates the form of a hyperboloid 
of one sheet, and is hence similar to that of nickel.* In 
the above equation, 0 is the angle measured from the 
[111] axis in the (110) plane, and » and m are the 
transverse and longitudinal effective masses. 

At several angles the effective mass was determined 
from the temperature dependence of the amplitude of 
the oscillations. These results are shown in Fig. 2 and 
are compared with Eq. (1), indicated by the solid line. 
The theoretical curve was normalized to the value 
m*(13°)= 0.5 at 0— - 13? for reasons to be discussed 
below. The data indicate a value of effective mass along 
the [111] axis of 0.455,2-396. These results are in 
excellent agreement with the masses determined by 
Koch, Stradling, and Kip? by means of cyclotron 
resonance. 

In Fig. 3 the amplitude of the oscillatory torque is 
plotted as a function of 0 for a constant magnetic field. 
At 0—0, this amplitude vanishes as is always the case 
for an extremum in the magnitude of the period. The 
torque amplitude increases initially with 0, passes 
through a maximum and goes to zero at 0— +13°+1°. 
For larger angles the amplitude again increases and 


@ dHvA MASSES 
© CYCLOTRON RESONANCE MASSES 


=20° -16° -12° -8* -4* O 4° 8° 


8 


Fic. 2. Effective masses in copper versus orientation in the 
(110) plane. Open circles are data of Koch, Stradling, and Kip 
(Ref. 5), found by cyclotron resonance. The solid curve is a plot 
of Eq. (1) normalized to a value m*=0.5 mo at the points denoted 
by crosses. 
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* A. S. Joseph and A. C. Thorsen, Phys. Rev. Letters 11, 554 
(1963). 

6 J. F. Koch, R. A. Stradling, and A. F. Kip, Phys. Rev. 133, 
A240 (1964). 


AND 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


As deu THORSEN 
3 
12f- 
g 10 Frc. 3. Amplitude 
3 g of the oscillatory 
He torque at constant 
$ : magnetic field (in 
E arbitrary units) ver- 
z 6 sus orientation in 
a the (110) plane. 
B4 
zZ] 
& 3 
«X 
2 -tooi pni] {io} 
1 
JB VAST 1L 116—113 
0 —50*-165*2*-8*-4* 0* 4* 8 12° 16° 20° 


8 


then decreases, finally vanishing at 0=+24°. We 
attribute the vanishing of the amplitude at 0— 3-13? to 
the effect of spin-splitting of the Landau levels. This 
effect is manifested in a cosine multiplicative factor in 
the theoretical expression for the magnetization. The 
factor can be expressed as cos[7g7:*/2mio ], where g is 
the spectroscopic splitting factor, and m* is the effective 
mass. It was introduced by Sondheimer and Wilson? 
and Dingle’ for the case when m* differs from mo, and 
is given above in the generalized form derived by Cohen 
and Blount.’ If the reasonable assumption that g=2 is 
made, then the amplitude should vanish when m*=0.5 
mo. Our experimental mass values do indeed approach 
this value for 0—:4-13?, evidencing remarkable agree- 
ment with the theory. 
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R. A. Cowrey* 
Crystallographic Laboratory, Cazendish Laboratory, Cambridge, England 
(Received 16 December 1963) 


_ The crystal dynamics of strontium titanate has been studied both experimentally and theoretically. The 
Irequency versus wave-vector dispersion curves for some of the normal modes propagating along the [0,0,1] 


direction have been measured by neutron spectrometry at 90 and 296° 
at the Chalk River Laboratories of Atomic Energy of Canada L 


K. The experiments were performed 
td., using a triple-axis crystal spectrometer. 


The temperature dependence of the transverse optic mode of the lowest frequency has been found to be in 
agreement with the temperature dependence of the dielectric constant, as predicted by Cochran. The ex- 
perimental results have been used to obtain the parameters of severa] models, more than one of which gives 
reasonable agreement with the experimental results. It is suggested that the anomalous behavior of the 
elastic properties and the phase transition at 110°K are associated with an accidental degeneracy of two 


branches of the dispersion curves; the longitudinal acoustic branch and the 


transverse optic branch of 


lowest frequency. The origin of the temperature dependence of this transverse optic mode and the relevance 
of lattice dynamics to the phase transitions in other perovskites are discussed. 


I. INTRODUCTION 


HE lattice dynamics of strontium titanate has 
been studied both theoretically and experimen- 
tally, in an attempt to understand the lattice dynamics 
of a ferroelectric material. Anderson! and Cochran? have 
independently suggested that the anomalous tempera- 
ture dependence of the static dielectric constant of 
ferroelectrics is associated with the temperature depend- 
ence of a transverse optic mode of vibration. Recently, 
this low-frequency mode of vibration has been observed 
experimentally in barium and strontium titanates, by 
using infrared spectrometry?^ and in strontium titanate 
by using neutron spectrometry.® 
In this paper experimental measurements are pre- 
sented of some of the frequency versus wave-vector 
dispersion curves for normal modes propagating in the 
[0,0,1] direction of strontium titanate. These dispersion 
curves were determined by neutron spectrometry using 
the triple-axis crystal spectrometer® at the N.R.U. 
reactor at the Chalk River Laboratories of Atomic 
Energy of Canada Limited. t 
The simplest model which is used to describe the 
lattice dynamics of ionic crystals is the rigid-ion 
model.7:$ In this model the ions interact with one another 
through both long-range electrostatic forces and short- 
range repulsive forces between neighboring ions. This 
model has been shown to be inadequate to describe the 
measured dispersion curves of two alkali halides, sodium 
—__—_ 
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! P. W. Anderson, Izd.AN. SSR., PT (1960). 

? W. Cochran, Advan. Phys. 9, 38 ` z5 : 

? A. S. Barker and M. dre Phys. Rev. 125, 1527 s 

^W. G. Spitzer, R. C. Miller, D. A. Kleinman, and L. E. 
Howarth, Phys. Rev. 126, 1710 (1960). 

ER. A. Cowley, Phys. Rev. Letters 9, 159 (1962). F 

5B.N. Brockhouse, Inelastic Scattering of Neutrons in Sign at) 
UR (International Atomic Energy Agency, Vienna, , 

TE. W. Kellermann, Phil. Trans. Roy. Soc. A238, 513 o 

* M. Born and K. Huang, Dynamical Theory of Crystal 
(Oxford University Press, London, 1954). 


iodide”: and potassium bromide.” These experimental 
results showed that a more satisfactory model was the 
shell model? which takes account of the polarizability of 
the ions in both electrostatic and short-range forces. 

The experimental measurements on strontium titan- 
ate have been used to obtain the parameters of both 
rigid-ion models and shell models. These parameters 
show that it is possible to obtain the large changes in the 
frequency of the lowest transverse optic mode by 
changing the values of the parameters only slightly. 

Recently, it has been found that the ultrasonic atten- 
uation in strontium titanate increases dramatically 
below 110*K, while the elastic constants show an 
anomalous temperature dependence.!-? A possible ex- 
planation for these observations is put forward in terms 
of an instability of the crystal arising from the acci- 
dental degeneracy of two branches of the dispersion 
curves. Finally the results are discussed with particular 
reference to the origin of ferroelectricity in the perovskite 
structure. 


IL THEORY OF THE LATTICE DYNAMICS 
OF STRONTIUM TITANATE 


1. Symmetry 


Some of the properties of the normal modes of vibra- 
tion of a crystal are a direct consequence of its sym- 
metry. The degeneracies between the different branches 
of the dispersion curves, and the separation of the 
normal modes into longitudinal and transverse vibra- 
tions for some wave vectors afe examples of these 
properties. For simple crystals these relations can be 
obtained by inspection, but for more complicated crys- 


9A. D. B. Woods, W. Cochran, and B. N. Brockhouse, Phys. 
; 1960). 
R E Woods B. N. Brockhouse, R. A. Cowley, and W. 
h hys. Rev. 131, 1025 (1963). 
Goba P ind G. Rupprecht, Phys. Rev. 129, 90 (1963). 
K. S. Krogstad and R. W. Moss, Bull. Am. Phys. Soc. 7, 192 


(1962). 
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Fic. 1. The crystal structure of strontium titanate. 


tals a more systematic procedure is given by using 
group theory. 

The normal modes of vibration are classified by as- 
signing them to an irreducible representation of the 
space group of the crystal. A representation for the 
normal modes is obtained by choosing as a set of basis 
vectors the displacement of each ion in turn along a 
coordinate axis. If the character of a symmetry opera- 
tion S is x(S) in this representation, and its character in 
the à irreducible representation is x^ (S), then 


x(S)=2 XCS), (1) 


where C) is the number of times the A irreducible 
representation occurs in the original representation. 

When the normal modes of the crystal are described 
by Bloch waves, the irreducible representations of the 
symmorphic space groups can be obtained from the 
irreducible representations of the little group, which is 
that point group which leaves the wave vector un- 
changed. The irreducible representations are then ob- 
tained from Eq. (1) and from the characters of the 
irreducible representations of the point groups.!* 

The structure of strontium titanate is cubic perov- 
skite, and the space group is Pm3m. The ions are 
situated on five interpenetrating simple cubic lattices as 
shown and labeled in Fig. 1. The characters of the 
different symmetry elements in the original representa- 
tion are listed in Appendix I. 


(a) ¢= (0,0,0) 


The little group is m3m, and the irreducible repre- 
sentations are 
“4D i5+T os. 


This result is, however, incorrect because the effect of 
the macroscopic electric field has been neglected. This 
electric field splits the degeneracy of some of the normal 
modes because the boundary conditions for the longi- 
tudinal and transverse modes differ.* The degeneracies 


1» V, Heine, Group Theory in Quantum Mechanics (Pergamon 
Press, Inc., New York, 1960). s i 

“4G. F. Koster, Solid State Physics, edited by F. Seitz and D. 
‘Turnbull (Academic Press Inc., New York, 1957), Vol. V. 
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will be lifted for those normal modes which transform 
like an ordinary vector. Since an ordinary vector trans- 
forms like I's, the degeneracies associated with three of 
the four irreducible representations T15 are split. The 
fourth irreducible representation corresponds to a uni- 
form translation of the whole crystal. There are, there- 
fore, the three acoustic modes, «— 0, a triply degenerate 
normal mode Tzs, and three longitudinal and doubly 
degenerate transverse optical modes. 


(b) &— (0,0,5) 


The little group is 4mm, and the irreducible repre- 


sentations are 
4A;+ AzT-5A;. 


The normal modes A; and As are longitudinally polar- 
ized modes, while those represented by A; are doubly 
degenerate transverse modes. 

At the zone boundary the little group is 4/mmm, and 
the irreducible representations are 


2M -2MY -M3-3M4-2MS. 


M; and Mz’ correspond to doubly degenerate transverse 
modes, while the other modes are longitudinally 
polarized. 


(c) ¢= (65,0) 


The little group is mm except at the zone boundary, 
where it is 4/mmm. The irreducible representations are 


52H22 +52 +42, 
and 


M +Mo+M} +M; +2M} +M +M +3M5'. 


There are no degeneracies at general points, but at the 
zone boundary there are four doubly degenerate normal 
modes. 


(d) (= (564) 


The little group at a general point is 3m and its 
irreducible representations are 


4Aid- Ag T-5As. 


At the zone boundary the little group symmetry is in- 
creased to m3m and the irreducible representations are 


DzZ-ET 3? -ETss- E Tos +20 15. 


The irreducible representation As corresponds to doubly 
degenerate normal modes, while at the zone boundary 
there are four triply degenerate normal modes, one 
doubly degenerate pair and a single normal mode. 


2. Models 


In the last section we found the properties of the 
normal modes which can be deduced from the symmetry 
of the crystal. In order to evaluate the frequencies 9 
any of the normal modes, a particular model must be 


&. 


SAA 


DNA 


le 


&-. 
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employed for the interactions between the ions. The 
simplest model which has been used for ionic crystals is 
the rigid-ion model.” The equation of motion for this 
model is then written in the harmonic approximation as 


M ke? U(K) =} (Rew +ZxCxxZx-)U(K’), (2) 


K* 


where the notation is the same as that of Woods et al? 

This model does not satisfactorily account for the 
dispersion curves of the alkali halides: because it 
neglects the polarizabilities of the ions. These polar- 
izabilities can be introduced by using the shell model.2° 
The equations of motion of the shell model in the 
harmonic approximation? are 


M ko?U(K) = (Rex +ZxCxxZx)U(K’) 
a 
+} (Tre t+ZrCrx Vx )W(K’), 
= 


(3) 
0=S (Text VxCxxZx)U(K’) 
K 


+2 (Szr t+YxCrr Yr )W(K'). 
T 


The matrices Txx- and Sx; give the short-range 
interactions between the displacements and the elec- 
tronic dipole moments, and between the electronic 
dipole moments, respectively. In general, the elements 
Rex, Tx and Ski are all different and must be 
specified by different short-range force constants. How- 
ever the shell charges Y x can be chosen so that Rx 
=Txx when the wave vector is zero,? and it was 
further assumed that they are then equal for all wave 
vectors. The validity of this approximation and the 
further approximation that Sxx-=Rxx-+kxéxr are 
discussed by Cowley, Cochran, Woods, and Brock- 
house.!5 The model then corresponds to a shell model in 
which all the short-range forces act through the shells. 

In actual calculations the short-range forces were as- 
sumed to be axially symmetric. If the equilibrium con- 
dition is imposed then these forces become central 
forces and the elastic constants obey Cauchy's relation, 
Ci5— C44. In practice, this relation is nearly satisfied, but 
the equilibrium condition was not imposed on the 
models described below. s 

For each type of short-range interaction two parame- 
ters are needed to specify axially symmetric forces. 
These parameters were chosen to be the derivatives of a 
potential function parallel and perpendicular to the line 
joining the interacting ions. These were Genes m 
similar way to those used for the alkali halides.’ 

The forces were specified by 


22) eA; (=) eB; 
=) TES a a} 
HC. e u 20 or/, 2v 
ER, A. Cowley, W. Cochran, A. D. B. Woods, 
Brockhouse, Phys. Rev. 131, 1030 (1963). 


and B. N. 
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SrTiO; A983 
where the suffix i is one for strontium-oxygen, two for 
titanium-oxygen, and three for oxygen-oxygen forces. 
Each type of force was assumed to act only between 
nearest neighbors. The short-range contributions to the 
dynamical matrix were then evaluated by the usual 
procedure,’ and the elements are listed in Appendix II. 

The ionic charges were specified by two charges; that 
on the strontium ion was Zye and on the titanium ion 
Zæ. The charge on the oxygen ion is then Zae 
=—3(Z,+Z,)e. 

The polarizabilities of the ions were specified in a 
similar way to that used for the alkali halides.?-5 These 
polarizability parameters were chosen so that they were 
independent of the choice of the shell charges. The 
electrical polarizability was defined as 


oo Y 2/[R;4- (T ijo], 
and the short-range polarizability as 
d; —Yi(Tu)/[E-t (T:)o], 


where the suffix i has values 1 for the strontium ion, 2 
for the titanium and 3 for the oxygen, and (711)s— 241 
+4B1, (Ta)o=A:+2B2. The oxygen-ion polariza- 
bilities are not isotropic, so that the polarizability was 
defined in terms of a mean (735); given by, 


(7T33)o7 3 QA i FABi- Ag 2B -4A 4H- 8B3) . 


The most complicated rigid-ion model of the crystal, 
then, has eight adjustable parameters, six for the short- 
range forces and two for the ionic charges, while for a 
shell model there are fourteen parameters, the same 
eight and six polarizability parameters. 


3. Calculation of Dielectric and 
Elastic Constants 


The frequencies of the normal modes of vibration can 
be obtained by solving Eq. (2) for a rigid-ion model, or 
Eqs. (3) for a shell model. Once a particular set of 
parameters has been chosen, their solution is quite 
straightforward when the electrostatic coupling coeffi- 
cients are known. These coefficients were calculated 
using the EDSAC II computer at the Cambridge Uni- 
versity Mathematical Laboratory, and their values for 
the (0,0,1) direction have already been reported.’ These 
and the coefficients for the (1,1,0) and (1,1,1) directions 
are listed in Appendix III. : 

'The dielectric constants can be evaluated in terms of 
the parameters introduced above.*"" The high-frequency 
dielectric constant is given by Eq. (2.44) of Ref. 17 and 
the static dielectric constant by Eq. (2.34). In practice 
the high-frequency dielectric constant was evaluated 
explicitly and the static dielectric constant obtained 
from the Lyddane-Sachs-Teller relation as extended by 


Cochran? 


18 R. A. Cowley, Acta Cryst. 15, 687 (1962). 
11 R, A, Cowley, Proc. Roy. Soc. (London) A268, 121 (1962). 
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The elastic constants were evaluated explicitly using 
the method of long waves as developed by Born and 
Huang? and Cowley." Since every ion is situated at a 
center of symmetry, the polarizabilities do not affect the 
elastic constants. The electrostatic contributions were 
evaluated numerically.!6 The elastic constants are given 
by: 

Cu- Qe/w)[3 (241+ 2Bit-A2t24 5+2Bs) 
+0.16485(Z2+-Z;7) — 0.78348Z2 
—1.0721Z3Z3--2.67981Z;Z5— 1.27803Z;Z5], 

C= Qe/vr)Ls (A d 5B1— B +A Ss 5B3) 

—0.55532(Z2+Z27) — 1.609462; 
--0.26873Z1Z3— 1.43585Z5Z54-0.05648Z;Z; ], 

Cyu= Qe/vr)Ls (A 1+ 3Bi4+ Bot A 3+3B3) 

— 0.08242 (Z:?2- Z2?) 4- 0.391742? 
4+.0.53605Z1Z2— 1.33991Z.Z54-0.63902Z;Z; ]. 


The electrostatic cohesive energy is (£/r)aur, where aa, 
the Madelung constant, is given by 


1.25950(Z2-+ Z2?) 4-4.57109Z 2-1-0.79260Z3Z3 
—0.667170Z5Z354-0.48362Z3Z5. 


The equilibrium condition is then 
4au+12(Bı+B:)+3B:=0. 


This last equation when substituted into the equation 
for Cy; gives C12= Ca. 7 is half the lattice parameter and 
v the volume of the unit cell, v= 87°. 


1j Il. THE EXPERIMENTS 
: ja 
Di 1. Experimental Methods 
i The excellent single crystal of strontium titanate was 
ft a 150-carat boule provided by the Titanium Division of 


the National Lead Company. The mosaic spread was 
measured against that of a silicon crystal and was 0.25 
deg. The crystal, which had [1,0,0] aligned along the 
length of the boule, was mounted in a metal cryostat, 
and could be either cooled or heated from above. It was 
aligned im situ by using neutrons. 

The experiments were carried out by using a triple- 
axis crystal spectrometer.* 

The integrated intensity of a neutron group, obtained 
using the “constant Q” technique,® is given in the 
harmonic approximation by? 


h |E] a 


H N (a5) | 
8x? |ko| (a5) 


N(aj) +1 
H (qj) is the structure factor for the mode and is given 
E 
Ha) — |Z exp Cris: RQR))O- e(K,aj) bM 
: Xexp(—W(K))|. 
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Fic. 2. The (1,0,0) plane of the reciprocal lattice of strontium 
titanate. The Brillouin zone is labeled ABCD and the area in 
which the reduced structure factor repcats is enclosed by (0,1,1), 
(0,1,1), (0,1,1) and (0,1,1). OLM and OL’M demonstrate the 
“constant-Q” technique in which the spectrometer is controlled so 
as to keep the momentum transfer constant while varying the 
energy transfer, while OGH and OG'H' shows the constant-energy 
technique in which the momentum is kept along the desired 
direction. 


If a normal mode is propagating along the [0,0,1] 
direction then the motion of all the ions is either 
transverse or longitudinal. The dependence of the 
structure factor on the wave-vector transfer can then be 
separated out if we assume the Debye-Waller factors for 
the different atoms to be equal. We can then introduce a 
reduced structure factor for the normal mode 


h(q7)=|d exp Qzi«- R(0K))e(K,q) M rbr]. 
K 


This reduced structure factor is periodic in reciprocal 
space. Figure 2 shows the (1,0,0) plane of the reciprocal 
lattice of strontium titanate. The Brillouin zone is 
labeled as ABCD, while the reduced-structure factor 
repeats over a larger unit in reciprocal space, enclosed on 
Fig. 2 by the reciprocal lattice points (0,1,1), (0,1,1), 
(0,1,1), and (0,1,1). Because the structure factor repeats 
over a larger volume in reciprocal space than the fre- 
quencies, there are several different points in reciprocal 
space at which phonons of the same wave vector have 
different reduced-structure factors. In the (1,0,0) plane 
there are four corresponding to reciprocal-lattice vec- 
tors, =: (0,0,0), (0,1,0), (0,0,1) and (0,1,1). 

The reduced-structure factor squared has been calcu- 
lated for each of these different positions for normal 
modes propagating in the (0,0,1) direction, by us!né 
model IV of Sec. IV. The results are shown in Fig. 3- 


2. Measurements of the Dispersion Curves 


Measurements have been made at 90°K of four of the 
transverse branches and of most of three of the long! 
tudinal branches. The measured branches are those with 
the lower frequencies. The results are illustrated n 
Figs. 7 and 9, where they are compared with several 0 
the models described in the next section. The highest 
frequency transverse optic branch and the two highest 
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frequency longitudinal optic branches were not meas- 
ured because of the extra difficulty in measuring these 
high frequencies. Unfortunately it was not possible to 
complete the measurements on the longitudinal acoustic 
branch with the crystal in the (1,0,0) plane. 

Less complete measurements were made of three of 
the transverse branches and the longitudinal acoustic 
branch at 296°K. The results are shown in Figs. 7 and 9, 
where they are compared with calculations using several 
of the models described in the next section. 

The frequencies of the normal modes with long wave- 
length, q—0, can also be obtained by using infrared 
spectroscopy. Strontium titanate has been studied by 
Last,? Barker and Tinkham? and by Spitzer et al.4 The 
frequencies of the transverse modes can be obtained 
directly from these measurements, and those of the 
longitudinal modes by using a result due to Kurosawa,” 
as explained by Cochran and Cowley.? In Table I the 
results of the infrared and neutron techniques are 
listed, for the q=0 modes. 

Since in strontium titanate each ion is situated at a 
center of symmetry, its Raman spectrum is a second- 
order spectrum and depends upon the joint density of 
states for the phonons. Narayanan and Vedam?! have 
erroneously interpreted their results as a first-order 
Spectrum. 


LONGITUDINAL 


a NR a ELÁ 


bs] 
[9] 


STRUCTURE FACTOR (arbitrary units) 
ki 


2n woo OD t=(1.19) 
WAVE VECTOR 


. Fic. 3. The reduced structure factor for normal modes propagat- 
ing along the (150,0) direction calculated using model IV of N 

he solid lines show longitudinal modes and the tansy: om Xm 
are polarized along the y direction and shown by dotis : 
modes are labeled in order of increasing frequency. 
c RUM ^ 

=J. T. Last, Phys. Rev. 105, 1740 (1957). 

2T. Kurosawa, J. Phys. Soe, Japan 16, 1298 (1961). 

aW, Cochran and RA. Covley, Inelastic Scattering of Slow 


Veutrons (International Atomic Energy Agency, Vienna, 1963), 
ol. I 


aips] Narayanan and K. Vedam, Z. Physik 163, 158 (1961). 
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, TABLE I. The frequencies of the q=0 optical modes in strontium 
titanate (units; 10 c/sec) as determined by neutron and infrared 
spectrometry. The measurements with an asterisk are at 90°K and 
the others at 296°K. 


Neutron P 
à spectroscopy Infrared measurements 
Normalmode measurements (3) (4) 
Transverse 
yi (fa 1.20* 
2.13 3.00 2.63 
ya 5.10* e 5.34 
v 7.95% N- e 
v4 e 16.5 16.3 
Longitudinal 
v 5.10* e 5.25 
va 7.95* ee 
v3 EE 14.5 13.8 
7i Weis 24.5 24.9 


3. The Temperature Dependence of the Lowest 
Frequency Transverse Optic Mode 


Cochran? has shown that, if the static dielectric con- 
stant of a crystal follows a Curie law temperature 
dependence, 


e-C/(T—T,), (4) 


the temperature dependence of the q=0 transverse 
optic mode with lowest frequency is expected to be 


wr -— K(T—T.). (5) 


Mitsui and Westphal? and Weaver? have shown that 
the static dielectric constant above 70°K follows a 
Curie Law with a Curie temperature of about 35°K. 

The q—0 mode of the transverse optic branch with 
lowest frequency has been studied at five different tem- 
peratures, and the neutron groups obtained at three of 
these temperatures are shown in Fig. 4. The increase in 
intensity on the low-frequency side of the 90°K group 
is due to the elastic and acoustic mode scattering as- 
sociated with the (0,2,0) reciprocal lattice point. The 
asymmetry of some of the groups and the change in the 
width of the neutron groups with temperature are 
probably largely due to effects associated with the finite 
resolution of the spectrometer. 

Figure 5 shows a plot of the frequencies squared 
against temperature for this normal mode. The excellent 
straight line gives ample evidence 'for Eq. (5), and the 
validity of discussing ferroelectrics as a problem in 
lattice dynamics, as suggested by Cochran.? The Curie 
temperature is given by the intercept of the straight line 
with the temperature axis, which yields a Curie temper- 
ature of 322: 5?K, which is in excellent agreement with 
the dielectric constant measurements.*-” 


Mitsui and W. B. Westphal, Phys. Rev. 124, 354 (1961). 
2H M ester Phys. Chem. Solids 11, 274 (1959). 
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Fic. 4. Neutron groups for the q=0 lowest frequency transverse 
optic mode at three different temperatures. 


4. The Temperature Dependence of the 
Transverse Acoustic Branch 


Recent measurements of the elastic constants of 
strontium titanate” have shown that on cooling from 
room temperature, C44 at first increases, but near 110°K 
it decreases quite dramatically." 

Neutron measurements have been made of the 
¢= (0,0,0.2) transverse acoustic mode to see if a similar 
temperature dependence is found at these higher fre- 
quencies. The results are shown in Fig. 6 and there is 
clearly no evidence for any discontinuity at 110°K, 
within the limits of experimental error. 


E 


@ 


8 
m 
(FREQUENCY)? 102% sec? 


o 


RECIPROCAL OF DIELECTRIC CONSTANT. 
^ 


o 100 200 300 400 500 
TEMPERATURE (*K) 
Fic. 5. A plot of the frequency squared against temperature for 


the q=0 lowest frequency transverse optic mode. The dotted line 
Es the reciprocal of the static dielectric constant (Refs. 22 


and 23). 
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Tic. 6. A plot of the 
frequency — (10? ¢/sec) 
against temperature (°K 
for the €= (0.2,0,0) trans- 
verse acoustic mode. 
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TEMPERATURE 


The ultrasonic measurements also showed anomalous 
attenuation below 110°K, but no anomalous broadening 
of the neutron groups at 90°K was detected for either 
the longitudinal or transverse acoustic branches. 


IV. MODELS 


The theory of the lattice dynamics of strontium 
titanate was described in Sec. II. In this section we 
discuss the results obtained when the parameters of 
these models are fitted to the experimentally measured 
elastic and dielectric constants,?225 and to the fre- 


LONGITUDINAL | | TRANSVERSE 
90°K 90° x 
ODELI 


MODEL 
Che om MODELI 


TRANSVERSE 
296° K 


24 
296 K 


LONGITUDINAL 
. 


o [I aso (— os o {t— oso 
WAVE VECTOR 


Fic. 7. The dispersion curves for model I (solid line) and model 
III (dotted line) for the normal modes propagating along the 
(¢,0,0) direction. The experimental measurements are taken from 
both neutron and infrared results. (See Refs. 3 and 4.) The ute" 
ducible representations at the zone boundary are for the longi- 
tudinal modes My, M 1, Ms, Mı, Ms! in order of increasing fre: 
quency, while for the transverse modes they are Ms, Ms’, Ms, Ms» 
Ms (model I) and Ms’, Ms, Ms, My, Ms (model ILI). 


FREQUENCY Cto?c/;2 


m Poindexter and A. A. Giardini, Phys. Rev. 110, 1069 
5 S. B. Levin, N. J. Field, F. M. Plock, and L. Merker, J. Opt- 
Soc. Am. 45, 737 (i855) zi 
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TABLE II. The par: a : ores 7 ; 
D Drs of re models described in Sec. IV. Those parameters which were varied in each model are given with the 
errors of the parameters, while for those which were kept fixed there are not any errors. 


Short-range forces e?/2v 


: Tonic charge (e) 
Strontium-Oxygen Titanium-Oxygen 


z Oxygen-Oxygen Strontium Titanium 
Model En B, A, B; A; B; Zi Z: 
90°K A ae E 7 = is Eu ie à e CM DNE ENG SER 1,48 
I 9.02-4.7 2.52.1 110.44-18.7 —36.64 5.7 24.8-- 6.0 1.0 5 t 
0 542. 42:18. <€ 3+6. 0+1.: 1404-0. 2040.36 
II 26.53-2.1 —2.02-1.3 288.5--10.3 —42.74 24 —3.744.3 -0109 2 H : ri ; 
III 26.9--2.5 —3.841.3 292.2-+10.7 —41.4 5.4 —6.84-5.0 0.8--1.2 2 4 
IV 12.9+7.4 0.31.5 313.7419.7 —64.0+18.0 5.348.8 0443.9 1.26--0.47 4.64-0.16 
V 6.91.5 2.6+0.9 3214+ 9.8 —70.14 4.8 8.8-2.9 —2.1+0.7 0.83-£0.07 4.94-£0,08 
VI 6.50.7 2.80.4 317.74 2.4 —68.8+ 0.7 9.8--0.6 —2.5-E0.3 0.32625 4.9123 
296?K 
I 2.92.3 6.7+4.6 112.12-20.9 —36.3+ 6.5 28.347.7 0+1.8 1.20+0.16 2.3240.35 
II 27.6428 2841.7 28964129 — —392-- 42 — —29:34 IÍ a e 
III 28.7+3.2 —4.2-+2.0 293.32-12.0 —40.8+ 9.2 —5.5+6.2 0.11.4 2 4 
IV 16.5+7.0 —1.42-2.3 300.52-36.2 —46.12-14.8 5.58.6 —1.6+2.8 1.34+0.50 4.39+0.50 
V 8.6--2.1 2121.3 315.0+12.5 —70.24- 5.3 9143.9 —2.340.9 0.83--0.07 4.91+0.09 
Polarizability (10) cm? or (e) Especie 
Strontium ion Titanium ion Oxygen ion obseryable/ 
Model [771 di a2 dz a d; exptl. error 
90*K Em 
I 0. 0 0 0 0 0 4.1 
II 0.025 0 0.003 0 0.0282-0.002 0.693-0.11 2.6 
III 0.025 —0.350.29 0.003 —0.08+0.15 0.030+0.002 0.68+0.13 + 2.5 
IV 0.025 —0.29+0.59 0.003 0.01+0.06 0.027 +0.004 0.82+0.11 2.3 
V 0.025 0 0.068-+-0.011 —2.13+0.46 0.0182-0.003 0.51+0.14 1.7 
VI 0.025 0 0.79441 —2.596 0.01535 0.42162 1.6 
296°K 
I 0. 0 0 0 0 0 44 
II 0.025 0 0.003 0 9.028+-0.004 0.68+0.15 FA 
III 0.025 —0.482:0.23 0.003 —0.05+0.16 0.0292-0.003 0.6940.15 2.8 
IV 0.025 —0.18+0.53 0.003 0.032-0.09 0.027 4-0.003 0.78+0.16 2.7 
V 0.025 0 0.0792-0.011 —2.60 +0.41 0.015-++0.605 0.42+0.17 1.9 


quencies of the normal modes as measured by infrared? TABLE IIT. The agreement between the experimental elastic and 
and neutron spectrometry, as described in the last dielectric constants and those calculated using the models, 

) — = 
section. The fitting was performed by a nonlinear least- 


squares analysis and the numerical work was done using Pere onsale Diem 
the EDSAC II computer. Model Gi eu Cu e E 


Results have been obtained for several models at = 
both 90 and 296°K. The parameters of these models are 90°K 


listed in Table II, while their agreement with the ex- Exptl 2E ross ie ea "e 
perimental elastic and dielectric constants is shown in H 326 107 10 id: t 
Table III. The agreement with the measured dispersion II 3.19 1.07 1.71 1171.8 3.83 
relations is shown for models I and III in Fig. 7 and for W oe 10; ; 2 d 204 
models IV and V in F ig. 9. The dispersion relations in VI Eo rie ES md me 
the (1,1,0) and (1,1,1) directions have also been calcu- 
lated for some of the models and are shown in Figs. 8 296*K 
and 10 Exptl. 3.303 1.014 1.244 301.05 5.5e 
cone z . s j I 2.98 1.12 1.02 37.9 1 

A particularly interesting feature of e ees is T en s E e i 
that the large charges in the frequencies of the trans- — qq 331 127 171 292.9 38 
verse optic mode with lowest frequency have been ob- IV 3.45 1.24 1.65 215.1 3.73 
tained with comparatively small changes in the parame- V 3.46 1.19 1.28 244.2 5.78 


ters of the models. Model VI, for example, has the ionic eae sae ; 
charges and polarizabilities of model V at 296°K but fits |» These have been obtained from Ea. (8) neglecting the effects of the 


he 90°K with on mall changes in th b See Refs. 22 and 23. 
the °K measurements wit ly S [o S the an Ras an 

d 
short-range force constants. See Ref. 11 
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Tic. 8. The dispersion 
curves in the (¢,¢,0) and 
din (¢,6,5) directions for models 

III and V at 90?K. The 
irreducible representations 
in order of increasing fre- 
10 quency are at the point (A); 
My, My, Ms, My (model 
Be Pocus IID), and My, My, Ms, My 

ccce NT UN (model V). At the point (B) 
“5 they are AM», Ms, Ms, My, 

Mi, Ms’, M4 for model III 
and Ms, Ms’, M», My, My, 
Mi, M4 for model V, while 


at the zone boundary (C) 
they are Tyo’, Tis, Tos, Iis, 
T», P»' (model IIT), and 
si Ias, Vis, Vis’, Vos’, Pis, Tf 
(model V). 
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V. THE 110°K PHASE TRANSITION 


1. The Temperature Dependence of the 
Elastic Constants 


The temperature dependence of the elastic constants 
of crystals has been discussed by several authors.*?$ The 


potential energy of the crystal is expanded in a double 
power series of the deformation parameters and of the 
phonon coordinates, Eq. (40.4) of Ref. 8. 

If this expansion is taken about the equilibrium con- 
figuration of the lattice at each temperature, then the 
isothermal elastic constants can be obtained from*^* 


(88/8) - Sas (=) FE E Sarl, af—49) 0//2(85)) Qn(aj) J-1)--2 27 Sal, 95-49’) 


2n(qj) +1 


X$4(—, —9j07') 


The notation is similar to that of Born and Huang.? The 
elastic constants are then given by 
Capysi*?= (1/4V)[ (8,79) -- (8,y) + Ba,y8)+ (8o) ]- 

'The adiabatic and isothermal elastic constants are 
related by macroscopic relations, which in Voigt's 
notation are 
C383 — Cis? — C15: — C155 

= (TVo2/Cy)(Cu*e3-2C1599? , 
C,454— C441 —0. 

"The third term in the expression (6) for the elastic 


/G. Leibfried and W. Ludwig, in Solid State Physics, edited by 
tz and D. Turnbull (Academic Press Inc., New York, 1961), 


^ 


a 
2w(qj) Ne? (87) —9* (7) 


s 
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constants is particularly of interest. The summation is 
over all the different pairs of normal modes with @ 
particular wave vector and belonging to different 
branches of the dispersion relations. Figure 11 shows the 
longitudinal-acoustic and transverse-optic dispersion 
curves at 296°K and at 90°K. At the higher temperature 
the curves are well separated, but at 90°K they appear 
to be almost degenerate over a considerable region 0 
wave-vector space. Since at zero wave vector the tem- 
perature dependence of the transverse optic mode 1S 
given by Eq. (5), it might be expected that the temperar 
ture dependence of the denominator in the third term 0 
Eq. (6) for these two normal modes is given by 


9*(qj)]—«(q7) a T—-Ta, 


C uu ML LC I 


De AU Cmm DT he E 
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where T4 is the temperature at which they become 
degenerate. The above temperature dependence can 
also be obtained by using a lowest order anharmonic 
theory of the temperature dependence of the normal 
modes. 

The other factors in the expression for the elastic 
constants are either independent of temperature or as a 
first approximation can be written as proportional to 
temperature in a limited temperature range. The elastic 
constant is then given by an expression of the type 


Ca8,457 A-BT--C/ (T— T4). (7) 


The coefficients 4, B, C can, in principle, be obtained 
from the matrix elements and summations over the 
normal modes. The coefficient C would be particularly 
difficult to evaluate, because it depends on the detailed 
shape of the dispersion curves near the degeneracy. 

The temperature dependence of the elastic constants 
has been measured above 110°K by Bell and Rupprecht," 
who obtained 


C7 3.341 X 102[1— 2.62 

X10-*(T— T4)—0.0992/ (T—T4)], 
C:5— 1.049 X 10?[1— 1.23 

X10-(T—T4)4-0.1064/(T— T4)], (8) 
Cas= 1.267 10?[1— 1.30 

X10-(T— T4)—0.1242/ (T— T4)], 


with Ta=108°K. 

"These results are clearly in agreement with the tem- 
perature dependence predicted by Eq. (7). Bell and 
Rupprecht!! also found that the ultrasonic attenuation 
increases dramatically near 110°K. The ultrasonic 
attenuation can be calculated in an exactly similar way 
to the elastic constant?’ and arises when the frequency 
Q of the ultrasonic wave is such that 


9—o(q7)—o(a7.)- (9) 


. Fic. 10. The dispersion curves of 
model IV in the (5,5,0) and (5,60) 
directions. In order of increasing 
frequency the irreducible represen- 
tations are for the zone boundary 
(A) My’, M>', Ms, Ms’ (90*K) and 
M’, M3’, Ms, Ms! (296°K), for the 
zone boundary (B) Ms, Ms’, M 5 
M», Mi, Ms’, M4 (90°K) and Ms’, 
Ms, M», Ms’, Mi, Ms’, Ma, while 
for the zone boundary (C) they are 
Dos, Pis, Vie’, Pis, Dos’, Te! (90°K) 
and Tis, Tes, Tie’, Tis, Des, Te 
(296°K). 
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*! R. A. Cowley, Advan. Phys. 12, 421 (1963). 
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Fic. 9. The dispersion curves for model IV (solid line) and 
model V (dotted line) for normal modes propagating along the 
(1,0,0) direction. The experimental measurements are taken from 
both neutron and infrared spectrometry. (See Refs. 3 and 4.) The 
irreducible representations at the zone boundaries are for the 
longitudinal modes My, Mi, Mz, Mi, My except for model V at 
90*K which gives M»', Ms, My, Mı, Mo! while for the transverse 
modes they are Ms, Ms’, Ms, Ms, Ms. The irreducible repre- 
sentations are listed in order of increasing frequency. 


If the transverse optic mode is almost degenerate with 
the longitudinal acoustic mode in this temperature 
region over a considerable region of q space, as suggested 
by Fig. 11, then there will be many normal modes for 
which Eq. (9) can be satisfied and the ultrasonic 
attenuation will be large. 


2. The Transition 


The elastic constants of the crystal will become nega- 
tive just above T4, Eqs. (8). When the elastic constants 
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become zero the crystal will undergo a spontaneous 
deformation to a new structure, the extent of the de- 
formation depending upon the third- and fourth-order 
elastic constants. Since the elastic constant C44 becomes 
negative at the highest temperature, a tetragonal low- 
temperature structure can result by combining three 
shear distortions along different axes in the crystal. A 
tetragonal structure of strontium titanate below 110°K 
has been found by Rimai and deMars.?5 In this transi- 
tion the volume of the crystal does not change, and it is 
extremely likely that the crystal splits up into small 
domains, with the major tetrad axis, the z direction in 
the domain, along different cube axes. 

The elastic constants and the free energy of the 
crystal depend not only on the frequencies w(qj) and 
w(gj’) which are nearly degenerate, but also on the 
number of these normal modes which are nearly de- 
generate. The temperature dependence of the distortion 
can then be estimated if we assume that its magnitude 
is proportional to the difference in the number of normal 
modes just above the degenerate frequency to those 
just below this. The frequencies squared of the trans- 
verse optic modes are linearly temperature-dependent, 
while the deformation is limited by the third-order 
elastic constants which are assumed to be independent 
of temperature. The temperature dependence of the 
distortion is then proportional to (74— 7)!? as found 
experimentally by Rimai and deMars.?8 

The discussion given above is very crude and cannot 
be expected to give satisfactory results at temperatures 
far away from the transition temperature. In particular 
the temperature dependence will depend on the effect of 
the distortion on the other normal modes, particularly 
near the degeneracy in the longitudinal acoustic and 
transverse optical branches, the temperature depend- 
ence of the population factors of the normal modes, and 
the effect of the domain structure on the free energy. 

Below the transition the behavior of the elastic con- 
stants and ultrasonic attenuation will be very complex. 
The elastic constants will be an average over the 

different orientations of the domains, as well as being 
strongly affected by the exact details of the degeneracies 
of the normal modes and their behavior in the distorted 
crystal. The ultrasonic attenuation will also be anoma- 


28 L, Rimai and G. A. deMars, Phys. Rev. 123, 702 (1962). 
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lous because of reflection of the ultrasonic waves at the 
surfaces of the domains, as suggested by Bell and 
Rupprecht." 


3. The Dielectric Properties 


Below 110°K the frequencies of the normal modes of 
vibration will be altered by the distortion of the 
crystal. It is likely that the frequency of the transverse 
optic mode will be particularly sensitive because of the 
large changes which occurred when small changes were 
made in the parameters of the models of Sec. IV. If the 
lattice spacing decreases by ôa along the x and y 
directions and increases by 28a along the unique z axis, 
the frequency of a transverse optical mode depends upon 
whether it is polarized along or perpendicular to the z 
axis. 

The changes in the frequencies due to the distortion 
can be estimated from the volume dependence of the 
frequencies in the cubic structure. Equation (5) for the 
frequency will be modified to 


DRE R) +2008, 
QUK T—85B, 


where c; is the normal mode polarized along the z axis 
and B—de?/da. 

The distortion of the crystal then changes the Curie 
temperature by —28aB/K for displacements along the 
z axis, and by 6aB/K for those perpendicular to it. 

The dielectric constant of a large crystal can then be 
estimated by assuming that there are an equal number 
of domains orientated with the z axis along each of the 
three original cube axes. The average dielectric constant 
is then 


TNT = ne QiaB/K) T- T,— ue) 
G e í 


& 3 


and therefore, 
&=C/(£—T,). 


The dielectric constant of a bulk sample of the 
distorted crystal is then identical with that of the 
crystal if there was no distortion. This result explains 
the rather surprising experimental result"? that the 
reciprocal of the dielectric constant does not even 
change slope through the 110°K transition. 

The distortion does however influence the low-tem- 
perature dielectric properties. Since the Curie tempera- 
ture is raised in one direction, some of the domains can 
become polarized above T.. This spontaneous polariza- 
tion will occur along different directions in the crystal, 
corresponding to the different orientations of the do- 
mains, and will be frozen in to a particular configuration 
by the domain pattern produced by the 110°K transition. 

The dielectric behavior at low temperatures Wl 
therefore be extremely complicated. Deviations from 
the Curie law will be caused by the spontaneous 
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polarization of some of the domains. 
probably larger deviations will be caused by quantum 
effects. Barrett? demonstrated how these occ ur by using 
a Slater model of a ferroelectric, and very similar con- 
clusions can be obtained by using a lattice dynamical 
approach. These two effects can probably account for 
the deviations from a Curie law found experi- 
mentally.2223 

When an electric field is applied the crystal shows a 
remanence polarization? at temperatures above 50°K, 
the size of this remanence polarization increasing as the 
temperature decreases. This polarization may be pro- 
duced by the electric field aligning the spontaneous 
polarization of some of the domains, which is then frozen 
in by the domain structure when the field is released. 
The increase in magnitude with decreasing temperature 
arises because more and more domains become spon- 
taneously polarized. The distortion required to shift the 
Curie temperature to 50°K, an increase of about 15°K, 
can be estimated from the measurements of the pressure 
dependence of the Curie temperature,” while the sign 
of the distortion is given by the change in free energy of 
the crystal. The change in the vibrational part of the 
free energy due to the changes in the frequency of the 
transverse optic mode show that the domains are 
probably elongated along the unique z axis. The de- 
pendence of the Curie temperature on lattice spacing is 


Further and 


dT, 1.210! 


da a 


For an increase in Curie temperature of 15°K, the 
increase in length along the tetragonal axis is 0.005 A, 
while the contraction perpendicular to the axis is 
0.0025 A. These distortions are very small, and could 
easily have been missed in the x-ray diffraction work 
quoted by Bell and Rupprecht.” 


VI. DISCUSSION 
1. The Parameters of the Models 


The parameters for the rigid-ion model (I) show that 
both the short-range forces between the titanium and 
oxygen ions and the titanium and oxygen ionic charges 
are considerably smaller than for the shell models. This 
decrease in magnitude must in some way compensate 
for the neglect of the polarizabilities of the ions. 

The parameters of all of the shell models are far more 
similar to one another than they are to the rigid-ion 
models. The parameters specifying the short-range 
forces in models II and III, in which the ionic charges 
were kept fixed, are very similar, but when ino lonic 
charges were allowed to vary in models IV and V there 
was a decrease in the strontium-oxygen forces and a 
slight increase in the titanium-oxygen forces. These 
changes are accompanied by a decrease in the ionic 


? J. H. Barrett, Phys. Rev. 86, 118 (1952). 
æ W. J. Merz, Phys. Rev. 77, 52 (1950). 
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charge of the strontium ions and an increase of the 
charge on the titanium ions. 

It is of interest to compare the parameters of the 
short-range forces with those obtained by the method 
suggested by Fowler? Devonshire,” Kinase,” and 
Dvorak and Janovec“ have used these forces to describe 
the forces in perovskite ferroelectrics. In the notation of 
Sec. II.2, the parameters of their forces are: 


A 1= 301 4232:429.1 437 63.5 
By=—2.74 B,=—41.53 B,;=—6.16. 


Although these force constants are significantly different 
from those of any of the models of Sec. IV, they do 
exhibit some of the principal features; for example, Ae 
is by far the largest. Nevertheless, the frequencies of the 
normal modes of vibration are so sensitive to small 
changes in the parameters that these short-range force 
constants do not give an adequate description of the 
interactions. 

When the ionic charges were allowed to vary in the 
shell models (LV, V), the charge on the titanium ion 
becomes larger than four, and on the strontium ion 
about one electronic charge. The charge on the oxygen 
ions remains at nearly two. These results suggest that 
the bonding in strontium titanate is more nearly ionic 
than covalent in character. 

There is a considerable reduction in the error of the 
models when the short-range polarizabilities of the posi- 
tive ions are allowed to vary (III, IV). These polariza- 
bilities of the positive ions come out either very small or 
negative, and cannot be readily understood in terms of 
the shell model. These results are exactly similar to 
those found for the alkali halides, potassium bromide 
and sodium iodide.!* In an exactly analogous way to the 
alkali halides, when the titanium ion’s electrical polar- 
izability was allowed to vary (V) the polarizability of 
the positive ion was larger than the crystal polariza- 
bility,?? while the polarizability of the oxygen ions was 
reduced. It was shown! that these surprising polariza- 
bility parameters could arise in the alkali halides from 
the neglect of the quadrupole moments produced on the 
negative ions. A similar explanation can be advanced 
here in terms of the quadrupole moments of the oxygen 
ions. 

A surprising feature of the polarizabilities of the 
oxygen ions is their anisotropy. For example, the elec- 
trical polarizability of the oxygen ions for the 90°K 
model V is 23 times as large in the strontium-oxygen 
plane as in the titanium-oxygen direction. The short- 
range polarizability in the titanium-oxygen direction is 
—1.41, while it is 0.615 in the strontium-oxygen plane. 


3: R. H. Fowler, Statistical Mechanics (C 
Press, Cambridge, 1936), 2nd ed. 

? A. F. Devonshire, Phil. Mag. 40, 1 

33 W. Kinase, Progr. Theoret. Phys. 

3: V, Dvorak and V. Janovec, Cze 

35 J. R. Tessmann, A. H. Kahn, an 
92, 890 (1953). 


y85,427570 41955). 
J. Phys. 812, 461 «1 
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These polarizability parameters suggest that the elec- 
tronic wave functions of the oxygen ions may be con- 
siderably distorted from spherical, and that the shell 
parameters Y; and ks, which were assumed to be 
isotropic, should be different for different directions. 


2. The Motion of the Ions in the 
Normal Modes 


There has been considerable discussion as to the 
nature of the ionic displacements in the transverse optic 
modes of strontium titanate. The polarization vectors 
for these normal modes have been obtained for several 
of the models described earlier, and are listed in 
Table IV for model IV. The lowest frequency transverse 
optic mode is found to be predominantly a vibration of 
the titanium ion against the rest of the structure. 

If the crystal becomes unstable against the lowest 
frequency transverse optic mode of vibration, it might 
be expected that the displacements of the ions would 
resemble the pattern of the displacements in the trans- 
verse optic mode of vibration.” It is therefore of interest 
to compare the displacements of the ions with the meas- 
ured displacements of the ions in tetragonal barium 
titanate,? and these are also shown in Table IV. Con- 
sidering the difference in the masses, sizes and polariza- 
bilities of strontium and barium ions, the displacements 

of the ions in barium titanate are fairly similar to those 
in the lowest transverse optic mode of strontium 
titanate. 


3. The Temperature Dependence of the 
Transverse Optic Modes 


One of the most interesting features of strontium 
titanate is the temperature dependence of the transverse 


TABLE IV. The displacements of the ions in the q—0 modes of 
model IV at 90°K, polarized along the z direction, and expressed 
so that the eigenvectors of the dynamical matrix are normalized 
and orthogonal. The numbering of the modes is in order of in- 
creasing frequency. 


Model IV Ti Sr Ont Or Orr 
‘Transverse 

(1) 0.077 0.022 —0.093 —0.129 —0.129 

(2) 0.097 —0.074 0.035 0.040 0.040 

(3) 0 0 0 —0.177 0.177 

(4) 0.007 0.004 —0.217 0.087 0.087 
Longitudinal ^ 

(1) 0.080 —0.077 0.050 0.065 0.065 

(2) 0 0 0 —0.177 0.177 

(3) 0.057 0.001 0.107 —0.143 —0.143 

(4) 0.075 0.008 —0.207 —0.032 —0.032 


Displacements of 0.053 0.003 
ions in BaTiO; 
transition® (A) 

SS ———— ——É 


a See Ref. 36. 


—0.051  —0.047 


36 B. C. Frazer, H. R. Danner, and R. Pepinsky, Phys. Rev. 100, 
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optic mode of lowest frequency. The difference between 
the parameters of the models which were fitted to the 
measurements at 90°K and those fitted to the 296°K 
measurements are not large, and model VI shows that 
it is possible to obtain the temperature dependence by 
changing the parameters of the short-range interactions 
by quite small amounts. These results show that only 
quite small changes in the parameters of the interactions 
can give rise to the observed temperature dependence, 

The temperature dependence of the normal modes in 
potassium bromide has been investigated both experi- 
mentally and theoretically.” The changes in the 
parameters of the models, which are needed to reproduce 
the changes in the frequencies of the normal modes from 
90 to 400°K, are fractionally very similar to those re- 
quired for strontium titanate. We can therefore con- 
clude that, broadly speaking, it is not necessary to 
invoke any mechanism to explain the temperature de- 
pendence in strontium titanate, other than what is 
present in the alkali halides. 

The temperature dependence of the normal modes in 
sodium iodide and potassium bromide has recently been 
shown to arise largely from the anharmonic interactions 
between the normal modes.*? Quantitative calculations 
of the anharmonic effects in strontium titanate, which 
will be reported elsewhere, suggest that a similar 
mechanism is responsible for the temperature depend- 
ence of its normal modes. 

The anomalous behavior of the normal modes in 
strontium titanate compared with the alkali halides is 
associated with the difference in the dynamical matrices 
of the two crystals. In the alkali halides, the frequencies 
of the normal modes are all fairly similar in magnitude 
(apart from the elastic constants region of the acoustic 
branches). For strontium titanate, on the other hand, 
there is a very large difference between the frequencies 
of the optical modes. At 90°K the ratio of the highest to 
lowest eigenvalue (frequency squared) of the dynamical 
matrix is as large as 400: 1. A change in the parameters 
of only 0.25% will then hardly alter the largest eigen- 
value but can drastically alter the smallest. The 
ferroelectric character of these perovskites then arises 
from the almost perfect cancellation of the strong short- 
range and Coulomb interactions for one particular 
normal mode. A small change in either of these strong 
interactions has a drastic effect on the frequencies of 
this normal mode. 


4. Unstable Normal Modes in the (1,1,0) 
and (1,1,1) Directions 


Several of the models described in Sec. IV show 
instabilities against some of the normal modes of vibra- 
tion propagating along the (1,1,0) and (1,1,1) directions. 
Examples of these are shown in Fig. 8 for models IH 
and V. It is unlikely that these instabilities occur 10 
strontium titanate and in this respect those models 


*' R. A. Cowley, Phys. Chem. Solids (to be published). 
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which show them are unrealistic, However, these in- 
stabilities may well occur in some of the perovskite 
materials which have phases that are slightly distorted 
forms of the cubic perovskite Structure. For example, 
CdTiOs, NaTaO;, Ca'TiO;, NaNbO, show multiple unit 
cells which are 2X22 times the cubic cell when re- 
ferred to the original cube axes while NaNbO; shows a 
4X2X2 unit cell, and PbZrO; shows a 4X4X2 unit 
cell. 

All of the unstable normal modes are rotations of 
groups of the oxygen ions. In models I and V octahedra 
rotate around the titanium ion while in model III 
squares of oxygens rotate about the points (4,0,0), 
(0,3,0) and (0,0,4). A further feature of these unstable 
normal modes is that their frequencies are very de- 
pendent upon the details of the models. For example, 
the normal mode T%;' are unstable for models I and V 
but for model III have a frequency of 2.6 and for model 
IV of 3.5. 

The transitions to the multiple unit cell perovskites 
may well be instabilities against these normal modes as 
suggested by Cochran? Unfortunately the displace- 
ments of the ions in real distorted perovskite structures 
are far more complex than mere oxygen octahedra 
rotations. However, in several of these materials, the 
rotations of the oxygen octahedra are the predominant 
displacements.? The displacements of the other ions are 
not, however, negligible and a detailed treatment would 
be very complicated. The unstable normal mode must 
be strongly coupled both to the macroscopic strain 
parameters and to some of the other normal modes in 
the crystal. 


5. The Models of Other Authors 


Several authors have previously used models for 
the lattice dynamics of perovskite ferroelectrics. Devon- 
shire? used a rigid-ion model with Fowler's?! short-range 
forces to attempt to relate the parameters of his 
thermodynamic theory to the microscopic interactions. 
Recently Dvorak and Janovec™ have calculated the fre- 
quencies of the transverse optic modes of this model, 
and have found the structure unstable unless the formal 
lonic charges are multiplied by a factor 0.16. The ire- 
quencies are not then in agreement with the results of 
neutron or infrared spectrometry. 

Kinase? has used this model, and included the elec- 
trical polarizabilities of the ions, but the results were no 
more satisfactory. 

Last!® suggested that the infrared spectra could be 
explained by assuming that the titanium-oxygen ies 
hedra was a strongly bound complex which was weakly 
coupled to the other anion. As the lowest C 
optic mode is a vibration of the titanrum against the 


33 F, Jona and G. Shirane, Ferroelectric Crystals (Pergamon 


P : k, 1963). ; l 
SER A Ferroelectricity in Crystals (Methuen and 


Company, Ltd., London, 1957). 
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rest of the structure this model is unsatisfactory. 
Rajopal and Srinivasan? studied the frequencies of 
vibration at q—0 when all the Coulomb forces are 
neglected. The high- and low-frequency dielectric con- 
stants of this model are then equal (to unity!) and the 
frequencies do not agree with the experimental results, 
A similar model has been used by Silverman and 
Joseph“! for the transverse optic modes with q=0. 


VII. CONCLUSIONS 


The frequency versus wave-vector dispersion curves 
of strontium titanate for several normal modes propa- 
gating in the [0,0,1] direction have been measured ex- 
perimentally by using neutron spectrometry. These 
results have been used to find the parameters of several 
models of the crystal within the harmonic approxima- 
tion. It was found that quite reasonable agreement with 
experiment could be obtained by using shell models in 
which the ions interact with one another both through 
short-range forces between neighboring ions, and through 
long-range electrostatic forces, determined by the ionic 
charges. The effects of the polarizabilities of the ions are 
also included. The principal features of the more suc- 
cessful models are that the titanium-oxygen short-range 
forces are very large, while the ionic charges on the ions 
are also large and very nearly the formal charges. 

Of particular interest in strontium titanate is the 
temperature dependence of the transverse optic mode 
with lowest frequency. This was measured experi- 
mentally and for the q—0 mode the square of the fre- 
quency was found to be proportional to temperature 
above 90°K, in agreement with the temperature de- 
pendence of the static dielectric constant as predicted 
by Cochran? This result shows that it is a valid ap- 
proach to treat the problem of ferroelectricity as an 
instability of the crystal against one of the normal 
modes. The motion of the ions in this normal mode was 

obtained for several of the models used, and was found 
to be predominantly a vibration of the titanium ion 
against the oxygen octahedron. The displacements of 
the ions at the ferroelectric transition in barium titanate 
are very similar to this. 

It was found that the experimentally observed 
changes in the frequencies of this normal mode could be 
obtained by changing the parameters of the shell models 
only slightly. The fractional changes were very similar 
to those needed to reproduce the temperature depend- 
ence of the normal modes in the alkali halides. This 
result suggests that it is unnecessary to introduce any 
mechanism to explain the temperature dependence of 
this normal mode, other than those which occur in the 
alkali halides. The indications are then that the temper 
ature dependence of this normal mode can be expla 
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in terms of the anharmonic interactions between the 
normal modes of vibration. 

The dispersion curves along the (1,1,0) and (1,1,1) 
directions of the models have been calculated and some 
of these show normal modes against which the crystal 
would be unstable. Although it is unlikely that these 
instabilities occur in strontium titanate, they may well 
be associated with the transitions in other perovskite 
materials. 

The transition in strontium titanate at 110°K has 
been discussed in terms of an accidental degeneracy of 
two branches of the dispersion curves. The experimental 
results from neutron spectrometry show that this de- 
generacy occurs very near 110°K. The anomalous 
elastic properties near 110°K and the smooth behavior 
of the static dielectric constant can be easily obtained 
with this theory. It is suggested that the conflicting 
experimental results on the low-temperature dielectric 
properties, and the temperature dependence of the 
distortion of the crystal can also be explained by this 
approach. 

It is clearly of interest to try to obtain more satis- 
factory models of strontium titanate, in the hope of 
obtaining a better understanding of both the tempera- 
ture dependence of the static dielectric constant and of 
the 110°K transition. Experimental measurements of 
the dispersion curves in the (1,1,0) and (1,1,1) direc- 
tions would enable the parameters of the models to be 
fitted more accurately. The models could then be ex- 
tended by including anisotropic oxygen polarizabilities, 
more short-range forces, and abandoning the assump- 
tion that R=T=S. 

The difficulties in describing the results in terms of a 
harmonic model might then become more serious, and 
it might well prove necessary to use a complete anhar- 
monic theory to describe the lattice vibrations. 
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APPENDIX I 
A. The Characters of the Symmetry Operations 


The representation chosen for the normal modes is 
obtained by displacing each ion in turn along the 
coordinate axes by a unit distance. The characters of the 
different symmetry elements are: 


identity operator 
x(£)=15, 
triad axis 
x (Cs) =0 D 
tetrad axis along the z axis 
x (Ca) 219-2 exp (2g) , 
diad axis along the z axis 
x (Cs) = —1—exp (2.7) 
—exp(i2g,7) —2 exp (2 (qz qy), 
diad axis at 45? to both x and y axes 
x(Csz) - —2— exp(i2q77) , 
inversion operator 
x()— —3[1--exp G2(g:--q- q2)r) - exp G2 (qe qr) 
-Fexp G2 (gy--q2)r)-- exp G2(q4-2)] 
plane of symmetry perpendicular to the z axis 
X(c2y)=2+3 exp(i297) , 
plane of symmetry at 45? to x and y axes and including 2 
axis 
x(ca) - 5, 
inversion tetrad axis along the z axis 
x (S4)  —1— exp (2927) — exp (i2 (g44-q2)7) ; 
inversion hexad axis 
x (Se) => 0 . 


al 


v 


Jllc 


gz 


LATTICE 


Trreducible 
representation 


“C= (00,5). 
Ai 


As 
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B. The Displacements of the Ions in Different Normal Modes 


Normal mode 


Sra, Ti, Ors, Or Orr. 
Ori — Orr 


Srz, Ti,Or, Orrz, Orrr-- 
Sry; Tiv, Ory, Orr;, Orry- 


Tiz, Om Omma 
Se Oje 

Orr: —Orriz 
Tiz, Orrz, Orriz- 
ign Ort Ofw: 
Sre, Orz- 
Ory, On: 


Tiz=Ti,, Sie Sin Or O 
Orrz 7 Orrrs, Orrrz Orr. 
Ori — Orr: 

Tiz=—Ti,, Srz— —Sry, Orz- —Ory, 
Orrz 7 —Orrr,, Orrrz7 — Orr; 

Tis, Sra Or; Orr-2 Orri-- 


Orr--Orir,- 
Orriz7 —Orty. 
Orrz 7 — Orr. 
Orrrz 7 Orry. 
Sn 
MAOR 
Or: 
Onn: 

Die Sry Ore 

1b SD Ory- 


Sr. —Sr,—Sr,, Tir=Tiy=Tiz, 
Orrrz 7 Orr, 7 Ors, Orrry 7 Orrie 
—Ornz-O0r:-0r 70r. 
Orz Orr - Orr; 7 —Ory 
=—Onz=—Onr:- 
Ti,=—(Ti,+Ti.), Sr: 2 — (Sr,4- Sr), 
Onte=—(Ony+Or:), Or-7 — (Orr, +O), 


Irreducible 
representation 
C= (6,60) 
As Orr; 
Ti:= —Ti,, Sr, —Sr,, 
Or 7 —Orr;, Or 
= —ÜOrris; Orr — Ory. 


Normal mode 


— (Orzy-FOrrr)- 


r^ Or: Orr, 7 Orre 


T Or; Orry7 — Ori. 
Or: —ÓOrr,. 
Fa; Or; —Orr.. 
O1,7 —Orr. 
Orrz 7 —Orrrs. 
T5 do 
du 
Tus 
lis Nx, Or ss Orye- 


Sry, Or: s Ori. 
Sra Orrz 7 Orrrs. 


Ai Sra Ons Oti 

Ay’ Orn. = —Orry. 
As Or. = —Ontty. 
A» Ti, Orriz 7 Orr,, Ore. 
As Tiz, Sty, Ore, Orrrz. 


Tin, Sre, Ory, OTe 


X, Tis, Sr, 2 Sr,, Orr: - Oris, 


Or, 7Or-. 


23 Sr.— —Sr,, Orrrz7 — Otte, 
Ory7 —Or.. 

2 Ti Ti, Sr. Ors Orr, 7 Orrrz 
Orrz 7 Orrrs. 

Ss Tiz=—Tiy, Ont, = —Ony, 
Orrz— —Orrr,- 

€= (2,0,0) 

Sy Tiz, Sr; Ore, Orrz, Orts 

2 Sry, Orry- 

z : 

-3 
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C. The Compatibility Relations 
= (0,0,6) with = (0,0,0): 
T35— Ag As, Tas— Ag As. 


t= (00,7) with ¢= (0,0,7): 
" A1— My, Ai M y', A»— Ms, As— Ms, As— My. 


— €— (550) with £— (0,0,0): 
T35—21--Zs-24, L'o5— Zi Z3 - 22. 


o t£ (6,0) with ¢= (3,3,0): 
Mi—Zy Mo—Zs, M/—Z4, Ms— Ds, M4 — Za, Mi— Zi, 
b. Ms—DetDa, M,/— D1 Es. 
= (Es) with {= (0,0,0): 
Lt Tys—AatAs, T55— Aot As. 


t= 655) with Ge 222): 


Pos — Ax- As, Lis—AitAs, les—AtHAa, T? — A, 
Ti — As. 
E ¿= (252539) with = 2550 JE 
Mg M3— Ar, My — As, Ma— Ao, Ms — 
Mss’, Ms— As, Mi — As. 
ip: 


S£) with ¢= (2,7,2): 
/— Ap’, Ti — AtA, I'5— AtA, Tis — At As, 


R. A. COWLEY 


= (5653) with = (0,0,3): 


My—3y, My —Z4, M3—Z3, My— Zi Za, My — Ej EX. 


= (6062) with C= (52,2): 
T» —Z4, Fi —Zz- Z4, Vg Zr Ze 25, 
T454—Z14-Zi Z4, To Zr Xe ZA. 


t= (3,0,£) with ¢= G,0,0): 
M4—2Z3, Mz —Zs, Ma—21, Ms—Zs--Zs, M? —Zı +2. 


c= (3,0,£) with ¢= G,0,2): 
M,—21, Mo—Zs, MZ — Za, M3— Èi, M; — 
M5;—Zs-2a, My —Z4-Zs 


Èo, M4—23, 


APPENDIX II: THE CONTRIBUTION OF THE 
SHORT-RANGE FORCES TO THE 
MATRIX R(KK’) 


The complete (1515) matrix can be split up into the 
(5X5) matrices for the (xx), (yy): +- components. In 
this appendix the (xx) and (xy) components are tabu- 
lated, in terms of the force constants introduced in 
Sec. II.2. Cz, Cy, Cz are the cosines of gar, qyr, qz, while 
Sz, Sy, S; are the analogous sines, and 2r is the lattice 
parameter. 

The matrix Rz in units of &/v is 


Ti Sr Or Orr Orr 
As1-2B; 0 —BC: —BC, —AsCz 
0 2A,+4Bi — (Ai4-B3)C4C, — (Ai4-B3))C.C, = IBCs 
— BC, —(ArFBjC.C, 41+ Bit B.+A;4+3B; —2B:;C,C: — (Ast Bs)CiCz 
o B:C, —(41+B))C.C, —2B;C,C, A; +B,+B.+A3+3B;3 — (Ast Bs)CCy 
Om —A4.C: —2B,C,C, — (4s-B3)C.C; — (4s4-B3)C4C, 2Bi- As 24st 2Bs 


The mo trix Rz, is, in units e/v: 
Ti Sr 


0 0 

0 0 (d 
mo (41—81)548, 

0 0 


B3)S5S, 0 0 
0 0 
0 (43—Bs)S Sy 


(43—B3)528, 
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APPENDIX III Ti-Ori 
TABLE V. Coulomb coefficients. T »y = ay a e 
oulomb coefficients. 05 0 0 0 o 0 " 
= (00,7) ) —9436 6.240 3.196 2.003 © 0 
huc eee i cca ee ie 6160 3.779 0 O0 
5 = = | O 07523350 tara? 8.618 5147 O O 
Ti-Ti Ti-Sr O1 —26838 16.570 10.268 5.997 0 0 
pa 5 & 1 
wae " M Mese - ve 3 27.988 17.135 10.852 6.28 0 © 
0.5 —4844 —4841 9.687 05 0 0 0 Sr-Or 
Ne B. E 9.563 0.4 —1.281 — 1.281 2 | t xx yy zz xy yo xz 
3 —4.618 —4.6 9.236 0.3 —2443 —2.443 4.286 " 2 x B Ps 
0.2 —4416 —4416 8832 02 —3.375 —3.375 6750 | 5 0 0 0 17.060 0 9 
0.1 —4.252 —4.252 8.503 0.1 —3.979 —3.979 7.958 | 04  —O0210  —0210 0420 15.924 0 0 
0 —4.189 —4.189 8378 0  —4189 —4.189 8.378 | as bete E 1 on E o 
. — bd. =LA A In Es 
: 010500957 05 008898677071 4044 7220 0 0 
Ti-Or Ti-Orrt 0 —2.239  —2239 4419 6283 0 0 
3 Ei yy zz € xx yy zz SrOin 
05 0 0 0 0.5 —33.622 10.323 23.299 ; 3 3 š ; 
0.4 3.339 3.339 —6.678 0.4 —33.684 10.373 23.311 s vu » gs TE UNE 
0.3 6.359 6.350 —12.717 0.3 —33.845 10.505 23.341 | 0.5 0 0 0 0 0 
0.2 8.765 8.765 —17.530 0.2 —34.046 10.668 23.377 | 0.4 4013  —1535  —2477 1892 0 0 
0.1 10.316 10.316 —20.632 0.1 —34.208 10.801 23.407 | 0.3 7.22) —2420 —4.792 3:568 0 0 
O 10.852 108532 —21.704 0 —34271 10.852 25419 | 02 9316  —2381 -674 4991 0 9 
1 10423 — —2.370. 8053 5946. 09980 
Be: SRM 0 10:762 —2:240: 8.503 6285 
is xx yy zz € xx yy zz EU n E PEE EE an 
0.5 —7.996 —7.996 15992 05 0 0 0 $= (5.5) 
0.4 —8.046 —8.046 16.091 0.4 1.370 —2.619 1.249 | === 
0.3 —8177 —8.177 16.354 0.3 2.614 —4.989 2.376 ae 
0.2 —8.340 —8.340 16.679 0.2 3.610 —6.880 3.271 
0.1 —8.472 —8.472 16.944 0.1 4.255 —8.101 3.846 G zx yy zz xy ys xz 
0 —8.523 —8.523 17.04 0 4.479 —8523 4.044 | gs 0 0 0 0 0 0 
- = 0.4 0 0 0 0.283 0.283 0283 
0.3 0 0 0 1170 1.170 1.170 
o 0.2 0 0 0 2499 2499 2499 
C= (65,0) 0.1 0 0 0 3.706 3.706 — 3.706 
- 0 0 0 0 4.189 — 4.180 — 4.189 
qu Ti-Sr 
s y d E. v ee ig xx yy 55 xy yz zz 
0.5 2.677 2.677 —5.354 0 0 ORE 0 0 o 0 0 0 
0.3 RE mi 24120 Gm 103 0 0 0 4.6855 — 4.655 — 4.655 
0.2 2.313 2.31 m 394 0.2 0 0 0 4.934 4.934 4.934 
0.1 2.157 2.157 —4.313 5.787 0 0 041 0 0 0 4472 4472 4472 
0 2.094 2.094 —4.189 6.283 0 0 0 0 0 0 4.189 4.189 4.189 
Ti-Sr Ti-Ori 
t a yy zz xy yz az e xx xy zz ay yz xz 
i 0.5 0 0 0 0 0 0 
0.5 0 ADAE EI 1090 0 0 |04  —086. 4930. 29301 1:121 Ost 
0.4 0.196 "U13  —126 8840 0 0 |03 18557 9278 92718 2.108 0.987 198 
0.3 0:13 EIS X50 75155 0 o |02 25.047 1254 125234 3178 2404 3.178 
0 i 190 1:890 “3790 639 0 0 | 01 -—28856 14428 14428 3911 3686 3911 
0 2.094 2094  —4180 6283 0 0 0  —30082 15.041 15041 4.180 4189 4.189 
Sr-O1 
Ti-O1 & xx yy zz ^ ay yt zz 
t xx yy as me MT phe! OS 0 0 0 MAS oe 
a 3 FED. ee 0 0 0 |04  —0353 -—0.53 1107 13.462 466 
OS T MEE S T31 Quo 0 0 |03 —1858 —1353 3715 10842 1618 1618 
04 16008 ot 33482 2426 0 o |02  —3189. —31489 6310 TL641 2.008 X008 
03 16.741 — 1674 “330 4353 0 0 |Ort  —4052 4052 S04 5:105 3851 m22 
02 EHO 16919 = 571 0 0 | O 4334 —4334 8668 4180 4180 4189 
0.1 ur Goa swa : 
0 17.135 17135  —34271 ; 
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numbers are multiplied by (c?/2)ZeZx: they give the Coulomb contribu- 
tions to the dynamical matrix. 
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A previous calculation of the effect of an external, uniform 
ciated with a direct transition between bands is extended. The 


steps whose width is determined by the separation of the discrete levels produced by the field. 


N a previous calculation! (which will be referred to 
as D), it was shown that the discrete levels which are 
formed when a uniform electric field is present in an 
insulating crystal? produce a periodic structure in the 
energy dependence of the absorption due to interband 
transitions. The expression for the absorption coefficient 
which was given in that paper is a slowly convergent 
Fourier series which cannot easily be interpreted. It is 
the object of the present work to transform this series 
in such a way that the energy dependence of the optical 
absorption is evident. I hope that this work will help 
to stimulate experimental investigation of the discrete 
levels. 

An infinite crystal containing a uniform electric 
field &, in the x direction is considered. From Eqs. (14) 
and (21) of I, the following expression is obtained for 
the absorption coefficient o, produced by a direct 
transition between bands 7’ and m (the notation con- 
forms to I as far as possible): 


2i 
TES 


Faroneoc =| 


2 


Ca) 
c4 4l nn (T 
(27)? ; 


eo dk, 
432» | ——|Man (a0) |? costal | > Q) 
isi J Qn)? 


in which x is the width of the Brillouin zone in the x 
direction (which coincides with a reciprocal lattice 
vector); ki represents components of the wave vector 
perpendicular to the field; Go is the magnitude of the 
vector potential of the radiation field (which has circular 
frequency o); 7 is the index of refraction, and ej is the 
permittivity of free space. In addition, F=e6, and 


K 
X9 — (Ann! — 1o) , 


2a 
2mF Qu) 


where 
«lB 


Ann (ki)=«7] [En (Raku) —Ew (R2,kx) dk. 


—x[2 


(2b) 


* Consultant. Permanent address: Department of Physics, 
University of California, Riverside, California. 

1J. Callaway, Phys. Rev. 130, 549 (1963). Other calculations 
of the effect of an electric field on optical absorption have been 
reported by: W. Franz, Z. Naturforsch. 13, 484 (1958); L. V. 
Keldysh, Zh. Eksperim. i Teor. Fiz. 34, 1138 (1958) [English 
transl.: Soviet Phys.—JETP 7, 788 (1958) |;and K. Tharmalingam, 
Phys. Rev. 130, 2204 (1963). 

2G. Wannier, Phys. Rev. 117, 432 (1960). 
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in an Electric Field 
electric field on the optical absorption asso- 
absorption is shown to consist of a series of 
The matrix element Mn» is given by 
; 2e G8 5 ; k 
M nn =— (E Paw) Ai (aß!) , (3) 


WK 


where pnn is the usual interband matrix element of the 
momentum and e is the polarization vector of the 
radiation. Both bands are treated in the effective mass 
approximation, so that the energy difference between 
bands is 

E,—En = E+ ((282/2u). (4) 


In this equation, E, is the energy gap at k=0, and pis 
the reduced mass for the two bands: u 1— m; 1+ Mart. 
We also have 


B—2uF/Ih; o=F(E,—hot?k2/2u) , 
and Ai is the Airy integral, which is defined by 


1 
Ai(s)=— 


7T 


] cos(sz-+s3/3)ds. 
0 


Other quantities which are not explicitly defined above 
have their usual significance. 

In I, approximate expressions were worked out for 
the terms of the series, Eq. (1). It is, however, evident 
from Eqs. (1) and (2) without approximation that 
whenever the photon energy fiw increases by A (fw) with 


A (iw) — 2m F /k , (5) 


all the cosines return to their original values. This 
energy change is just the separation between the dis- 
crete levels so that the absorption must contain some 
structure with this period. In order to investigate this 
structure, we proceed as follows. The sum of trigono- 
metric functions is evaluated using the identity? 


LDS canhy=e 935 NOI (6) 


t=1 m=—o 


Then with the use of (3) 


2u Ga 
a=K— | dk, Ai? (of! e| 12 > costa | 
wh! 


l=1 
= (4K u/6'%0) Y | dk, Ai? (oB"8)8(Qmx0—20j), (? 
j= 


3G. Goertzel and N. Tralli, Some Mathematical Methods, of 
Tyit (McGraw-Hill Book Company, Inc., New York, 1960), 
p. 123. 
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in which 
K —2e|e- Dane I2/z72m*nesc. (8) 


From Eqs. (2) and (4), we obtain for Xo 
2rxo=K (e-I-x8715) , (9) 
in which 
X= K°/126?!2 , 
Then we change the variable of integration in (7) 
to y=of!8 and obtain 


Ta^ K ys? B o. f 
gelu [^ AP) 


OK j——- j 


Xé[y--x— Qv j8'*/«)]. (10) 


If we allow k, to vary between 0 and €^, then y varies 
from a lower value 
p=; R-QuA)E- 1e) — (M) 


to ©. Note that R? may be either positive or negative. 
We then find that the absorption coefficient is 


Ax? Kup? us 25 jg /2 
OCHO ar( -x) Pisa 
0 


CK i=j K 


o 
o 


ABSORPTION COEFFICIENT, C (m |) 


5 1.60 
10 0.64 0.96 1.28 
o. Eo 0.0 0.32 , 
64 0.32 Sores SU 


5 : i rn as a function 
Fre. 1. The optical absorption coefficient a 15 shown asa iu 


of fi — E, as calculated from Eq. (12) using numbers abe 
to the transition between the heavy i 
band in gallium arsenide. For the purpose o 
large field, 10° eV/m has been chosen. 


-hole band and the conduction 
illustration, a very 
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ABSORPTION COEFFICIENT, a (m^!) 


-0.06 -0.04 -0.02 0.0 0.02 0.04 0.06 
Ta-£g(ev) 


Fic. 2. The optical absorption coefficient is shown as a function 
of energy in the weak-field limit, Eq. (18), neglecting the discrete 
levels. Numbers appropriate to gallium arsenide, including both 
light and heavy holes, have been considered. Each curve is marked 
by the value of the field F in electron volts per meter which has 
been used in the calculation. The field is oriented along the [100] 
crystal axis. 


in which the lower limit of the summation jo is deter- 
mined by the condition that js be the next integer 
larger than qo, where 


(27-qo8!!?/x) — x — RYB.. (13) 


Let us interpret this result. Observe that the argu- 
ment of the Airy functions in (12) is independent of 
energy. The energy dependence of the absorption 
coefficient is contained in the external factor o~! and 
implicitly in the lower limit of the summation. As the 
photon energy fiw increases, R? decreases, and conse- 
quently yo and jo decrease. For each decrease in fo, a 
new term appears in the sum in Eq.(12). Consequently, 
a graph of the absorption coefficient a as a function of 
energy resembles a staircase with slanted steps. The 
width of each step is the energy required to produce a 
unit change in jo. This is easily seen to be given by 
Eq. (5). Hence, the width of a step is just the separation 
of the discrete levels. The height of a step is proportional 
to AP(K8?5), evaluated at the energy at which the 
jump occurs. The downward slant of each step is 
determined by the factor «^. 

For large positive values of 78^, the change in a, 
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Aa, produced by bringing in one more term (the height 
of the step), is easily obtained by using the asymptotic 
form of the Airy function: 


Ai(s)= (1/270"?2"4) exp(— 223). (14) 


Hence, in this region 


Aa= (m KB/exR) expL—4R/38 ] 


E 2u ] 
m I2(E,— ho) 


4 /2uN M? (E, — fio? 2 
eof "EE ). as 
3\ i? F 


Thus, the height of the steps decreases rapidly with 
increasing E,—7&e and decreasing field. For large 
negative values of R?8-? we use 


Ai(—z)- (1/1214) sin (223/24 1/4) 


(16) 


and find 


ArKF 2u NE 
E 
xo Lf? (fio— Ej) 


T 2/2u 12 (hw— E, T E 
xs (2) a] . (QU 
3NP F 4 


These results are illustrated in Fig. 1. 

The mathematical argument may be concluded by 
observing that the weak-field limit of the absorption is 
obtained by replacing the sum in (12) by an integral. 
Put 


z— 2m 385 /k—x , 


CALLAWAY 


and let z be continuous. Then we obtain in place of (12) 
a= ck] Ai? (z)dz 


Vo 


d Ai) " 
-amika ( — yo Aton |, (18) 
dy Vo 


where yo is given in Eq. (11). This is equivalent to an 
expression given by Tharmalingam,! and may also be 
obtained by discarding all but the constant term in 
Eq. (1). Expressions valid in the limit of large positive 
or negative values of yo may be obtained by inserting 
the asymptotic expansions of the Airy functions 
Eqs. (14) and (16) into (18). The results of this can be 
obtained in an obvious way from Eqs. (23), (29), and 
(31) of I, and also from Tharmalingam’s paper, so they 
will not be repeated here. However, since the weak-field 
absorption coefficient has not been exhibited in the 
region near the band gap in which asymptotic forms 
cannot be used, we present in Fig. 2 the absorption 
coefficient as calculated in this approximation using 
numbers appropriate to gallium arsenide. 

Finally, we should note that although the detailed 
results of this calculation depend on the use of the 
effective-mass approximation, Eq. (4), the existence of 
structure in the absorption constant with the period of 
the separation between the discrete levels is rigorous. 
In addition, one expects the effective-mass approxima- 
tion to be valid in the region near a band gap which is 
accessible to experimental observation. The quantity x, 
which cannot be calculated reliably in this approxima- 


tion since it involves an integral over the entire band . 


structure, acts only as a phase angle in (12). Therefore, 
uncertainty as to the value of x does not affect these 
results in any essential way. 

I am indebted to Dr. P. E. Kaus for a valuable 
discussion. 
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The value of the exchange integral J between pairs of donor centers in silicon has t 


been investigated by 


looking at the ENDoR spectrum of phosphorus-doped silicon in a range of concentration of the order of 


8X10" per cc. The microwave field is saturating the cluster electronic line. The line sh 


ape of these ENDOR 


lines gives an unambiguous determination of the sign of this exchange integral which turns out to be anti- 
ferromagnetic. It is also possible to determine an order of magnitude of J and to have an idea about its dis- 
tribution for two samples. We are able to understand all the features of the ENDOR spectrum at least in a 
qualitative way by calculating all the energy levels using a second-order perturbation calculation. A line 
coming from an ionized phosphorus center, weakly coupled toa pair of neutral phosphorus is also identified. 


I. INTRODUCTION 


HE substitutional donors like phosphorus or 
arsenic at low concentration in silicon give rise 
to a paramagnetic resonance spectrum! of 27+1 lines, 


LT 


Fic. 1. Electron-reso- 
nance spectrum of phos- 
phorus-doped silicon; 
8X 1015 P/cm? at 1.3°K. 


where J is the nuclear spin of the donor (J=4 for 
phosphorus). When the concentration of centers is 
above’ 6X10!§ per cc, new lines appear between the 
(21--1) lines of the low-concentration spectrum’ (Fig. 
1). Slichter? gave an explanation for the occurrence of 
these satellite lines. When two centers are close, there 
is an overlap between the wave functions of the two 
electrons, giving rise to an exchange integral, which 
may be written as /Js;*s», where Si and s; are the 
electronic spins of the two electrons. We limit ourselves 
to phosphorus donors from now on. The spectrum is 
analyzed by diagonalizing the Hamiltonian for a system 
of four spins 4, two electronic spins Si, $2 and two 
nuclear spins I, and I. If the value of J is larger than 

the hyperfine interaction A,‘ the total electronic spin 

defined by |S|?=|s:+sz2|? is nearly a good quantum 

number. We have two electronic states; a triplet S=1 

and a singlet S=0. 


external magnetic field; we have three electronic sub- 
levels M,=+1, 0, and —1. Each of these sublevels has 
a fourfold nuclear degeneracy, partially lifted by the 
hyperfine coupling and the nuclear Zeeman energy. 
The energy level of the singlet state crosses one of the 
triplet sublevels for a certain value of the external field 
(Fig. 2). As the hyperfine coupling has matrix elements 
between singlet and triplet states, in the vicinity of the 
crossing, there will be large change of energy levels 
and wave functions. By studying with the ENDOR 
technique, the structure of the nuclear sublevels, and 
the magnitude and the sign of J will be determined. 
The microwave field at 9206 Mc/sec is saturating the 
cluster electronic line, therefore we select nuclear spins 
of pairs of exchange-coupled phosphorus nuclei (Fig. 1). 
In a given sample, there is a distribution of values of J 
due to the random distribution of phosphorus centers. 
We have been able to determine the value of J which 
has the largest probability for two samples with different 
phosphorus concentrations. 


Il. THEORY 


1. First-Order Perturbation Theory 
of Energy Levels 


We assume that the overlap of the wave functions 
belonging to two neighboring phosphorus atoms, leads 


Fic. 2. Energy lev- 
els of the system of 
two electronic spins 
in the static mag- 


Zeeman energy kvels 


: te ac lift zan netic field coupled by 
The degeneracy of the triplet state 1s picster the exchange inter- Jl 

1R. C. Fletcher, W. A. Yager, G. L. Pearson, A. Ne CE action. | 
W. T. Read, and F. R. Merritt, Phys. Rev- 4 E 
RC. Fletcher, W. A. Yager, GL. Pearson eA QUE 

Y Rey 95, Mt Ceetrher, and E. A. Gere, Phys. Rev. 100, T 
1784 : = TS 

: COSS ter, Phys. Rev. 99, 479 (1955). Magnetic field H, 

* A. Abragam (unpublished lectures). A1001 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


— o | 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1002 D. 


m e 
Inti,» A? 
T rins» b» _A? 
! (V) 
hid; eee 
4Ye V+) 


A 
lot ay 

A? 

L CD ltoips? 47 


A: 
Hot) - ay, 


OAAS 
l3t4» +g 


Az A 
lSto»lS> -1 +r) GRA) 


JÉROME AND 


J. M. WINTER 


Fic. 3. Second-order shift of 
energy nuclear sublevels in the dif- 
ferent triplet and singlet electronic 
states (not to scale). The good 
wave functions in zero-order ap- 
proximation are noted beside each 
level. We assume an antiferromag- 
netic coupling lower than ve. 
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to a coupling between their spins //si*$». The total 
Hamiltonian will be written in frequency units. 


K= ve(5151-522)--J $1 SotA (si-Li+- $3: 12) 
+ y7(11-+T12:) J (1) 


where v, and vz are the electronic and nuclear Larmor 
frequencies, respectively. J is the exchange coupling 
value which has been measured in cycles per second.* 

To a first approximation, taking into consideration 
only the first two terms of the Hamiltonian (1), the 
total electronic spin is a good quantum number: 


|S|?=|sit-se|? and S.=si.+S2 


define good quantum numbers. 
The Zeeman electronic Hamiltonian removes the 


degeneracy of the triplet state S=1 and gives the three 
following electronic states: 


|T), M.=+1; | T9; M.=0; 


The singlet state |S), S=0 is diamagnetic. In the 
spin multiplicity S £1, s; and sz may be replaced by 
S/2. This gives for the Hamiltonian of the whole 


Mr Ameis 


system: 
30 »,SH-3J4-3AS- (li--Db)- rrt). (2) 
In lowest order, the energy of (2) will be given by 
Ey,oim73J M vM. 34M. mid ma)-vi(mid- m2). 


6 From now on, we assume 4 positive value for J. We shall see 
further that this hypothesis will be verified. 


EM 
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In the Hamiltonian (1), ve and A are positive quanti- 
ties and vr is negative for a positive nuclear moment as 
that of P?!, The first-order energy levels are shown in 
Fig. 3. The electronic transition frequencies AS —0, 
AM ,—2c1, Amı= Am:=0 are the following: 


v=v H44 (mid ms). 


The electronic resonance spectra exhibit two lines at 
frequencies v,--14 which are not observable, due to the 
lines of the isolated donors and a line at the frequency 
ve twice as intense corresponding to --7— 0.^^ 

We use the same considerations for clusters of three 
phosphorus atoms in the spin multiplicity S=% which 
is completely symmetric with respect to the three spins 
Sı, Se, and ss. We obtain for this case $1—$5— $4— 15. 
The only electronic frequencies are y,2-24 and veteA. 


The two eigenstates of spin S=} give electronic, 


resonance frequencies which vary from one cluster to 
another giving rise to a continuous background as has 
been shown by Abragam.^ 

The only discrete lines for four-atom clusters are 7e 
vettA, ve-tiA. A line at frequency ve broadened by 
hyperfine interaction with Si® spins appears only for 
clusters of an even number of phosphorus atoms. 

At a temperature of 1.3°K the electronic resonance 
spectrum shows lines at frequencies ve+%A which 
merge almost completely in the background line. W e 
conclude that the number of four-atom clusters 15 
negligible in the samples (Fig. 1). 


«pP WwW 


* 


| 
| 
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ELECTRON SPIN 


| D1,to) > |T nt) (Tnt) — | Tih) 
A? 
—— y +—___. 


4(ve—J) 


|T_1,h) > |T ito) ITut) — |Ty,t 


2. Second-Order Perturbation Theory 
of Energy Levels 


There are nondiagonal matrix elements of the hyper- 
fine structure Hamiltonian 3ep— A (L-s;--Ls;) be- 
tween different triplet states and between singlet and 
triplet states. The Oz component of the total spin 
F,=S:t+lh:+Ie, and the total symmetry of wave 
functions with respect to interchange of two atoms are 
good quantum numbers. We defined a total parity 
number Pas negative when the sign of the wave function 
changes by simultaneous permutation of the two 
electronic spins and the two nuclear spins. The per- 
turbed states will be classified according to the two good 
quantum numbers as indicated in Fig. 3. 

The coupling of nuclear spins in every electronic level 
gives three symmetric triplet states |4), |£o), and |4.), 
and an antisymmetric singlet state |s). The even- 
interaction 3C p couples states which have same quantum 
numbers F, and the same total parity. This is true for 
any order perturbation calculation. 

Starting with the eigenfunctions of zero order (Fig. 
3), we obtain immediately the nondiagonal matrix 
elements from which one can obtain the shift of energy 
levels. 

We present the calculation of the shift for triplet 
states | 71) and | 7). We deduce shifts of | Z_1) levels 
by exchanging v, in — v, in the formula. 

(a) | T,4). This state is not coupled with any other 
state triplet or singlet because |7,4) is the only state 
with F,—2 and there is no change in its energy beyond 
the first-order value. 

(b) | Tuto) and |Ts,s). | Tto) is coupled to | To,fo) and 
|T3,5) to |.S,5) by the nondiagonal matrix elements. 
"There are no diagonal matrix elements for any of these 
States, 

(T319|3€;| Ty) —A/2 , 


(Tys|365|.5,5)— z4/2- 
The second-order shift of the state | 71,5) is 
E-— A2/A(veT-J), 
and that of state | T1,/o) is 
E=A 2/4ye . 


(c) | T1,-1)- This state is coupled to | Toto) and | S,s). 
Its shift is 
A2/4y_+A2/4 (Het J) A 


RESONANCE ON 
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TABLE I. Allowed nuclear transitions and ENDOR frequencies. 


INTERACTING DONORS IN Si A1003 
A 43 At At 
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2 4(v,--J) dve 4J 


(d) | Tuto) and | Ts). The same method as that of 
Sec. 2(b) does not remove the degeneracy between the 
two states in second order. To that order both states are 
shifted by the same amount, 


E-AUAJ. 


(e) |To) and | TL). The shifts are respectively 
A*/4y, and — A?/4y,. This computation is only valid 
under conditions: A«&y,, J, ye—J,v-+J. It is not valid 
at the crossing of energy levels. This restriction is im- 
portant as we shall see further. 


3. Line Shapes and Frequencies of 
Nuclear Transitions 


(a) The coupling with the rf field inducing an 
electronic or nuclear transition is an operator propor- 
tional to 


a(s1z- 502) +8 (132-2) . 


The selection rules for electronic or nuclear tran- 
sitions are thus AP, —-E1, AP=0. With a perturbation 
like 5127-52. or Jız+I2; the total parity is preserved. 
The allowed transitions are shown in Table I and Fig. 3. 

(b) From Table I, it appears that nuclear lines are 


J Value 


Fic. 4. (a)-Theo- 
retical dependence 
on J of the ENDOR 
frequency in the 
group 4A+y, and 
estimate of J distri- 
bution in dashed line. 
(b)-Experimental 
line shape for sample 
B. 


Electronic signal 


Endor frequency 
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J Value 


Fic. 5. (a)-Theo- 
retical dependence 
on J of the ENDOR 
frequency in the 
group j4—»; and 
estimate of J distri- 
bution in dashed 
line. (b)-Experimen- 
tal line shape for 
samples A and B, 
respectively, show- 
ing the frequency 
gap. 


fs 
Endor frequency 


AR eas 
2 Vte 


Electronic signal 


Endor frequency 


split into three groups, two whose frequencies in first 
order are 4A—yv; and¥4A+y7, and one whose mean 
frequency is near the Larmor frequency of a bare 
phosphorus atom. We assume now J>0. The values 
of the coupling constant are distributed_in the clusters 
with a distribution function P(J) normalized to unity 
when J is varied from zero to infinity. It is assumed to 
have a maximum for a value J=Jp. 

The 14--»r group includes two frequencies." One is 
J-independent $4--»r— (4?/4v,), the other is J-de- 
pendent £4--»r—[.4?/4(v.4-J)]. The distribution of 
J produces a distribution of the latter frequencies (Fig. 
4). The +A —vr group always includes one J-independent 
frequency line. The others show a singular energy 
denominator for J=ve. Thus, a second-order pertur- 
bation-frequency treatment is no longer valid for such 
clusters. However the frequencles of these clusters 
(J) far from $4—vr, are not observed. It is im- 
portant to notice that there is a forbidden frequency 
band of width 42/4», in the 424—»r group. Such a 
phenomenon does not occur in the 3A+ v7 group as- 
suming an antiferromagnetic coupling (Figs. 4 and 5). 
The |vr| group includes two lines whose frequencies are 
J-dependent (Fig. 6). 


III. EXPERIMENTAL 


1. Principles of the Experiment 


— A microwave field at the frequency v,— 9206 Mc/sec 
is applied. The populations of the electronic levels 
responding to the nuclear sublevels m-+m:=0 are 
efore equalized. Then a rf field of frequency v in 
he vicinity of one of the nuclear-transition frequencies 
so applied. This field produces a change in the 
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population of one of the states with m-+m:=0 and 
the electronic signal varies. By sweeping the radio- 
frequency » we reach all the nuclear frequencies 
possible. For a given » the change of the electronic 
signal is very likely proportional to the number of 
clusters having the nuclear frequency transition v. For 
instance, if we look at the ENDOR frequencies vr-- (4?/ 
4v,)-- (42/4J), the spectrum reproduces the distri- 
bution of the values of 4?/4/ in our sample. 


2. Experimental Technique 


The frequency of the electronic spectrometer is 
stabilized by a Pound discriminator on an external 
cavity. We use a constant-level symmetric detection." 
All these experiments have been done at 1.3?K and 
because of the length of the electronic relaxation time, 
the dispersion mode is observed. The signal-to-noise 
ratio for the ENDOR signal on clusters varies between 


3 and 5. We use two samples of phosphorus-doped 
silicon from Merck; 


N-—8X10!5P/cc p=0.110-cm, 
N=6X10'§ P/cc p-—0.1390-cm. 


Sample A: 
Sample B: 


3. Experimental Results 


Taking for the phosphorus nuclear Larmor frequency 
the value |vrz|=5.67 Mc/sec and for the hyperfine 
constant the value? A=117.53 Mc/sec, the nuclear- 
frequency transition calculated in a first-order approxi- 
mation is given by 44—vz=64.11 Mc/sec, $4--»r 
=53.09 Mc/sec, and |vz|. A typical second-order 
correction is given by 42/4v,— 0.375 Mc/sec. 
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Fic. 6. (a)- Theoretical dependence on J of the ENDOR frequency 
in the group |v;|. (b)-Experimental line shape for sample A. 


6 G. Feher, Phys. Rev. 114, 1219 (1959). 
7G. Feher, Bell System Tech. J. 26, 449 (1957). 
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The ENDOR line in the vicinity of y= 1A — vr shows 
two peaks separated by a frequency gap of the order 
of 4?/4v,. This fact shows without any ambiguity that 
the singlet level crosses the group of M, —1 sublevels 
and the coupling is antiferromagnetic (Fig. 5). 

The line for M ,— 1 sublevels shows only a broadening 
(Fig. 4) coming from the different possible value of 
the second-order shift 42/4(v,4- 7). 

The ENDOR line for the M,—0 group of levels (Fig. 
6) occurs at the frequencies v given by v= |vr| -- (42/ 
4y.)= (A?/4J). 

As stated above, the shape of the line reproduces the 
distribution of 4?/4/. If we call Jp the value of J 
which gives the maximum probability for A2/4J, we 
deduce 4?/27 p by measuring the distance between the 
maximum of the two peaks. For the sample A we find 


A?/2J p—0.94 Mc/sec. 


Knowing the distribution for A?/4J, it is possible to 
deduce the function P(J), giving the probability of 
finding a given J and in particular, the value Jp giving 
the maximum value for P(J). The results are 


Sample A: Jp-15 0002-3 000 Mc/sec, 
Sample B: Jp= 13 500--3 000 Mc/sec. 


The shape of the two other lines can be discussed in 
a qualitative way. Figure 5 shows the lines 34 —»r for 
the two samples; for the sample A the low-frequency 
peak is more intense than the high-frequency peak. 
The result is opposite for sample B. 

Since the low-frequency peak corresponds to values 
of J larger than v, and the other peak to values of J 
smaller than ve, this result is consistent with the fact 
that Jp is smaller in sample B than in sample A (Fig. 5). 

The width of the experimental frequency gap is 
smaller than 4?/4v,; the time constant due to electronic 
saturation is probably reducing the experimental gap. 

Coming back to the line at v; we notice (Fig. 6) the 
two broad lines. The intensity of these lines is very 
small in sample B. The width of the line, being of the 
order of 4/47 p, is larger for sample B and is responsible 
for the small signal. 


IV. INFLUENCE OF IONIZED CENTERS 


A very narrow line (10 kc/sec) occurring exactly at 
the frequency |vz| is observed in the ENDOR spectrum 
of both samples (Fig. 6). Feher® already noticed the 
occurrence of this line in the ordinary EN DOR spectrum 
of a compensated sample and attributed it to an ionized 

center. : 
d that there is an ionized phosphorus in the 
uster of two neutral phosphorus, d 
always will be a small scalar coupling between the E 2 
electronic spins and the ionized ED S 
spin I’. We therefore add, to our Hamiltonian (1), 


the term 


vicinity of a cl 
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This coupling comes because the wave function for 
the two electrons localized around phosphorus 1 and 2, 
has a small admixture of a wave function for an electron 
bound to the phosphorus J’, This is an extension to a 
system of three phosphorus and two electrons of the 
calculation performed by Miller and Abrahams? for 
explaining the conduction of compensated sample. 
Because the admixture is very small a; and az are much 
smaller than A. This new term gives a contribution to 
the energy spectrum, which is for a. state M, (of the 
triplet): 
vnw 3-3 (ay-a;)m' M ,, 
m being the value of 7/. 

As a; and a; are small this term does not influence 
the electronic spectrum. But there are now ENDOR 
lines corresponding to the selection rule Am’ =- 1, and 
occurring at frequencies given by 

y= [vrl] H4 (arta) M, 
(we neglect second-order corrections). For M,=0 a line 
appears exactly at v= | vr]. 

There are also ENDOR lines corresponding to value 
of M;,-—-ki, but they are not observed because there 
is a distribution of a; and as values. 

The important point is to notice that only lines 
corresponding to the M,=0 level of clusters are ob- 
served. There is undoubtedly phosphorus ionized near 
isolated neutral phosphorus, but their ENDOR fre- 
quencies are given by v=|vr|-+4aM,, M:=+} and 
are not observed. We observe only ionized phosphorus 
coupled to clusters of two neutral phosphorus, this 
point is also proved by the following experimental fact. 

Now we saturate the electronic line »—»,24-1A (Fig. 
1) and look at the ENDOR spectrum. The narrow line 
at |vr| appears but its intensity is half the intensity 
of the |vr| line observed when we saturate the y=, 
line. It is known that a cluster of two has an electron 
spectrum with lines at »—»,--14 which are not ob- 
served because they are masked by the large signal 
coming from isolated phosphorus, the intensity of 
these lines is one-half the intensity of the v=», central 
line. So the narrow v=|vz| line observed when we 
saturate the electronic lines at y=y,-£4A is not due to 
the ENDOR spectrum of isolated phosphorus but to the 
ENDOR spectrum of clusters of phosphorus. 

By comparing the intensity of the narrow v=|vy! 
line and the intensity of the broad line near |vrl 
(coming from neutral phosphorus) it should be possible 
to get quantitative informations about the degree of 
compensation of the samples. 

V. THEORETICAL ESTIMATE OF THE 
EXCHANGE ENERGY 

The Hamiltonian of the two-electron system is given 
by 
36(1,2) 2 T (r1) +T (re) + V (1) + V; (11) 

Bee Pa 3E V;(re)+ V;(rz)2- V (ryrz) » 

* A. Miller and E. Abrahams, Phys. Rev. 120, 745 (1960). 
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Fic. 7. Results of the computation of J as a function of the 
average distance between two neighboring donors. The experi- 
mental value of J for sample A corresponds to an average distance 
of 110 A, assuming a—25 A. 


where rı and rs are the coordinates of the two electrons, 
T(r) is the kinetic-energy operator, V;(r) and V (r) 
are the attractive potentials due to phosphorus nuclei 
at sites ? and j. In our spin Hamiltonian (1), J is 
defined as the energy difference between triplet and 
singlet spin states”! 


J=(T|3e(1,2)| 2)—(S|5e(1,2)|5), 


where |T} is the total wave function for the system, 
antisymmetric with respect to the orbital variables, 
symmetric with respect to the spin variables in the 
triplet spin state. The opposite is true for |S}. 

The orbital part of the one-electron wave function, 
taken from the effective-mass theory” may be written 
at a site i: 


6 
Vir) 27 (6) PF) bo), 
p=1 
where ¢,(r) is a Bloch function corresponding to the 
wave vector of the p valley, and F,(r) is an envelope 
function satisfying the wave equation in the effective- 
mass approximation: 


Fs) Gre) exp—| 


+ 


a? b? 


xy z J 
? 


the z axis being parallel to the wave vector corre- 
sponding to the value #. In silicon a and 6 are? 


a=25 Å, b=14.2 Å. 


According to our definition of the exchange energy, 
J takes the following form?: 


pem f vé Gov) if UF G3) Go) ro) dradrs 
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Ko 


? P, W. Anderson, in Solid State Physics, edited by F. Seitz and 
D: Turnbull (Academic Press Inc., New York, 1963), Vol. 14. 

10 W. Heitler and F. London, Z. Physik 44, 455 (1927). 

u J. M. Luttinger and W. Kohn, Phys. Rev. 97, 869 (1955). 

x2 W. Kohn, in Solid State Physics, edited by F. Seitz and D. 
Turnbull (Academic Press Inc., New York, 1957), Vol. 5. 
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Since V (r) is an attractive potential the first term is a 
positive one, the second is negative. If one neglects the 
anisotropy of the envelope function within a given 
valley, the estimate of J is simple. But one should 
notice that each term of the latter expression is smaller 
than the one obtained by using a pure hydrogenic wave 
function because there is destructive interference 
between parts of the wave functions coming from 
different valleys. 

Utilizing Eq. (II 17), Eq. (II 18), and (paragraph) 
IV of Ref. 8, we evaluated J (as function of the distance 
between phosphorus centers) assuming an isotropic 
envelope function and taking for the radius a=25, 
20, or 15 A (Fig. 7). We find that the positive theo- 
retical value of J agrees with the experimental results. 
The average distance between centers is easily deduced 
from the concentration of centers, and we find that the 
experimental value of J is larger than the theoretical 
prediction, even if we use a— 25 À for the radius. If we 
believe in the impurity concentration measurements by 
conductivity at room temperature, this fact seems to 
indicate that the average distance between donors is 
smaller than the average distance deduced from the 
concentration, by about 20%. This may be due to a 
nonuniform concentration of phosphorus donors. It 
has been suggested that impurities tend to concentrate 
along dislocation lines.'? 


VI. CONCLUSION 


The method we described can only give information 
if the exchange integral is larger than the hyperfine 
coupling A, otherwise no electronic line at the frequency 
v, appears in the spectrum. There is also an upper limit 
because when J becomes larger than kT and Ave; there 
are very few electrons in the triplet states. If we remark 
that J varies very rapidly with the distance, our 
method is only valid in a narrow range of concentration. 
At high concentration, another limit appears. The 
electronic relaxation time becomes shorter, and the line 
difficult to saturate. 

The possibility of measuring J is directly related to 
the presence of the hyperfine coupling term; without 
it, the exchange term and the Zeeman term commute 
and the frequencies are not affected by the exchange 
term. It is the small coupling by the hyperfine energy 
of the triplet and singlet levels which allows measure- 
ment of J. The effect of exchange on the energy spec 
trum is very small except when the crossing occurs. 
By a second-order perturbation calculation we are able 
to explain the shape of the experimental ENDOR 
spectrum and to determine the sign and the magnitude 
of J. 

We have explained the occurrence of an ENDOR line 


1], K. Wilson, Ph.D. dissertation, Rutgers University, New 
Brunswick, New Jersey, May 1963 (unpublished). 
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coming from an ionized phosphorus center weakly 
coupled to a cluster of two neutral phosphorus atoms. 
Note added in proof. Dr. J. J. Pearson made a calcula- 
tion of J without neglecting the anisotropy of the en- 
velope function. The results will be published in a 
separate article. 
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The Frenkel-Kontorova dislocation model is modified by replacing the sinusoidal substrate force by one 
which is piecewise linear. Exact solutions are found for the static configuration and for the Peierls stress, 
op. Good agreement is found between these values of «p and those obtained previously for a two-dimensional 


Rosenstock—Newell model. The atoms of the linear chain are then considere 


d in random motion correspond- 


ing to thermal equilibrium and under an applied stress e «ep. The time required for motion of the dislocation 
from one position of stable equilibrium to an adjacent one is computed by means of a rate-theory formulation 
adapted to the present type of problem in which the positions of all the atoms in the chain are required to 
vary in passing over the potential barrier. The theoretical transition times for an infinite chain are compared 
with analog computer results for a six-atom chain and reasonably good agreement is found. 


1. INTRODUCTION 


HE rate of dislocation creep over the Peierls 
barrier has been studied in connection both with 
low-temperature creep! and the Bordoni peak.? Because 
of the complexity of the phenomenon, it is necessary to 
make various assumptions of a mathematical and 
physical nature when treating a realistic model of the 
process. 

The purpose of the present work has been to construct 
a simple dislocation model which is amenable to analy- 
sis with few additional assumptions. It is hoped that the 
results of this idealized analysis may provide insight 
into the nature of the real process, and that the mathe- 
matical techniques employed may be applied to more 
realistic models. 

The dislocation model considered here is a modifica- 
tion of the Frenkel-Kontorova one-dimensional model? 
with the sinusoidal substrate force replaced by one 
which is piecewise linear as in the two-dimensional 
treatment of Sanders.* For this model it is possible to 
; by the U. S. Air Force 


t No. AF-AFOSR-228-63 
under Grant No. 


* This research was supported jointly 
Office of Scientific Research under Grant ? 
and by the National Science Foundation 
NSF-G19010. so Fe 

HIB Santas and J. P. Hirth, Phys. Rev. 115, 543 (1959). ; 

2A. Seeger, H. Donth, and F. Pfaff, Discussions Faraday Soc. 
^ J 7 ae and T. Kontorova, Phys. Z. Sowjetunion 13, 1 
(1938); J. Phys. (USSR) 1, 137 (1939). A paper which api 


S ` : . Kratochvil and 
peared after this manuscript was minii LI nde: the same 


del from the static 


V. L. Indenbom, Czech. J. Phys. 13, 814 (19 
modifcation of the Frenkel-Kontorova mo 
viewpoint only. z j 

* W. T. Sanders, Phys. Rev. 128, 1540 (1962). 
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obtain exact solutions to the difference equation de- 
scribing the static configuration under applied stress and 
to derive an exact expression for the Peierls stress, op 
(Sec. 2). The values of ep computed here agree surpris- 
inglv well with the results of the two-dimensional 
calculations.* 

In order to study the dislocation creep rate, the atoms 
of the model are then considered in random motion 
corresponding to thermal equilibrium and under an 
applied stress « «ep. The problem of determining the 
rate at which the dislocation passes from one stable 
equilibrium position to an adjacent one is in the general 
class of problems considered in rate theory.* We present 
here (Sec. 3) a derivation of the pertinent rate formula 
ab inilio which is somewhat different from the usual one 
and is particularly adapted to the present type of 
problem in which the positions of all the atoms in the 
chain are required to vary in passing over the potential 
barrier. 

For the infinite chain dislocation model it is possible 
(Sec. 4) to calculate explicitly all the quantities entering 
into this rate formula. The theoretizally predicted values 
of the frequency of transition of the dislocation from 
one equilibrium position to an adjacent one are found 
to be in order-of-magnitude agreement with analog 


5H. Eyring, J. Chem. Phys. 3, 107 (1935); C. Wert and 
C. Zener, Phys. Rev. 76, 1169 (1949); C. Wert, ibid. 79, 601 
(1950). A paper which came to our attention after this manuscript 
was submitted is G. H. Vineyard, Phys. Chem. Solids 3, 124 
(1957), which also develops the pertinent rate formula from the 
viewpoint of Sec. 3. We have retained the present discussion, 
sich differs somewhat in emphasis, for the sake of completeness. 
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Fyc. 1. Linear chain dislocation model. 


computer results for a 6-atom chain. Finally, a summary 
and conclusions are presented in Sec. 5. 


2. STATIC SOLUTION 
Model Description 


As a simplified crystal model, we consider a linear 
chain of mass points (hereafter referred to as atoms) 
interconnected by linear springs with spring constant kı 
and equilibrium spacing b (corresponding to the lattice 
parameter) and subjected also to a periodic (with period 
b) substrate potential. This substrate potential, as in 
the original Frenkel-Kontorova model, represents the 
net effect of the remaining atoms in the (3-dimensional) 
crystal. We assume a piecewise quadratic, continuous 
potential U(x), namely, 


U(x) > Shox? , 
kop /b ) 
b— ae ; 


—— "where x is the distance from a potential minimum. This 
. replacement of the trigonometric potential of Frenkel 
.. and Kontorova by the piecewise quadratic potential of 
Eq. (2.2a) simplifies the analysis and introduces the 
- additional parameter $, the distance from a potential 
; um to the nearest point at which the potential 
ature is discontinuous and becomes negative; ¢ is 
thus a measure of the hardness of the crystal. The 
parameter ky in Eq. (2.1) is a measure of the shear 
odulus of the crystal while the parameter kı is a 
of the Young’s modulus of the crystal. 
er to introduce a dislocation into this model, we 


|x| Xo; 


t 


b 2.1 


=kpb— 
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start with the linear chain as in Fig. 1(a) and subject 
the jth atom to a longitudinal force G;, where 


Gy a Gu $hib; 


G;=0 allother j. (22a) 


From this, the following equilibrium configuration 
will result: All atoms 72M are displaced 5/2 to the 
left, all atoms 7 — M are displaced 5/2 to the right, 
and those for which —M « j « M are undisturbed.5 To 
verify that this is indeed an equilibrium configuration, 
observe that the sum of the forces acting on each atom 
vanishes. For the special case M —2, this new equi- 
librium configuration is shown in Fig. 1(b). This con- 
figuration provides a model for a dislocation because 
five atoms, viz., —2, —1, 0, 1, and 2, now occupy a 
region ordinarily occupied by only four atoms. 

Now with these forces still applied, the atoms are 
subjected to the periodic substrate potential defined in 
Eq. (2.1) and sketched in Fig. 1(c), and which is posi- 
tioned relative to the configuration of Fig. 1(b) with 
either a maximum or minimum at the position of all 
atoms so that it exerts no forces on them in this 
configuration. 

We next apply forces —G; to the configuration of 
1(b) and (c), annulling those originally applied, and 
also apply a constant force ë (which represents ex- 
ternally applied shear stress) to each atom. Let the 
resulting displacement of the jth atom from the con- 
figuration of 1(b) be #;. With the substrate potential of 
Eq. (2.1), the substrate force on the jth atom is 


F;=—kð; for |j|2M, providing |d;|<¢, 
2kop 

F;2———À; for |j|<M, 
b—26 (2.2b) 


b 
providing |5;| <(;-#) : 


Note that in the first of Eqs. (2.2b) 5; is measured 
from a potential minimum, while in the second of these 
equations 7; is measured from a potential peak. As will 
be seen subsequently, the value of M is determined for 
a given set of parameters so that the inequalities on the 
atom displacements associated with Eq. (2.2b) are 
satisfied. The final configuration, after forces —G; and 
& are applied is then as shown in Fig. 1(d). The atoms i 
the range | j| <M will be termed subject to weak bonds 
to the substrate potential while the others will be 
referred to as strong bonds. The number of weak bonds, 
1,,— 2M —1, may be odd or even. In the latter case, the 
atoms will be indexed as shown in Fig. 1(e); that is, 76 
understood to take on the values ---, — 14}, —2, h 


* Only the infinite chain will be considered in detail and there- 
fore the location of the origin may always be taken as the midpoint 
of the dislocation. The indices j and M in Eq. (2.22) may be either 
integers or half-odd integers; see discussion following Eq: (2.2b). 
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15, -++ when 2M is odd. Therefore in both cases the 
weak-bond atoms may be taken to correspond to 
—M<j<M and both cases may be treated together. 


The equations of equilibrium for the configuration of 
1(d) are 


bi(0; i9; 4— 25) ha; -Gj+e=0, JÈM, 
2b 


kiüti; 20;)+—3,-G+¢6=0, (2.3) 


—M«j«M. 


With the introduction of the dimensionless variables 


vj—)5;/b, G;— G;/ (kb); 
o=a/(kob), P=ho/ki; 
(2.42) 
2yP 
y=¢/b, Q= ; 
1—2y 
and the reduced displacements 
1;—7j—9c, (2.4b) 
these equations may be written 
uja— QT P)ujyruia-G;, |j|zM; (2.5a) 
(2-0) = 
Uj41— (2— Q)u;d-uj1— G;— , 
j+1 j j—1 j (1—2y) 


—M«j«M. (2.5b) 


Displacement Solution 


We treat here only the infinite chain, V+; the 
solutions to Eq. (2.5) which decay at infinity are 


uj-ABÀ, j€—-M; 
uj-Bg, j2+M; (2.62) 
Um- = BB¢M—3 7 =e AgM-1-1; 
- e B 
1457 C cosj0+D sinjó— -, —M<j<M; 
zy 
g 
tty — C cosM0--D sinM0— Hi; (2.6b) 
Yi 
2 1 
2... — C cosM8— D sinM0———3; 
2y 
where t 
B—3[P4-2— (PHAP)'* ], (2.7) 


cosó—1—Q/2. 


Equations (2.62) and (2.6b) satisfy (2.52) and (2.5b), 
respectively. The four constants A, B, C, and S 
then be determined by the requirement that Eqs. (2.62. 


A1009 


and (2.6b) agree for the four atoms, ~M, —M-+1, 
M —1, M for which the expressions overlap. The result- 
ing expressions for the displacements are 


uo; M)-(cFu--Gu)8; j<—M; 
m(cFu—Gwu)8, j>M; (2.82) 
c(8—1) cos 8 
2y[B cos(M m 1 e— cosM 8] 


(8— 1) sinj@ c 


de : ———, —M«j«M; 
2[sinM6—8 sin(M—1)0] 2y 


where 
cosM0— cos( M — 1)6 
27[B™ cos(M — 1)0—8?*-! cosM6] à 
sinM0— sin(M — 1)0 
Gu= : : : 
2[8™ sin(M —1)0—8*-! sine] 


Fy 


(2.8b) 


In order to complete the solution for the displacements, 
it is necessary to determine the value of M, character- 
izing the number of weak bonds, which corresponds to 
given values of P, the force constant ratio, ^y — $/b, and 
the applied stress, c. We proceed in an inverse fashion, 
regarding M as well as P as prescribed, and determine 
the range of y for which a solution with »,.=2M—1 
weak bonds is (1) compatible with the assumed force 
law (2.2b) and (2) is mechanically stable. 

For compatibility, the following conditions are 
necessary [see Eq. (2.2b) ]: 


Üu, v o42—6, 


b b (2.92) 
Üu — (5-9) , Pass (5-9) 


these will later be shown to lead to sufficient conditions. 
With the use of Eqs. (2.4) and (2.8) these inequalities 
become 


y> —8*G u--o(12-B4F u), Y> —8"G y—3o(1--8MP y); 
and (2.9b) 
4€ —8X7Gyd-o(14-84F y), 
y€ —8'*Gy—oc(1--BY1F y). 


Clearly, if a given bond configuration is not compatible 

without stress it is not compatible under stress. The 

stress-free compatibility range is 
—BMGy vy —8* Gy . (2.9c) 


Through the use of Eqs. (2.7) and (2.8b) together with 
some trigonometric manipulation, the following implicit. 
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equation for the lower limit on y is obtained: 
1p 
ctn 
1+6 


Let the value of y-! that satisfies this equation be 
denoted by Cyr. The corresponding equation for the 
upper limit on y is 


tanM0-— 


(2.9d) 


1— 


e @ 
tan(M — 1)0— —— ctn- (2.9e) 
14-8 2 


so that the range corresponding to that of Eq. (2.9c) is 
Cuai€Y !€Cyv. (2.9f) 


A detailed investigation shows that some of the m- 
equalities (2.2b) other than those of (2.92) will be 
violated unless the lowest values of 0 satisfying Eqs. 
(2.9d) and (2.9e) are chosen. If this is done, then the 
maximum value of 0 in the range (2.9f) satisfies the 
inequality (M — 1)0 «7/2, and this can be used together 
with Eqs. (2.8) to show that for 0« j <M «is negative 
and u;1«w; and that for j>M ww; is positive and 
uj41«u; (only positive j, because of symmetry, need 
be considered). Then all of the inequalities (2.2) are 
satisfied. 

For the examination of the stability of the equilibrium 
configuration defined by the displacements 2; it is con- 
venient to return first to the consideration of a finite 
chain —(V+1)<j<N-+1 [where N may be a half- 
odd-integer; see Fig. 1(e)] and then to let N >. Let 
q; bea small displacement of the jth atom from the equi- 
librium position being examined, with q-y-1— qw41— 0, 
V (q-x;*  *,qw) the resulting change in potential energy, 
and S;;— (1/k:)(0°V/dq:0q;), —N i, j € N, evaluated 
at g;=0. Then 


Sup AEST ESSI MSIN; 
Sy=(Q2-0); —M«j«M; 
= 02-0) J (2.10) 
Sigr—$;447—1; —N+1<jsN-1; 

$520 for |i—j|>1. 


Let g(\)=(@_w(A),¢-wa4i(),° - > ,gw()) be an eigenvector 
of S5; corresponding to the eigenvalue of A. Then q;(\) 
are the solutions to the difference equations 


qizi—- (2+ P—d)Gg+Gqu=0; 
=NŅN<jJ<S-M; M<SJSN; 


ga (2-0) Gj +Gu=0; 


—M<j<M; Q-N-31— Qn1=0 a (2.11) 


In the terminology of Gantmacher” 5; is a normal 
Jacobi matrix. The eigenvalues à are then real and 
simple and are denoted by \9<)\i<-- -Yoyaa- A second 
property of normal Jacobi matrices is that the sequence 


7F. R. Gantmacher and M. G. Krein, Oszillationsmatrizen, 
Oszillationskerne und Kleine Schwingungen. M echanischer Systeme 


(Akademie-Verlag, Berlin, 1960), p. 80. 
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qos dias), a dn), P=, «++, 2N--1, has 
precisely p changes in sign. In order to determine criteria 
for the positive definiteness of 5;; and hence for the 
stability of a solution with 2M —1 weak bonds, it is 
sufficient to consider Ao and we therefore seek a solution 
to Eqs. (2.11) with no changes in sign. This solution, 
for the case N —« may be written in the form, analo- 
gous to that of Eq. (2.6), 


g—-48?, j<—M+1; 
gBB, j2M—i; 
q;=C cosjüd-D sinjd, —M<j<M; (22b) 


where 


cosĝ=1— 0/2; (2.13) 


The four coefficients, 4, B, C, and D are determined 
by the requirement that Egs. (2.12a) and (2.12b) agree 
for the four atoms for which the expressions overlap. 
The resulting system of four homogeneous equations 
may be separated into two sets of two each for sym- 
metric and antisymmetric eigenvectors, respectively. 
We consider only the former set of equations since X 
corresponds to a symmetric eigenvector (as noted above, 
there are no changes in sign in the sequence of its 
components). They are [with F—$(A-- B) ] 


C cosM6—FB“=0, 


a. d (2.14) 
C cos(M —1)0— F8 1—0, 

and equating the determinant of the system to zero 

leads to the following implicit equation for Xo, 


cosM 


— (2.15a) 
cos(M — 1)0 


B= 
where ào is that root for which M6< (/2). We consider 
next the variation of \p=)o(y~!; M) with y+, with P 
constant. The following sequence of inequalities may be 
determined from the equations cited: 


48 
zen [Eq. (2.13)] 
dXo 
Ea —»0. [Eq. (2.152).] 

d(cos@) 


Consequently d(cos8)/dAo7 0, and from Eas. (2.42) 
and (2.13) 


dy" dy3d 2P,dQ 1 2Prd(cos0) 1 
Agua do =| e (c ree. 
d\o dQ dX Ndo 2/7 Q?L dw 2 


The same result, that a decrease in y~! decreases Ào 


(2.122) . 


* 


* 
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may also be seen directly from Rayleigh’s theorems,? 
since, as appears from Eqs. ( 242), a decrease in y 
corresponds to an algebraic decrease of the weak-bond 
spring constants. "Therefore the solution corresponding 
to 2M —1 weak bonds will be stable only for 4717. 
where Sy, is that value of y~! for which XS a M)= 0. 
Since for \5=0, B=6 and 6=6, the transcendental 
equation for Sy, is 


cosM0—$8 cos(M—1)0—0, M0-(x/2); (2.15b) 
or 
1—8 6 
chs 
(iY 


tan(M —3)0— (2.15c) 
where Eq. (2.15c) follows upon use of trigonometric 
identities. Consequently S3r— C y;. 

Therefore, combining the discussions for compati- 
bility and stability, we may conclude that for a value of 
Y in the range C y-43 y 31 X C y a solution with 2M —1 
weak bonds is both compatible and stable, while the 
only other compatible solution for the same y, that with 
one more weak bond, is unstable. 


Peierls Stress 


Consider next a linear chain with displacements 
corresponding to a compatible and stable solution with 
2M —1 weak bonds and apply to it a stress ø which is 
zero initially and increases slowly (so that inertia effects 
are negligible). Equations (2.9b) show that this bond 
configuration will remain compatible while 

Y-FBMGy yte Gu 
cx uum (ej 

1--8MF y TER oo 
We define cp as the stress above which these inequalities 
are invalid; for ¢>ap the problem has no (statically) 
compatible solution, i.e., the chain could not support a 
stress o7» op in static equilibrium. Therefore op corre- 
sponds to the Peierls stress for the model considered. 
The first of Eqs. (2.162) may be shown to be controlling; 
this yields 

= cos(M — A] 


sinM6— 8 sin(M — 1)0 
[ sinM8— ô sin(M — 1)0 


X i (2.16b) 
cosM8— 8 cos(M — 1)0 


where ó—6(2y—1)/(2y—8). We note that ep=0 for 
y= Cy [this follows from Eq. (2.9d) ] and for a) M 
[Eq. (2.15b) ], that is at the limits of compatibility and 
stability of configurations with 2M—1 weak bonds. 


Numerical Results 


Equation (2.16b) has been evaluated, for values of P 
(the ratio of noncentral to central forces) equal to 1, 


8Lord Rayleigh, The Theory of Sound (Dover Publications, 
New York, 1945), Vol. 1, p. 111 ef seq. 
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LINEAR CHAIN 
TWO-DIMENSIONAL MODEL 


Fic. 2. Peierls stress, er, for linear chain; P=0.5. P is the ratio 
of noncentral to central forces and y is the normalized critical 
shear stress of the corresponding perfect crystal. Two-dimensional 
model results are from Ref. 4. 


0.5, and 0.2. For each value of P the parameter + (the 
ratio of perfect crystal shear strength to shear modulus) 
was varied over a range wide enough to show the im- 
portant details of the problem. The results are shown in 
Figs. 2 and 3; for purposes of comparison Sanders’ 
results,‘ corresponding to P=0.5, are also shown in 
Fig. 2. 

The agreement between these results and the earlier 
ones is quite good, in view of the expected sensitivity 
of the Peierls stress to the details of the model. Both sets 
have the quasicyclic pattern, and both have about the 
same exponential dependence on 7“!, while the present 
results have peak values roughly twice those of the 
earlier ones. 

A qualitative argument would indicate two opposite 
effects in passing from the one to the two-dimensional 
model: if the single chain treated here is considered as 
the first atom row above the slip plane, then the presence 
of the rows below it would tend to narrow the disloca- 
tion, increasing the Peierls stress, while the rows above 
it would increase the width and lower the stress. 
Apparently, the latter effect dominates. 

The agreement between the one- and two-dimensional 
cases might serve to substantiate the superiority of the 
accurate calculation on a discrete model, even a rela- 
tively poor model, over the use of a continuum model 
for dislocation studies.? 


3. STATISTICAL MECHANICAL CONSIDERATIONS 


We wish to consider next the creep, due to thermal 
motion, of a linear chain, as described in Sec. 2, with a 
dislocation under an applied stress s<op. For this 


? See also R. Hobart and V. Celli, J. Appl. Phys. 33, 60 (1962). 
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Fic. 3. Peierls stress, cp, for P=1.0, 0.2. 


purpose an ensemble of such chains is assumed with 
distribution function p(p—w,---,pw,9-y,***,9v)=p(P,9) 
where q;(t) is the displacement at time/ of the jth atom 
from its equilibrium position under stress ø, and p;= md; 
is the momentum of the jth particle with mass m. We 
treat a chain in thermal equilibrium with its surround- 
ings at temperature T and accordingly set 


pa) = Ce Een, (3.1) 
where 
H($,9) — (1/2m)p;br- V(g-u;- f -qN) D (3.2) 
C= | eH. OI? TT dp; dq;. (3.3) 
Q i 


In these equations and in the remainder of this section 
latin subscripts assume the values — N, ---, N, where 
by the latter notation we signify zero included. Greek 
subscripts assume the values — N, ---, —1, 1, ---, V, 
zero excluded. The symbol Q denotes the full phase 
space, — co <p;< c», — o» « q;« c», and the summation 
convention is employed. 

The potential energy function V(g) for the type of 
system under consideration admits of a number of 
positions of stable equilibrium with intermediate posi- 

- tions of unstable equilibrium. Let a typical stable equi- 
librium position have coordinates q= (gw; : :,qw) — 0. 

- In a neighborhood of g=0, the potential V(g) may be 

|. approximated by 

Wu V(q)= 0/2)5503;; (3.4) 

ere the matrix S= [Ss], Sij=(1/h1)(0?V (0)/89:99;), 

positive-definite. It is now assumed that the system 


(p,q) is very small elsewhere, and that therefore 
e purpose of computing the normalization constant 


AND W. T. 
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C, Eq. (3.3), the approximate expression for V(q), 
Eq. (3.4), may be used. The result is 


IS] 12k N+} 


NOTUM 3.5 
(Qa kT)N ig Nt ( ) 


where || is the determinant of the matrix S. 
Consider next the position of unstable equilibrium at 
q-—d-— (d.w,-:,dw) nearest to the stable equilibrium: 
position at g—0. We confine attention to a neighborhood 
of the saddle point g=d in which the potential V(q) is à 
adequately represented by its quadratic approximation 


V(g)- V(d)d-(k/2)U (qi—di)(qj—45 , — (3.6) 
where 


U;— (1/k) (0? V (d)/0g;00;) (3.62) 


so that the equations of motion in that neighborhood 


take the form 
(3.7) 


Let X; be the eigenvalues of the matrix U —[U ;j] with 
associated unit eigenvectors a;; and let A9«0 be the 
minimum eigenvalue with the sign of ao; chosen so that 
dod; 0. It is assumed that Ay is the only negative 
eigenvalue.!? With the orthogonal change in coordinates 


mi— — QU ;(qj— d;) a 


Qi ai(gj—d;) , (3.8) 
Eq. (3.7) assumes the form, for Qot), 
Qo—--sQo, s$— —kQw/m, (3.9) 


with the solution 


Qo(!) = 5EQ«(0)4-51Q«(0) Je" 
-HIEQ«(0)—5-9.(0)]e-**. (3.10) 


Therefore, if at an arbitrary instant of time, /—0, | 
Q.(0)=0 and Q,(0)>0, 


then Qo(/) will increase monotonically, at least as long | 
as Eq. (3.5) applies. Since ao; points away from the ` 
stable equilibrium position at g=0, this result implies 
that if the system, in its notion in phase space, reaches 
the hyperplane Qo=0 with positive velocity, it will not 
return to its original equilibrium position but continue 
on towards the next equilibrium position.!! This hyper- 
plane corresponds therefore to the crest of the ridge 
between the two valleys. In order to compute the 
frequency, f, with which the system reaches the hyper- 
plane Qo=0 with Qo>0, we introduce, as new Co- 
ordinates for phase space the Q; as defined by Eq. (3.8) 
and the corresponding new momenta P;=dzjp;- The 
distribution function in terms of these new variables 
will be denoted by p(Po,0o,P.«,0«) where the arguments 
are separated as shown for future convenience. The 


10 For the case of the linear chain described in Sec. 2, it can be 
proved that only one negative eigenvalue exists. t 

u This statement is, of course, intended only to apply to shor 
times. 
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frequency f may now be obtained directly by use of a 
formula derived by Rice.!? This formula, which requires 
the assumption of the ergodic hypothesis for its validity, 
is in the present case, [ 


1 ^ 
yf "m | Pod P, i pP(Po,0, Pa Qa] dP dOa, (3.11) 
n Jo o G 


where 9" is the subspace, — « < Pa< «, — % «Q.«o. 


This leads, after the integrations in P; are performed, to 
k, HS ue 


E (Qm) (RT)N A m 


[7-792] 
xj EN m = II4Q., (3.12) 
J g d E 


where Q" is the subspace — © «Q4,« o and V(Qs,04) 
is the function V(q) expressed in terms of the new co- 
ordinates Q;. The assumption is next made that, for the 
purpose of evaluating the integral in Eq. (3.12), V(0,02) 
may be evaluated from Eq. (3.6), the expression valid in 
the neighborhood of the saddle point. Then 


V (0,Qa)= V (d)+ (k1/2)Z a NQ4^. (3.13) 


Substitution of this expression into Eq. (3.12) leads to 
the following expression for f: 


f= fue VOUT (3.14) 
where 
1 ol S| 1/2 
fo —(Rs/m)!*; -( = ) (3.14a) 
2r P|U| 


and where |U | 2 IT; X is the determinant of the matrix 
U. The quotient of determinants may be written 


SRS ne) |=, (3.15) 


where S-! is the matrix inverse to S, J is the unit matrix 
and D— U— S. In the type of problem considered here 
the difference matrix, D, will have only a few nonzero 
elements; in this case, the determinant |J+-S-1D| may 
be expressed in terms of a correspondingly low-order 
determinant, as will be seen in Sec. 4. 


4, CREEP OF LINEAR CHAIN 


In this section, the rate formula of Eq. (3.14) is 
applied to the linear chain described in Sec. 2 under an 


applied stress e «c. 


Saddle-Point Energy, V(d) 


In Sec. 3, two approximate expressions for V(q), 


Eqs. (3.4) and (3.6) were given, the first valid in the 


i : ilibrium position, q— 0, 
neighborhood of the stable equi! 
and the second valid in the neighborhood of the un- 


5 5). Reprinted in 
1 S. O. Rice, Bell Syst. Tech. J. 24, 58 (1945). nied: 
Resa pons ‘on N ud and Stochastic Processen; edited by N. Wax 
(Dover Publications, New York, 1954), p- 120. 
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stable equilibrium position, g=d. For the model de- 
scribed in Sec. 2, with a piecewise quadratic substrate 
potential, the two regions of validity (referred to as 
stable and unstable regions, respectively) have a 
common boundary B, and the displacements 9,” corre- 
sponding to the Peierls stress op fall on B. Therefore, by 
use of Eqs. (3.4) and (3.6) to evaluate V(q?), it is 
found that 


V(d)= (h1/2) Suq: qj? 
—(k/2)Us(q;P—dj)(qj —dj). (4.1) 


The position g;=0 corresponds to stable equilibrium 
under an applied stress a «a p (bars denote dimensional 
quantities). Since the stress-displacement relation is 
linear within the stable and unstable region, it follows 
that 


9V(q")/9g;— kS iq; — 8p—8, ali. (4.2) 


when the position q? is regarded as arrived at from the 
stable position. Similarly, if it is regarded as arrived at 
from the unstable position at g=d, we have 


9V(qP)/8q; EU ;;(qj* —dj) 28 p—8, all į. (4.3) 


Substitution from Eqs. (4.2) and (4.3) into Eq. (4.1) 
yields 
V(d)=}(Gp—G)>: di. (4.4) 


Because of the linear behavior in both regions, q;^(a), 
the displacements to the common B from the equi- 
librium positions corresponding to the applied stress, c, 
are given by 


qi (2) 9 [(20—2)/25 ]g;" (0) , 
di(&)— qi (2) - [(6p—8)/8 p ][d.(0) — qs (0) J, 
so that 
d;(a)L(a p—8)/a p. ]d:(0) - (4.5) 


Therefore, the energy at the saddle point d(z), corre- 
sponding to an applied stress, 2, is 


V(d(z)) [(s0—2)*/22».] 27; d(0) - (4.6) 


Consider V(d(0)). Let A; be the displacement of the jth 
atom when the dislocation moves from the stable equi- 
librium position at g=0 to the next adjacent stable 
equilibrium position. Then V(0)=V(A)=0, and 
V(d(0)) can be computed by considering either dis- 
placements from q—0 or g=A. In both cases Eq. (4.6) 
applies (with oppositely signed 27) so that 


V(d(0))—is»7:d(0)— —3¢p 2:(d,(0) — A;) 


25::d/(0)— 55; A;—5, (4.7) 


where b is the Burger's vector of the dislocation. Sub- 
stitution into Eq. (4.6) then vields the following formula 
for the saddle-point energy: 


Or 


(@p—a)*b (a p—a)* 
o(d(@))=_——_—_=——#b?. — (48) 


Öp dop 
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! Computation of v 


For the model of Sec. 2 which is stable with 2M —1 
weak bonds, only the elements of S and U corresponding 


Í to i=ņJ=M differ, so that the only nonzero element of 
i D=U—S is 


Duu=—(P+0)= 


(4.9) 


Fic. 5. Variation of the dimensionless frequency », 
Eq. (3.14), with Peierls stress, op. 


AND W. T. 
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Fic. 4. Variation of the dimension- » 
less frequency v, Eq. (3.14), with the 
normalized critical shear stress y. 


Substitution in Eq. (3.15) gives the result 
[S|[U]3— (124- S7 irm Daar)! (4.10) 


so that only the element Sasa; of the matrix S is 
needed. This is found (for the infinite chain) by solution 
of the difference equations S;;20;= ô: with the result 


Signs 


J cosM 0 
2LcosM 8—8 cos(M—1)0 
sinM0 
+ | a (Guild) 
sinM0— 8 sin(M — 1)0 


Finally it is necessary to calculate ^o, the minimum 
eigenvalue of the matrix U. This is found from Eq. 
(2.152) with M replaced by M-+4. For the case in which 
stability corresponds to M — 1, an explicit formula for 
Ao may be derived, namely, 


(21—0)(P--Q4-2)—P—4 | 
x- [ 9 ] : (4.12) cS 

JP EQ ep - [ 

For other values of M, numerical evaluation of Eq. E 
(2.15a) was used. The resulting values of v, the dimen- P 


sionless frequency that appears in the rate equation, 
Eq. (3.14), are plotted in Fig. 4 for P=1.0 as a function ; 
of y. It may be noted that »—0 for those values of y | 
which correspond to the limit of stability, Sar, for the j 
given configuration, as does op (Figs. 2 and 3). How- | 
ever, v increases monotonically within the range j 
Sy y !<Cm, whereas op returns to zero at y 1— Cur i 
It is clear, therefore, that these calculations do not 
reveal a linear dependence of y upon 4/o p as might be 
expected from a model treating the dislocation as 4 
single entity moving in a potential field corresponding | 
to cp. A plot of v as a function of \/ap, given in Fig. 9; | 
reveals, however, that if only the initial portion of each i 
range of y is considered, the two quantities are indee 
linearly related. It may be that these deviations from 


13 
le 
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TABLE I. Compar 


hob? /kT 


ison of theoretical and analog creep rates. 


t 
| 990 560 245 
analog a a 
ÍI fe - | = — —- 
theoretical | 0.456 0.53 0.595 


linearity are due in part to the discontinuous substrate 
potential employed. If only the initial portions of each 
range of y are considered, then the present calculations 
lead to the relationship vea /ce» with a=5-+0.5 for 
0.1X PX 10. This may be compared with the similar 
relationships found, for example, by Seeger,? Weert- 
man, and Lothe! in their treatments of a dislocation 
as a vibrating string, the latter two obtaining values 
of a=5.6 and 2.5 respectively. 


Analog Computer Results 


A REAC Model-400 analog computer was used in an 
attempt to check these rate calculations, at least in 
order of magnitude. Because of limitations in equipment 
only a six-atom model could be simulated; also it was 
not considered feasible to run through a random 
sequence of initial conditions, so initial conditions were 
chosen that would give a minimum frequency of jumps 
of the dislocation. This model, then, was run for P— 1, 
220.2396, and &—0.010 (cp=0.0146), and for three 
values of temperature. Table I shows these results 
together with corresponding results based on Eq. (3.14). 

'The analog data do agree, within a factor of 3, with 
the theoretical results. The analog results show a lower 
frequency, as expected (because of initial conditions). 
They also exhibit a more sensitive dependence on tem- 


15 A. Seeger, Phil. Mag. 1, 651 (1956). — 
1 J, Weertman, J. Appl. Phys. 28, 1185 (1957). 
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perature; this again was expected, because of the fact 
that only six atoms were simulated. In view of the limi- 
tations on the analog model, the agreement seems to be 
good, and appears to lend justification to the assump- 
tions made in the theoretical derivation. 


5. CONCLUSIONS 


Some conclusions which may be drawn from this work 
are as follows: 

(1) The one-dimensional dislocation model yields 
values for the Peierls stress «p in surprisingly good 
agreement with those for the two-dimensional model 
considered by Sanders.* 

(2) In particular, the sensitivity of op to small 
changes in y, the critical shear stress, which was found in 
the two-dimensional calculations‘ is confirmed here. 

(3) Detailed results could be obtained on rate of 
motion of the dislocation under various levels of applied 
stress less than the Peierls stress and for a range of 
values of the other parameters of the problem. The 
relation between these rates and the Peierls stress is in 
good agreement with that found earlier by treating the 
dislocation as a vibrating string, except for systematic 
deviations that are possibly due in part to the dis- 
continuous substrate potential used here. 

(4) The rate formula, Eq. (3.14), which has been 
applied here to a specific one-dimensional model has 
broader applicability. It allows explicit calculation of 
rates of transition, without consideration of the concept 
of entropy of activation, when the potential energy 
function of the system as a function of the microscopic 
coordinates is known. 
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The intensity of small-angle x-ray scattering in single-crystal silicon has been measured with the scattering 
vector K oriented along each of the three symmetry directions. The scattering intensity extrapolates to zero 
angle in reasonable agreement with the thermal scattering, calculated from the velocity of sound in each 
direction. The thermal scattering is calculated from known phonon dispersion relations and subtracted from 
the total to obtain the incoherent, or Compton, scattering. The incoherent scattering is smaller than that 
calculated by Freeman from Hartree-Fock wave functions for the free atom. This conclusion agrees with 


previous approximate measurements at small K on other crystals. Theoretical ideas about the source of the 


deficiency, apparently a solid-state effect, and about small-angle thermal scattering are outlined. The 


incoherent scattering shows little anisotropy and no 


sharp rise with increasing angle, which would suggest 


the onset of excitation of electrons across the forbidden band. 


I. INTRODUCTION AND THEORY 
1 A. Incoherent Scattering 


Na perfect, rigid crystal all the diffuse single scatter- 
ing is incoherent. From the semiclassical or quantum 


1 theory, it is! 
d N 2 
DO DEE [veste rdr (1) 
" A : 
i " 
Í -f |Vo|?| Z e=] 2dr 
i xus : 
= |f ie D» ek tidy (2) 
7 


in electron units (e.u.). K= 4 sin36/X, 0 is the scattering 
angle, À is the x-ray wavelength, Ym is the wave function 
in state m, m=O is the ground state, NV is the total 
number of electrons, r; is the position of the jth electron, 
and the integrations are over all coordinates of the 
electrons. Then /;,,(0)—0. The approximations that 
the x-ray frequency is small enough to avoid recoil and 
relativity corrections, but that it is much larger than 
the Bohr frequency vom for any m which contributes 
appreciably to the sum in (1), are used. The former 
assumption holds in the present work. The latter 
assumption excludes anomalous dispersions, or Raman, 
terms which correspond to excited final electronic states. 
These terms are too small to have been observed in x- 
ray scattering, but are finite at K=0. 

"Thermal motion produces additional diffuse scatter- 
ing Zu. Walker? separated inc from Iin in Al experi- 
mentally, by their temperature dependences. Jin is 
almost unshifted in wavelength, but Zin. is known, from 
— spectral measurements at large K, to peak near a 

wavelength shift A\=(//m.c)(1—cos8), the value for 


* Now at Lincoln Laboratory, Massachusetts Institute of 
"Technology, Lexington, Massachusetts. 
XR. W. James, The Optical Principles of the Diffraction of 
ays (G. Bell and Sons, Ltd., London, 1948). 
B. Walker, Phys. Rev. 103, 558 (1956). 


a slow, free electron. Thus Curien and Deroche? 
separated Jine from Ji, in Al with a dispersive filter. 
Laval! measured Iino in Al, diamond, and KCl by 
subtracting estimated Jm from the total diffuse scatter- 
ing. Measured phonon dispersion relations were not 
then available. Cribier® treated CaF. similarly. Meas- 
urements on pure, perfect crystals at small K have been 
difficult in the past. The weak scattering power requires 
an unusually strong x-ray source and low background. 
For convenient A, 0 is small, so that common diffraction 
equipment is not suitable. 

If Yo is a determinant of one-electron functions yj, 
then from (2), 


N 
Tinc= N— 22; (3) 
17’ 


[rote mera]. 


The most careful calculations have used (3) with 
Hartree-Fock y; for free atoms or ions. A recent review? 
lists the results. They lie above the measurements’ 
at small K, but agree otherwise.®? Coherent scattering 
in crystals (at large K) is also well described by free- 
atom orbitals. Neglect of interatomic overlap in (3), 
which is emphasized at small K, presumably produces 
much of the overestimate of Jino. It may be that overlap 
contributions vanish when eK" varies with large K, 
and that well within the atom, v; is similar in free atom 
and crystal. Overlap effects might be estimated with 
simple approximate atomic orbitals. The solutions of 
the Hartree-Fock problem are Bloch functions, extend- 


E 35 Curien and C. Deroche, Bull. Soc. Franc. Minéral. 79, 102 

4J. Laval, Bull. Soc. Franç. Minéral. 62, 137 (1939); Compt. 
Rend. 215, 278 (1942). 

5D. Cribier, Rev. Mod. Phys. 30, 228 (1958); Ann. Phys. 4 
333 (1959). 

6H. Curien, in International Tables for X-Ray Crystallogra yr 
edited by C. H. Macgillavry, G. D. Rieck, and K. Lonsdale The 
Kynoch Press, Birmingham, England, 1962), Vol. III, p. 241. 

7 A. J. Freeman, Phys. Rev. 113, 176 (1959); Acta Cryst. 13, 
190 (1960). 
P 8B. W. Batterman, D. R. Chipman, and J. J. DeMarco, Phys: 
Rev. 122, 68 (1961); M. Kuriyama and S. Hosoya, J. Phys. S0C- 
Japan 18, 1315 (1963). 
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ing over the whole crystal, but the determinental yo is 
not changed if one takes each y; as a linear combination 
of the Bloch functions which is localized on an atom. 
However, for metallic or covalent bonding, these v; 
have greater overlap than the free atom orbitals. For 
example, the assumption of some free electrons,? 
instead of isolated atoms, reduces Jine at small K 
(except very near K— 0). In the absence of accurate 
measurements on crystals at small K, no realistic 
calculations of Zine, including interatomic effects, have 
been made. 

To be Compton scattered in an insulator or semi- 
conductor at absolute zero of temperature, a photon 
must give up at least the band-gap energy. Suppression 
of scattering with small AX is reported? from spectral 
measurements on diamond, with a 7-eV band gap. 
Analogous reduction of Zine at angles less than a 
minimum, 6,, calculated from the slow, free-electron 
Compton relation, has been suggested??? as an 
alternative explanation of the deficiency in Jine- Si has a 
1.08-eV band gap, and with A= 1.542 A, as in this work, 

«—7?. But the electrons in band-gap solids are not 
free and slow, so that large energy transfers at small @ 
may occur. In some cases the deficiency is by compari- 
son with Zine calculated for free closed-shell ions, which 
themselves have few low-lying excited levels. The idea 
is inapplicable, a fortiori, to a metal. 

There is more justification for requiring conservation 
of quasimomentum, k'—k- K4- K;, from (1). k, K’, 
are the wave vectors of the excited electron in the initial 
and final states. Kr is a vector of the reciprocal lattice. 
Knowledge of the band structure of Si!! shows that the 
addition of this requirement scarcely increases the 
predicted 8., for K along the [001 ] direction. However, 
this kind of description of the excited states seems 
inadequate. 


B. Thermal Scattering 


Thermal scattering involves emission or absorption 
of phonons, but it can be treated classically, and is well 
understood for harmonically vibrating crystals. +1.13 

Since the two primitive sublattices in Si are identical, 
their amplitudes must be equal in each normal mode. 


<2 


Then the harmonic, one-phonon scattering is 


Tin(K) im exp(—2M)K*| fl? 


2 x 
x[(1+cosKz:9) © E? cos(K,e;) 
i=l 


+(1—cosKz: 9) x Ew; cos*(K,e)] G) 


i=4 


? K. Alexopoulos and G. Brogren, Arkiv Fysik 6, 213 (1953). 

1) J, Laval, Compt. Rend. 215, 359 (1942). 324 (1963) 

u F. Bassani and D. Brust, Phys. Rev. 131, 132 : . 

12 T. Laval, Bull. Soc. Franc. Minéral. 64,1 A UE 

13 W. Cochran, Rept. Prog. Phys. 26, 1. (96) E 
dudin, E. W. Montroll, and G. H. Weiss, in Solid i Piece 
edited by F. Seitz and D. Turnbull (Academic Press ^ne. * 


York, 1963), Suppl. 3. 
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in e.u./atom. The two summations extend over the 
acoustic and optical modes, respectively. K is in the 
zone centered on Kz, q= K— Ky, w;/2r is the frequency 
of the ith mode with propogation vector q, E:= 4a; 
Xcoth (t9;/ 2k T), 2| f|? (12-cosK- o) is the usual struc- 
ture factor, m is the atomic mass, and (K,e,) is the angle 
between K and the polarization vector e;. The Debye- 
Waller factor exp( —2M) can be neglected in this work, 
since 2M — 0.3 sin*0/X* in Si at 300°K. n-phonon seat- 
tering is given by the M” term of a power series in 
M. Since M is so small here, multiple-phonon scattering 
is negligible, and it vanishes in general at K=0. 

At K —0, the optical modes do not contribute, since 
their phase velocities &;/K — « as K —0. Nor do 
purely transverse waves, for which (q,e;)-90?, con- 
tribute when K is in the first zone, where q— K. Hence, 
in directions of such high symmetry that the modes 
are purely transverse or longitudinal at q=0((001), 
(111), and (110) in cubic crystals), the harmonic thermal 
scattering is 

In(0) — m-!| f(0) RTV- (5) 


at K=0. V is the adiabatic velocity of sound in the 
direction of K. The correction for anharmonicity is to 
replace V by the isothermal velocity.“ The difference is 
negligible in hard solids. 

The calculated thermal scattering in Si is showa in 
Fig. 1 for Kalong each of the three symmetry directions. 


Il. EXPERIMENTS AND DISCUSSION 


A rotating water-cooled Cu anode was used, at 33 
kV and 98 mA, with monochromatization by a Ni filter 
and proportional counter-pulse-height analyzer. Slits 
0.6-cm high collimated and analyzed the beam. With 
the slit widths used for 02: 3.5^, the rms resolution was 
1.0? at 8— 3.57, and decreased to 0.7? for 02: 10°. The 
resulting intense beam was needed to measure the weak 
scattering of Si at small K. The background was about 
4 counts/min, and the smallest sample scattering at 
622? was about 100 counts/min. Other experimental 
aspects have been described." The accuracy of the in- 
tensity standardization? has been verified'^ within a 
few percent. 

The sample was a (110) disc of 100 2-cm n-type Si, 
made by Philco Corp. Alternate polishing and etching 
reduced the thickness to 0.0102 cm, uniform to 0.0005 
cm over the area used, with x-ray transmission 0.237. 
The final treatment was an etch, to minimize the 
damaged surface layer. Static imperfections in this 
grade of Si should produce completely negligible (vol- 
ume) diffuse scattering. Low dislocation density was con- 
firmed by Laue photographs and examination of etch 
pits. 

The orientation was accurate to 1°. Since the sample 


4S. M. Rytov, Zh. Eksperim. i Teor. Fiz. 33, 166 (1957) 
[English transl.: Soviet Phys.—JET P 6, 130 (1958) ]. 

15 D. L. Weinberg, Rev. Sci. Instr. 34, 691 (1963). 

16 D. L. Weinberg, Phys. Letters 7, 324 (1963). 
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was normal to the incident beam, K varied in direction 
at half the rate at which 0 changed, when the analyzing 
slits were rotated. Figure 1 shows the measurements, 
corrected for background (angularly dependent), ab- 
sorption, polarization, and slit-height smearing. The 
correction!? for this last effect is a factor [1+ (b/a) 
x(8—4.76) /[14-(5/a)0*], b/a in deg, where the 
expression 44-56? is fitted to the experimental total- 
scattering curve at 0. The factor is 0.90 to 0.91 at 0=0. 

The experimental values for 7 (0) are from 16 to 30% 
too large in the three directions. Only measurements 
made during the first month after the final etch are 
reported because the scattering is found to increase with 
time, most conspicuously at the smallest 0. The increase 
must represent detérioration, either surface corrosion 
or possibly x-ray damage. Scattering from surface 
roughness is known!” to be prominent at small 6, 
especially in otherwise perfect crystals, where the 


11 D, L. Weinberg, J. Appl. Phys. 33, 1012 (1962). 
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(c) 


Fic. 1. Scattering of 1.542 A x-rays in Si, measured by transmis- 
sion through a thin plate of (110) orientation, held perpendicular 
to the incident beam. At @=0 the scattering vector K is along the 
(a) [001], (b) [111], (c) [110] direction. For scattering at angle 
0, K deviates by 30 from the symmetry direction. + Incoherent 
scattering at 0—17.7?, theoretical. [A. J. Freeman, Acta Cryst. 
12, 929 (1959).] x Total scattering, experimental. ———— 
Thermal scattering, theoretical. e Incoherent scattering, obtained 
by subtracting the thermal scattering from the total scattering. 
— — — Extrapolation of the experimental curves to 0—0 as 
a-+b62. The two vertical arrows indicate verified, but unexplained 
irregularities. The thermal scattering was calculated according to 
(4) with measured dispersion relations {G. Dolling, in Inelastic 
Scattering of Neutrons in Solids and Liquids, (International 
Atomic Energy Agency, Vienna, 1963), Vol. 2, p. 37. These 
measurements of the dispersion relations in Si agree approximately 
with x-ray scattering measurements by J. Corbeau, Compt. Rend. 
256, 1275, 4641 (1963), and with calculations by T. I. Kucher, 
Fiz. Tverd. Tela. 4, 992 (1962) [English transl.: Soviet Phys.— 
Solid State 4, 729 (1962)].) except near @=0, where (5) was 
used. The polarization of modes with K along the [110] direction 
is not known, except at one point of high symmetry in each zone of 
(periodically extended) reciprocal space, where the vibrations 
are purely longitudinal or transverse. Therefore, the measure- 
ment and calculations for the [110] direction are confined to 
angles near 0—0. 


volume scattering is small. Comparison of surface 
conditions shows that it accounts at least primarily for 
the deviation, at @<4°, of the experimental curves in 
Fig. 1 above the expected a+06? variation. The 
observed excess in (0) is attributed to this cause. The 
time variation in J(6) is approximately independent of 
6, so that it is believed the final curve should be lowered 
uniformly by about 0.05 e.u./atom. 

Similar experiments on quartz, in two directions, also 
show surface scattering, similar agreement with 
(piezoelectric effects are negligible), and Tine less than 
theory. However, the phonon dispersion relations are 
not known. 
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Transmission spectra of thin (10 to 30 uy) CdS single-crystal samples have been recorded in the hitherto 


Li uninvestigated region of 250 to 400 cm-t. A number of new absorption maxima are located which necessi- 
tated a new phonon assignment with considerably lower values of the acoustic modes. Some 30 absorption 
peaks are attributed to multiphonon combinations of the following Brillouin zone-edge frequencies: LO — 295 
j cm™, TO, = 261 cm“, TO:=238 cm, LA —149 cm, TÀ;— 79 cm™, and TAs=70 cmt. The transmission 
d Spectrum of hexagonal ZnS is obtained for the first time in the region 300 to 750 cmi. Again some 27 
absorption maxima are assigned to multiphonon combinations of LO —346 cm 1, TO 5318 cm, TO; — 297 
Fee cm“, LA=181 em™, TA; =92 cm, and TA;—73 cm". The phonon assignment for the optical lattice 
Es absorption in cubic ZnS due to Deutsch is shown to be inconsistent with the requirements of the Brout 
he sum rule and other regularities expected for the zinchlende-type semiconductors. Two new assignments 
sle j (LO=339 cm~, TO=298 cm, LA=155 cm-!, and TA=93 cm, and LO=339 cm-!, TO —298 (an 
nt LA=190 cm™, and TA=115 cm™) only slightly different from each other are proposed. The characteristic 
st. phonon frequencies are discussed in terms of structure and effective ionic charge. 
ed 
e 1. INTRODUCTION and Besson* reported seven absorption maxima between 
a NFRARED lattice absorption arising from multi- 400 and 600 cm™ which were assigned to various binary 
ie phonon combinations has been observed in many combinations using four assumed characteristic fre- 
nal group III-V compounds and group IV elements.!2 The — Quencies. Spitzer" has recently shown that the same 
RE combination bands are a continuous absorption. How- two-phonon bands can also be adei de another set 
P ever, peaks in absorption occur because of singularities of four-phonon neue, ane e M ccr 
cr, in the phonon frequency distribution. These singu- from those used by Balkanski and Besson. Deutsch* has 
A larities arise from critical points in the Brillouin zone P MR additional bands, ae bed 
ion where the frequency wave vector (k) curves have zero 1 radiation, in the samec pon es 
: of slope. Such critical points are believed to occur at or diferent sets of assignments which almost fully ex- 
a near the zone boundary. Usually the critical point char- P ES Pu es pe s im Yn ae eps 
to acterizes a phonon branch and the combination bands of the bands reported oy a XADSERATC UN 
may be classified into summations or differences of Dem BULL absorption bands in the region 600- 
the longitudinal optical (LO), transverse optical (TO), LUGE Aah ness E ee es mee 
longitudinal acoustic (LA) and transverse acoustic On eae Bee be li dm ero 2 fe 
ice (TA) phonons at the zone edge. The interaction of an ne eure : " 1 ae y ADU ae p 
for infrared photon with the lattice vibrations are governed 2 pt pnm doit s 
E z DRE All transmission measurements on CdS to date have 
in only by the conservation of energy and wave vector. b z : EA r haa DO e 
; i A a for ea E confined to frequencies greater than cm, 
he Turner and INGES e teve Rx : n : despite the observation of Collins that the zone-center 
he nents in the lattice absorption of AISb in this manner: POEM optical-mode phonon has a value of 241 cmt 
of M Ony cs Mss earam: 9i earum puts of Thus, it was felt that a transmission study in the region 
el these effects has been obtained for group II-VI com Jow 400 cm! could provide additional important in- 
pounds. Mitra? hasi recently explained cie nia formation, particularly regarding the low-frequency 
lso i absorption observed m d loy Gn GC pc characteristic phonons. These were considerably larger 
(5) » phonon combinations, and it appears be con - than in the case of ZnSe? and the III-V compoun doi 
an tions are similar to those governing the S ede Furthermore, if the proposed frequencies are designated 
ATE III-V compounds. In the case OBIS NE a LO, TO, LA, and TA phonons, there is a lack of con- 
reflection technique located thet zone centem ope sistency with various correlations described by Keyes,” 
transverse mode at 241 cmt. Subsequently, Balkanski Mitra and Mitra and Marshall,"»including the Brout 
sum rule.” It was also anticipated that by using very 
| * Supported by U. S. Air Force Cambridge Research Labora- thin crystals, structure in the broad absorption bands 
I tories, Office of Aerospace Research, under Contract No. 114 be resolved and be of considerable assistance in 
2 l m Ee det Research Foundation of Illinois Institute obtaining a reliable assignment. 
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i 300 to 550 cm™. 
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1 In the case of ZnS, Deutsch? has studied absorption 
i in the cubic modification, over the frequency range 


410-830 cm™. His assignment fails to explain the strong 
absorption at 431 cm. Furthermore, the characteristic 
frequencies used (particularly the low-frequency value), 
as in the case of CdS, do not conform with the correla- 
tions mentioned earlier. Thus a new assignment for 
cubic ZnS, with lower values for the acoustic branches, 
seemed necessary. 

Lattice absorption in hexagonal ZnS is reported here 
for the first time. Thin crystals have been used in order 
to cover the optical-acoustical phonon combination 
region and to resolve the structure involved in the 
anticipated removal of degeneracy of the zone- 
boundary TO mode and possibly the TA mode as well. 


2. EXPERIMENTAL 


Platelet crystals of CdS and ZnS, grown from the 
vapor phase by nucleation on a quartz seed as de- 
scribed by Fochs and Lunn," were used in this investi- 
gation. This growth technique yields thin crystals 
(~10 p) of high optical quality and low impurity 
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— — Fc. 2. Transmission spectrum of CdS in the region 
525 to 750 cm™. 


Deutsch, in Proceedings of the International Conference on 
wuclors, Exeler, 1962 (The Institute of Physics and 
ciety, London, 1962), p. 505. 

'ochs and B. Lunn, J. Appl. Phys. 34, 1762 (1963). 
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content, but relatively small surface areas («0.5 cm?). 
Absorption data was obtained for both compounds 
over the region 300-800 cm! using a Perkin Elmer 
double-beam spectrometer (model 521) with grating 
optics. 
3. ANALYSIS OF SPECTRA 


a. CdS 


The absorption spectrum is shown in Figs. 1 and 2, 
and the positions of the bands observed listed in 
Table I. The absorption spectrum may be conveniently 


Taste I. Assignment of the absorption maxima observed in 
CdS in terms of phonon combinations of frequencies (cm): 


'TO; 2261 TA; 279 
LO=295 LO=14) 
TO2=238 TA2=70 
Balkanski 
Present and 
study Besson Deutsch 
Band (300°K) (77°K) (77°K) Caled. Caled.- 
No. cm^! cm^! emt Assignment cm^! Obsd 
1 308 TO:+TA2 308 0 
2 318 TOs:--TA: 317 -1 
3 331 TO:-- TA» 331 0 
4 340 TOi--TA: 340 0 
a 347 TO1+TO2—-LA 350 +3 
5 365 LO+TA2 365 0 
6 374 LO-TA: 374 0 
7 387 d TO2-+-LA 387 0 
40. 
8 410 TOiTLAÀ 410 0 
b 421 TOi1+TO2—TA1 420 -1 
c 428 432 TOi1+TO2—TA2 429 +1 
9 442 443 LO+LA 444 +2 
d 467 TO2+LA+TA2 466 -1 
10 478 2TO: 476 —2 
11 498 500 502 TOi+TO: 499 -1 
e 511 2LO —TA: 511 0 
12 522 524 526 2TO: 522 0 
13 533 540 LO -TO: 533 0 
14 555 562 562 LO-F-TO: 556 +1 
f 571 579 576 TO1+TO2+TA2 569 —2 
15 590 599 598 2LO 590 0 
g 628 LO --TO: 4-TA2 626 -2 
h 647 TOi TO: TLA 648 +1 
i 660 2LO +TA2 660 0 
j 684 LO+TO2+LA 682 -2 
k 695 LO+TO1+2TA2 696 +1 
1 715 3TO2 714 =i 
m 730 2LO --2T A2 730 0 
795a LO 4-TO: --TO:? 794 -1 
1045 (broad LO-+2TO:+TO: 1055 +10 
ani 
weak)# 

1345 (broad 2LO+2TO:-+TO: 1350 T5 


an: 
weak): 


a Reported earlier (Ref. 9). 


divided into three regions: (1) the group of strong bands 
below 450 cm=!, (2) bands of medium absorption 
between 450 and 710 cm~, and (3) the weak bands 
above 700 cm~. Since the zone-center transverse 
optical-phonon frequency as determined by Collins 15 
at 241 cm-!, these three regions may be considered a5 
arising from one, two, and three optical-phonon proc 
esses. First, considering the second region the SU 
prominent bands [Nos. 10, 11, 12, 13 (Fig. 1) and 
Nos. 14 and 15 (Fig. 2) ] are assigned to various binary 
combinations of the three zone-boundary optical 
phonons, LO —295 cm, TO; — 261 cm}, and TO; 238 
cmi, It may be commented here that the first two char- 


| in 
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acteristic phonons have been considered by all the 
previous investigators, Balkanski and Besson, Deutsch, 


‘Spitzer, and Mitra. Since the zone-center value of the 


transverse phonon is 241 cm~, it seems reasonable that 
the zone-boundary phonons should split into 261 and 
238 cm™ in the hexagonal wurtzite structure. 

Balkanski and Besson have studied the temperature 
dependence of the prominent bands in the two-optical- 
phonon region. In Table II, we have made a comparison, 
based on our assignment, between the calculated and 
observed values of the absorption coefficient at 80 and 
293°K. It may be shown" that the net power absorbed 
in a two-phonon summation band v44-v; is proportional 
to 


a=1+F,+F;, (1) 


F7 [exp (tcv,/kt) —1]7, (2) 


and v,— wave number of the nth phonon. Thus the 


where 


TABLE II. Comparison of the observed and predicted temperature 
dependence of the optical phonon combinations in CdS. 


Peak 
position Observed* Calculated 
cm7! Assignment fo 7 
498 TO,+TO: 0.56 0.556 
522 2TOi 0.57 0.574 
533 LO+TO, 0.60 0.578 
555 LO+TO; 0.64 0.598 
590 2LO 0.63 0.624 
* Determined from Balkanski and Besson's data (Ref. 6). 
terms f. and fo listed in Table II are 
fo=a(80°K)/a-(293°K) , G3) 
and 
fo as (80*K)/ao(293*K)) ; (4) 


. Where ap is the observed absorption coefficient at the 


peak absorption after correction for background. The 
excellent agreement shown in Table II between the 
calculated and the observed temperature dependencies, 
although perhaps a bit fortuitous, is a further confirma- 
tion of the essential correctness of the assignment of the 
optical modes. i 
Having obtained the optical-mode frequencies, the 
acoustical-mode frequencies at the zone boundary are 
obtained from the prominent bands in region (1). A 
perusal of region (1) reveals that one may further sub: 
divide it into two groups: one above and one bolo 
375 cm. The three bands at 387, 410, and 442 cm 
(numbers 7, 8, and 9, respectively, in Fig. 1) seem an 
obvious choice for the bands arising from combinations 
of the three optical modes with the longitudinal acousti- 
cal mode at the zone boundary. Each one of thon yields 
a value close to 149 cm! for LA. Below 375 cm! there 


16 W. Cochran, S. J. Fray, J. A. Johnson, J. E. Quarrington, and 


a 


N. Williams, J. Appl. Phys. 32, 2102 (1961). 
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Fic. 3. Infrared absorption in wurtzite ZnS 
(350 cm™ to 600 cm). 


are six prominent bands (Nos. 1 to 6 in Fig. 1). They 
occur as three pairs, each with an average splitting of 
9 cm™. This splitting is attributed to two transverse 
acoustical phonons, which, like the transverse optical 
phonons, seem to be in order for the wurtzite structure. 
The two zone-boundary acoustical phonons assigned in 
this manner are TA;=79 cm and TA,=70 cm-!. 
The complete assignment for all the observed absorption 
maxima is presented in Table I. 


b. Wurtzite ZnS 


The absorption spectrum of wurtzite ZnS is shown in 
Figs. 3 and 4 and the band positions are listed in 
Table III. In this case, the absorption regions are not 
as clearly defined as in CdS, but similar broad classifi- 
cations can be made: region 1—below 545 cm“, can 
further be divided into regions above and below 430 
cm; region 2—between 545 and 700 cm™, and region 
3—above 700 cmt. 

The absorption in region 2 is again thought to arise 
primarily from the binary combination of optical 
phonons with assumed characteristic phonon frequen- 
cies, LO=346 cm~, TO;—318 cm™, and TOs-— 297 
cm. In region 1 two areas are distinctive, comprising 


60 T 
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TABLE III. Assignment of the lattice absorption in hexagonal 
ZnS in terms of the phonon frequencies (cm): 


TO,=318 TA, =92 
LO=346 LA=181 
T02:=297 TA,=73 
Peak 
position 
Band at 300°K Caled. Calcd.— 
No. en Assignment cm Obsd. 

1 358 TO2+TA2 360 E2 
2 362 2LA 362 0 
a 374 2LO—TO: 374 0 
3 381 'TOs-- TA, 383 +2 
4 390 'TO:- TA? 391 +1 
b 404 TOi+LA—-TAi 406 —2 
5 409^ TO: TÀ; 409 0 
6 419 LO+TA: 419 0 
c 423 TO+LA—TA2 425 +2 
7 437 LO+TAi 437 0 
d 450 'TOi2- TA; 4-TAs 452 +2 
e 463 'TO; -2TAs 464 +1 
8 4755 TO.+LA 474 —1 
9 487 TOi+TAi+TA2 483 —4 
10 498 TO,+LA 500 42 
11 526 LO+LA 528 ar? 
Í 555 LO--TO;—TA; 551 —4 
g 571 TO, +LA+TA:2 572 +1 
12 583» 2TOs 584 +1 
13 612 TO:+TO2 610 —2 
h 622 LO+LA+TAi 620 —2 
14 6348 2TO: 636 3-2 
15 6402 LO--TOs 638 —2 
16 658 LO4-TO: 658 0 
17 694 2LO 692 —2 
i 730 LO+T0O2+TA1 730 0 
j 750 2LO-FTA; 750 0 


a Relatively intense bands. 


the absorption maxima arising from the combination 
of the optical phonons with the zone boundary TA 
(between 370 and 450 cm?) and with the zone boundary 
LA (between 435 and 525 cm) phonons. Six com- 
ponents are observed in the optical-plus-TA region, and 
again it has been necessary to assume that the TA 
modes are nondegenerate. Characteristic phonon fre- 
quencies of TA,—92 cm™ and TA;=73 cm predict 
the observed structure in the absorption spectrum with 
good accuracy. From the optical-plus-LA combination 
bands (8, 10, 11, Fig. 3), a value of LA=181 cm™ has 
been obtained. This also explains the relatively intense 
absorption band observed at 362 cm as a 2LA 
combination. 


c. Zincblende ZnS 


The lattice infrared spectrum of cubic ZnS single 
crystals was investigated by Deutsch.” The reflectivity 
was measured between 15 and 34 u. The reststrahlen 
band with a reflectivity maximum at 30.8 » was located. 
A Kramers-Kronig dispersion analysis of the rest- 
strahlen band gave a value of 312 cm™ for the zone- 
center transverse optical phonon (v;). The transmission 
spectrum of cubic ZnS was measured between 1 and 
24 u by Deutsch. The transmission is essentially con- 
stant up to 14 u. Ten absorption maxima were observed 
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at longer wavelengths between 430 and 820 cm. These 
bànds were attributed to optically active two- and 
three-phonon processes. Using characteristic zone- 
boundary phonon frequencies of A —379 cm, B=297 
cm, C=263 cm, and D=228 cm, Deutsch could 
explain seven out of the ten observed bands. 

Since a diatomic cubic crystal usually has two optical 
and two acoustic branches, the higher two phonons 
must be from optical and lower two from acoustic 
branches. In a predominantly ionic crystal like ZnS, 
one would expect that the LO modes should have a 
higher frequency than the TO modes throughout the 
Brillouin zone. One may thus identify the four char- 
acteristic phonon frequencies, 4, B, C, and D assigned 
by Deutsch as zone boundary LO, TO, LA, and TA, 
respectively. However, a comparison? with the phonon 
frequencies of the other zincblende-type crystals reveals 
that the acoustical phonon frequencies as assigned by 
Deutsch are unusually large. Furthermore, since the 
zone-center longitudinal optical phonon frequency 
(vi) is 377 cm™ as calculated from the Lyddane-Sachs- 
Teller formula, a value of 379 cm™ for the zone 
boundary LO seems improbable. The Brout sum at the 
zone center amounts to 1.200108 sec? as compared 
with a value of 1.754108 sec near the zone edge. 
This large difference (45%) is quite unexpected for a 
predominantly ionic cubic crystal such as ZnS, and may 
be attributed to high values of LA and TA phonons as 
assigned by Deutsch. The discrepancy is even larger if 
one uses the v, given by Mitsuishi, Yoshinaga, and 
Fuyita.l” 

A fresh assignment of the zone-boundary character- 
istic phonon frequencies of cubic ZnS with lower values 
of the acoustic modes has therefore been attempted. 


"TABLE IV. Assignment of the multiphonon absorption 
bands in cubic ZnS. 


Deutsch Present study 
Observed^ Assign- Calc. Assign- Calc. Assign- Calc. 
cm"! ment! (cm7) ment 18 (cm7!) ment 2b (cm) 
b 
A EE M oae ORIYA 432 TO+TA 413 
455 2D 456 TO+LA 453 LO+TA 454 
491b C+D 491 LO+LA 494 TO+LA 488 
526 2C,B+D 526 LO+2TA 525 LO-LA 529 
593b 2B 594 2TO 596 O 596 
6054 e -- TO+2LA 608 LO+LA+TA 005 
642b A+C 642 LO+TO 637 LO+TO 7 
677 A+B 677 2LO 678 2LO Oe 
733° LO+TO+TA 731 71 


ER As LO+2LA ? 
823 2B+D 822 LOXTO42TA 824 LO+TO+LA 82 


a Data from Deutsch (Ref. 13). 

b Relatively stronger bands. be 

e Although Deutsch lists a value of 431 cm-!, this band appears e th 
beyond the range of his observation. A lower value seems probable S 
from the extrapolation of his spectrum and from a comparison of his SP! 
trum with that of the hexagonal modification reported in this paper. 

4 Not listed by Deutsch but is apparent from his spectra. 

* Broad and very weak. 

í A 2379 cm7, B=297 cm^!, C 2263 cm^!, and D =228 cm^!. = 

£ LO =339 cm-!, TO 2298 cm™!, LA =155 cm™, and TA =93 cm". 

h LO =339 cm-!, TO 2298 cm=!, LA =190 cm^!, and TA =115 cm 


16R. H. Lyddane, R. G. Sachs, and E. Teller, Phys. Rev. 5% 
673 (1941). ? Soc 

1 A. Mitshuishi, H. Yoshinaga, and S. Fuyita, J- Phys. 907 
Japan 13, 1235 (1958). 
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TABLE V. Comparison of the obse 


rved and li JE 
dependence of the tw. PDA EE 


0-phonon combinations in cubic ZnS. 


o Observed* Calculated 

-eak (77°K 420° K (7 (420°K 

Position EN ota NIE UU EH, 
cm” Assignment a(300°K)  a(300^K) al300°K)  a(300?K) 
491 LO +LAb 0.50 eB 0.490 1.34 
y TO-+LAs 0.50 e 0.518 1.34 
593 2TO 0.69 1.25 0.616 1.31 
642 LO+TO 0.79 1.14 0.643 1.29 
677 2LO 0.80 1.12 0.671 1.28 


? Determined from Deutsch's spectra (Ref. 13). 
b Assignment 1. 
ssignment 2. 


With assumed values of LO — 339 cm-!, TO — 298 cm~, 
LA - 155 cm^, and TA=93 cm™, all the observed ab- 
sorption maxima could be satisfactorily explained. How- 
ever, the LA-phonon frequency seems to be a bit low 
when compared with the corresponding phonon fre- 
quency of the hexagonal ZnS. An alternative assignment 
with the same LO and TO but LA=190 cm-! and 
TA=115 cm™ seems also possible although the nu- 
merical agreement with the observed band positions is 
a trifle worsened in this case. The assignments are 
shown in Table IV. 

Deutsch's data are used for the comparison of ob- 
served and calculated temperature dependences. 
Table V shows the calculated and observed temperature 
factors for the relatively prominent bands. There are 
usually two contributions to the background absorp- 
tion; the first due to free charge carriers and the second 
to the tail of the resonance absorption at v:. The former 
is expected to decrease with the energy of the absorbed 
photon. Since the background absorption in the high- 
frequency side of the lattice bands of ZnS is very low 
and independent of temperature, the contribution of 
free carrier absorption to the background may be com- 
pletely neglected. This is to be expected for a crystal of 
large band gap. The essentially temperature-independ- 
ent contribution from » to the background absorption 
is obtained from the extinction coefficient calculated 
from the reststrahlen spectrum (the temperature- 
dependent portion chiefly arises from the multiphonon 
processes). For the bands where data is available, the 
absorption coefficient is rather temperature insensitive. 
Thus the agreement between the observed and the cal- 
culated values may be regarded as only fair. The two 
assignments proposed here only differ in the case of the 
band at 491 cm. Temperature dependence alone can 
not therefore decide between the two assignments. A 
fresh high-resolution transmission study of the cubic 
sample seems desirable. 


4. DISCUSSION 


As noted earlier, the multiple structure observed in 
the lattice infrared spectra of the zincblende-type 
crystals can usually be accounted for in terms of i 
characteristic phonon energies belonging to the our 
branches: TA. LA, TO, and LO at the Brillouin zone 
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boundary. In certain cases, notably diamond? and 
GaAs, it has been deemed necessary to assume the 
breakdown of the degeneracy of the transverse modes 
in order to satisfactorily account for all the observed 
optically active multiphonon combinations. Since the 
Brillouin zone-boundary value of a branch may differ 
in different k space directions, it is only natural to 
assume such a splitting of the transverse modes in the 
case of the wurtzite structure. For satisfactory assign- 
ments of the numerous lattice absorption maxima in 
hexagonal CdS and ZnS we have indeed taken recourse 
to two TA and two TO phonons. 

The transverse frequency is smaller than the longi- 
tudinal one for all values of the propagation vector for 
the acoustical branches, except at the zone center where 
both are zero. Therefore there is no difficulty in identi- 
fying the lowest characteristic phonon frequencies with 
definite TA and LA modes. However, the identification 
of the optical-branch phonons to definite transverse 
and longitudinal modes is not so clear cut, and some 
confusion has resulted in the past. 

Keyes” has noted some correlation between the zone- 
boundary optical-phonon energies of the zincblende- 
type crystals and the effective ionic charge q*, derived 
from the Szigeti relation.’ For a purely covalent 
crystal (q*—0) such as diamond or Ge, the transverse 
and longitudinal optical branches are degenerate at the 
zone center whereas TO LO at the zone boundary. 
Although v>», at k=0 (Lyddane-Sachs-Teller for- 
mula!$) for weakly ionic crystals like InSb (g*— 0.42), 
TO is still greater than LO at the zone boundary. In 
the case of a crystal of intermediate ionicity (q*~0.7), 
Keyes predicted that LO — TO at knaz, which indeed is 
the case with ZnSe as shown by Mitra? However, 
strongly ionic crystals (g*70.7) such as the alkali 
halides? are expected to have LO greater than TO for 
all values of k. 

The effective ionic charge on CdS and ZnS is 1.00 
and 0.854, respectively. The highest characteristic 
phonon frequency for these materials may therefore be 
termed the zone boundary LO. The next two character- 
istic frequencies of the hexagonal CdS and ZnS, and the 

next single characteristic frequency of the cubic ZnS 
may now be identified with the TO modes. For pre- 
dominantly ionic crystals, the transverse optical 
branches as a rule undergo only limited dispersion. The 
zone boundary values of TO,=261 cm™ and TO,= 238 
cnr for CdS, TO; =318 cm~ anc TO2= 297 cm™ for 
the wurtzite ZnS and TO=298 cm™ for the zincblende 
ZnS are not too different from their zone-center values, 
241 cm-! and 312 cm~}, respectively, for CdS and ZnS. 
The zone boundary LO, TO, LA, and TA phonon fre- 


18 B, Szigeti, Trans. Faraday Soc. 45, 155 (1949). 

1? A. D. B. Woods, W. Cochran, and B. N. Brockhouse, Phys. 
Rev. 119, 980 (1960); A. D. B. Woods, B. N. Brockhouse, R. A. 
Cowley, and W. Cochran, ibid. 131, 1025 (1963); R. A. Cowley, 
W. Cochran, B. N. Brockhouse, and A. D. B. Woods, ibid. 131, 


1030 (1963). 
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Fic. 5. Correlation of the ratio of the optical frequencies of the 
zincblende- and wurtzite-type semiconductors with the effective 
ionic charge. 


quencies for a number of IV-IV, II-V and II-VI com- 
pounds along with their effective ionic charge are listed 
in Table VI. Keyes noted that (LO/TO)’ is a linear 
function of g* for crystals of diamond and zincblende- 
type structures. Such a plot is shown in Fig. 5. The 
crystals of wurtzite structure also seem to follow the 
general pattern if one uses the mean of the two TO 
values. 
The Brout sum rule? for lattice vibrations, 


Yo? (k) =constant , (5) 
where o;(k) is the circular frequency in the ith branch 
when the wave vector is k, was believed to be strictly 


valid for diatomic ionic crystals of cubic symmetry, 


TABLE VI. Zone-boundary phonon frequencies and effective ionic 
charge of some IV-IV, II-V, and II-VI semiconductors.* 


LO TO LA TA 
Compound cm! cm? cme: cmH g* 
© 1161 1275 991 750 0 
Si 413 482 333 139 0 
Gee 247 280 215 65 0 
SiC 851 710 540 363 0.94 
GaSb 193 215 134 49 0.33 
InSb 155 179 118 43 0.42 
GaAs 234 258 188 70 0.51 
AISb 297 316 132 65 0.53 
GaP 361 378 197 115 0.58 
InP 318 329 150 62 0.68 
ZnSe 212 208 162 87 0.71 
ZnS (cubic)? 339 298 155 93 0.85 
339 298 190 115 
i ZnS(hexagonal)e 346  318;297 181 92;73 
CdS (hexagonal 295 261; 238 149 79;70 1.00 


a Data from Ref. 3 if not otherwise stated. 
b Reference 2. 
e B. N. Brockhouse and P. K. Iyengar, Phys. Rev. 111, 747 (1958). 
4 This study fe 1). 
=e This study (assignment 2). 
í This study. 


» A. A. Maradudin, E. W. Montroll, and G. H. Weiss, Theory of 
altice Dynamics im the Harmonic Approximation (Academic 
nc., New York, 1963), p. 115. 
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Fic. 6. The effect of mass difference on the separation between 
the optical and the acoustical branches. 


whose constituent atoms interact only by Coulomb 
forces between all ions and the repulsive interaction is 
between nearest neighbors alone. However, relation (5) 
seems to hold reasonably well for a number of zinc- 
blende!*-type crystals. Rosenstock’! has recently 
demonstrated that Eq. (5) with little or no modification 
can also be applied to more general force models and to 
nonionic crystals. 

In Table VII the zone-edge Brout sums (vp2+2p?) 
for CdS and ZnS are compared with their zone- 
boundary values (LO?-+2TO?+LA?+ 2TA?). The agree- 
ment is fair. Such a comparison may therefore be re- 
garded as a check on the upper bounds of the zone- 
boundary phonon frequencies of diatomic crystals. 

Other than the numerical agreement between the ob- 
served and calculated positions of infrared absorption | 
peaks involving the acoustical modes, there are but few 
checks on the soundness of the LA and TA assignments. 
In analogy with the diatomic linear chain, Keyes has | 
shown that one may expect a linear relationship between | 
(LO/LA)? and c, the ratio of the larger to the smaller — | 
ionic masses for the zincblende-type crystals. Such a 
plot is shown in Fig. 6, in which the wurtzite crystals 
are included as well. The LA assignment of hexagonal 
CdS and ZnS seems reasonable. For the cubic Zn$ 


: E : * 
assignment, No. 2 seems to be more probable. If suchjs = 
the case, the difference between the hexagonal an 
cubic ZnS characteristic phonon frequencies is not very 


| 
3 
| 
| 
l 


TABLE VII. Comparison of the zone-center and zone-edge 
Brout sums for some 2-6 compounds. 


Brout sum Brout sum 
at k~0 in at Kmax 1D 
Compound Structure 1027 sec? 1027 sec 
CdS Wurtzite 7.73 8.10 
ZnS Zincblende 12.00 11.84 
12.60 
ZnS Wurtzite SS 12.61 
ZnSe Zincblende 5.35 6.13 


a Assignment 1. 
b Assignment 2. 


21 H. B. Rosenstock, Phys. Rev. 129, 1959 (1963). 


tion 
few 
nts. 
has 
veen 
aller 
ch a 
stals 
onal 
Zns 
ch is 
and 
very 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


OPTICALLY ACTIVE PHONON PROCESSES IN CdS AND ZnS A1025 
large, which is to be expected from the essential simi- ACKNOWLEDGMENTS 
larity in the gross features of the spectra of these two 
modifications. Such a similarity was also noted between 
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127 (1959); W. G. Spitzer, D. A. Kleinma IE TF Aeronautical Research Laboratories for providing the 
ibid. 113, 133 (1959). odes specimens. b 3 


PHYSICAL REVIEW VOLUME 134, NUMBER 4A 18 MAY 1964 


Theory of Electron Backscattering* 


Rocer F. DasuEeNT 
Sandia Laboratory, Albuquerque, New Mexico 
(Received 23 December 1963) 


An exact integral equation is derived for the backscattering coefficient as a function of energy and angle. 
The equation has an approximate symmetry which allows one to determine the conditions under which 
backscattering will be independent of the energy of the incident electrons. The validity of some approxima- 
tions is discussed, and a possible method for obtaining numerical solutions is proposed. It is found that small 
and large-angle individual scatterings are of about equal importance. For those electrons that have lost very 
little energy, the equation can be solved exactly, and some interesting results are obtained. 


1. INTRODUCTION follows. Given a uniform, monoenergetic flux of elec- 
trons incident on a semi-infinite block of scattering 
material, what is the energy and angular distribution 
of electrons that are scattered back out? In order to 
solve this problem using standard multiple-scattering 
theory, one must find the distribution function for the 
electrons at each point within the block. A more 
economical method would be to obtain an equation for 
the distribution of backscattered electrons alone, 
thereby freeing ourselves from the unnecessary task of 
finding the distribution within the block. In the fol- 
lowing section we will derive an integral equation for 
the backscattering coefficient as a function of energy 
and angle. An application of this equation to the energy 
dependence of the backscattering coefficient is given in 
Sec. 3. Section 4 contains a critical analysis of some 


A7 HEN a beam of electrons is directed against a 
thick solid target, many of the electrons stop 
in the target but some are scattered back out. These 
are the backscattered electrons. Over the past few 
years a considerable amount of experimental infor- 
mation has been collected ?-9* and from an empirical 
point of view there now exists a fairly complete picture 
of the phenomenon. On the other hand, attempts at a 
theoretical interpretation of the data’ have been 
based on rather severe approximations and have met 
with only limited success. The purpose of this paper is 
to present a reformulation of the problem which cir- 
cumvents many, but by no means all of the difficulties 
which one encounters in trying to treat backscattering 
through the usual methods of multiple-scattering 
theory. 
Mathematically, the problem can be formulated as 


* Work was performed under the auspices of the U. S. Atomic 
Energy Commission. $ i : 

1 Present pem Department of Physics, California Institute 
of Technology, Pasadena, California. p ly. iti 

1 Secondary electrons are also emitted. Experimentally, aT 
usually assumed that an electron emerging from He target wi 
an energy of more than 50 eV has been backscattere E 

2 E. Stenglass, Phys. Rev. 95, 345 (1954). _ 

3 P. Palluel, Compt. Rend. 224, 1492 (1947). 

4H. Kanter, Ann. Physik 20, 144 (1957). 687 (1962) 

* K. Wright and J. Trump, J. Appl. Phys. 33, A 406 (1954) 

* H. Kulenkampft and K. Rüttiger, Z. Physik 137, 449 V797- 

1H. Bethe, Ann. Physik 5, 325 (1940). 

5 G. Archard, J. Appl. Phys. 32, 1505 (1961). à ; 

* T, Everhart, J. Appl. Phys. 31, 1483 (1960). Fic. 1. Kinematics. 

9 W, Bothe, Z. Naturforsch. 4a, 542 (1949). 
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approximate methods along with some interesting 
numerical results based on the assumption that small- 
angle scattering is unimportant. Finally, in Sec. 5 we 
discuss the possibility of more extensive calculations. 


ROGER F. 


DASHEN 


2. THE BASIC EQUATION 
Given a uniform flux of electrons incident on a plane 
surface (Fig. 1), we define the backscattering coefficient 
R to be 


outgoing flux per unit solid angle per unit energy interval 


R(n,E;n’,E’) 


where H=incident energy, E’=outgoing energy, 
n=unit vector antiparallel to the incoming direction, 
n'—unit vector parallel to the outgoing direction. One 
will note that the quantity measured in the laboratory 
is not R, but R(n,E,n'E/) sec cosó', where @ is the 
angle between the n and the normal to the surface. 
This is due to the fact that we have assumed a uniform, 
infinitely broad, incoming beam whereas in the labo- 
ratory the incoming beam is confined to a thin pencil. 
To derive an integral equation for R, we will follow 
a chain of thought that has proven to be very fruitful 
in similar contexts and generally goes under the title of 
"invariant imbedding.” Our application of these ideas 
will be of the most rudimentary nature. For a thorough 
exposition of this manner of thinking and its many 
applications, the interested reader is referred to the 
review article of Bellman, Kalaba, and Wing.” The 
basic idea is very simple. Suppose we add a thin layer 
of the same material to the surface of our target thereby 
increasing its thickness by an amount At. Since we have 
assumed that our target was already infinitely thick, 
the backscattering coefficient must remain unchanged. 
If we take the new layer to be differentially thin, this 
simple device will lead to an integral equation for R. 


D 


TARGET 
Fic. 2. New paths created by increasing the thickness of the 


E Uf 
(a) 
target. The dots represent a single elastic or inelastic collision in 


the new layer, and the broken lines indicate backscattering in the 
original target. 


Py / l, 
Yi, 
) 


(c 


11 R. Bellman, R. Kaaba, and G. Wing, J. Math. Phys. 1, 280 
(1960). 
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incident flux 


As notational difficulties would tend to obscure the 
basic simplicity of the argument, we will not follow the 
derivation in mathematical detail. The reader, if so 
inclined, can easily fill in the gaps. 

Apart from normalization factors, the backscattering 
coefficient R(n,E; n',E") is imply the probability that 
an electron will follow any path which can lead to 
backscattering from the state (n,E) to the state (n', E^). 
By adding a layer of scattering material we have 
created some new paths. These are the paths where the 
electron makes a collision, either elastic or inelastic, 
somewhere in the new layer. They are illustrated 
schematically in Fig. 2. Note that we need not consider 
paths where the electron makes two or more collisions 
in the added layer since the probability that an electron 
will follow one of these paths is of order (A/)* To 
calculate the increase in R due to fact that we have 
added some extra paths, we must add up the proba- 
bilities that an electron will follow each of the new 
paths. Referring to Fig. 2 we see that the sum over 
paths of type (a) is simply a normalization factor times 
the atomic cross section. Again, apart from normali- 
zation factors, the paths of type (b) can be summed 
by multiplying the atomic cross section for scattering 
from the state (En) to a state (E",n") times the 
“original” backscattering coefficient R(n',E" ; n, E) 
and summing over all intermediate states (n'^ E"). 
Paths of type (c) are treated similarly. The sum over 
paths of type (d) contains the atomic cross section 
times the square of the “original” backscattering C0- 
efficient and a double summation over intermediate 
states. 

Since the total variation in R must be zero, the 
increase in probability of backscattering due to the 
increased number of available paths must be exactly 
cancelled by a decrease in the probability that 2n 
electron will follow one of the old paths; i.e., a pat 
along which it makes no collisions in the added layer. 
It is immediately obvious that the net probability that 
an electron will follow one of the original paths 1 
decreased since the probability that an electron will 
pass twice through the new layer without making 2 
collision is less than one. More specifically, the reduction 
in the probability that an electron will follow one of the 
old paths is simply the “original” backscattering CO" 
efficient times the total probability that the electron 
will make a collision as it passes twice through the new 
layer. 
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At this point we make the usual assumption that 
inelastic collisions with the atomic electrons contribute 
only to energy loss and do not change the incident 
electrons’ direction of motion. This simplification is not 
in principle necessary, but it is a very good approxi- 
mation and has the pleasant feature that the cross 
sections for angular deflection and energy loss become 
independent. Combining this assumption with the 


analysis of the preceding paragraphs, one finds that R 
satisfies the integral equation 


[(or(E)+wr(E)) secd+ (or (E’)-+wr(E’)) sec’ | 


XR(n,E;n’,E’) 
=0(E, —n-n’) sec6’6(E— E/) 


E 
«f [« (Z, E") secüR (n, E" ; n', E) 
E 
+oE”,E') secóü' R(n, E; n',E") E" 
[emn sech” R(n”, E; n’, E’) 


+o (E' n" -n') sech R(n, E ; n", E^) dn" 


E 
«f J [Roe n” Eo (E'', na) 
E' 


Xsecó"" R(n” E"; n’, E')dE”dn”dn” , (2) 


where c (E, cosO) is the atomic cross section for elastic 
scattering through an angle O, e(E,E') is the atomic 
cross section for an inelastic collision leading to an 
energy loss E—E', or(E) and wr(E) are the total 
elastic and inelastic cross sections, sec is the secant 
of the angle between n and the normal, and the angular 
integrations run over the hemisphere of unit vectors 
pointing out of the target. The first term on the right 
of Eq. (2) is the probability that an electron will follow 
one of the new paths of type (a). In accord with our 
assumption that inelastic collisions do not cause angular 
deflection, we have included only the elastic cross 
section in this term. The two single integrals are the 
sum over paths of types (b) and (c) and the triple 
integral is the sum over paths of type (d). Finally, the 
left-hand side is the net reduction in the probability 
that an electron will follow one of the old paths. The 
geometrical factors secó are the usual ones which arise 
because the effective volume of the layer available for 
scattering into a direction n is proportional to Af secé. 
In writing down Eq. (2) we have omitted a factor 
AtX (number of atoms per unit volume) which multi- 
plies each term as it is of no consequence. Physically, 
this means that backscattering is independent of the 
density of the scattering material. r 

For future purposes it is convenient to write Eq. (2) 
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in the form 


E 
| Í [8CE— E" op (E) —(E,E"] 
» 


Xsec"6(n—n")R(n", E"  n' EdE” dn” 
E 

ES | | R(n, E; n” E"JC(E" — Earp (E?) 
a 


—e(E" EJ] secba(n'—n")dE" dn” 


=o(E, —n-n') secó'ó(E— E’) 


" 
+f [tiet nit neto 
gu 


Xsecb" 0 (E— E") R(n" E” : n E") E" dn" 


E 
dc / ji [R(nE; n", E")[s (E",n' -n") 


—8(n!—n"or(E") | secf(E"" — E^) ME" dn" 


E pE” 
+f j. | [Renen 
n dig 


Xec(E", E n" n") sec” (qun En 
XRM”, E"! ; n', E)dE'"dE'"dn'dn"' 5 (3) 


where 6(n—n’) is the delta function on the unit sphere. 

In the remaining sections of this paper we will restrict 
ourselves to electron energies that are nonrelativistic 
but large compared to the binding energy of the atomic 
electrons. This is a permissible limitation since from the 
form of Eq. (2) it is apparent that for a given maximum 
value of E and minimum value of £’, R does not depend 
on reflection coefficients whose energy indices lie outside 
this range. Under these conditions the elastic cross 
section is simply the Rutherford cross section” 


Ze 1 


o(E, cosQ) = SSS, 
4E? (1—cosO+7)? 


(4) 


where y is the screening parameter. The inelastic cross 
section can be sufficiently well approximated by” 


eZ 1 G - 
ON TELE) e Cua E—E'> enin (5) 
E (E—Ey 


where emin and €max are the minimum and maximum 
energy losses.? For our purposes it is sufficient to know 
that emax is proportional to Æ and that the usual range- 


2 N. Mott and H. Massey, Theory of Atomic Collisions (Oxford 
University Press, New York, 1949). 
13 M. Livingston and H. Bethe, Rev. Mod. Phys. 2, 245 (1937). 
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energy relation is given by” 
| E dE 2wZe  /2E 
i w(E,E')(E-E')\dE' = ———-— inf — , (6) 
0 dx E I 


| where the unit of length has been chosen so that there 
| is one atom per unit volume and J is a mean ionization 


potential. 
: In writing down Eq. (3) we have placed the factors 
: in a particular order to bring out the fact that the 
| integrations over energy and angle can be considered 


as matrix multiplications. This observation leads to a 


more economical notation and allows us to use the 
; methods of matrix analysis to manipulate the equation. 
We consider R(n,E; n',E') as a matrix with two sets 
i of indices, n and E, and define the following matrices 
i: Sp=4(eZ2)10(E, —n-n’) sech’ (E— E’), (7a) 


Sr=4le Z) [o (E,n-n’)— ò (n—n’)or(E) ] 


Xsecó'8(E—E'), (7b) 
1 W= (QxeiZ)-[5(E— E')er (E) — e EE’) ] 
t Xsecóü'ó(n—n'). (7c) 


Equation (2) can now be written as 
WR+RW = (Z/8r)(Ss+SrR+RS r+RSeR). (8) 


Owing to the fact that none of the matrices Sz, Sr, 
and W commute, Eq. (8) cannot be solved explicitly. 
We can, however, put it in a more advantageous form. 
Tt is a standard theorem of matrix calculus“ that a 
formal solution to (8) is given by 


Z i) 
Ro I exp( — Wi) (Ss-I-SgR-- RS z-- RS5R) 
T 
3 Xexp(—Widt, (9) 


which holds provided that the integral converges. To 
see that the integral does converge one need only con- 
sider the physical meaning of the exponentials. Writing 
the matrix exp(—/W) in the form 


exp(—/W)=P(E,E’;tsecé)6(n—n'), ^ (10) 


we find that P satisfies the differential equation and 
boundary condition 


Bie aw 1 sec) 

E a —Qzez) i [o(B,E”)—8(E-E")wr(E)] (41) 
A y X PE" E! t secü)d E" , 

ae PEE; 0)=6(E—E’), 


m which one easily sees that P(E,E’ ; t sec6) is simply 
probability that an electron will suffer an energy 


Bellman, Introduction to Matrix Analysis (McGraw-Hill 
pany, Inc., New York, 1960), 


a 
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loss E—E! while traversing a path of length /secg 
where the unit of distance is chosen such that 2me!Z 
(number of atoms per unit volume)=1. Now for 
fixed E and Æ’, P tends rapidly to zero for large ¢ and 
the integral converges. Landau has given explicit 
expressions for the function P so we now have an 
equation that can, at least in principle, be solved by 
successive approximations. 

Equations (8) and (9), being quadratic in R, most 
likely have more than one solution. Obviously the 
physically interesting solution has the property that, 
considered as a function of Z, it tends smoothly to zero 
for small Z. We will now give a heuristic proof that 
Eq. (9) has a unique solution with this property. 
Strictly speaking, the atomic screening radius y and 
the minimum energy loss €min are functions of Z. In 
the next section, however, we will see that the de- 
pendence of R on these quantities is rather unimportant 
and to a good approximation we can take the matrices 
Sr, Sg, and W to be independent of Z. With this 
assumption one can easily see that Eq. (9) has a unique 
formal solution as a power series in Z. To see that this 
series is, in fact, convergent we need only observe that 
an expansion of R in powers of Z corresponds exactly 
to an expansion in the number of times an individual 
electron is elastically scattered before leaving the target. 
Now a large number of scatterings requires a long time 
spent in the target, but for a given energy loss very long 
paths are highly improbable. Therefore, for fixed E— E', 
the coefficient of Z must tend to zero for large N and 
the series converges. Since we have found that (8) has 
a unique solution which is analytic around Z =0, any 
extraneous solutions must have some singular behavior 
for small Z. 


3. DEPENDENCE ON THE INCIDENT ENERGY 


Perhaps the most striking experimental observation 
is that the number of backscattered electrons and their 
distribution in energy and angle are, over a broad range 
of energies, very nearly independent of the incident 
energy. The first application of our formalism will 
therefore be to investigate the energy dependence of 
R. Since the parameters y and émin appearing in (5 
are rather poorly known functions of E and Z, our first 
task must be to see if R is sensitive to moderate vari- 
ations of these parameters. For fast incident electrons 
both y and émm/Z are very small numbers, generally 
being on the order of 10-3. In the limit as Y and €min 
tend to zero both the elastic and inelastic cross sections 
diverge, but one can easily verify that they have the 
symbolic expansions 


c(E,n-n^) —cz(E)6(n—n')— eZ: (QE)? 


XIn(1/y)? (1—n-n) +01), (12) 
w(E,E!) Sor (E) (E— E) -2rZe E? 
xImmQE/DX (E—E)2-0(), (9) 


15 L, Landau, J. Phys. 8, 201 (1944). 
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where the symbol O(1) has been used to denote finite 
terms independent of y or emin plus terms which vanish 
as Yy Or Emin tends to zero and we have used Eq. (6) to 
simplify (13). Referring to Eq. (3) we see that the terms 
proportional to ez and wr drop out so the dependence 
of R on eji, and y is only logarithmic. 

Upon inserting the cross sections (4) and (5) into 
Eq. (2), one can verify by a simple change of variables 
that if R(n,£; n', E^) is a solution so is AR(n, AE; n’ NE’) 
where À is any positive number, provided that one 
neglects the energy dependence of the logarithms 
In(1/y) and In(2E/I) appearing in (12) and (13). 
When the latter is a good approximation we can apply 
our uniqueness “theorem” which then yields 


R(n,E; n'E) AR(njE; n AE). (14) 


Physically this means that increasing the incoming 
and outgoing energies in the same proportion will leave 
the whole backscattering process unchanged, except 
for a scaling factor which is precisely that needed to 
keep the integrated coefficient at a constant value. 
While it is reassuring to derive this well verified 
symmetry from theory, it is perhaps more important 
to investigate the conditions under which it is valid. 
'To begin with, Eq. (14) does not hold unless we can 
neglect the energy dependence of the logarithms in 
(12) and (13). Roughly speaking, this requires that the 
incident energy be greater than the K-shell binding 
energy of the scatterer. For carbon the K-shell energy 
is roughly 300 eV and one finds experimentally that the 
total backscattering coefficient is very nearly constant 
for EZ; 500 eV.?? On the other hand, the K-shell energy 
of silver is about 25 keV and one indeed finds that the 
number of backscattered electrons does not become 
constant until E is on the order of 15 keV.?? A second 
breakdown of (14) occurs when the angle of incidence 
is near 90? (Fig. 1). In this case the incoming electrons 
can take full advantage of the forward peak in the 
Rutherford cross section which is extremely sensitive 
to y. With an incident angle of 80°, Kanter* has found 
that the energy distribution of electrons backscattered 
from aluminum varies considerably for incident energies 
between 10 and 70 keV even though the K-shell energy 
of aluminum is on the order of 1.5 keV. Since back- 
scattering is largest at grazing incidence, this breakdown 
of (14) for 62:90? will have a disproportionate effect 
when the incident flux is isotropic. Formally, Eq. (14) 
breaks down near 0=90° because lny multiplies an 
angular derivative which becomes very large when the 
initial direction is parallel to the surface of the target. 
Finally, if the relativistic cross sections, rather dear 
those given by (4) and (5) are used in Eq. (2), one finds 
that the resulting equation has a different dependence 
on the energy variables and (14) no longer holds. Ex- 
perimentally? one finds that in the relativistic ee 
the backscattering coefficient is not independent of the 
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incident energy, but shows a marked decrease from the 
nearly constant nonrelativistic value. 


4. APPROXIMATIONS 


Most theories of electron transport are based on the 
assumption that electrons lose energy continuously 
according to the usual range-energy relationship. This 
is definitely an approximation since energy loss, like 
angular deflection, is a stochastic process. To determine 
the accuracy of this approximation when applied to 
backscattering, we must examine the integrals on the 
left-hand side of Eq. (3). For simplicity, we will con- 
sider only the first integral and assume that the inte- 
gration over the angular delta function has already 
been carried out. The inelastic cross section e(E,E/) 
is very strongly peaked near E=E”, which suggests 
that we change the lower limit of integration from Æ’ 
to zero and, keeping E' fixed, expand R(n,E" ; n',E/) 
in a Taylor series around E"— EF. The zeroth-order 
term in the expansion is exactly cancelled by the 
&(E— E")er(E) term and the first integral becomes 


E ð 
sec | e(E,E")(E— Bde" RE ; n E!) 
0 ðE 


[5 


2 


E € 
-4 se| Í «(E^ E- Eye | 
; PE 


XR(n,E; n',E')4----. (5) i 


According to (6) the first term in (15) can be written 
as 


| 
1 
4 

| 


dE à 
— secü— RUD ; n^, E). 


dx à | 


On the other hand, in the continuous approximation m 
the probability of energy loss per unit path length a 


w(E; E') would be 
(dx)-38[ E— E'— (dE/dx)dx ]. 


In this case wz is simply (dx)-!, and upon inserting 
these expressions into the first term on the left-hand — 
side of (3) and taking the limit dx — 0, one finds that —— 

the continuous energy loss approximation is exactly 
equivalent to keeping only the first term in (15). Since 
w(E,E”) is peaked near E— E", the coefficient of t 
first term in (15) will be considerably larger than t 


these terms will be a good approximation unless R is 
so rapidly varying that the higher derivatives are lar 
To estimate these derivatives one must remember that 
we are considering R as a function of E for fixed E' 
perimentally one finds that for low-Z targets, R 
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slowly varying function and the approximation should 
be very good but for high-Z targets R changes by an 
order of magnitude between E/E'—1 and E/E'—2. A 
rough calculation indicates that for Z greater than 
about 40, the second term in (15) can be nearly as 
large as the first. One cannot, therefore, expect a theory 
based on a continuous energy loss to give detailed, 
quantitative results for high-Z materials. This con- 
clusion is in agreement with Monte Carlo calculations 
carried out by MacCallum, who found that calcu- 
lations based on the usual range-energy relationship 
gave the details of the energy distribution of back- 
scattered electrons correctly only for low-Z scatterers. 

'The easiest way to incorporate this approximation 
into the present formalism is through Eq. (9). If the 
energy loss is assumed to take place continuously, the 
function P(E,E';:sec0) defined in (10) is obviously 
just a delta function whose argument is determined by 
the range-energy relation which in terms of / is given 
by dE/di- —E- |n(2E/1). If we want our solution to 
obey Eq. (14) we must neglect the energy variation of 
the logarithm in which case the range-energy relation 
can be integrated analytically and we find 


P(E,E';isec0)— 2E'B(E?— E'?—Cisecü), (16) 


where C —2 In(2E/J). There is, of course, some ambi- 
guity in assigning a numerical value to the “constant” 
In(2E/I). However, one will note that (16) is simply a 
restatement of the Thompson-Whiddington'law? and 
Terrill!’ has found that this simplification of the range- 
energy relationship is fairly accurate provided that one 
takes C~10. Since one might argue that this value of 
C is not the proper one to use in backscattering prob- 
lems, it is interesting to observe that C can be deter- 
mined directly by studying the behavior of R in the 
limit E'— E. Clearly an electron which has lost very 
little energy cannot have spent enough time in the 
target to have been scattered more than once, which 
means that in the limit E’ — E the first-order term in 
an expansion of R in powers of Z is exact. This term 
can be obtained simply by setting R=0 on the right 
hand side of Eq. (9). 

At this point it is convenient to introduce some 
notation which will be used throughout the rest of this 
section. Since we will be interested only in solutions 
which satisfy Eq. (14), it is advantageous to work with 
the dimensionless function 


7(nn'; £))- E cos R(njE; n',E^), (17) 


where £=£’/E and the factor cosb’ has been added to 
facilitate comparison with experimental data. That 7 
is a function only of £’/E follows directly from (14). 

Substituting (16) into Eq. (9) and integrating over 


16 C. MacCallum, Bull. Am. Phys. Soc. 5, 379 (1960). 


— 31H. Terrill, Phys. Rev. 22, 101 (1923). 
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Fic. 3. Behavior of r for £=1 and normal incidence. The dots are 
the experimental points of Kulenkampff and Riittiger (Ref. 6), 


the delta functions, one easily verifies that to order Z 
Z  £cos0 cosh’ 1 ( 
4v C cos04- € cosh’ (1--n+n’+7)? i 


r(n,n'; £)— 


18) 


Taking C=10, we find that for £—1 and 0=0 (normal 
incidence) r should behave as shown in Fig. 3. The 
curves are in excellent agreement with the experimental 
points of Kulenkampff and Rüttiger. One will note 
that in the limit — 1 it is not an approximation to 
neglect the energy dependence of the In(2E/I) term 
in the range-energy relation which means that for £—1 
Eq. (18) is, within the approximation of continuous 
energy loss, exact and C has the unambiguous value of 
21]n(2E/I) where E is the incident energy. Since the 
experimental points were taken at 30 keV, this would 
yield an ionization potential [~400 eV independent of 
Z whereas one usually takes Iz(13.4 eV)Z. This is 
indeed a remarkable occurrence for which there appears 
to be no immediate explanation. 

Recently there have been attempts to explain back- 
scattering in terms of a single large-angle scattering ^ 
Since a theory that includes only a single scattering 
necessarily corresponds to the first term in an expansion 
of R in powers of Z, it must be exact in the limit Z 7 0, 
and one might hope that it would be valid for low-Z 
materials of experimental interest. Further motivation 
for this approach can be found in the fact that for Z 
between 6 and 40 the total backscattering coefficient 
for normal incidence is about 0.013Z which is suggestive 
of a first-order term in Z. However, for 0—0, Eq. (18) 
yields a total coefficient of [8C ]-[1—102]24-0(2) 
which for C—10 is about 0.004Z. Everhart? has SUE 
gested treating C as a phenomenological parameter 
which should be adjusted to give the observed back- 
scattering. In view of the accuracy with which Terrill’s 
value reproduces the experimental points at £—1 this 
is hardly a permissible procedure. Moreover, the energy 
distribution of backscattered electrons predicted by 
(17) is in violent disagreement with experiment so NE 
are forced to the conclusion that a single-scattering 
theory is not valid for realistic values of Z. Apparently 
the linear behavior of the total coefficient for low- 
materials is accidental. 
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Having found that a theory based on a single large- 
angle scattering is not realistic, one might be tempted 
to go to the opposite extreme and use a small-angle 
approximation in which one keeps only the forward 
peak of the cross section. The possibility of constricting 
a quantative theory of this sort is imbedded in what 
is perhaps the most interesting question in backscat- 
tering theory: Which is the more important, small- or 
large-angle scattering? One way to answer this question 
is simply to chop off the small-angle part of the Ruther- 
ford cross section, calculate the resulting backscattering 
coefficient, and compare the results with experiment. 
We have made a number of such calculations by 
iterating Eq. (9) in the continuous energy-loss approxi- 
mation (16). Other motivations for the calculations 
were the hope that we might obtain quantitative results 
for low-Z targets and to find out if matrix equations 
like (9) are suitable for machine calculations. More 
specifically, we replaced the Rutherford cross section 
Li—cosO+y]~ with [1—cosO ]- exp[ —5?/ 
(1—cosO)?] and iterated (9) starting with R=0. As 
long as Z was not large or b too small, the iterates 
converged satisfactorily becoming almost constant 
after about six iterations. However, for large Z or very 
small b satisfactory convergence could be obtained only 
for values of & close to one. This was due to the fact 
that the matrix Sr is the difference between two terms 
which partially cancel each other, and if either Sp or 
R becomes too large or rapidly varying, the equation 
becomes numerically unstable. For this reason, the 
method is not practical for applications of the theory 
and it would not be worth while to go into the details. 

Some typical results obtained for 5—0.1 are shown 
in Figs. 4 and 5. While the calculated curves lie con- 
siderably lower than the experimental ones, the results 
are remarkable considering that we have in effect said 
that if an electron is not scattered through an angle 
greater than about 25°, it is not scattered at all. Par- 
ticularly surprising is the fact that the calculated curve 
for copper is about as close to the experimental one as 
it is for aluminum, even though the atomic number of 
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Fic. 4. Results of the cutoff calculations TUE d 
incidence compared with the experimental ata . 
Kulenkampf and Rüttiger (solid line). 
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0.15 


Cu(Z*29) 


t Fic. 5. Results of the cutoff calculations (dashed line) for normal 
incidence compared with the experimental data (Ref. 6) of 
Kulenkampff and Riittiger (solid line). 


copper is over twice that of aluminum. This appeared 
to be a general trend. Our results for carbon and silver 
were percentage wise, in about the same agreement as 
those for copper and aluminum. Apparently the physical 
reason for this is that the increased probability of small 
angle scattering in the higher Z materials is being 
compensated for by the shorter time which a typical 
backscattered electron spends in the target. One will 
also note that the relative discrepancy between the 
calculated and observed curves is larger for 6/— 83? 
than for 6’=43°. This is to be expected since the points 
at 6’=83° corresponds to deflection through a smaller 
angle, which is more likely to be effected by small-angle 
scattering. As the cutoff angle was decreased, the calcu- 
lated points rose towards the experimental values. 
However, the difference between the curves computed 
with cutofis of 25° and 12° was only about 15% which 
indicates that the bulk of the discrepancy is due to 
scattering at angles well inside the forward peak of the 
Rutherford cross section. The conclusion is that quite 
independently of Z, about half the backscattering 
arises from a relatively small number of large-angle 
scatterings, with the remainder being due to the dif- 

fusive effects of small-angle scattering. In retrospect, 

this is not a surprising result since the mean-square 

scattering angle is on the order of y so one needs roughly 

y small-angle scatterings to deflect an electron through 

90°, but about one out of every y collisions will give 
rise to a single scattering through an angle greater than 
90°. 


5. A POSSIBLE METHOD FOR NUMERICAL SOLUTION 


The main result of the previous section was that 
backscattering is a very complicated problem. For one 
thing, we found that both the forward peak of the 
scattering cross section and the details of its large-angle 
tail are important. Also, it appears that for high-Z 
materials one must include the stochastic nature of 
energy loss. For these reasons it seems very doubtful 
that a simplified treatment will ever lead to more than 
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a crude estimate of the total coefficient. Barring the 

appearance of an entirely new formalism, if one wishes 

to make quantitative calculations, he is faced with 

either resorting to the Monte Carlo method or at- 

tempting a numerical solution of one of the present 

equations. Leaving aside, for the moment, the diffi- 
culties arising from the peak in the Rutherford cross 
section, let us consider some pertinent properties of 
Eq. (2). 

The factors sec appearing in (2) blow up at 0— 90^. 
Since the laboratory coefficient R secü cosóü' is well 
behaved at 0—90? the integrand remains finite. In 
carrying out our cutoff calculations we approximated 
the angular integrals with a Gaussian quadrature 
scheme which avoids the ambiguous points at 6=90° 
and found that these apparent divergences caused no 
difficulty. 

Since the problem has rotational symmetry around 
an axis perpendicular to the target, R(n,E; n’,E’) has 
the property that its average over the azimuthal angle 
of either n or n' is equal to its average over the azimuthal 
angle of both n and n'. Using this property, one readily 
verifies that averaging (2) over the azimuthal angles 
of n and n' yields an equation for the corresponding 
average of R. The resulting equation is still of the 
matrix form (8) and the same techniques can be applied 
to both the full and averaged equations. This procedure 
will greatly reduce the number of points needed to 
carry out the angular integrations. 

Another simplification arises if the solution is required 
to satisfy (14). As one would expect, (2) then becomes 
an equation for the dimensionless function r defined in 
(17) and the number of energy variables is cut in half. 
We have pointed out that R(n,E; n',E/) is independent 
of reflection coefficients whose energy indices are greater 
than E or less than E’ which means that the integral 
equation for r is of the Volterra type as far as the £ 
variable is concerned. One can therefore compute 7 out 
to some minimum value of £ without having to find the 
function for smaller values of £. The validity of forcing 
R to satisfy (14) is, however, not a friori known. 

The major stumbling block will, of course, be the 
strong forward peak in the Rutherford cross section. 
One will note, however, that the matrix W is also ill 

behaved, but when it was exponentiated to obtain Eq. 
_ (9) its singular behavior ceased to cause any difficulty. 
. The latter equation*can be obtained by multiplying (8) 
= from both right and left by exp(—W/), noticing that 
the left-hand side is a. total derivative and integrating 
rom zero to infinity. More generally, suppose we take 
any matrix function A(t) which has the properties, 
-. =I and A (*») —0, and carry out the same manipu- 
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lations. One easily verifies that the analog of (9) is then 


iz 


A o0 
ec A (D Ss-- Bi() R3- RB«(0) 
OT 
x +RSpRIA (idt, 


8r d Z 
Bil) --c ()—A() — Wc (Ss. ; 
Z di 8m 


8m d 7 
50] -| 4 © jmo- W4- E)r] 
7 dl 8r 


The game is now to choose 4 so that the matrices B, 
and Bo are well behaved. The ideal choice would be 
expl—(W—Z/87Sr)t], but from the analog of Eq. 
(11) one finds that the (n,E; n',E)th element of this 
matrix is the probability that an electron will be 
scattered from the state (n,E) to the state (n, E") while 
passing through a slab of thickness ¢, provided that 
one only includes scatterings such that the electron is 
always traveling into the target. Evaluating this matrix 
would be as difficult a problem as solving the original 
equation. On the other hand, the theory of small-angle 
multiple scattering!$:9 has provided a number of ap- 
proximate evaluations of this matrix. These approxi- 
mations are only valid for small-angle deflections, but 
one will note that this is all that is needed to insure 
that Bı and Bs are well behaved. The scheme is, there- 
fore, to calculate exp[— (W —Z/8mSr)t] in some 
tractable small-angle approximation, use this matrix 
for A(f) in. (19), and solve the resulting equation by 
iteration starting with R=0. Any reasonable approxi- 
mation for the above exponential should remove most 
of the small-angle scattering from B, and Bs, and the 
successive iterates should converge like an expansion 
in the number of large-angle scatterings. Our experience 
with the cutoff calculations described in the previous 
section indicate that such a series would converge 
rapidly after about five iterations. Furthermore, any 
iterative solution to (19) is clearly well behaved for 
small Z, so if the iterates converge they are guaranteed 
to yield the correct solution. The author has not 
investigated the above scheme in detail but preliminary 
investigations indicate that it may lead to a practical 
method for obtaining numerical results. 


(19) 
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An experimental and theoretical investigation is made of 
a semiconductor bar subjected to parallel electric and m 
requiring only a thermal-equilibrium electron-hole pl 
It is shown that in extrinsic material the growth is sp 
tion, while for nearly equal densities of positive and negative carriers the 
corresponds to the oscillistor phenomenon. Experimental observations of the waves wi 
at and above room temperature for frequencies from 20-400 kc. 


growing screw-shaped plasma density waves in 
agnetic fields. A particularly simple mode is used, 


asma, low-recombination surfaces, and moderate fields, 
zatial, corresponding to stable traveling-wave amplifica- 


wave is absolutely unstable and 
ere made in germanium 
and with electric and magnetic fields from 


25-60 V/cm and 0-11 kG, respectively. The growth rates and phase characteristics were found to be in 


excellent agreement with theory and gain in excess of 35 dB/cm was obtained. 
corresponding to nearly intrinsic material, evidence of instability 


cal prediction. 


I. INTRODUCTION 


Hue plasma density waves were originally 

proposed by Kadomtsev and Nedospasov! to 
account for an instability that occurs in the positive 
column of a gas discharge in a longitudinal magnetic 
field. Subsequently, Glicksman? suggested that a slightly 
generalized form of this theory could explain the oscil- 
listor effect in semiconductors.^-? In its simplest form 
the oscillistor is a semiconductor bar that exhibits 
terminal voltage and/or current oscillations when an 
electron-hole plasma is created by injection or light 
generation and sufficiently large parallel electric and 
magnetic fields are applied. Further refinements of the 
theory for semiconductors and experimental confirma- 
tion of the helical nature of the -oscillistor instability 
have been provided by several other authors,* =” leaving 
no doubt as to the correctness of Glicksman's basic 
idea. However, the feature from which the oscillistor 
takes its name, the terminal voltage-current oscillation, 
has not been adequately accounted for up to now, nor 
has it been determined whether the oscillation is caused 


* Based on a thesis submitted by C. E. Hurwitz to the Massa- 
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requirements for the degree of Doctor of Philosophy. 
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At higher temperatures, 
was found in accordance with the theoreti- 


by an absolute instability or a convective instability 
with feedback.4—9 

The present work indicates that the oscillistor in- 
stability is an absolute one and that the terminal oscil- 
lations can result from nonlinearities alone. The bulk 
of the paper, however, is devoted to the theoretical and 
experimental investigation of a completely different 
aspect of these helical waves, that of traveling-wave 
amplification. In this study, a preliminary account of 
which has been given earlier,” a mode is utilized that is 
much simpler than the one considered by previous au- 
thors and which involves only the thermal-equilibrium 
hole-electron plasma. In these other treatments the 
growth was always assumed to be temporal, representing 
instability, and the possibility of stable amplification 
was not considered, perhaps because the high degree to 
which the growth and propagation constants are in- 
fluenced by the net difference in hole and electron densi- 
ties was not fully appreciated. It will be shown that 
using nothing more than a uniform bar of germanium at 
room temperature with a low-recombination surface, 
Ohmic contacts, and moderate applied electric and mag- 
netic fields, controlled amplification of as much as 35 
dB/cm can be achieved. In the limit of nearly equal 
hole and electron densities (intrinsic material or material 
with a high level of injected carriers) the helical wave 
becomes absolutely unstable, corresponding to the 
oscillistor situation mentioned above. 
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Fic. 1. Right-handed helical perturbation for Jm|=1. (a) 
Perturbed-electron distribution indicated by dashed lines and 
perturbed hole distribution by solid lines. [After Hoh and Lehnert 
(Ref. 21).] (b) Cross-sectional view showing charge separation and 
resulting perturbed electric fields £4. Directions of carrier flow 
due to the combined effect of E; and Bo are indicated by the 
vectors EjX Be. (c) and (d) Carrier density versus radius along 
a horizontal diameter through the cross section of (b) for the 
equilibrium-gradient mode and the surface-density mode, re- 
spectively. Solid lines correspond to unperturbed distributions and 
dashed lines to distributions altered by radial and azimuthal flows. 


II. PHYSICAL MODEL 


A physical model of the growth mechanism of the 
helical wave was developed by Hoh and Lehnert” 
for the gas-discharge case. We will include an additional 
part of the mechanism and modify the picture to apply 
to semiconductors. The argument is begun by assuming 
a small, quasineutral, screw-shaped perturbation of 
positive and negative carriers to be superimposed on the 
steady-state unperturbed distribution. The general 
form, in cylindrical coordinates, would be F(r) exp (tat 
—ikz—im) with F(r) determined by the boundary 
condition at the wall; however, for the present argument 
we consider only the lowest mode m=-+1, as the full 
‘mathematical treatment shows that this mode has the 
lowest threshold for growth. The perturbation is shown 
schematically in Fig. 1(a) for m and k having opposite 
signs (a right-handed helix). The longitudinal electric 
field Ep tends to separate the superimposed positive and 
negative screws axially, which is equivalent to a rotation 
of one screw relative to the other, as shown in Fig. 1(a) 
and in the cross-sectional view of Fig. 1(b). The result- 


C. Hoh and B. Lehnert, Phys. Rev. Letters 7, 75 (1961). 
C. Hoh, Phys. Fluids 5, 22 (1962). 
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ing charge separation creates both an azimuthal and a 
radial electric field, Z1, and Err, respectively. The radial 
field was neglected by Hoh and Lehnert but it is no less 
important than the azimuthal field in the description of 
the total growth mechanism. In the linear approxima- 
tion these fields, together with the longitudinal magnetic 
field Bo, act on the unperturbed distribution to produce 
a radial and azimuthal flow of particles, indicated in 
Fig. 1(b) by the vectors Ei. Bo and Ei-XBo. If the 
unperturbed distribution has a radial gradient, as it 
does in a gas discharge or in the usual form of the oscil- 
listor (due to the finite recombination of injected car- 
tiers at the walls), this flow is not divergenceless and 
hence can feed particles from the main distribution into 
the screw with the proper phase, thus producing a growth 
of the perturbation. This may be seen more clearly in 
Fig. 1(c), where the carrier density is plotted as a 
function of radius along a diameter horizontally through 
the cross section of Fig. 1(b). For the signs of Ey and 
By chosen, the flow adds carriers to the assumed excess 
in the region 70 and removes carriers from the as- 
sumed deficit in the region «0, thus enhancing the 
perturbation. 

When the influx of particles is sufficiently large to 
overcome the dissipative effects of diffusion and recom- 
bination the helix will grow. It may thus be expected 
that for a given value of electric field Eo there will be 
a threshold value of magnetic field Bo, above which 
growth is observed. There should also be an optimum 
wavelength, for if the wavelength is too long the charge 
separation and therefore the driving electric fields vill 
be too small, while for a very short wavelength the 
diffusive effects will take over and dissipate the per- 
turbation. If the sample or plasma column is long enough 
that end effects are negligible, the wavelength should be 
independent of the sample length. For the electric and 
magnetic fields as shown in Fig. 1(a) the screw of 
opposite sense will clearly be attenuated since the radial 
and azimuthal flows now reverse directions and tend to 
destroy the perturbation. In general it may be stated" 
that the helix which can exhibit growth is right- (left-) 
handed for the electric and magnetic fields parallel 
(antiparallel). 

In a semiconductor a radial gradient of the unper- 
turbed distribution can be produced in thermal equi 
librium by proper doping (e.g., by indiffusion or out- 
diffusion of impurities), eliminating the need for & 
steady-state injected or optically-generated plasm 
with all of its inherent complications. Of course; the 
radial gradients of the holes and electrons in this 
“equilibrium-gradient mode” will be in opposite direc- 
tions, producing a partial cancellation in the growt 
process, but as shown in a calculation of the threshold 
for this mode in Appendix A there can still be a net 
buildup of the perturbation. 

However, except for this calculation, we will be con- 
cerned with an even simpler mode of semiconductor 
operation requiring no unperturbed radial gradient at 
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all, merely a sufficiently low surface recombination. 
In this mode, called here the “surface-density mode,” 
growth occurs because the radial and azimuthal flows 
can pile up or deplete the carriers at the surface in the 
proper phase, as illustrated in Fig. 1(d). The role of the 
dc-electric and magnetic fields in producing the flow 
is the same as before. Experimentally the surface- 
density mode is easier to achieve than the equilibrium- 
gradient mode, and also the mathematical theory for 
the surface-density mode can be formulated with fewer 
approximations. 

In addition to the growth properties, the motion of 
the wave can also be ascertained from the physical 
model. As is well known, a quasineutral density per- 
turbation under the influence of an applied electric 
field Æo propagates in the direction of minority carrier 
drift with a velocity va— u,Eo, with pa, the ambipolar 
drift mobility, being given by 


Ea [ues (16— o) ]/ (touet pour), 


where zo and po are the equilibrium densities of elec- 
trons and holes, and pe and y, are their respective 
mobilities. Thus, the helix performs a bodily translation 
with the ambipolar drift velocity va. In addition, there 
is a rotation of the helix due to diffusion and conduction 
of carriers across the magnetic field. (Another mecha- 
nism for rotation, due to the radial ambipolar electric 
field, also exists when the unperturbed carrier distribu- 
tion has a radial gradient.) The total motion of the helix 
is composed of these two parts. For extrinsic semicon- 
ductors the translation will dominate, while for in- 
trinsic semiconductors the ambipolar mobility is zero 
and the motion is therefore purely rotational. Discus- 
sion of the direction of rotation will be postponed until 
the next section where the necessary mathematical 
expressions are derived. It should be noted that because 
of the nature of a helix both motions when viewed in any 
cross-sectional plane produce a rotation of the carrier 
density and field patterns about the axis.” ) 

The ambipolar drift of the helix also plays a crucial 
role in distinguishing instability from stable growth. A 
detailed discussion is presented in the next section, but 
to complete the physical picture the basic ideas are 
outlined here. Three fundamental processes govern the 
change in amplitude of the perturbation with mue i 
any point in space: the growth mechanism prev is. 
described, diffusion, and ambipolar drift down ex à 
Above threshold the growth is enough to overcame d i 
fusion, but for a sufficiently large drift velocity the dis 
turbance is swept away, growing as nes Me 
point in space the perturbation remams bounas: m 
time, and the result is simply a spatially grow lc. > 
However, if the ambipolar driît velocity is Eie e 
the perturbation may build up faster than it can ditiuse 
—_ 


` is rotation was first 
2 The contribution of the drift motion to this rotation w 
;. : (Ref. 6) 
Pointed out by Okamoto ef al. (Ref. 0). 
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and drift away. At a point in space it may then grow 
temporally, corresponding to an absolute instability. 
Clearly the first case is pertinent to extrinsic semicon- 
ductors, and the latter to semiconductors with nearly 
equal electron and hole densities and to gaseous plasmas. 

It should also be mentioned that Gurevich and Joffe” 
have proposed a very similar mechanism which can 
lead to growing density waves in thin semiconductor 
slabs. In their model a steady-state density gradient 
and electric fields are set up either by illumination of 
one surface or by the Hall effect. Then the addition of 
a wave-like density perturbation creates a perturbed 
electric field through charge separation in the steady- 
state electric and magnetic fields. As in the case of the 
helical wave, this perturbed field acts on the unper- 
turbed distribution to produce a carrier flow in the 
correct direction to enhance the perturbation. The ap- 
plied electric and magnetic fields perform different 
functions in the chain of physical argument, but the 
fundamental process of coupling an unperturbed density 
gradient with a perturbed electric field to achieve 
growth is basically the same as for the helical wave. 
Experimental observations of instability under condi- 
tions similar to those considered by Gurevich and Ioffe 
have been reported??* and may well be explained in 
terms of their model. 


II. MATHEMATICAL THEORY 


With the preceding physical picture in mind, we now 
formulate a quantitative mathematical description of 
the helical wave in the surface-density mode. The 
equilibrium-gradient mode is considered in Appendix 
A. Using the equations of motion and continuity for 
electrons and holes, the dispersion relation governing 
the growth and propagation of the helical wave will be 
derived. From this equation will then follow both the 
threshold conditions and the growth and phase con- 
stants for stable growth as well as the threshold for 
instability. 

Consider a cylindrical bar of semiconductor, long in 
the z direction so that end effects may be neglected. The 
bar is assumed to be uniform and in thermal equilibrium 
and therefore to contain a uniform density np of elec- 
trons and fo of holes with 15— p= N 5 — NV 4, where Np 
and N4 are the densities of ionized donor and acceptor 
impurities, respectively. Under the assumption that the 
times and distance of interest are much larger than the 
mean free times and paths of the free carriers, we may 
write the following equations to describe the behavior of 


3L. E. Gurevich and I. V. Ioíie, Fiz. Tverd. Tela 4, 2641, 
2064 (1902) [English transl.: Soviet Phys.—Solid State 4, 1938, 
2173 (1903 

= ^ Nakashima and Y. Miyai, J. Phys. Soc. Japan 18, 1219 
(1983). i ; 

28S, Nakashima and Y. Noguchi, Japan. J. Appl. Phys. 2, 307 
(1963). 
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the carriers under applied electric and magnetic fields: 


J.= nqu,E--qD,Vn— i J.X B , (1) 
J,— pgurE— gDr p+ urda X B, (2) 


àn/àl— (1/9) V -J.d-y , (3) 
8p/ot- — (1/9 V-Ju- y, (4) 
V-E=—g(n—$)/€. (5 


The subscripts e and /i refer to the electrons of density 

n and holes of density p, respectively ; u is the mobility, 

D is the diffusion constant, y is the net rate of bulk 
1 generation (which in the absence of trapping is the same 
E for both carriers), e is the dielectric constant of the 
semiconductor, q is the magnitude of the electronic 
i charge, J is the current density, and Æ and B are the 
; electric and magnetic fields, respectively. 
In order to effect a considerable simplification in the 
mathematics it will be assumed throughout that the 
square of the Hall angle for each carrier is small; i.e., 
that u,;2B*««1. Under this assumption and also assum- 
ing that the self-magnetic fields are negligible with 
respect to the applied uniform magnetic field Bo, Eqs. 
(1)-(4) may be combined to yield 


àn/ 0i — ny, V - E--j,E- Vnd- D, Vni 


+ueE- (VnXBo), (6) 
and 
9p/0t— — pu, N -E— uE- V$-- Di V*p 

+y2E-(VpXBo). (7) 


The bulk generation term has been omitted as it is a 
: simple matter to reinsert it if desired near the end of the 
z calculation. 
Equations (6) and (7) are now linearized for small 
perturbations about equilibrium by writing 
p=potpi, E-Ec-Wa, (8) 
where the subscripts 0 and 1 refer to the equilibrium 
quantities and their perturbations, respectively. As- 
suming quasineutrality (ie, mie: but V0), 
eliminating y; from the linearized equations, and re- 
stricting ourselves to the particular case of interest where 
Ev and Bo are both in the z direction, we have the famil- 
- jar ambipolar continuity equation for 71: 


àn3/0t— D, V*ny— ua Eo(011/02) , 
ka= [uan (1t0— po) ]/ Oone t poun) 


Da= (noue Drt pounDe)/ (nouet pour) 


re the ambipolar mobility and diffusion constant, 

SE espectively. 

— "po obtain the normal modes for the problem, the 

turbations n, and yı are Fourier analyzed in cylindri- 
coordinates (r,$,2) in the form 


ny Ni(r) exp(iot— ikz— im) , 
ya Wa(r) exp(iet— ikz— im) , 


n=n n, 


(9) 


- where 


(10) 


(11) 


A. Lb. MOWHO RTER 
and (9) is solved for N,(r), the radial part of nı, with 
the result 

Ni(r)= eus (Br) , (12) 
where 7, (Br) is the modified Bessel function of the first 
kind of order m, cı is an arbitrary constant, and 


Bi [(m— ua Evk)/ Da H- k. (13) 


The potential Vi(r) is obtained from the continuity 
equations (6) and (7), which after linearization and for 
Es and By in the z direction may be combined in the 
form 


Vyit A V?n31— 0, (14) 
where 
: Eok(ucA- un) IB — 10) 
A=-i m TOS) 
(nouet poun) (B.— E?) monet Porta 
"Therefore, 
V1— Ania solution of Laplace's equation. (16) 


The appropriate solution of Laplace's equation is a 
modified Bessel function J,,(kr), and as a result 


Wi (7) = Ac m(6r) +621 m(kr) 5 (17) 


To complete the solution, the boundary condition at 
the surface r— a of the cylinder must be applied. These 
conditions have the form 


— (1/9)J «(a) — (1/9)Ju« (2) — smi (2) , (18) 


where J«(a) and Ja.(zm) are respectively the radial 
components of the electron and the hole current at the 
surface, s is the surface recombination velocity and 
nı(a) is the excess carrier density at the surface. From 
(1) and (2), for Eo and Bo in the z direction and for 
Le e Bol, we have to first order in the perturbation 
quantities 


1 (= LeBo pi 
—-J r= Nope = ) 


q or r 0$ 
ôn eBo On 
»( pauses ) (19) 
ðr r dO 
and 
1 Ov1 urBoð 
Tu= pub : x 
q or č r do 
On Bo On 
=i A. (20) 
or r 0 


_ Substituting these expressions into (18) and introduc- 
ing the solutions (11), (12), and (17) into the resulting 
equations we obtain two linear, homogeneous equations 
for the coefficients cı and cs. The dispersion relation 
relating the frequency w and propagation constant k 
is obtained by setting the determinant of these equations 
equal to zero. The result, after inserting the value of A 
from (15) and performing some algebraic manipulation, 
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is the following equation: 


(mus EsBo/ D.) &/ (8 k?) ] 
X [Bl mn! (Ba), (ka)— bI, (Ba) Tp! (ka) | 
+ aBRI m (Ba)L,/ (ka) — tmp BoBI, (Ba) In (ka) 
-FLGa/D.) — imu; Bo]kI n (Ba)Im (ka) 
—imunBo(s/D.)Is (Ba), (ka)-0, (21) 


where ua, the ambipolar mobility, is as defined in (10), 
un Ruoo= (pom? —nou?)/ (nouet pour) — (22) 

is the mixed conduction Hall mobility, and 
Ma — Nopoueun (uet unr)?/ (nouet pour)? (23) 


is a quantity with the dimensions of mobility squared 
which appears also in the theory of transverse magneto- 
resistance. In obtaining (21) use has been made of the 
definition (10) of Da and, consistent with the earlier 
approximation u., BK1, a term in B has been 
dropped. 

This dispersion relation (21) is rather complex and 
extremely difficult to treat as it stands. Besides being 
transcendental, it contains both w and k in the argu- 
ments of the Bessel functions (through the quantity £). 
Fortunately, for much of the region of interest it is a 
reasonable approximation to expand the Bessel func- 
tions in Taylor series to obtain an algebraic equation 
for w and k. In the section on experimental results some 
numerical solutions of (21) will be presented which indi- 
cate the range of validity of these expansions. Further, 
since we are dealing with the surface-density mode, it is 
expected that surface recombination will have to be 
small in order to obtain growth. Therefore, we make the 
approximation that sD,/a and neglect the terms in- 
volving s. In addition, it is assumed that bulk recombi- 
nation is also sufficiently small so that ra?/D,, which 
makes it unnecessary to reinsert the bulk recombination 
term. In Appendix B the modifications due to recombi- 
nation are taken into account and it is found that for 
the experimental situation of interest the correction is 
negligible. 

For simplicity we consider here only the lowest 
order |m| — 1 mode as this mode has the lowest thresh- 
old for growth. In Appendix C the general m mode is 
considered. Expanding the Bessel functions, we keep 
only the first two terms of the Taylor series, which will 
be valid as long as 62a? and £*a*««20. Substituting the 
expansions in (21), neglecting terms in £?£?, and elimi- 
nating 6? by means of (13) we obtain for the simplified 
dispersion relation: 


Ka? 3-4-2i(u,— pn) Bo }+i(a2/2D.) 
X (o—waE ok) [3—iBputHe)Bo] 
+ (ux EyBo/ D) ka?--A[14- i (u4— us) Bo] E (24) 


Threshold Conditions 


As discussed in Sec. II it is expected that for a given 
electric field there should be a critical magnetic fi 
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and a critical frequency at which growth is first ob- 
served. At this point, which we shall call the threshold, 
the frequency w and propagation constant £ are both 
real, corresponding to neither growth nor attenuation. 
Therefore, at threshold the real and imaginary parts of 
(24) may be separated and w eliminated between the 
resulting equations. Neglecting terms of order p, nB 
compared with unity the result is 


S+ (uw EsBo/ D, )ka?--4— 0, (25) 


It is evident that only the product ÆoBo enters (25). 
Since we seck the minimum value of the applied fields 
for which growth is first observed we minimize the 
EsBs product with respect to k. Tn this manner we ob- 
tain from (25) the threshold value of k, denoted by ke: 


k2=4/3a?, (26) 


and also the threshold value of By. For ease in com- 
paring theory with experiment, the latter is written as 
an expression for the threshold magnetic field Boe, 
which is then a function of Ey: 


OR-Da 1 
Boe=— me (27) 
myu w Eo 


with |m| — 1. 

The critical frequency f, is obtained from (24). 
Since at threshold Ey-- Boc! from (27), we must con- 
sider two limiting cases in order to remain consistent 
with the assumption p,,2Bo?<1. For extrinsic material, 
such that | (no— po)/ (not po) | og, Bè, all termsexcept 
the one involving u, are neglected, so that 


fe= Q/2x)u Eok. (28) 


For intrinsic or nearly intrinsic material, where 
| (no— po)/ (no po) | & ui, Bo, only the terms in 1,4 Bo 
are retained, with the result 


f = (20mD,/9za?) (us— ua) Boe . (29) 


Equation (28) corresponds to the translation of the 
helix at the ambipolar drift velocity (u,E;), while (29) 
corresponds to the rotation of the helix due to diffusion 
and conduction across the magnetic field. Tn the inter- 
mediate case |(o— po)/ (not o) | n, 2Bi?, the total 
motion is composed of both translation and rotation; 
however, a correct expression for f, cannot be obtained 
in the approximation p,,;2By2K1. 

From (26) it is evident that just the magnitude of 
ke is determined. This is simply a consequence of the 
fact that for a wave of the form exp (iet— ikz— im) 
only the relative signs of £, m, and w are significant. 
We will choose the signs of £, and m to make f->0 in 
(28) or (29) and to satisfy (27). The resulting signs of w, 
k, and m for various directions of E; and By and for dif- 
ferent types of material are listed in Table I. We have 
assumed e> ya. The sense and motion of the helix may 
be deduced immediately from this table. 

In agreement with the predictions of the physical 
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Tarte I. Relative signs of w, k, and m for helical wave 
of proper sense to permit growth. 


Directions of n-type p-type Intrinsic 

Eo and Bo material material material 
Eo>0 Bo>0 w>0k>0 m<0 e»0k«0m20 w>0 k>0 m<O 
Eo»0 Bo <0 w>0k>0 m>0 w>0k<0 mO w >0 k>0 m>0 
Ex <0 Bo <0 e90k«0m»0 w>0k>0 m<0 w>0k<0 m 20 


Eo«0 Bo»0 e»0k«0m«0 e»20k»50m»0 w>0 k«0 m<0 


model, the helix that can exhibit growth is right- 
(left-) handed for Bo parallel (antiparallel) to Lo, 
irrespective of the type of material. In each case the 
screw of opposite sense has no threshold and is always 
attenuated, as will be seen in a subsequent calculation 
of the growth constant. In any cross-sectional plane, 
the direction in which the helix appears to revolve about 
its axis is independent of the direction of Eo and reverses 
sign with a reversal of Bo. Further, for a given Eo and 
B, this direction of apparent revolution is the same for 
n-type and intrinsic material and is in the opposite 
direction for p-type material. 


Growth and Phase Constants 


Having established that a threshold for growing 
helical waves does indeed exist and having calculated 
the critical fields, wave number, and frequency, we 
now examine in detail the functional behavior of the 
propagation constant of the waves both above and 
below threshold. In particular we will consider suf- 
ficiently extrinsic material that the waves exhibit only 
stable spatial growth. The question of instability will 
be taken up in detail later and it will be seen just how 
extrinsic the material must be to assure stable growth. 

We seek solutions of the dispersion relation in the 
form of traveling waves, excited at a specific frequency 
and varying spatially. Therefore, w is considered to bea 
real variable and & is allowed to become complex, i.e., 
k=k,+1k;, where k, is the phase constant and k; 
the growth constant. The wave will then travel with 
phase velocity Vpn=w/k, and will vary in amplitude as 
exp(Az). Equation (24) is now solved directly for k, 
and for material sufficiently extrinsic that 


Halio>D./4, (30a) 
| (110— po)/ (not Po) |>He,n Bo, (30b) 

and 
us Eo >eDa , (30c) 


the square root in the quadratic formula can be ex- 
panded in a Taylor series. In performing the expansion 
we retain real terms to first order and imaginary terms 
to second order. Further, we choose the root correspond- 
ing to the negative square root; the other choice leads 
to E;zpsEo/2D, which by (302) is much greater than 
unity and is thus not self-consistent with the earlier 
Bessel function expansion. Actually the full transcen- 
dental dispersion relation leads to an infinite number of 
-other roots, which presumably correspond to strongly 
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damped modes. For the correct root we have 


kæ (o /uaEo) —(8Da/ 30 ha Eo) [1 +4 (oa/ pa)" 
+ (waum Bo/4uaDa)] > (D 


A considerable simplification of this result may be 
achieved through the use of the previously derived 
expressions (27) and (28) for the threshold frequency 
and magnetic field. The propagation constant k=k,+ik; 
is then 


[o 8D, 1p fBo JN 
k= +i |; -( ) ZH (32) 
Bako 3Ha Eo feBoc ile 
Equation (32) corresponds to the helical wave of proper 
sense to permit growth, as given by Table I. The 
propagation constant for the wave of opposite sense is 
obtained by simply reversing the sign of the term 


(2f Bo/f Bo). 
Since k>k:, we have from (32) 


Vphase— Ügroup = Ia Eo , (33) 


confirming our earlier assertions that for extrinsic ma- 
terial the wave translates with the ambipolar drift 
velocity. 

We see further that the wave will grow spatially 
(k; 0 for waves traveling in the direction of increasing 
z and b; «0 for those propagating in the opposite direc- 


tion) when 
2By/ Bw (f/ f)9- (fe/ f) (34) 


and the wave of opposite sense is always attenuated. 

Actually the existence of a value of k; of the correct 
sign for growth is not sufficient to prove that the solu- 
tion corresponds to a growing wave; it might also be 
an evanescent wave, such as in a waveguide beyond 
cutoff. However, the experimental results confirm the 
fact that (32) is a true growing wave solution, and we 
need not investigate this point further. 


Instability 


In the earlier discussion of the physical model for 
the helical wave it was reasoned that for sufficiently 
small ambipolar drift velocity a disturbance could grow 
temporally rather than spatially, resulting in instability 
as opposed to stable growth. We shall now derive this 
result mathematically by developing a quantitative 
stability criterion for the surface density mode of 
operation. 

Stability criteria and the important distinction be- 
tween convective growth and absolute instability in 
systems with growing wave behavior have been dis- 
cussed by several authors.1*7? The problem is essentially 
one of determining the response of the system to a spa- 
tially and temporally bounded, but otherwise arbitrary, 
excitation. If the response (which may be any physical 
quantity of interest) is bounded as / — co then the system. 
is stable, and it is absolutely unstable otherwise. As- 
suming that only one dimension need be considered and 


iy 
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that end boundaries are sufficiently far removed so that 
reflected waves are of no significance, it can be shown 
that for the purposes of determining the stability it is 
sufficient to investigate the response ¢(z,f) to a 6-func- 
tion excitation in space and time: 


eg v: 
0- | eus | dk, (35) 
CL Cr D(k,w) 


where D(k,w)=0 is the dispersion relation for the sys- 
tem under consideration. The Laplace contour C; runs 
in the lower half of the w plane, below all singularities 
of the integrand, i.e., — «—i» «co « «—ip, where v is 
a large positive number. For convenience we have set 
$—i» in the conventional Laplace formulation. The 
Fourier contour Cr is as usual along the real k axis. For 
positive / the w-integration contour is closed by a semi- 
circle at infinity in the upper half-plane, and the £ 
contour is closed in the upper or lower half-plane when 
the region of interest is for z<0 or 270, respectively. 

In general the integrals in (35) will be difficult and 
several authors? have proposed techniques for ob- 
taining the stability information without explicitly 
evaluating these integrals. However, for the simplified 
helical-wave dispersion relation (24) for the surface- 
density mode, the integrations in (35) can be performed 
directly to investigate the behavior of $(z,!) for large 1. 
From the physical model presented in Sec. II it is ex- 
pected that instability will be possible only for nearly 
equal densities of positive and negative carriers, so for 
simplicity we set 79= fo in most terms of (24). How- 
ever, we do not set ua=0, and thereby retain the very 
important terms involving the quantity (uo— po). This 
turns out to be equivalent to neglecting terms of the 
order of (mo— $o)/ (nod- po) with respect to unity, but 
not with respect to quantities of order p, „Bo, which are 
much less than unity. This level of approximation will 
be maintained throughout the calculation. In several 
places we shall neglect to set no= po purely to preserve 
the form of particular terms and to aid in their identi- 
fication later; however, it is to be understood that the 
following calculations are correct only for 292 po. à 

Equation (24), after rationalization of the leading 
term, may then be written in the form (assuming 
Be,» By«&1 as usual): 


D (ko) = (k—ko)?—a(w—a0) =0, (36) 


where 


ko= "zx myeusEoBo/ 6Da--i| uo/ 4Da x 
uen (uo— un) Bé/9D« ]Eo, (37a) 


w= — (muun Eg Bo/6)| uo/ Da k 
— 5peunr (ue— pr) Bé / 3D H-:[8D«4/3a* . 
+ (u Fo? /8D,—uw EgBé/18D.]. (37b) 
ac m (u,— uj) By— 3i ]/6D.. (376) 


Focusing our attention on the Fourier integral of 
(35) in the & plane, we see that only contributions to 
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the integral will be at values of k=k(w) such that 
D(&,»)--0. As the notation implies, these values of £ 
depend parametrically on w, where w assumes all values 
on the Laplace contour Cy. Referring to (36) it is clear 
that there will be two lines of zeros of D(kyw) in the $ 
plane, corresponding to the two solutions 


E (o) = Essi [a (w—wyo) ]2, (38) 
where 


w=w(Cr). (39) 


Since C, is below all singularities of the integrand in the 
w plane, we may immediately conclude that the two 
lines of zeros do not cross the real £ axis and that all of 
the zeros are simple. Further, using (37c) and (38) in 
(36) and recalling that » may be an arbitrary large 
positive number, it is clear that k,.(w) corresponds to the 
line of zeros in the upper half & plane, and £..(») to that 
in the lower half-plane. 

Since the zeros of D(E,») are simple, the poles of the 
integrand at &— E, (o) are also simple and the integral 
over & can be evaluated using the theory of residues. 
For z>0 the contour is closed in the lower half-plane 
enclosing the line of zeros corresponding to k= E. (o). 
The solution for z«0 proceeds analogously and will 
not be detailed. We may thus write for the response: 


exp(iet-t- iLa(w—ay) ] 2 


$(z,l) = exp(—ik) | do. (40) 


cr [a —) J 


The integral in (40) may be evaluated directly from a 
table of Laplace transforms," with the result 


1 a 

$(2,t)= exp( inui ite ; (41) 
(at)? A 

Writing wo=wort+iwo;, we immediately conclude that 
the system will be stable or absolutely unstable de- 
pending on whether c»; is positive or negative. 

It therefore follows from (37b) that a long semicon- 
ductor bar with a low-recombination surface and with 
applied parallel electric and magnetic fields Ey and By 
will be stable (wo; 0) when 


(uEx-- (8Do/av3 > (Su w^ E,Bo)* (42) 


and will exhibit absolute instability when the inequality 
is reversed. 

We may now make a plausible identification of the 
terms of (42). The quantity u,Esis simply the ambipolar 
drift velocity. The term (8D,/av3) is in the form of a 
diffusion velocity and is a measure of the effective veloc- 
ity at which carriers diffuse out of the helix. To aid in 
the identification of the last term we note that the 
earlier results on the threshold for growth may be put 
in a form similar to (42). From (26) and (27) we may 
write that growth will obtain when 


(Sua EyBo > (8D./av3)*. (43) 


*: See, for example, R. V. Churchill, Operational Mathematics 
(McGraw-Hill Book Company, Inc., New York, 1958), p. 328, 
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The right-hand term is again the square of the effective 
diffusion velocity. Since it is diffusion which must be 
overcome to obtain growth, we may conclude that the 
left-hand term is the square of the effective velocity at 
which carriers are driven into the helix by the combined 
action of Æo and Bo, corresponding to the growth 
mechanism. 

With these identifications in mind we may write the 
three quantities in (42) as vau), var), and Urrowths; 
respectively. From (42) and (43) it may then be stated 
that the helical wave will exhibit stable traveling-wave 
amplification when 

Varit H ait > Vgrowth > diff » (44) 
and will be unstable when the first inequality is 
reversed. 

Although the above discussion is based on a plausible 

and not rigorously justified identification of terms in 

(42) and (43), we see that (44) leads us to essentially 

the same conclusions reached by physical reasoning. 

That is, when the flow of particles due to the growth 

| mechanism is sufficiently strong to overcome dif- 


r fusion, a disturbance will grow but will be swept away 
4 from its point of origin and spatial growth will result. 
E Further, when the flow is sufficient to overcome both 


diffusion and drift, the disturbance will build up tem- 
porally at its point of origin, resulting in instability. 
Tt is not clear, however, why the above expressions in- 
volve the squares of the various velocities. Although 
these velocities are actually complex vectors, and some 
components will therefore be in either or both time and 
space quadrature, no explanation of the form of (44) 
has yet been obtained on this basis. 

Several further interesting conclusions may be drawn 
from Eqs. (42)- (44). For no po the threshold for growth 
is lower than that for instability, and consequently as 
Bois increased for fixed Ey the system passes successively 
from attenuated waves to stable growing waves to 
instability. For no exactly equal to po, however, there is 
no region of stable gain and above threshold the system 
is immediately unstable. The instability threshold value 
of Eo decreases monotonically with increasing Bo, but 
there is a minimum value of Bo given by 


Bo,min= 3us/ 2u y? (45) 


below which there can be no instability regardless of the 
value of Æo. The instability threshold value of Bo also 
decreases monotonically with Æo and although (42) 
_ predicts no corresponding minimum Ep for instability, 
- there presumably is such a minimum due to saturation 
_ of the growth term when ye,Bo>1. The terms which 
]ead to this saturation have, of course, been neglected 
hroughout the calculations. 

Finally, we may calculate the frequency and wave 
umber at which instability is first predicted, ie., 
when wo;=0 and (42) becomes an equality. These 
ties are just wor and hor, respectively, and are 
(37a) and (37b), where Eo and B, are the in- 
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stability threshold values and are related by (42) with 
the inequality sign replaced by an equal sign. 


IV. EXPERIMENTAL PROCEDURE 


In order to provide quantitative verification of the 
theory presented in the preceding section, a long, uni- 
form cylindrical semiconductor sample was required 
with accurately known carrier densities and mobilities 
and with bulk and surface recombination sufficiently 
low that 7>>@2/D. and s«D,/a. Further, it was neces- 
sary that the longitudinal electric field, pulsed to avoid 
heating, be applied by means of truly Ohmic contacts in 
order not to alter the uniform equilibrium carrier dis- 
tribution. Finally, a means of exciting and detecting 
the helical wave without altering the number of minority 
carriers was required. 

'The samples were fabricated from high-quality, un- 
compensated n-type germanium with a room-tempera- 
ture resistivity of about 30 ohm-cm and a bulk lifetime 
of 1400 psec. Accurate values for the carrier densities at 
each temperature of interest (290-360°K) were com- 
puted from the value of n— p=Np—Na=3.16X 10", 
obtained by Hall measurements at 77°K, plus the 
reported value? of the np product. In order to ensure 
that the net donor density was given correctly by the 
usual expression Np—N a= — (Rug), where Ri is the 
Hall coefficient, the measurements were made with a 
magnetic field of 20 kG oriented in a (001) crystal- 
lographic direction? Further, the Hall sample had a 
length-to-width-ratio of 6 to prevent shorting of the 
Hall field by the end contacts. The uncertainty in the 
resulting value of V5 — 4 is estimated to be +3%. 

Carrier-drift mobilities were obtained by scaling the 
reported? lattice mobilities at 300°K according to the 
experimentally obtained? power law appropriate to each 
carrier for lattice scattering; i.e., e= (3900) (7/300) 1-59 
and p= (1900) (7/300)-25, This computation was felt 
to be fully valid in view of the fact that in high-quality 
uncompensated germanium, lattice scattering is com- 
pletely dominant in this temperature range.3^9? Further, 
the measured resistivities at each temperature (using 
voltage probes and electric fields of about 0.05 V/cm 
to avoid complications due to contacts) agreed well 
within experimental error with the values calculated 
from the above densities and mobilities. 

Samples of both square and circular cross section 
were used. The samples were first lapped and then 
etched to final size to remove all damaged surface layers. 
Dimensions after etching were approximately 25 mm 
longX1 mm diam for the cylindrical samples and 
25X1X1 mm for those of square cross section. To 
comply with the requirement of the theory, the samples 
should be much longer than the attenuation length of 

28 F, J. Morin i 

2 wA Bulle snd : W. E ie E ER (LPS) 

9 C. Dunlap, Ax Introduction to Semiconductors (John Wiley & 
Sons, Inc., New York, 1957), p. 186. 


3 M, B. Prince, Phys. Rev. 92, 681 (1953). 
3: F, J. Morin, Phys. Rev. 92, 62 (1954). 
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any reverse wave so that the presence of the ends can- 
not be communicated to a significant portion of the bar. 
Since it is not known theoretically if a reverse wave does 
exist and if so what its properties are, the sample 
length was arbitrarily made approximately ten times 
the threshold wavelength [see Eq. (26)]. Experi- 
mentally, no significant effects due to end boundaries 
were observed. 

The etching procedure was quite complex and has 
been described in detail elsewhere. Using this procedure, 
which was concluded by protecting the surfaces with 
silicone oil, it was possible to achieve and maintain 
surface recombination velocities which were regularly 
less than 50 cm/sec and often as low as 20 cm/sec. 
These values of s were computed?! from the lifetime of 
injected carriers, measured by decay of photoconduc- 
tivity for each sample, and the similarly measured bulk 
lifetime of the original crystal. 

The relatively high-resistivity material and strong 
electric fields (20-60 V/cm) precluded the use of the 
usual methods for making Ohmic contacts. Metallic 
soldered, plated, or alloyed contacts always gave ptn 
or ntn junctions which interfered with the flow of 
carriers. Depending on the direction of current flow a 
p*n contact injected or extracted holes while an n*n 
contact excluded or accumulated them. Sandblasting 
the surfaces near the contact to recombine an excess 
or make up a deficit of minority carriers failed because 
the electric field swept the carriers through any practical 
recombination-generation region much faster than they 
could diffuse to or from the surface. 

The problem was solved by use of the arrangement 
shown in Fig. 2. Only one end of the bar is labeled but 
the other end is identical. The p+ contact carries only 
the hole current and the parallel n* contacts carry only 
the electron current. The series resistors are much larger 
than the contact resistances. Thus, by adjusting the 
variable resistor in series with the p* contact, the rela- 
tive electron and hole currents can be set to precisely 
the values required to maintain the thermal-equilibrium 
carrier distribution. The large resistors make the setting 
independent of the electric field, but of course the vari- 
able resistor has to be readjusted for each temperature 
of operation. With respect to fabrication, excellent 
contacts were formed by alloying In+1/2% Ga and 
Sn+5% Sb spheres to the sample followed by electro- 
lytic etching.* 

Excitation and detection of the waves were accomp- 
lished by means of pairs of small n+ probes attached to 
the surface of the sample (see Fig. 2). These n+» junc- 
tions conduct majority carriers (electrons) freely wet 
are impervious to the flow of minority carriers (holes). 

Thus when a pair of probes (such as aa’) is driven 


9$ C. E, Hurwitz, MIT Lincoln Laboratory Technical Report 
No. 320, 1963 blished). ; - 7: 
“W. E DeC = Holes in Semiconductors (D. Van 
Nostrand Company, Inc., New York, 1950), pp- 318- 0 eration 
35 A. K. Jonscher, Principles of Semicondudar Device Op 
(G. Bell and Sons, London, 1960), pp: 122-144. 
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Fic. 2, Sketch of cylindrical sample, showing 
Ohmic end contacts and #* probes, 


with an ac signal source, the electric field produced 
causes the holes to move back and forth in synchro- 
nism, perturbing the carrier distribution and initiating 
the wave. Actually, the linearly polarized excitation 
just described excites both the m=-£1 modes, but as 
discussed in the last section, one mode will always decay 
while the other can grow. By using two sets of probes 
(aa^ and bb’, for example) which are 90° apart around 
the circumference and driving them with two signals 
90° out of time phase, one or the other of the circularly 
polarized modes may be excited preferentially. 

The use of the n* probes in detecting the helical wave 
makes use of the fact that the electron quasi-Fermi 
level (QFL) is constant through the * junctions. The 
open-circuit voltage between two probes then gives the 
difference between the electron QFL's just beneath the 
surface at the two points. However, for small signals 
the perturbed electron QFL is linearly related? to the 
perturbations in the density and potential, i.e., 


$acyi— (kT /q) (n/n). 


Therefore, all information pertaining to the amplitude 
and phase characteristics of the waves may be obtained 
by measuring $41 with z* probes. Of course, it is also 
necessary that the contacts be impervious to the flow 
of holes in order to prevent their generation or recombi- 
nation in the junctions. 

These probes were fabricated by bonding 0.002-in. 
Sb-doped Au wires to the sample with pulses of current. 
To assure that the two probes of a set (c and c’, for 
example) were aligned on the same cross-sectional plane 
of the sample, an electrical null technique was used to 
locate them.? All probes were checked for proper opera- 
tion by using them to measure known changes in the 
electron QFL produced by illuminating the sample with 
penetrating light.? This test was also correlated with the 
current-voltage characteristics of the probes. 

A diagram of the arrangement for the above-described 
excitation and detection of the waves is shown in Fig. 3. 
The phase shifter and probe driver provided signals 90? 
out of phase from low-impedance sources. The input 
impedance of the detecting and measuring circuits was 


35 W. Shockley, Electrons and Holes in Semiconductors (D. Van 
Nostrand Company, Inc., New York, 1950), p. 308 

9 M. C. Waltz, in Transistor Technology, edited by H. E. 
Bridges, J. H. Scaff, and J. N. Shive (D. Van Nostrand Company, 
Inc., Princeton, 1958), Vol. T, pp. 375-380. k 
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Fic. 3. Experimental arrangement for exciting and detecting 
helical waves. The differential amplifier may be connected to any 
desired set of probes. 


sufficiently high to provide “open circuit" conditions up 
to frequencies in excess of 500 kc. 

The completed sample, mounted in a jig, was at- 
tached to a holder which permitted rotation of the 
sample in a vertical plane for accurate alignment 
parallel to the magnetic field. This arrangement was 
mounted between the poles of an electromagnet which 
was rotatable in the horizontal plane to complete the 
alignment of sample and field. The sample and lower 
part of the holder were immersed in a bath of silicone 
oil which was maintained at a constant, controllable 
temperature. A thermocouple mounted on the jig next 
to the sample read the actual temperature. In this 
manner the temperature could be set and held to within 
-£0.1°C of the desired value. 


V. EXPERIMENTAL RESULTS AND COMPARISON 
WITH THEORY 


The basic experiment consisted of applying parallel 
dc-magnetic and pulsed-electric fields to the sample, 
exciting the wave with a signal source at an appropriate 
set of probes, and then observing the excitation in 
magnitude and phase as it traveled down the bar. 

Many samples of both square and circular cross sec- 
tion were fabricated and examined experimentally. All 
exhibited spatially growing waves whose properties 
agreed qualitatively with the predictions of the physical 
model and the mathematical theory. However, due to 
the involved fabrication procedure and the finite- 
failure rate at each step (a full set of properly operating 
n+ probes was particularly difficult to achieve), only 
two samples, one square and one circular, were judged 
to fully meet the requirements discussed in the previous 
section and therefore to be suitable for accurate quanti- 
tative measurements. As will become evident in the 

following discussion, the agreement between experiment 
and theory is sufficiently good that although results for 
only two samples are reported, the theory can be con- 


dp c completely verified in the region of stable growth. 
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The final dimensions of these two samples were 1.16 
mm diam by 25.4 mm long for the cylindrical sample 
and 1.02X1.0225.4 mm for the one of square cross 
section. 

The temperature of operation proved to be a particu- 
larly valuable parameter to vary as it provided a test 
of the functional dependence of the wave properties on 
the equilibrium carrier densities without the need for 
many samples of varying resistivities. In fact, by raising 
the temperature it was possible to pass from the ex- 
trinsic region in which stable spatial growth is observed 
to nearly intrinsic material where some of the considera- 
tions of instability and the oscillistor are of interest. 

The extremely low yield of good samples precluded 
the use of sample geometry as another variable. The 
dependence on sample radius seems well enough con- 
firmed by the general quantitative agreement of all 
measurements, but the question of how long is a "long" 
sample and the length dependence of the threshold in 
samples comparable in size to a wavelength would still 
be of interest. Except where otherwise stated, the data 
to be presented refer to the sample of circular cross 
section, for in this case there is no ambiguity as to the 
value of radius to be used in the theoretical expressions. 


Helical Waveshape 


'The helical nature of the wave was confirmed by 
exciting the sample at probes aa' and observing the 
relative phase of the resulting signals at probes fi^ and 
gg'. (See Fig. 2. The electric field is positive from left 
to right.) The resulting waveforms are shown in Fig. 4, 
and we see that for By parallel to Eo the signal at gg’ 
led that at ff’ by 90° and lagged by 90? when B, was 
reversed. The traces shown are for threshold, but the 
same phase relationship was found for all fields and 
frequencies. Adding to this result the fact that the phase 
and group velocities are in the direction of the electric 
field (see next section) it is clear that we do indeed have 
a helical wave, traveling in the direction of ambipolar 
drift, which as predicted is right- (left-) handed for 
By parallel (antiparallel) to Ep. 

Further, when a circularly polarized excitation was 
provided by driving probes aa’ and bb’ with signals 90° 
out of time phase, only the polarization direction cor- 


-—- Vac 

xx F16. 4. Oscilloscope 
traces showing 90° 
phase difference be- 
tween signals at 
probes gg’ and ff’. 
Set (a) corresponds 
to Bo parallel to Eo 
and (b) to Bo anti- 
parallel to Eo. Verti- 
cal scale: 5 mV/div. 
Horizontal scale: 10 
psec/div. 
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HELICAL DENSITY WAVES 
responding to the predicted screw sense produced a 
growing wave. The opposite excitation produced only 
a strongly attenuated wave. That is, for excitation in 
the sense a — b'— a/— b the wave grew (for fields 
above threshold) for B, parallel to £i; and always de- 
cayed when Bo was reversed. For excitation in the sense 
a — b — a' — b' the converse was true. These results 
are in full agreement with the predictions of the physical 
model, Eq. (32) and Table I. 


Phase Constant—Phase and Group Velocity 


l'or a wave of the form exp[iw!—i(k,+-ik;)si¢ | the 
phase difference Ad between two points differing in z 
by Az at the same instant of time and same value of ó 
is simply 
Ab=—k,Az. 


Thus the measurement of the relative phase of the signal 
at two sets of probes, such as dd’ and fi’, gives directly 
the phase constant &, In Fig. 5 we plot —A®, the 
amount by which the phase at ff’ lags that at dd’, and 
the resulting value of k,, as a function of f/ Es for several 
temperatures and electric fields. The probe spacing Az 
in this case was 0.8 cm. Since the probes were several 
wavelengths apart there was always an uncertainty of 
2nm in the absolute phase difference as related to that 
measured. However, for the extrinsic case we assume 
that we have a simple wave so that the extrapolated 
value of A® is zero for w=0. Therefore, to each set of 
experimental points (at a given temperature) we have 
added an appropriate multiple of 27. For each set of 
points the required translation was always 2: and n 
was just that integer required to make k,=2/av3 at 
threshold, in agreement with (26). 

The measured values of A and therefore k, were, 
within experimental error, independent of magnetic 
field over the range 0-11 kG for all temperatures except 
67°C and above, but in the latter range exhibited a very 
slight dependence at the lowest frequency and highest 
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Fic. 5. Phase constant A. (left-hand scale), qd 
measured phase difference between probes dd" an l En fields 
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Fic. 6. Critical magnetic field Bos versus 1/Ea for several 
temperatures, Points are experimental; curves are theoretical, 
calculated from Eq. (27). 


magnetic fields. The theoretical curves were calculated 
from (32) and agreement is excellent. The slopes of the 
straight lines are just 4./2m and therefore, as predicted, 
Uphase= Vgroup = Malo, the ambipolar drift velocity. Values 
of k, obtained by numerical computation from the full 
dispersion relation (21) were insignificantly different 
from those given by (32). 


Threshold Conditions 


The threshold, by definition, is the point at which the 
z variation of the wave first changes from attenuation to 
growth. At this point k;=0 and the signal at each set 
of detecting probes must have the same magnitude. For 
a fixed electric field, the threshold frequency f, and 
magnetic field By, were obtained by exciting the wave at 
probes aa’ and bb’ and then slowly raising the magnetic 
field while sweeping the frequency back and forth. The 
field and frequency at which the signals at probes dd’, 
and ee’, and fi’ first became equal were then the thresh- 
old values. For properly operating probes these signals 
became equal simultaneously. The signal at probes cc’ 
was usually slightly larger, presumably due to higher 
modes which had not decayed sufficiently that near to 
the excitation. 

In Figs. 6 and 7 we have plotted the measured values 
of fe and By, as a function of the applied electric field 
Es for various temperatures. The theoretical curves were 
calculated from (27) and (28). Agreement is again very 
good. The small deviations from theory of the values of 
f. at the higher temperatures, lowest electric fields, and 
highest magnetic fields are presumably due to the be- 
ginning influence of the rotational contribution to the 
frequency. At high magnetic fields, Bo. also begins to 
deviate from theory due to breakdown of the assump- 
tion ut, Bg«&«1. Note, however, that at moderate fields 
the theoretical and experimental values of By. agree 
very well at all temperatures. This is to be expected 
since no assumptions concerning the relative carrier 
densities were made in the theoretical calculation of 
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Fic. 7. Critical frequency f. versus Æo for several temperatures. 
Points are experimental; curves are theoretical, calculated from 


Eq. (28). 


Bo. As the temperature increases and (57o— fo) — 0, 
fe decreases rapidly, but Bo. quickly approaches a 
limiting value which is independent of the magnitude 
of no and po, as may be seen from Eq. (27). However, 
when the mobilities are decreasing functions of tempera- 
ture, as is the case here, Bo. reaches a minimum and 
then increases slightly with temperature. Thus, as we 
see in Fig. 6, the values of Bo, at 77°C are somewhat 
higher than at 47°C.38 


Growth Constant 


The exponential variation of the wave amplitude in 
the z direction was confirmed by measurement of the 
relative signal magnitudes at probes dd’, ee’, and fi’, 
with excitation provided at aa’ and/or bb’. More 
precisely, as long as the excitation was sufficiently small 
that the signal at any set of probes was less than 500 mV 
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G. 8. Growth constant k; versus Bo for two values of Eo, at 
7°C and f—f.. Points are experimental; solid curves are 
t theoretical solutions for pe „Bo <1, and dashed curves are 


Since all measurements were made with small signals, no 
esis effects such as described by B. Ancker-Johnson [ Appl. 
hys. Letters 3, 104 (1963) ] were observed. 
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peak-to-peak, the wave varied longitudinally as exp (&;z) 
where k; is the growth constant. When the signal rose 
above this level, saturation effects began to set in and 
the linearized theory no longer applied. More will be 
said about nonlinear effects in the next section; in the 
present discussion we limit ourselves to the small-signal] 
linear region. 

Figures 8 and 9 show the theoretical and experimenta] 
variation of the growth constant k; as a function of Bo 
for various temperatures and values of Æo. In all cases 
the frequency was fixed at the threshold value f, 
pertinent to the specific temperature and electric field. 
The dashed curves are calculated from (32); the full 
curves are the results of the exact (in the limit 
Le BeK) numerical solution of the full dispersion 
equation, (21). The approximate solution is clearly ac- 
curate only in the neighborhood of threshold (&;— 0), 
and deviates both above and below threshold where the 
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Fic. 9. Growth constant k; versus Bo for several temperatures 
at Eo— 50 V/cm and f= fe. Points are experimental; solid curves 
are exact theoretical solutions for ue 2 Bo°<1, and dashed curves 
are calculated from Eq. (32). 


Bessel function expansions are no longer correct. The 
experimental results agree very well with the theoretical 
curves except at high magnetic fields where the assump- 
tion ue,» Be««1 becomes a poor approximation. Numeri- 
cal solutions were not obtained for the higher tempera- 
tures, but the excellent quantitative agreement between 
experiment and theory at lower temperatures indicates 
that the agreement should be equally good. Intermediate 
values of electric field, omitted from Fig. 9 for clarity, 
also gave quantitatively correct results. 

Note the rapid increase of growth constant with tem- 
perature in Fig. 8. The highest measurable value of ki 
was about 4 cm™, corresponding to a gain of approxi- 
mately 35 dB/cm. For growth rates above this value 
the sample began to amplify appreciably its internal 
noise to values comparable with the maximum signal 
amplitudes obtainable under linear operation. 
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The frequency dependence of the growth constant is 
detailed in Figs. 10 and 11, where for E5— 50 V/cm the 
theoretically predicted and measured values of k; have 
been plotted as a function of frequency at T=27°C 
and T'— 37?C for three values of magnetic field : By < Boc, 
Box Do, and Bo> Bo. Again the dashed curves are 
calculated from (32) and the full curves are the solutions 
obtained by numerical computation. The inadequacy 
of the simplified theory, except in the neighborhood of 
threshold, is strikingly evident. Equally apparent is 
the extremely good agreement between the experimental 
points and the exact theory. 

In Figs. 12 and 13 similar measurements of k; as a 
function of By and f for the sample of square cross 
section are presented. An effective radius a—0.053 cm 
was chosen in plotting the theoretical curves from (32) 
in order to give the best fit to the experimental data. 
This choice is very nearly the radius of a cylinder with 
the same cross-sectional area as the sample. No numeri- 
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Fic. 10. Growth constant k; versus f for Bg Boe, Boc Bue, 
and Bo> Bo, at T —27?C and Ea=50 V/cm. Points are experi- 
mental; solid curves are exact theoretical solutions for pe, Bo <&1, 
and dashed curves are calculated from Eq. (32). 


cal solutions were obtained for this case but, in light 
of the results for the circular sample, the behavior at 
frequencies and fields above threshold would presum- 
ably be in agreement with the exact solution. The 
gross disagreement at low frequencies, which is more 
than could be accounted for by the exact theory, is only 
an apparent one. It is due to the fact that linearly 
polarized (two-probe) excitation was used rather than 
the circularly polarized (four-probe) excitation employed 
for all measurements on the cylindrical sample. Exami- 
nation of (32) shows that for f< f. and By « By. the 
right- and left-handed helices have comparable attenua- 
tion rates. Since both are excited approximately equally, 
the measured value of k; will be some combination of 
the two. This identical discrepancy was later noted on 
the cylindrical sample, but was eliminated by using 
four-probe excitation. Because the square samples were 
by then of little interest, no attempt was made to redo 
€ earlier measurements. 
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Fic. 11. Growth constant k; versus f for By <Boe, By Boe and 
Bo> Boe, at T=37°C and [550 V/cm. Points are experimental: 
solid curves are exact theoretical solutions for Le PB KA, and 
dashed curves are calculated from Eq. (32). 


An effort to measure directly the functional behavior 
of ki for the helix of screw-sense opposite to the one 
exhibiting growth was made by simply reversing the 
rotation direction of the four-probe excitation, while 
maintaining the fields in the same direction. However, 
due to the fact that four points cannot give a perfect 
exp(ip) distribution, a sufficient amount of the grow- 
ing wave was excited to completely obscure the 
measurements. 

Along this same line an attempt was made to excite 
the next higher |m|=2 mode by connecting a and 
a’ together and b and b’ together, and then driving 
the resulting two terminals with the signal generator. 
Such an arrangement should preferentially excite the 
|m|=2 mode. However, again due to the imperfect 
excitation, enough of the |m|=1 mode was excited to 
drive the sample into strong nonlinear behavior and 
saturation before the higher |m|=2 mode threshold 
could be reached. 

Finally, it should be mentioned here that the growth 
constant was found to be an extremely sensitive func- 
tion of the angle between B; and Æo. Figure 14 illustrates 
this behavior. The value of k; falls off so rapidly that 
at an angle greater than -E3? the strong attenuation 
made measurements impossible. The reason for such a 
striking sensitivity to a very small transverse compon- 
ent of magnetic field is not known. In terms of the physi- 
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cal model it would seem that the only effect of such a 
field is to exert a weak transverse force on the carriers, 
slightly altering their distribution and setting up a 
small Hall field. Mathematically, the introduction of a 
transverse component of B, complicates the problem 
considerably by destroying the azimuthal symmetry. 
It might be possible to effect some simplification by 
introducing the transverse component of field as a 
perturbation. However, such a calculation has not yet 
been attempted and the effect remains unexplained. 


Nonlinear Effects and Current Oscillations 


As stated earlier, when signals at the probes reached 
approximately 500 mV peak-to-peak, saturation began 
to set in, indicating the onset of nonlinear behavior. The 
probes farthest down the bar (in the direction of Ej) 
saturated first as expected, since the amplified signal 
is largest there. Further increase of the signal level by 
raising of the excitation or growth rate successively 
saturated the level at earlier probes, with the signal 
reaching a limiting value of about 1.5 V peak-to-peak 
at all probes. At temperatures of 47°C and above, the 
gain for large enough magnetic fields could be made 
sufficient to saturate at least the last probes of the sample 
on amplified internal noise. The spectral characteristics 
of this noise were quite unusual and willbe the sub- 
ject of further discussion in connection with current 
oscillations. 

Two important effects were noted in conjunction with 
the onset of strong saturation, namely a marked increase 
in the dc sample resistance and appearance of oscillations 
in the terminal sample current. In Fig. 15 the measured 
dc resistance of the sample is plotted as a function of 
increasing magnetic field (and hence increasing growth 
rate). The point at which the resistance begins to in- 
crease over its equilibrium value coincides with nearly 
full saturation at the farthest set of probes (fi^). Cur- 

rent oscillations, shown in the inset of Fig. 15 begin to 
come into evidence at the same point. 

The sample resistance change is a graphic manifesta- 
tion of the onset of nonlinear behavior. The perturbed 
carrier density 71 has now become comparable with the 

equilibrium densities zo and ġo, and the carriers are 
strongly bunched into the traveling helix. Although the 


total number of carriers in the sample has not changed, 
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this bunching has produced a net increase in the resist- 
ance. The expected order of magnitude of this resistance 
change may easily be calculated. When strongly 
bunched, the carriers will be concentrated near the 
surface of the sample in a helix with pitch 27/k and an 
effective radius approximately equal to the sample 
radius a. Most of the current will flow between the ter- 
minals in this helical path. Both the increased path 
length of the current and the reduced component of 
the electric field in the direction of current flow then 
act to increase the total sample resistance over the 
equilibrium value. It is a simple matter to show geo- 
metrically that the path length is increased by a factor 
of approximately (14-£a?)!?, and that the component 
of electric field along the helix is also smaller than the 
applied fied by the same factor. Thus, with the onset of 
strong bunching the sample resistance may be expected 
to increase until it is about a factor of (1-++%°a?) larger 
than the unperturbed value. From (26) k’a?~4/3, so 
the resistance should approach a value about 2.3 times 
larger than that for no bunching. The measured values 
shown in Fig. 15 are in qualitative agreement with this 
estimate. 

On the basis of the linear theory no terminal VI 
oscillations are predicted. Since all time-varying quanti- 
ties vary as exp[2ot—ikz—imó ], mz£0, the density 
integrated over a cross section is independent of time 
and hence there can be no oscillation of the total re- 
sistance. Glicksman's assertion that standing waves, 
set up by reflections at the contacts, can give rise to 
terminal oscillations? is clearly incorrect since the den- 
sity still varies sinusoidally with $ and ż and the inte- 
grated density again vanishes. One must therefore in- 
voke nonlinearities or nonuniformities as the basis of 
the observed oscillation. Misawa’ has proposed that 
mixing in of the m=0 mode due to deviations from 
circular symmetry, such as nonuniformities or trans- 
verse components of magnetic field, could account for 
the behavior, and this possibility cannot be ruled out. 
In fact Okamoto eż al9 have observed that certain 
oscillistor samples show current oscillations only in the 
presence of a small transverse component of magnetic 
field. However, the above experimental observations 
that oscillations are present in extremely uniform 
cylindrical samples with strictly longitudinal fields, and 
that the onset of current oscillation is always associated. 
directly with the onset of nonlinear behavior, provide 
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quite conclusive evidence that large-signal nonlinearity 


‘is in itself sufficient to account for terminal oscillations. 


The fact that current oscillations were observed in 
the region of s/able amplification, requiring only that the 
gain and excitation be sufficiently large to reach satura- 
tion, shows further that instability is not required to 
obtain such oscillations. In this region the oscillations 
were essentially sinusoidal at the frequency of excitation, 
until very strong saturation occurred and some higher 
harmonic content became apparent. At temperatures 
below 67°C when the gain was raised sufficiently high 
to saturate the sample on its own noise, the resultant 
small current oscillations exhibited a similarly noisy 
frequency spectrum. However, at 67°C and above, fur- 
ther increase in the magnetic field resulted in a rapid 
narrowing of the frequency spectrum of both the signals 
at the probes and the current oscillations, until the fre- 
quency was nearly monochromatic and roughly equal 
to the threshold frequency. One might conceive of a 
situation in which the strong excursion into nonlinearity 
might produce a substantial narrowing of the bandwidth 
for amplification; however, it seems more satisfying to 
picture the cause as the onset of an instability and the 
consequent buildup of oscillation at a specific frequency. 
Indeed just such an instability was predicted from the 
theory for nearly intrinsic material (i.e, at high 
temperatures). 

Unfortunately, as discussed earlier in connection with 
the instability theory, a region of stable growth pre- 
cedes the onset of instability in extrinsic material. The 
resulting amplification of noise in anything but very 
nearly or absolutely intrinsic material drives the sample 
into saturation before instability is attained, so that the 
linear theory is no longer valid. Such was the case in all 
of our measurements, even at the highest temperature 
(87°C). We may, however, make the following qualita- 
tive observation. For the 30 ohm-cm germanium used in 
the measurements, and for Egy— 60 V/cm and Bo=11 
kG, the maximum values available with the present 
arrangement, Eq. (42) predicts that instability will be 
attainable for temperatures above 60°C. We note also 
that the rapid narrowing of the frequency spectrum of 
the amplified noise with increasing fields was observed 
experimentally only at temperatures above about 67°C. 
"These two facts lead one to the plausible conclusion that 
this spectral narrowing is the result of the onset of 
instability and the resulting buildup of oscillations in 
a narrow band of frequencies; however, further quanti- 
tative verification of the theory would certainly be 
desirable. : 

Just how close to intrinsic the material must be to 
reach the instability threshold before nonlinearity oc- 
curs may be seen by returning to Eq. (37a) and savne 
for the growth constant ko; at the point of insonuit 3 
Neglecting the term peunBo which will almost always 


be negligible, the result is 
heir (us Es AD; e (QEV ART) L0 P9 (no Po) J: 
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Experimentally it was observed that saturation began 
to set in when £;~4 cm, and in order to keep pe rB 
so that the theory correctly applies, the minimum value 
of Es for instability is about 25 V/cm. Therefore we 
need to have 


(q/kT)| (to po)/ (not po) | <0.64 V, 


which for 30 ohm-cm germanium requires T> 100°C. 
An easier method would be to start with much purer 
material. For Vp—N4<10" cm the threshold could 
be reached while still in the linear region at room 
temperature. 


The Oscillistor 


The above prediction and probable observation of 
absolute instability in a semiconductor bar, with nearly 
equal densities of electrons and holes and subjected to 
sufficiently large parallel and magnetic fields, leads 
naturally to a discussion of the oscillistor, for it is under 
precisely these conditions that the effect is observed. 
It was shown experimentally that large signal nonlinear- 
ity could give rise to terminal VI oscillations and, of 
course, the instability can provide the mechanism for 
the rapid buildup of large signal behavior in the form 
of a strongly bunched helix of carriers. 

Unfortunately the oscillistor effect as it is usually 
observed and reported in the literature presents such 
a complicated experimental situation that quantitative 
comparison with theory is prevented. The attainment 
of nearly equal carrier densities by double injection 
into a relatively pure bar of semiconducting (long 
p-i-n structure) leads not only to radial gradients of the 
carrier distribution but also to substantial longitudinal 
variations in the steady-state densities and electric 
field. Such variations are not taken into account by 
the theory and to do so would hopelessly complicate 
the mathematics. Further, oscillistor samples are usually 
made with relatively low surface recombination, and 
as a result growth is probably due both to the steady- 
state density gradient and to some piling up of carriers 
at the surface. 

Notwithstanding these complications, the essential 
features of the instability mechanism remain the same, 
and one would predict the occurrence of an absolute 
helical instability under the conditions pertinent to the 


® M. A. Lampert and A. Rose, Phys. Rev. 121, 26 (1961). 
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oscillistor effect. This result coupled with experimental 
confirmation of the helical nature of the effect," 
provides conclusive verification of Glicksman's pro- 
posal of a helical instability as the basis of the 
phenomenon, 

Finally, it should be pointed out that for the gaseous 
plasma the unperturbed densities of electrons and ions 
are always equal and there is no ambipolar drift of 
the helical perturbation. There will therefore be no 

f region of stable growth and, as assumed by Kadomtsev 

| and Nedospasov! and Hoh and Lehnert,” a only helical 

instability is possible. 


e 


VI. CONCLUSION 


It is evident from the excellent agreement between 
experiment and theory that the treatment of Sec. III 
provides an accurate description of the properties of 
helical waves in the approximation p?B*«1. Further, 
all observations, except the unexpectedly strong de- 
pendence of the growth rate on the angle between the 
electric and magnetic fields, are consistent with the 
13 physical model presented in Sec. II. 
| Although the work was devoted almost entirely to 
id the surface-density mode of operation, it is clear that 
the general arguments and results may be carried over 
to more complicated situations, such as the equilib- 
rium-gradient mode or the case of an injected plasma. 
In addition, although germanium was the only material 
studied, other semiconductors and perhaps semimetals 
should exhibit growing helical waves for wide ranges of 
fields and. frequencies, and in fact, oscillistor behavior 
has been observed in silicon and indium antimonide.5:19 
Unfortunately, in most materials surface and bulk 
recombination will produce such strong damping of the 
waves that growth may be difficult to achieve. 
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APPENDIX A: THRESHOLD CONDITIONS FOR 
EQUILIBRIUM GRADIENT MODE 


We again consider a cylindrical sample of radius a, 
long in the z direction, but now assume that the equi- 

- librium carrier densities 7; and po are graded in the rad- 
al direction. These gradients could be produced, for 
ample, by the outdiffusion of impurities (consider 
onors to be specific) in which case zz; and po would be 
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of the form” 
nor) noJ (ar) , 
do(r) 2 n2/no(r) = po/ J (or) , 


where 7z is the intrinsic density. For simplicity we con- 
sider the case of weak gradients (o?a? <2); no and py 
may then be written as 


no(r) &no(1 —o7?/4) 
bor) = po(1-- 97/4) . 


We further assume that the surface recombination 
velocity s is large (s= co), in order to avoid considering 
an admixture of the equilibrium-gradient and surface- 
density modes. As usual it is assumed that the bulk 
lifetime 7,:2»a2/D, and that the squares of the Hall 
angles pe,7°Bo<1. 

In equilibrium the continuity equations (6) and (7) 
become, denoting equilibrium values by the sub- 
script 0, 


no(r)u; V: Eod- i,Eo: Vito(7) + D; V?no(r) 


(A.1) 


(A.2) 


+y2Eo- [Vi0(7) XBoJ= OF (A.3) 
2v Eo+urEo: Vpo(r) — D. V? po(7) 
— ue Eo [Vfo(r) XBo]-0. (A.4) 


The magnetic field Bo as before is in the z direction, 
whereas the electric field Eo, in addition to the applied 
z component Eo: now has a radial ambipolar com- 
ponent Eo, due to the density gradients. With equi- 
librium densities as given in (A.2), Eqs. (A.3) and (A.4) 
may be readily solved for Ey,, with the result (assuming 
T.=T,=T) 

Eo, (kT /2q)r. (A.5) 


Following the solution for the surface-density mode, 
the continuity equations are linearized about equi- 
librium and quasineutrality is assumed. We then have 


On, 19 
= —peno(r) VN un — (rEs;) 
dl r ôr 


Qni On; àno(r) Oy 
Fu Bo, + Eo: ) Be -ED.Vts 
r Oz Or Or 
1 ôn 1 dy ano(r) 
Amos =e ty, (UR 9) 
r do rôp Or 
and 
ae il @ 
— uupo(r) Vya— urn S 
d T Or 
Om — On 8po(7) oy 
(Bs + Eo; Ju A DA DN TAS 
or 02 ðr ðr 
1 ôm 1 3y: ð O E 
0 a pa (A.7) 


Her BoEor- —+p2B 
r dp r õp ðr 
“ The problem is analogous to that of heat flow from a cylindri- 


cal rod. See H. S. Carslaw and J. C. Jaeger, Conduction of Heat 
in. Solids (Oxford University Press, T 1959), Ch. VII. 
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Due to the presence of terms arising from the equi- 
librium gradients, the equation for nı may not be de- 
coupled as in the surface-mode solution (9). The coupled 
equations (A.6) and (A.7) are difficult to treat so, 
following the Kadomtsev and Nedospasov! treatment 
of the very similar gas-plasma case, we solve the prob- 
lem only approximately in the following manner. The 
perturbations are Fourier analvzed as in Eq. (11) and 
only the lowest |m| 21 mode is considered. It is now 
assumed that N,(r) and Wi(r) have the approximate 
form 
NiY(r) - aJi(fir) , 
Vs (r) — esJi (Bir) , 


where ĝa is the first zero of Jı, equal to 3.83, and the 


(A.8) 


c’s are constants. Note that nı(a)=0, as required by 


the assumption s= cc. Equations (A.2), (A.5), and (4.8) 
are substituted into (A.6) and (A.7) and the resulting 
equations are averaged over r by multiplying them by 
rJı(Bır) and integrating with respect to r.*! 

We thus obtain the following two algebraic equations 
for cı and cz: 


a(2D, (22-81) — o? (KT/q) (A+R) +i lotu. E E) 
Ha (kT/q) ue? Bol} + esngu (2 (22-822) (Q— 1) 
+a?(1—R)—ip,Bu®}=0, (A.9) 
and 


c( 2D? -8?)-- o? (ET /q) (14-R) +112 (o — us Ek) 
+a (&T/q)uit Bo) + copoun{2(#+62) (Q-I-1) 
+2(1—R)+ip,Bu?}=0, (A.10) 


where 


rao | HJ) dr | 


x| f eena] , (A11) 


pa a exu 
Q= ra] | eren | | rè (Brdr | . (A42) 

/0 0 
The integrals (A.11) and (A.12) are readily evaluated,* 
with the result R=0 and Q=07/126,’. 

Equations (A.9) and (A.10) are a pair of linear homo- 
genous equations in ci and cz. The dispersion relation is 
obtained by setting the determinant of the equations 
equal to zero. Under the assumptions eK BY and 
en BEK1, we then have, after some straightforward 


algebra, 
2D (gue A- poun) (BPF) + Eo: Bites (tui, — pons) 
=w Boni? pour) io Boats (RT/q)(8r-- E) 
X (nous Bou) bie? B(Bit-- E) nue Dat pu D e) 
-Fi2(82-- E) (nouet pua) (o — us Ex.) =U, iS) 
"See Refs, 1 and 22 for discussion of the validity of this 
approximate method, e Theory of Bessel Functions 


2G. N. Wat A Treatise on the 1 heory on 
(Cambridge cope Press, London, 1958), 2nd ed., pp. ES 
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where we have employed the definitions (10) of pa 
and Da. 

At threshold w and & are real and (A.13) may be 
separated into real and imaginary parts. Elimination 
of w from the resulting two equations yields 


& Eo, Bok nopu ui (2 — uy?) 
MM MEER. 
(nouet pour)? 


2D, 


(BPR)? (A. 14) 


Minimizing the product Ey,By with respect to k we ob- 
tain the threshold conditions: 


kè m BY/3 E 4.89/a* A (A. 15) 


and 


328 7k, D.wu,4- pour)? 1 
Bo, ———— — 
3mo? nospauus (ua — ui?) Eo 


(A.16) 


The result, as expected, is very similar to that for the 
surface-density mode. The fact that By, appears to 
become infinite when u,=y, is a consequence of the 
fact that the equilibrium gradients of mg and po are in 
opposite directions, producing a partial cancellation of 
the growth mechanism. In the simple approximation 
(A.2) used here these gradients are exactly equal and 
opposite, so that the cancellation appears to be com- 
plete when u,— u+. In an exact solution this cancellation 
would be substantial but not complete. 

The threshold frequency is obtained from (A.13) 
where, as discussed in the derivation of (28) and (29) 
for the surface-density mode, we take care to remain 
consistent with the assumption pe a P«1. We then 
have for | (s— po)/ (no+ po) | Pus i Bo, 


f. d 2z)u Ek. (A.17) 
which, as before, corresponds to the translation of the 
helix with the ambipolar drift velocity in extrinsic 
material. 

For intrinsic material, | (9— po)/ (tot po) | Kue i Bg, 


c 


T Bau (opt pote) 


2m Lq Noget fous 
Note Drt pokiDe 
Cela 


Hope Poth 


The first term is the rotation due tó the built-in ambi- 
polar field E», and the second term arises from the 
diffusion and drift of carriers across the magnetic field. 
Equation (A.19) may be simplified considerably by 
using the Einstein relation D,,,— (ET, ,/q)us,4, assum- 
ing T.=T,=T, and by setting mo= fo, since (A.19) 
applies to the nearly intrinsic case. Then 


ma? kT 
f= LIT wen Bac. (A.19). 


dr q 
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AND BULK RECOMBINATION 


Equation (9) with the inclusion of the bulk term is 


ðn ðm m 
=D. V n Halo Ser S 
ðt Ó2 Tb 


(B.1) 


where y has been replaced by (n—0)/75— 1/75. Re- 
combination has been assumed to take place without 
trapping so that a single bulk lifetime 7, may be used 
for both carriers. Such a treatment is fully satisfactory 
for germanium at and above room temperatures. In the 
Fourier analysis (11), n~e®t so that (B.1) can be 
written in the form 


(io--1/7))m— D.Vhi—pg4Eo(0i/0z). — (B.2) 


Bulk recombination may therefore be included simply 
by replacing e by (w—i/7,) in (24). 

'To include surface recombination we proceed from 
(21) exactly as in the derivation of Eq. (25), but now 
retain the terms involving s. As a result for small bulk 
and surface recombination, (25) is replaced by 


302+ (uy? EoBoka?/ Do) (1— sa/ D.) 
+4(1+-sa/3D.)+3a?/2Da7,=0, 


where second-order small terms involving (sa/ Da)? and 
(sa/Da)(@/Dar») and uen Bo have been dropped. Pro- 
ceeding exactly as in the derivation of (26) and (27) 
we have finally at threshold: 


(B.3) 


* 4 sa 3a? 
kj- (14 ar ): (B.4) 
3a? 3D, 8Dats 
6k.D, sa\ í 
Bo=— (5, (B.5) 
mum 3D, Eo 
and 
119— po 
fe (1/2m)uaEoke , Dhe,nBo? (B.6) 
not po 
20mD, 2sa 
== (ueu Bo 1+) 9 
Ora? 3D, 
No— fo 
«uei Boe. (B.7) 
Not fo 


As expected, recdmbination, being a loss mechanism, 
has raised the threshold fields. However, for the experi- 
mental situation of interest: s~30 cm/sec, ta~ 1400 
psec, 4— 0.058 cm, and Dz~60 cm?/sec, the correction 
to k; and f. is about 1.5% and that to Bo, is about 2.5%. 
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'These corrections are less than the experimental error 
and are therefore negligible. 


APPENDIX C: THRESHOLD CONDITIONS FOR 
GENERAL AZIMUTHAL MODE 


In the dispersion relation (21) we again expand the 
modified Bessel functions in Taylor series expansions 
except that now we consider a general m instead of 
m=+ 1. The case of m=0 can be treated and eliminated 
at the outset. In (21), setting m=0 we have simply 


aBkIo(Ba)Io(ka)=0. (C.1) 


The magnetic field does not enter this expression at all, 
so there is no threshold. The solutions therefore corre- 
spond to damped or evanescent waves and need not be 
considered further. 

For m0 modes we keep only the first two terms of 
the expansions and substitute them in (21). Proceeding 
as in Sec. III we arrive at the equivalent of (24) for 
general m: 


Ea mpm |-4-2)4-2m| m] 4-1) (ue ux) Bo] 
-Fi[ a? (o— ps Eck)/2D. ]L | m| (| m | 4-2) 
-Fim(|m|4-2)ugBs— im | m |usBo] 
mua EsBoka?/ DoA- 2m? (|m | +1) 
+2m|m| (|m|+1)(ue—un)Bo=0. (C.2) 
Separating the real and imaginary parts of (C.2) for 


real & and w and eliminating w from the resulting pair 
of equations, we then have 


[m | (|| 4-2) &2?4- (mu a EgBy/ Dz) ka? 


+2(|m|+1)=0. (C.3) 
Therefore, at threshold 
k2=2|m| (||+1)/(|m|+2)a? (C.4) 
and 
2 +2)Dak 1 
fec m(|m|+2) e 1 (C.3) 


|m| par Do 
From (C.5) it is evident that the m=1 mode has the 
lowest threshold for growth. 


Returning to (C.2) we have, following the derivation 
of (28) and (29), the threshold frequency: 


flg— 
fee (2) Es, Ae (CO) 
not fo 
2m(|m|--1) Q|m | 2-1) D; 
EAS ar B c3 
m (|m|4-2)a TRIERS 

no— 

| ua Kue Be. (C7) 

not fo 
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Two groups of copper single crystals of various orientations were irradiated with neutrons in the BNL 


r reactor between 50°C and 70°C; one group was exposed to 0.535€ 10" net (total epicadmium flux) and the 
other 1.1 10'8 nut (total epicadmium flux). An irradiated and a nonirradiated crystal of cach orientation 
were isothermally annealed in a specially constructed dilatometer at temperatures ranging from 75°C to 

& 350°C, and the irreversible differential changes in length were measured. In addition, electron microscope 


studies were made of the size distribution of irradiation-produced prismatic dislocations after several of the 
annealing treatments. These experiments revealed that (1) a direct relationship exists between the magnitude 
of shrinkage after annealing treatments and the total neutron exposure, (2) during annealing dislocation 
loops decrease in diameter until they disappear, (3) the decrease in dislocation density is related to the 
volume shrinkage, (4) the changes in loop distribution could be predicted by a line-tension-climb equation, 
d and (5) the activation energy for the annealing of irradiation-produced loops lies between 2.08 eV and 
2.10 eV. The results indicated that the annealing process in neutron-irradiated copper is not the same as 
observed in quenched metals although a self-diffusion mechanism appears to be the controlling factor. 


INTRODUCTION 


RRADIATION of copper with high-energy particles 

results in defects that produce an expansion in the 
lattice. Vook and Wert! bombarded copper foils with 
deuterons of an average energy of 8.5 MeV at 15°K and 
found that copper expanded; subsequent annealing to 
about 280°K resulted in a shrinkage curve which is the 
same form as the resistivity curve determined by other 
workers)? Although several experiments concerned 
with the dilatation of copper! were made, a detailed 
account of the effect of annealing after irradiation with 
neutrons at reactor temperature of approximately 50?C 
has not been reported. Experiments show, however, 
that the physical and mechanical properties of irradi- 
ated copper are affected by subsequent annealingí 
between 0 and 500°C. Moreover, Coltman, Klabunde, 
McDonald, and Redman? have shown that thermal 
neutrons as well as fast neutrons produce measurable 
damage at low temperatures; however, more investiga- 
tion is necessary to ascertain the relationship between 


» the effect of slow neutron bombardment at pile tem- 
peratures (50-70?C) and the recoverable damage. 

i Transmission electron microscopy investigations of 
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neutron-irradiated copper revealed dark spots in copper 
which are believed to be prismatic dislocation loops." 
These prismatic dislocation loops do not form in all 
cases, but their production is found to be dependent 
upon the material irradiated and the temperature of 
irradiation." In copper the density and size distribu- 
tion of these spots are related to the total radiation 
exposure and the neutron spectrum." As the irradiation 
exposure is increased, the density increases to a satura- 
tion value and the loops grow in size; subsequent 
annealing of this material will lower the density of the 
loops.!^.? Since the disappearance of these loops is 
equivalent to a decrease in dislocation density, the 
present experiments were designed to obtain data from 
which the change in volume of the copper could be 
correlated with the change in dislocation density during 
the annealing of irradiated copper. Conclusions can be 
made concerning (1) the amount of point defects 
remaining in the lattice after irradiation, (2) the volume 
occupied by the dislocation loops, and (3) the mode of 
diffusion during annealing treatments. 


EXPERIMENTAL PROCEDURE 


Single crystals, 3 mm in diameter by 140 mm long, 
of high-purity copper (99.999505) were grown in 
graphite molds in a vacuum. These crystals were 
mechanically cut into two equal parts, 70 mm long, in 
order to make pairs of crystals of identical orientations 
suitable for the precise length change measurements. 
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"TABLE T. Neutron spectrum for irradiation treatmer 


AND J. R. DEMOS 


its (temperature between 50-75*C). 


Flux Total : Total Total 
Thermal Epicadmium above Effective thermal epicadmium flux 
Exposure Specimens flux flux 1 MeV time flux flux 1 MeV 
^ p. (101: n) (10! nv) (10! nz) (10° sec) (102 nvt) (1015 nvt) (1017 not) 
A 70, 81, 82 1.0 3.8 2.7 3.0 3.0 1.14 8.10 
B 66 IE 2.8 1.9 2.47 0.532 e 


To reduce the possibility of spurious effects due to 

oxygen entering the lattice during the initial annealing 

treatments, the copper specimens were heated in an 
; evacuated quartz tube in the presence of a zirconium 

getter. After annealing for 12 h one crystal from each 
e orientation was sent to Brookhaven and irradiated in 
the BNL graphite reactor. Specimens 70, 81, and 82 
were irradiated together in the same hole with a total 
integrated epicadmium flux of 1.14 10/5 (nf), exposure 
A, whereas specimen 66 was exposed to about one-half 
of this dose, exposure B. Table I contains the pertinent 
data concerning the neutron bombardment. 

The dilatometer? used in these experiments was 
designed to compare irreversible differential changes in 
length resulting from the rearrangement of point 
defects between an irradiated and a nonirradiated 
crystal of the same orientation. The dilatometer was 
similar in construction to the one used by J. Takamura 
in the study of changes in length of quenched metals 
after annealing at temperature near 20°C." The present 
dilatometer can be operated in the temperature range 
from 75 to 400°C. The relative changes in length 
betweeu the irradiated and nonirradiated crystals were 
magnified by an optical level system and a telescope; 
when mecessary, the readings thus procured were 
recorded by a slow motion cine technique. In order to 
obtain precision measurements, the dilatometer was 
kept in a room in which the temperature did not vary 
more than 1.0°C over a 24-h period. It was necessary, 
however, to monitor the temperature of certain com- 
ponents of the dilatometer in order that accurate 
corrections could be applied to the dilatometer scale 


— D. 225°C (82) 
-—D—p—0— n— g—n 


X—— 
RR. 


ch —i1—$- 3 4 5 6 T 8 39 1| i 
TIME (105 sec) 


2 Fic. 1. Change in Al/I for isothermal annealing of irradiated 
copper crystals 70, 81, 82; irradiation exposure A. 


——. 

T. G. Greenfield and J. Takamura (to be published). 
J. Takamura, Acta Met. 9, 547 (1951). 
ahs 


readings. An analysis of the calibration data has shown 
that the sensitivity was about 200 A with the standard 
deviation of +500 A for the maximum variation of 
temperature of the components of the dilatometer. All 
annealing experiments were carried out in an atmos- 
phere of high-purity argon that was passed over hot 
zirconium chips. 

The size distribution and the density of dislocation 
loops were measured from micrographs in regions about 
2000 A thick. The thickness was measured, where 
possible, by calculating the perpendicular projection to 
the surface of the intersections of slip traces. Although 
dislocations producing these traces were active after 
long exposures to the electron beam, the dislocations 
were not always present in the regions where the density 
measurements were made. As a result some interpola- 
tion was necessary ; however, the accuracy falls within 
+500 A of the thickness measurement. 


RESULTS 


Length Changes Due to Annealing 


The effect of isothermal annealing on the change in 
length of single crystals of copper irradiated with 
neutron exposure A is shown in Fig. 1. In this graph of 
specific length change, Al/lo, is plotted against annealing 
times at the desired temperature. Consider first the 
results of annealing at 225°C (specimen 82) for 7.9 10° 
sec. The continuous record of the length changes has 
shown a high rate of shrinkage during the early part of 
the annealing treatment; however, after approximately 
6X105 sec, the rate of shrinkage became negligible. 
Precise measurements of the irreversible difference in 
length were then made at room temperature after the 
irradiated specimen and the dummy specimen were 
cooled to room temperature; the measurements agreed 
with those taken at the annealing temperature. Then 
the specimen was heated to 300°C and again a rapid 
shrinkage took place during the initial heating; how- 
ever, after about 0.5X 105 sec, Al/Iy became constant. 
The total value for the specific change in length for 
225°C was found to be —0.76X10-> and for 300°C, 
— 1.30X 10-5. 

The isothermal annealing treatment carried out at 
275°C (specimen 70) was also characterized by an 
initial rapid shrinkage rate that was greater than 
observed with the 225°C annealing treatment, and 
after about 7X 105 sec, Al/Ip reached a constant value. 
The total specific length change was not significantly 
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DISLOCATIONS AND 
different from the value of — 1.30X 10-5 found in the 
300°C annealing treatment discussed above. The irradi- 
ated crystal treated at 275°C was not annealed at a 
higher temperature since it was necessary to preserve 
the prismatic-dislocation-loop distribution in the speci- 
men for an electron microscope investigation that will be 
discussed later in this paper. Specimen 81 was annealed 
at 250, 275, and 300°C for lengths of time indicated 
in Fig. 1. The results of these dilatometric measure- 
ments agree with those described earlier. Annealing at 
higher temperatures, up to 350°C, did not produce 
significant additional shrinkage, and thus the maximum 
Al, lo for this neutron exposure was — 1.30% 10. 

Thus, for the neutron dose A no detectable shrinkage 
occurred below 200?C; above 225?C considerable 
shrinkage initially took place, and, as the annealing 
time was increased, the shrinkage rate approached zero. 
Additional shrinkage occurred when the temperature 
was raised higher than this annealing temperature; 
however, after being annealed in the range of 275°C 
for more than 7X10 sec or 300°C for a shorter time, 
no additional changes in Al/Ip are detected. The maxi- 
mum Al/I) was about equal to —1.30x 10-5 for this 
irradiation dose. 

For the lower irradiation exposure B, on the other 
hand, the shrinkage behavior was somewhat different. 
Specimen 66 was annealed at temperatures and times 
listed in Table II. The annealing treatment was ex- 
tended until it was certain the rate of change of Al/lo 
reached zero; then the specimens were cooled to room 
temperature before the next isothermal treatment. The 
times listed in Table II are not necessarily the minimum 
time for Al/lo to reach a constant value. Above 325°C 
additional shrinkage was not seen. If the specific 
shrinkages in Table II are compared with those shown 
in Fig. 1, it can be concluded that the cumulative 
Allo for the crystals exposed to the smaller neutron 
dose approached a constant value at about 325?C, 
whereas for the greater dose the maximum A///, is found 
at about 275°C. In addition, the maximum shrinkage 
for the lower dose is less than one-half that for the 
higher irradiation exposure. Thus, the range of tem- 
perature in which shrinkage of the specimen occurs and 
the total Al//) attainable depends upon the irradiation 
dose. 


TABLE II. Annealing treatments for specimen 66; 
neutron exposure B. 


Annealing Time at Cumulative 

era temperature Al/la 
T (105 sec) (1073) 

15, 100, 12 175, 200 13 0.24 
275 1.80 0.34 

300 3.42 0.75 

325 2.16 0.82 

350 0.82 


VOLUME CHANGE 


DURING ANNEALING 


IN Cu A1053 


) 


loops 
cm3 


[3 
a 


Fic. 2. Size dis- 
tribution of pris- 
matic dislocation 
loops in neutron irra- 
diated copper after 
annealing. 


wo“ 


/ 
onl- 


DENSITY OF PRISMATIC DISLOCATION LOOPS ( 


; 
i 


Density and Distribution of Dislocation 
Loops after Annealing 


The distributions of prismatic dislocation loops in 
neutron irradiated copper before and after annealing 
are shown in Fig. 2. The parameter r/5 is the radius of 
the loop divided by the distance of closest approach 
between atoms. The upper curve represents the con- 
figuration observed in the copper crystals following 
irradiation treatment A (see Table I). Figures 3(a) and 
3(b) are electron micrographs of specimens in the as- 
irradiated state and the annealed state. Small dark 
spots less than 25 À in diameter were not seen in 
abundance as reported by Makin et al." The geometry 
of the diffraction contrast in the small spots did not 
reveal, in general, the nature of the irradiation-produced 
defects, and consequently, it was not possible to infer 
whether these structures were tetrahedrons or small 
dislocation loops. Since the density and size distribution 
were measured in rather thick regions, the amount of 
inelastic scattering of the electrons has contributed to 
the limited image resolution. 

The density of dislocation loops were decreased more 
than a factor of 10 after annealing crystal 70 at 275°C 
see Fig. 2. Moreover, the average size of loops has only 
increased slightly, indicating that loop growth plays a 
minor role during the annealing of neutron irradiated 
copper. After treating specimen 82 at 300°C, the loop 
distribution was not altered greatly although the 
average loop radius was slightly higher. Figure 3 
illustrates the typical micrographs before and after 
annealing. It should be pointed out that loop size 
measurements are most accurate at the large-radii part 
of the distribution curve, and the density measurements 
are most accurate when the loop density is above 
105/cc. 

The most significant result due to annealing is the 
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Fic. 3. Electron micrographs of irradiated copper (exposure A) (a) before annealing 
and (b) after annealing at 275°C for 7X 105 sec. 


great decrease in density of dislocation loops with only 
a slight increase in average radii of the loops. 


ANALYSIS OF RESULTS 


Dependence of Prismatic Loop Size on 
Annealing Conditions 


Silcox and Whelan!® have demonstrated that the 
shrinkage of prismatic dislocation loops formed by 
quenching in aluminum can be described by an equation 
originally derived by Friedel.!® It was shown that the 
rate of change of the radius of a prismatic-dislocation 
loop is related to three temperature-dependent condi- 
tions: (1) diffusivity of the point defects, (2) the line 
tension of the dislocation loop and (3) the super- 
saturation of point defects in the vicinity of the loop. 

The rate of change of the radius is then 


dr/di- — Zvab exp(— E/RT) 
X[exp(FAP/RT)—exp(FJS/RT)]C., (1) 


where Z is the atomic coordination number, va is the 
atomic frequency, 6 is the Burgers vector, E is the 
activation energy for vacancy diffusion, Fe= Gb ln(r/5)/ 
[4(1—c)r/b], G is the shear modulus, c is Poisson's 
ratio, and F,— (kT/b?) In(c/co). The factor exp (F .b?/kT) 
is dependent upon the line tension of the loop, and the 
factor exp(F.b?/kT) is the supersaturation term which 
indicates whether the loop will accept or emit vacancies. 
The term C, is considered to be the concentration of 
jogs and in a prismatic dislocation loop can be con- 
sidered as equal to 4. Friedel” has pointed out that 
this analysis depends upon the assumption that the 
growth process is controlled by climb and not by the 
migration of point defects to sinks. It has been shown 
that when loops are considered in thin foils, Eq. (1) can 


15 J, Silcox and M. J. Whelan, Phil. Mag. 5, 1 (1960). 
16 T. Friedel, Les Dislocations (Gauthier-Villars, Paris, 1956). 
17 J, Friedel’s discussion at the end of Ref. 13. 
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be simplified by considering the surface as a sink for 
the vacancies. Then c/co is approximately equal to 
unity (ie. nearly an equilibrium concentration of 
vacancies), and Eq. (1) can be written as 


dr/di— —ÀZbr, exp(—E/kT)Lexp(F.b?/kT)—1]. (2) 


Since prismatic dislocation loops are produced in copper 
by neutron bombardment, it may be assumed (1) that 
shrinkage of these irradiation-produced loops is con- 
trolled by the same forces as the quenched-in loops, 
and (2) sinks and sources of point defects are available 
during annealing, and (3) the average distance between 
sinks and sources of point defects is about the same as 
the average distance between loops. In contrast to the 
annealing behavior of isolated loops which have been 
studied,'5 this investigation is concerned with a multi- 
tude of small loops in bulk material, however, if the 
assumption is made that the vacancies move to sinks 
faster than they are produced by the loops and clusters, 
then Eq. (2) can be used. 

For aluminum and copper which contain loops of 
sizes larger than 50 atomic diameters in radius, this 
equation may be simplified since the reciprocal of the 
bracketed term is merely a linear function of r/b. 

For loops of smaller radii, this relationship: is not 
linear; and, to determine the time required for a 
prismatic dislocation loop to shrink to a given size, it 
is necessary to consider the bracketed term in Eq. (2) 
for various values of 7/b (radius/Burgers vector). If 
Eq. (2) is rearranged into the following form and 
integrated between the limits indicated, 


I j 2 E 
t= — -Va ew(—) 
to Z RT 
7 FB? r 
LEE- © 
ro/b kT b 
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then the time necessary for a prismatic dislocation loop 
climbing from radius ry to radius r can be determined. 

Equation (3) can be integrated numerically after the 
values of the constants are substituted. It has been 
found that the appropriate value for the activation 
energy £ falls between 2.08 and 2.10 eV (the evidence 
for these values will be discussed later) and the rela- 
tionships between 7/b and time for various size loops 
are shown in Figs. 4 and 5. In Fig. 4 the solid lines 
represent the theoretical shrinkage of dislocation loops 
at 275° with a self-diffusion activation energy of 2.08 eV. 
The initial values for r/b are found at the ordinate when 
the annealing time is zero and the values of r/b decrease 
in a homologous manner during the annealing 
treatment. 

From this graph the change in loop sizes can be 
estimated for the irradiated copper single crystal 70 
after annealing at 275°C for 7.9X 105 sec. For example 
the loops represented by r/b=60 shrink to r/b=58; 
the loops of initial size r/b=30 shrink to r/b— 19; and 
loops of initial sizes below about r/b— 26 completely 
disappear after this annealing treatment. 


60 


20 25 


TIME (10° SEC) 


Fic. 4. Shrinkage of loops of size r/b when E=2.08 eV 
at 275 and 300°C. 


A similar analysis can be made from the data in 
Fig. 5 where Z= 2.10 eV. With this information and the 
original size distribution after irradiation, a new size 
distribution can be estimated after an annealing treat- 
ment; these results are shown in Fig. 6. The shaded 
area represents the expected values for the density 
distribution when activation energy between 2.08 and 
2.10 eV is used. 

For r/b greater than 25, the experimental curve falls 
in the shaded region of the estimated curves, however, 
for low values of r/b the estimated curve rises above the 
experimental curve and ends in a region where r/6™~15. 
The differences between the experimental and estimated 
values of size distribution in the region where r/b<25 
can be ascribed to the experimental difficulty in 
positively identifying all the small loops. Thus, the 
measured density values would tend to be below the 
theoretically estimated values which were derived from 
the distributions of larger loops. In addition, the 
theoretical description of the shrinkage od loops pre- 
dicts that all loops with initial sizes r/b<26 will 
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Fic. 5. Shrinkage of loops of size r/b when E-2.10 eV 
at 275 and 300°C. 


disappear during the annealing treatment; conse- 
quently, the estimated curve results in a sharp cutoff. 
It is more probable, however, that the curve does not 
abruptly end, and that a maximum would occur as the 
measured evidence indicates. Thus, a compromise 
between the measured data and the calculated data is 
reasonable. If the same analysis is made for specimen 
82 (first annealed at 225°C and then annealed at 
300°C), a new distribution can be estimated which is 
shown in Fig. 7. As in the previous case, the new distri- 
bution has the proper form and is a reasonable fit with 
the actual distribution. These estimated curves of size 
distribution after annealing are affected greatly by the 
value of the activation energy £, and the best fit has 
been found to be between 2.08 and 2.10 eV. Other 
investigators have found activation energies of this 
magnitude for annealing irradiated copper in this 
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Fic. 6. Estimate of distribution change for annealing treatment 
at 275°C for 7.9X 105 sec when 2.08 eV X E X2.10 eV. 
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Fic. 7. Estimate of distribution change for the combination of 
annealing treatments (1) at 225°C for 7.9X105 sec and (2) at 
300°C for 2.1105 sec when 2.08 eV S Ex 2.10. 


temperature range. Makin? has concluded from 
recent experiments that the annealing of small spots is 
associated with an activation energy of 2.0 eV and the 
annealing of the large spots is associated with an 
activation energy of 1.0-1.5 eV. His determination was 


EXPANSION 
EM SHRINKAGE 


Fic. 8. (a) Increase of volume as a result of plate of atoms 
ving to surface; (b) shrinkage after relaxation in the vicinity 
—— of loop; (c) decrease in volume as a result of plate of atoms moving 
.— — jinto loop; (d) transfer of plate of atoms from extrinsic loop and 
, ver-all shrinkage after relaxation. 


18A, W. McReynolds, W. Augustyniak, M. Mckeown, and 
senblatt, Phys. Rev. 98, 418 (1955). 
Eggleston, Acta Met. 1, 679 (1953). 
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not based on the line-tension-climb relationship 
described by Eq. (1). 


Relationship Between Volume Shrinkage 
and Dislocation Loops 


The specific volume change AV/Vo for isotropic 
shrinkage is three times the specific length change. 
Comparisons between the length changes after anneal- 
ing for the crystals of the orientations shown in Fig. 1 
give evidence that Al/l is isotropic; hence, the specific 
volume changes are readily computed. 

The relationships between volume change and the 
removal of dislocation loops from a crystal is illustrated 
by the schematic diagram in Figs. 8 and 9. In Fig. 8(a) 
the “plate of atoms" is transported from the dislocation 
loop to the surface, thus, increasing the volume of the 
crystal by the extra plate of atoms; a void equal to the 


Y ne 
DXX 


Fic. 9. (a) Change in volume due to the annihilation of inter- 
stitial loops by vacancies in the lattice; (b) dissolution of vacancy 
or interstitial loops. 


volume of the atoms transported plus the volume of 
dislocation of length 2zr remain in the lattice. After a 
readjustment of the lattice planes, the differential 
volume change corresponds to the strain field associated 
with 2mr length of dislocation; and, consequently, 4 
small shrinkage occurs [dark areas in Fig. 8(b)]. 4 
similar argument is applicable for a vacancy loop and is 
diagrammed in Fig. 8(c). Moreover, when both vacancy 
and interstitial loops co-exist, the “extra plate of atoms" 
can replace the “plate of vacancies” if the loops are the 
same size or can replace portions of the vacancy loops 
if the loops are of different sizes; the decrease in volume 
is again proportional to the decrease in dislocation 
length [see Fig. 8(d)]. The same relationship results if 
loops moved out of the material by slip along the glide 
cylinder. The volume change due to the annihilation of 
an interstitial loop by vacancies which are in super- 
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j DISLOCATIONS AND VOLUME CHANGE DURING ANNEALING IN Cu A1057 
‘as TABLE IIT. Specific volume changes based on the reduction of dislocation. 
Densities After Annealing 
(1) Oe (3) (4) 5 (6) (7) (3) 
, Prismatic Total 
Annealing dislocation Dislocation cumulative Dislocation 


temperature density 2rf density volurne change density. 
Bis (loops/cm*) 1015 — (10-5 cm) (10*/crn?) AV/Vo (1075) (105/cm?) AV/Vo (1)* AV/V« (2)* 
Room 2.90 wc" k 754 WAR DUE CN i o NCC NC SK 
275 0.226 7.37 —3.84 66.8 —0.62 —3.15 
300 0.144 4.04 — 3.93 70.2 —0.65 — 3.28 
a The value of AV/Vo (1) is ups NEN 
b The value of AV/Vo (2) is based on the summation of the specific volume change in column 7 and the specific volume change associated with the 


reduction of the supersaturation of point defects. 


saturation in the lattice is illustrated in Fig. 9(a). The 
over-all shrinkage is larger than shown in Fig. 8 and 
depends upon the area enclosed in the loop, the lattice 
relaxation around the vacancies or clusters of vacancies, 
and the volume associated with the dislocation loop. 
A similar analysis for vacancy loops and a super- 
saturation of interstitial atoms also indicates volume 
shrinkage. Another possible interaction between loops 
and point defects is shown in Fig. 9(b). In this case the 
loops dissolve into point defects which remain in the 
crystal, and thus, a volume expansion should be found. 
Experimentally only shrinkage was measured, and the 
latter process will not be considered. 

The dislocation density, pı, is equal to the total 
dislocation-loop density, po, times the circumference of 
an average loop, 2zr. If the change in p, is compared 
with the specific volume change after an isothermal 
annealing treatment, then the volume for a unit length 
of dislocation can be evaluated. These data are com- 
piled in Table III. Stehle and Seeger? have calculated 
the change in volume associated with a dislocation by 
applying a nonlinear model to describe the dilation. 
Their results indicate that the dilation is greatest in the 
region from one Burgers vector to the center of the 
core, and the volume associated with a dislocation of 
length b is about twice the atomic volume. These 
calculations were applied to straight dislocations. For 
dislocation loops, however, the volume per unit length 
of dislocation loop would be somewhat dependent upon 
the diameter of the loop. If the loops are considered to 
be prismatic, and if it is assumed that the volume per 
unit length is the same as in a straight edge dislocation, 
the specific volume change associated with the loop 
density change can be calculated for the various 
annealing treatments. The results are given in Table ITI. 
For the annealing treatments at 275 and 300°C the 
calculated shrinkage is about one-sixth the measure 
value [see columns (5) and (7) ]. , 

If the decrease in density of the loops is a result of 
the combination of point defects from defect clusters 
with the loops as shown in Fig. 9(a), then the specific 
volume change will be the sum of the volume of the 


% H, Stehle and A. Seeger, Z. Physik 146, 217 (1956). 


dislocation and the volume previously occupied by the 
clusters. Assuming that a cluster having n point defects 
has a volume of nXatomic volume (relaxation not 
considered), the AV/V derived from the experimental 
data of dislocation density is in good agreement with 
the experimentally measured values [see columns (5) 
and (8) in Table III]. 

The present and other annealing experiments?! 
have demonstrated that the dislocation loops in 
neutron-irradiated copper will decrease in diameter 
during annealing until they disappear. A similar loop 
behavior is noted in quenched metals in regions near 
sinks such as grain boundaries and dislocations that are 
able to climb." In contrast, when sinks are not available 
there is a tendency for loops to grow as evidenced in 
quenched A1-0.5 at.% Mg.” Since the loops decrease in 
size during annealing in irradiated copper, it is assumed 
that loops are emitting or accepting defects in such a 
way as to cause loops to shrink. It is expected that the 
rate-controlling process is diffusion; and, thus, the 
average distance a vacancy may move during the 
annealing treatments can be computed. For the 275°C 
anneal, the vacancy can travel about 1000 A in 7.9 108 
sec; this diffusion distance is too small for vacancies to 
migrate to the surface in bulk material. However, the 
dislocation loops are on the average 700 Á apart and 
can serve as sinks for point defects. A direct deter- 
mination of the intrinsic or extrinsic nature of the 
dislocation loops in neutron-irradiated copper has not 
appeared in the literature although loops produced by 
a-particle? bombardment have been found to be inter- 
stitial. A recent electron diffraction experiment con- 
cerning loops that appear in molybdenum after neutron 
irradiation at elevated temperatures has shown con- 
clusively that these loops are interstitial.“ Makin et al., 
on the bases of annealing experiments in which large 
and small spots are seen, have concluded that the large 
spots are interstitial loops and the small are vacancy 
clusters.” The nature of the dislocation loops cannot 
be determined from the present experiment, but the 


* J. Takamura, K. Okazaki, and I. G. Greenfield, Conf. J. Phys. 
Soc. Japan 18, Suppl. III, 78 (1963). 

2 R. S. Barns and D. J. Mazey, Phil. Mag. 5, 1247 (1960). 

=J. D. Meakin and I. G. Greenfield (to be published). 
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4 
| evidence indicates that both loops and submicroscopic (3) The change in size RS n prismatic 
| defect clusters exist in these neutron irradiated dislocation loops can be predicted by Eq. ( ) 
| specimens. (4) The activation energy for the annealing process 
l SUMMARY corresponds to the activation energy for self-diffusion. 
: : : DOS 50 (5) Dislocation loops and submicroscopic point- 
Copper crystals irradiated nb peurons M a defect clusters are present in neutron irradiated copper. 
à and 709C show the following characteristics when 
à isothermally annealed between 75 and 350°C: ACKNOWLEDGMENTS he 
; (1) A decrease in volume is detected and is propor- the interest and mm 
tional to the neutron exposure. The maximum specific 'The authors are grateful for t e interest an advice 
i volume changes for 1.1410'S (mu total epicadmium of Dr. H. G. F. Wilsdorf of the University of Virginia " 
i flux) and 0.532108 (nvt total epicadmium flux) are and Dr. J. Takamura of the University of Kyoto, and 
i —3.93X 10-* and —2.46X10~, respectively. for the stimulating discussions with Dr. C. E. Birchenall 
i] (2) Measurable shrinkage of the single crystals takes — of the University of Delaware and Dr. R. Vook of The 
i place between 225 and 325°C, and the temperature Franklin Institute Laboratories. We are indebted to 
i where the maximum shrinkage rate occurs appears to R. Larson who has taken care of the irradiation of the 4 
f be higher for the lower neutron dose. specimens in the Brookhaven reactor. 
i 
i 
i 
i PHYSICAL REVIEW VOLUME 134, NUMBER 4A 18 MAY 1964 | 
Thermal Conductivity of Silicon and Germanium from 3?K to the Melting Point* 
C. J. GLASSBRENNER| AND GLEN A. SLACK 
General Eleciric Research Laboratory, Schenectady, New York 
(Received 11 December 1963) 
The thermal conductivity K of single crystals of silicon has been measured from 3 to 1580°K and of single 
crystals of germanium from 3 to 1190°K. These measurements have been made using a steady-state, radial 
heat flow apparatus for T>300°K and a steady-state, longitudinal flow apparatus for T<300°K to give 
absolute K values. This radial flow technique eliminates thermal radiation losses at high temperatures. The 
accuracy of both the low-temperature apparatus and the high-temperature apparatus is approximately +5%. 
Some special experimental techniques in using the high-temperature apparatus are briefly considered. 
At all temperatures the major contribution to K in Si and Ge is produced by phonons. The phonon thermal 
conductivity has been calculated from a combination of the relaxation times for boundary, isotope, three- 
phonon, and four-phonon scattering, and was found to agree with the experimental measurements. Above | 
700°K for Ge and 1000°K for Si an electronic contribution to K occurs, which agrees quite well with the i 
theoretical estimates. At the respective melting points of Si and Ge, electrons and holes are responsible for | 
40% of the total K and phonons are responsible for 60%. The measured electronic K yields values for the | 
thermal band gap at the melting point of 0.62-0.1 eV for Si and 0.262-0.08 eV for Ge. | 
\ 
* 


INTRODUCTION longitudinal heat flow apparatus. Germanium and sili- 


N a semiconductor, various carriers can contribute 
to the thermal conductivity. These are phonons, 
photons, electron-hole pairs, and the separate electron 
and holes. The problem of sorting out the contributions 
from these processes has been handicapped by the lack 
of accurate high-temperature measurements. In this 
paper, we report ort measurements of the thermal con- 
ductivity K of single crystals of germanium and silicon 
between 300 and 1580°K taken with an improved cylin- 
drical heat flow apparatus. The measurements of K of 
silicon below 300°K have been taken with an existing 


* This work is part of a thesis submitted by C. J. Glassbrenner 
to the Physics Department of the University of Connecticut in 
partial fulfillment of the requirements for the degree of Doctor of 
Philosophy. 

_ [Present address: Controls for Radjation Inc., Cambridge, 


- Massachusetts. 
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con were measured because they are easily obtained 
with high purity and known electrical characteristics. 
As a consequence of this investigation, they could serve 
as standard materials for K measurements. 

"The purpose of this investigation was to measure and 
analyze the K of these materials up to the melting point 
and, in particular, to unravel the magnitude of the con- 
tributions from the various carriers of heat. It was found 
that the phonon or lattice thermal conductivity is domi- 
nant and decreases faster than 1/T at high tempera- 
tures. The bipolar thermal conductivity, i.e., from 
electron-hole pairs, is significant at high temperatures 
and agrees with a simple theory. The polar thermal 
conductivity, i.e., from the separate electrons and holes; 
is small, and the photon thermal conductivity is not 
detectable. 


1G, A, Slack, Phys, Rev, 122, 1451 (1961). 
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Fic. 1. The high-temperature radial 


heat flow apparatus showing the crys- 
tal sample, heater, thermocouples, and 
shields. 


CRYSTAL SPECIFICATIONS AND PREPARATIONS 
A. Low-Temperature Silicon Crystal 


The silicon single crystal used for the low-temperature 
measurements was grown by the pedestal method from 
a bar of high-purity silicon by W. C. Dash of this labo- 
ratory. The growth direction was [111]. It was made 
with care in order to be oxygen and dislocation free? 
Any vacancy clusters were less than one micron in di- 
ameter as determined by etchpit and copper decoration 
techniques. The pedestal method prevents the silicon 
from coming in contact with quartz, usually the chief 
source of oxygen. The crystal was 2 cm long with a 
geometrical average diameter of 0.44 cm. The diameter 
varied slightly along the length of the sample. The cross 
section could be best described as an equilateral triangle 
with rounded corners, typical of [111] oriented crystals. 
The room-temperature resistivity was approximately 

| 2000 Q-cm, and the sample was p type. 


| B. Crystals Used at High Temperatures 


The silicon and germanium samples were both single 
j crystals, cylindrically shaped, approximately 13 cm long 
Dg by 2.6 cm in diameter. They had on the order of 105 
| dislocations? per cm?. The Ge crystal was grown by the 
| Čzochralski method with a [100] axis from a charge of 
| zone-refined germanium. The room-temperature resis- 
tivity of the charge material was greater than 40 Q-cm 
which indicates that it was basically intrinsic. The domi- 
nant impurity in the charge was specified as antimony. 
The single crystal of silicon was produced by the floating 
zone process. The atmosphere was argon and the axis 
of the cylinder was [111]. The diameter of the “as- 
received” crystals varied slightly along the axis. Their 
crystal faces were clearly visible. 
? W. C. Dash, J. Appl. Phys. 29, 736 (1958). 
SW. C Dash! T. Appl, Phys 50, 49 (1950): 


* W. C. Dash, J. Appl. Phys. 31, 73 
5 W. C. Dash dst communication). 
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A rather elaborate machining operation was used on 
the crystals in order to insure that the axial heater was 
exactly in the center of the sample and that the thermo- 
couples were at exactly known positions along the radii. 
These physical conditions imposed on the heater and 
thermocouples were solved by cutting the sample into 
two exactly equal half-cylinders. For this purpose, the 
original single crystal was centerless, ground to auniform 
diameter and carefully cut lengthwise into two approxi- 
mately equal halves. The flat side of each half-cylinder 
was given an optically flat polish so that the halves 
would fit tightly together. Thermocouple grooves were 
then cut in one of the halves. The nominal radial dis- 
tances of the center line of the three thermocouple 
grooves from the axis were: 2.5 and 10 mm on one side 
of the heater groove, and 6.5 mm on the other side. A 
heater groove was cut along the axis of each half so that 
the heater would be in the center of the assembled cy- 
linder. After the grooves were cut, the two halves were 
temporarily cemented together, but were offset slightly 
along the longitudinal axis. They were centerless ground 
a second time so that the interfacial plane and heater 
wire passed through the center of the cylinder. 

After the machining, the sample was assembled and 
encapsulated in a molybdenum case with a high-purity 
Al:O; ceramic wafer placed at each end. See Fig, 1. 
Molybdenum was used because it does not form any 
low melting alloys with either Si or Ge. Neither the Mo 
nor the Al;O; contributes electrically active impurities 
to the Si or Ge in significant amounts during the lengthy, 
high-temperature runs. 

The intrinsic electronic K could be suppressed if, 
inadvertently, the material became highly doped to a 
level of about 10!5 carriers/cc. As a check on the im- 
purity concentration, the electrical conductivity and 
Hall coefficient of the germanium and silicon samples 
were measured before and after the high-temperature 
measurements. Before use the room-temperature resis- 
tivity of the germanium crystal was 46.6 Q-cm, and the 
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¥ic. 2. K versus T for Si showing the experimental points and 
the extrapolated lattice component K,. The Debye and melting 
temperatures are 0 and MP, respectively. 


net carrier concentration was 2X 10?/cm*. The crystal 
was so lightly doped that it could not be well character- 
ized as either z- or p-type. After the K had been meas- 
ured to near the melting point, the electrical resistivity 
dropped to 4.6 Q-cm and the crystal was now doped 
with 8X10! acceptors/cm?. These acceptors were iden- 
tified as copper. The copper was probably introduced 
during the machining of the germanium sample prior 
to the measurements. 

The room-temperature resistivity of the high-temper- 
ature silicon sample was 440 Q-cm with a carrier con- 
centration of 1.2710"/cm? and it was n type. After 
the K had been measured to near the melting point of 
silicon, the room-temperature resistivity dropped to 177 
Q-cm, the carrier concentration rose to 2.4610", and 
it was still z type. The specific impurities could not be 
identified. The important conclusion from these meas- 
urements is that neither the germanium nor silicon con- 
tained enough electrically active impurities to affect the 
intrinsic polar or bipolar thermal conductivity in any 
way. 


, APPARATUS 


The low-temperature measurements were made with 

a steady-state, longitudinal heat flow apparatus de- 
signed by Slack.! The final form of the high-temperature, 
_ cylindrical-flow thermal conductivity apparatus was ob- 
- tained after several modifications. Figure 1 shows the 
details of the sample in the final version of the high- 
emperature apparatus. A more detailed description of 
ntire method and apparatus will be published else- 
e later. The central section of the apparatus is 


hown in Fig. 1. The vacuum plumbing was designed 
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AND G, Ac SUACK 
so that the sample chamber could be evacuated to 10-5 
Torr and then filled with ultra-pure helium. The outer 
ceramic tube serves as the wall of the vacuum chamber 
for the sample. The external, 3-section electric heater 
is wound on this tube, and serves to control the average 
temperature of the sample. The split inner tube holds 
the sample and all of the removable parts, i.e., thermo- 
couples, heaters, shields, etc., in place. | 

The cylindrical geometry for measuring the high- 
temperature thermal conductivity was chosen for rea- 
sons previously stated.* In the present, more thorough 
measurements, certain experimental problems arose 
which will now be given a cursory examination. Ultra- 
pure helium gas was used for the atmosphere, because 
it contained a low oxygen content,’ and thus there were 
no oxidation problems at high temperatures. The gas 
served to provide good thermal contact between the 
sample and the heater and thermocouple wires. The 
gas also partially helped to suppress the vapor trans- 
port of Si and Ge, which would be appreciable at high 
temperatures in a vacuum. The molybdenum jacket 
which encased the germanium and silicon reduced the 
exposed surface area of the samples and hence further 
reduced the evaporation. This case also held the halves 
together. Molybdenum was employed for the jacket, 
axial heater wire, voltage leads, and heater leads because 
of its previously mentioned chemical stability. The 
temperature gradients were measured with platinum 
versus platinum-10% rhodium because these thermo- 
couples are very stable with time and possess well- 
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Fic. 3. K versus T for Ge showing the experimental points and 
the extrapolated lattice component K,. The Debye and melting 
temperatures are 0 and MP, respectively. 


(tS e. A. Slack and C. J. Glassbrenner, Phys. Rev. 120, 782 
). 
cel Young and N. R. Whetten, Rev. Sci. Instr. 32, 453 
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known calibration curves, However, they have a very 
small emf output and are subject to contamination 
problems unless carefully used. The thermocouples and 
axial crystal heater were insulated from the sample by 
Degussit® alumina tubing because it is a very pure 
ceramic, and is available in very small diameters. It 
does contain a small amount of iron and silicon, i.¢., 
about 300 ppm. The radiation shields were made from 
platinum —40% rhodium. These served to keep heat 
losses from the end of the sample low. They also served 
as vapor barriers for germanium and silicon vapor 
coming from the sample. The external heater winding 
was made in three sections so that both the temperature 
gradient along the axis of the apparatus near the ends 
of the sample and the average temperature of the sam- 
ple could both be controlled. A ground sheath was 
placed under the external heater to reduce stray elec- 
trical leakage currents. The ceramic end plugs were 
installed to hold the electrical wires in place and to re- 
duce axial heat conduction by the gas. They also helped 
to protect the thermocouple wires in the temperature 
gradient from the silicon and germanium vapors. 

The electrical heater power to the sample was fur- 
nished by either dc or ac. Direct current was used below 
about 750°K and ac was used (about 435 cycles/sec) 
at higher temperatures. Due to the lower electrical resis- 
tance of the Si, the Ge, and the Al.O; at high tempera- 
tures, a small leakage voltage from the crystal heater 
always appeared in the thermocouples above about 
850°K. This leakage was rendered harmless by use of 
ac since any small 435-cycle ac voltage was not recorded 
by the thermocouple galvanometer. To insure that the 
temperature gradient was measured accurately, a few 
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Frc. 4. The low- and high-temperature K versus T results for Si. 


3 Degussit A123, Liasson Company, New York, New York. 


AND Ge A1061 
S0r—r-r TE e NT ahai aaa X YE A ET 
- P 
Ge 1 
a r e LOW TEMP APPARATUS 
E © HIGH > 
8 OE = 
SE E 
du: 
SC 
rated 
= 
Æ E 
t2 
8! - 
S E E 
q 
B E 
ae 
um 
E 
0. 
p 10 100 000 


TEMPERATURE, °K 


Fic. 5. The low- and high-temperature K versus T results for Ge. 


special measuring techniques were employed. Since no 
two thermocouples are exactly alike, two of the three 
thermocouples inside the sample were always calibrated. 
in terms of the third when the sample was in an isother- 
mal condition. Checks on the presence of possible ther- 
mocouple contamination were made in two ways. First 
the high-temperature A data points were always fol- 
lowed by a reconfirmation of lower temperature 
points at about 400°K. Secondly, variations in the axial 
temperature gradients in the apparatus at and beyond 
the radiation shields were intentionally produced by the 
end windings on the external heater. These variations 
should not affect the measured K. Any apparent effect 
meant contaminated thermocouples. Discrepancies of 
these types resulted in rejection of the high-temperature 
data, and a reassembly of the entire apparatus with new 
thermocouples. The final results for both Si and Ge each 
involved several such reassemblies. 

Under steady-state operating conditions between 30 
and 50 W of electrical power was furnished to the crystal 
heater. This is equivalent to about 4 W/cm of sample 
length. The average temperature differences between 
the center and surface of the sample ran between 1 and 
3°K. The heater wire temperature usually operated be- 
tween 50 and 100°K above the sample temperature. 


RESULTS 


The results of the present investigation of the high- 
temperature A of Si and Ge are shown in Figs. 2 and 3. 
The composite results for single crystal Si are shown in 
Fig. 4. Notice that the low-temperature results (solid 
circles) join on smoothly to the high-temperature results 
(open circles). Both sets of data were obtained in their 
respective apparatuses as absolute thermal conductivity 
values, and neither set of data has been adjusted to 
match the other. The actual data points in the low- 
temperature set span the temperature range from 4.3 
to 304°K, while the high-temperature points cover the 
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Fic. 6. Comparison of present and previous low-temperature 
K versus T results for Si. 


418 to 1577°K range. The Debye temperature of Si at 
674°K is shown for scaling purposes. The melting point, 
MP, of silicon at 1681°K is only 104°K above the high- 
est point measured. 

The results for single crystal Ge are shown in Fig. 5. 
The points for temperatures between 3.2 and 300°K 
have been taken from Slack and Glassbrenner.® The 
high-temperature data points between 398 and 1194°K 
were measured in the present high-temperature appa- 
ratus. Neither set of data was adjusted, and again the 
agreement between the low-temperature and high- 


TABLE I. Representative values of the thermal conductivity K 
ur pono single crystals of silicon and germanium (accuracy 
+3%0). 


GLASSBRENNER AND G. A. 


K, W/cm deg K, W/cm deg 
T°K Si Ge T*K Si Ge 
20 15. 400 1.05 0.44 
30 10.5 500 0.80 0.338 
40 7.7 600 0.64 0.269 
50 26 59 700 0.52 0.219 
60 21 4.7 800 0.43 0.193 
70 17.0 3.7 900 0.356 0.177 
80 13.9 SENE 1000 0.310 0.171 
90 114 2.55» 1100 0.280 0.169 
100 9.5 2.25 1200 0.261 0.173 
125 6.0_ 1.66 1210 0.173 
150 4.20" 1.30 1300 0.248 
175 3.25 1.10 1400 0.237 
200 2.66 0.95 1500 0.227 
250 1.95 0.73 1600 0.219 
300 1.56 0.60 1681 0.216 


a The bar over the last digit indicates that it is rather uncertain. 
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temperature data is very good. The Debye temperature 
0 of Ge is 395°K. These results approach to within 
16°K of the melting point, MP, of Ge at 1210°K. The 
absolute accuracy of both the high- and low-temperature 
results is computed to be +5%, while the relative ac- 
curacy of any two points measured on the same sample 
is +2%. Table I gives smoothed values for the K of Si 
and Ge. The low-temperature region where K depends 
on sample size has been excluded. 


PREVIOUS DATA 
A. Silicon Below 300°K 


A comparison of the present and previous low- 
temperature K measurements of silicon below 300°K 
is shown in Fig. 6. Except for the measurements by 
Thompson and Younglove, all of the measurements 
agree quite well above 100°K. Some of the variation 
in K below 100°K is accounted for by boundary scat- 
tering in different size samples. However most of the 
variation in Fig. 6 is caused by phonon scattering from 
impurities and other crystalline defects. The earliest re- 
port of thermal conductivity measurements of silicon 
at low temperatures of which the authors are aware 
was made by Rosenberg.? He measured a polycrystalline 
silicon sample over the temperature range 2 to 100°K, 
and employed gas thermometers to measure the tem- 
perature gradient during steady-state heat flow. Scat- 
tering by the grain boundaries in his polycrystalline 
sample is probably the reason that Rosenberg’s K re- 
sults are much lower than any of the others. White and 
Woods" have measured a single crystal silicon sample 
(room-temperature resistivity 6.7 Q-cm) over a tempera- 
ture range of 2 to 100°K. The impurities present in this 
sample were not stated. Thompson and Younglove;" in 
the temperature interval 2 to 200°K, have measured 
several samples of silicon doped with boron or arsenic. 
Figure 6 shows their measurement of their most highly 
doped silicon sample (boron doped with a room- 
temperature resistivity of 2000 Q-cm). Carruthers et al.” 
have measured the thermal conductivity of a high- 
purity single crystal silicon sample, 7 type from 2 to 
300°K. The net carrier concentration was 5X10" at 
300°K. Holland? measured the thermal conductivity 
of several silicon samples containing different amounts 
of oxygen and has discussed the effect of oxygen as à 
scatterer of phonons to reduce the thermal conductivity- 
His measurements of an n-type sample with the least 
amount of oxygen (<10!® cm-3) and a room-tempera- 
ture resistivity of 260 Q-cm agree very closely with the 
present experiment. 


° H. M. Rosenberg, Proc. Phys. Soc. (London) A67, 837 (1954). 

10G. K. White and S. B. Woods, Phys. Rev. 103, 569 (1956). 

u J. C. Thompson and B. A. Younglove, Phys. Chem. Solids 
20, 146-149 (1961). 

2 J. A. Carruthers, T. H. Geballe, H. M. Rosenberg, and J. M. 
Ziman, Proc. Roy. Soc. (London) A238, 502 (1957). 

13M. G. Holland, Proceedings of the International Conference 
on Semiconductor Physics, Prague, 1960 (Academic Press Inc- 
New York, 1962), p. 633. 
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B. Above 300°K 


Various previous authors have measured the K of 
Si and Ge over part of the high-temperature range 
studied here. Consider Si first, Figure 7 shows how the 
present results on Si for T>300°K. compare with some 
of the better previous data. The earliest K data on Si 
that has been reported is that of Koenigsberger and 
Weiss" in 1911 who found 0.84 W/cm deg at 290°K for 
an impure sample. Later authors,” who have meas- 
ured A as a function of temperature for 77 300*K, are 
listed in Table II. The recent values of K determined by 
Shanks ef al? from thermal diffusivity measurements 
are about 10% lower than the present ones between 
300 and 900°K, and 20% higher at 1400°K. The abso- 
lute accuracy of the K values in the present experiment 
is +5%, and this same limit of error seems to be present 
in the data of Shanks ef al., as judged from the scatter 
of the points and the size of the samples used. Thus the 
10% difference is just within the combined limit of 
error. The almost temperature-independent K found by 
Shanks eż al. for T7 1000*K, however, is not confirmed. 
The K of Si continues to decrease with increasing tem- 
perature up to 1580°K, The high-temperature heat- 
capacity data used by Shanks ef al. is in reasonably 
good (3-295) agreement with that given by Kelley,” 
so the conversion from thermal diffusivity to K should 
be accurate. The reason for the discrepancy in K deter- 
mined by these two different methods is not as yet 
understood. The K results of Abeles et al.?!?2 from 310 
to 1200°K, also determined from thermal diffusivity 
measurements, fall between the present curve and that 
of Shanks et al. 

In reviewing the literature on the K of Ge it can be 
seen that the early measurements on Ge by Greico and 
Montgomery? gave K=0.59 W/cm deg at 298°K, in 
good agreement with the present value of 0.60 W/cm 
deg at 300°K. The temperature variation of K of Ge 
for temperatures near or above room temperature has 
been measured by many investigators. These are 


MJ. Koenigsberger and J. Weiss, Ann. Phys. 35, 1 (1911). 

15 B. B. Kuprovski and P. V. Geld, Fiz. Metal i Metalloved. 3, 
182 (1956). A : 

15 P, V. Geld, Zh. Tekhn. Fiz. 27, 113 (1957) [English transl.: 
Soviet Phys.—Tech. Phys. 2, 95 (1957) ]. mid 

17 W, D. Kingery, J. Am. Ceram. Soc. 42, 617 (1959). 

18 H, Mette, W. W. Gärtner, and C. Loscoe, Phys. Rev. 117, 
1491 (1960). S SE (190) 

19 A. D. Stuckes, Phil. Mag. 5, ; 

?' R. D. Morris and J. G. Hust, Phys. Rev. 124, 1426 (1961). 

?! B, Abeles, D. S. Beers, G. D. Cody, and J. P. Dismukes, 
Phys. Rev. 125, 44 (1962). e Zt 

2D. pot a Cody, and B. Abeles, Proceedings of the 
International Conference of the Physics of pa (The 
Institute of Physics and the Physical Society, London, 1962), p. AL. 

23 H. R. Shanks, P. D. Maycock, P. H. Sidles, and G. C. Daniel- 
son, Phys. Rev. 130, 1743 (1963). à 

ER. G Morris and J. L. Martin, J. Appl. Phys. 34, 2388 (1963). 

23 K, K. Kelley, U. S. Bureau of Mines Bulletin No. 584, 
Washington, D. C. (unpublished). 1 $ 

2 A. Greico and H. C. Montgomery, Phys. Rev. 86, 570 (1952). 
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Fic. 7. Comparison of present and previous high-temperature 
K versus T results for Si. 


listed5212225795 in Table IT. Some of these K versus T 
curves for Ge are shown along with the present results 
in Fig. 8. The agreement of the older results with the 
present ones is to within +20% at 300°K, but becomes 
increasingly poor at the higher temperatures. The very 
abrupt rise in K suggested by the authors’ in 1960 is 
not confirmed. The change in slope is much more grad- 
ual. The large range of values for K of Ge above 700°K 
found in the other curves in Fig. 8 can be attributed to 
experimental difficulties in measuring and controlling 
the thermal radiation losses. The K values deduced 
from thermal diffusivity measurements by Abeles 
et al." are not subject to this source of error, and the 
agreement of these results with the present ones be- 
tween 500 and 1070°K is very good. The 10% discrep- 
ancy between the present results and those of Abeles 
el al. at 300°K is about at the outer limits of accuracy 


“K. A. McCarthy and S. S. Ballard, Phys. Rev. 99, 1104 

1955). 
: 75 A. F. Ioffe, Can. J. Phys. 34, 1342 (1956). 

?? M. Shtenbeck and P. I. Baranskii, Zh. Tekhn. Fiz. 27, 233 
(1957) [English transl.: Soviet Phys.—Tech. Phys. 2, 205 (1957) ]. 

? P. I. Baranskii and N. S. Konopliasova, Zh. Tekhn. Fiz. i 
1621 (1958) [English transl.: Soviet Phys.—Tech. Phys. 3, 1493 

1958) ]. 
s E s Phys. Chem. Solids $, 340 (1959). 

= F, Kettel, Phys. Chem. Solids 10, 52 (1959). 

9 J. I. Pankove, Rev. Sci. Instr. 30, 495 (1959). 

* A. D. Stuckes, Phil. Mag. 5, 84 (1960). 

3E. D. Devyatkova and I. A. Smirnov, Fiz. Tverd. Tela 2 
561 (1960) [English transl: Soviet Phys.—Solid State 2507 
(1960) 


35 E. D. Devyatkova and L A. Smirnov, Fiz, Tverd. Tela 4. 
1669 (1962) [English transl.: Soviet Phys.—Solid State 4, 1227 
(1962) ]. 
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Fic. 8. Comparison of present and previous high-temperature 
K versus T results for Ge. 


of the two experiments. The reason for this probably 
real discrepancy is not completely understood. 


INTERPRETATION OF LOW-TEMPERATURE 
MEASUREMENTS 


In the temperature range below 300°K, the K results 
on Si and Ge can be interpreted using Callaway's forma- 
lism. The K due to heat transport by phonons is 


TABLE II. Literature references to the thermal conductivity 
of Si and Ge for T2 300*K. 


Temp. 
range 
Method ?K 


Mtrl Author Year Ref 
Si Koenigsberger and Weiss c 293 1911 14 
Kuprovski and Geld b 380-1190 1956 15, 16 
Kingery c 370-540 1959 17 
Mette el al. e 550-770 1960 18 
Stuckes c 300-580 1960 19 
ET Morris and Hust C 300-700 1961 20 
- Abeles et al. d 310-1200 1962 21, 22 
Shanks et al. d 300-1400 1963 23 
Morris and Martin c 680-1000 1963 24 
Ge  Greico and Montgomery c 298 1952 26 
McCarthy and Ballard © 280-370 1955 27 
Ioffe a 300-870 1956 28 
Shtenbeck and Baranskii a 200-370 1957 29 
Baranskii and Konopliasova a 80-370 1958 30 
Abeles a 320-1070 1959 31 
Kettel t a 390-1000 1959 32 
Pankove e 370-970 1959 33 
Stuck 7 c 310—680 1960 34 
Devyatkova and Smirnov a 80-440 1960, 62 35, 36 
Slack and Glassbrenner a,b 3-1020 1960 6 
Abeles et al. d 310-1070 1962 21, 22 


Absolute, steady-state, longitudinal heat flow. 
Absolute, steady-state, radial heat flow. 
Comparative, steady-state longitudinal heat flow. 
Variable state, thermal diffusivity. 


Callaway, Phys. Rev. 113, 1046 (1959). 
Callaway and H. C. von Baeyer, Phys. Rev. 120, 1149 


allaway, Phys. Rev. 122, 787 (1961). 
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PEIN xte” 
e VE a (n 
A h ) I (e7—1)? (1) 


where k= Boltzmann's constant, »— average sound ve- 
locity, T— absolute temperature, h= Planck's constant, 
9=Debye temperature, += (ftw/kT), w=phonon. fre- 
quency, and rc is a combined relaxation time. The 
complicating effects? of normal phonon scattering have 
been neglected. The combined relaxation time is thus 


taken as 


given by 


rg log rr Hro, (2) 


where rz, Tr, Tu are the relaxation times for boundary, 
isotope, and umklapp scattering of the phonons. 

The particular form of the relaxation times shall now 
be examined. Klemens‘?! has derived a relaxation time 
for scattering by point imperfections. His results are 
valid for isotope scattering. 


Tr (o) 3Volo/7v? , (3) 


where Vs average volume per atom in the crystal, and 
T — point impurity scattering parameter.” For umklapp 
scattering the following semiempirical expression” will 
be used: 

tu = Byw T. (4) 
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Fic. 9. Comparison of the low-temperature K results for Si 
with the theory in which various phonon scattering mechanisms 
are considered. 


4 P. G. Klemens, in Solid State Physics, edited by F. Seitz and 
D. Turnbull (Academic Press Inc., New York, 1958), Vol. 7, p- 1. 

^: P. G. Klemens, in Handbuch der Physik, edited by S. Flügge 
(Springer-Verlag, Berlin, 1956), Vol. 14, p. 198. 

42 G. A. Slack, Phys. Rev. 126, 427 (1962). 

53 G. A. Slack and S. Galginaitis, Phys. Rev. 133, A253 (1964). 
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Fic. 10. The function (W—W,)T— versus T for Si as a means of determining the extrapolated 
lattice thermal resistivity at high temperatures. 


Here By has the approximate value of 
Bu~ (hy?/2rmbv") exp(—6/3T) , (3) 


where y= Grüneisen's constant assumed to be equal to 
2, and m- average mass of a single atom. For boundary 
scattering the relaxation time for diffuse scattering of 
the phonons off the walls of the crystal is simply 


Tpl-— v/L ) (6) 


where Z is the average diameter of the crystal. For Si 
these various relaxation times have been included in the 
K expression given in Eq. (1) using the following values: 
7—6.4X10* cm/sec, 0—674?K, Vo=1.99X 10-3 cm?, 
T'—1.65x 10-5, and L— 0.44 cm. The results of the cal- 
culations for various combinations of relaxation times 
are shown in Fig. 9 for Si. It can be seen that all three 
relaxation times must be used in order to obtain agree- 


‘ment between theory and experiment. The theoretical 


curve with all three scattering processes included gives 
a maximum K of 56 W/cm deg at 28°K for Si compared 
to the observed maximum of 39 W/cm deg at 25 K. 
The fit at lower K values is rather better. In view of 
the simple model used for deriving Eq. (1) such agree- 
ment is considered reasonable. A similar sort of analysis 
has been carried out by other authors®*” for Ge at low 


temperatures, where the agreement is even better. Some 
improvement in the fitting of the theory to experiment 
can be made by including the effects of both the trans- 
verse and longitudinal phonon branches and the effects 
of phonon dispersion.“ These added complications will 
not be considered here. The primary conclusion to be 
drawn from the low-temperature results for T<300°K 
is that phonons are the only significant carriers of the 
thermal energy, and theyare scattered by other phonons, 
isotopes, and the crystal boundaries. Other carriers of 
thermal energy do, however, become important for 
T>300°K. 


INTERPRETATION OF HIGH-TEMPERATURE 
MEASUREMENTS 


Lattice Thermal Conductivity 


In the present context high temperature means 
T>300°K. In this region it is necessary to consider 
photons and electrons and holes as well as phonons as 
possible carriers of thermal energy. We shall assume that 
these various carriers interact only weakly so that their 
contributions to the thermal conductivity are additive. 


“ M. G. Holland, Phys. Rev. 132, 2461 (1963). 
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n 
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x 
+ 6.0 
$ 
‘N 
z 
T 
tis 
x 
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4.0 
o 200 600 
TE MPERATURE,*K 
Thus 
K=K,+K-+K,, (7) 


where K, is the phonon, K, is the electronic, and K, is 
the radiative or photon contribution to K. 

Let us first consider the lattice thermal conductivity 
K,. Athigh temperatures we have to consider the relaxa- 
tion times 7z and ry given in Eqs. (3) and (4), but we 
can neglect rs in Eq. (6). Equation (4) gives the relaxa- 
tion time for 3-phonon umklapp processes. However, 
for 300°K <T « 1681?K, we are in the range of T com- 
parable to or greater than the Debye temperature 0 for 
both Si and Ge. In this range, it may be necessary to 
consider the relaxation times for four-phonon processes, 
as Pomeranchuck*^-^ has pointed out. He gives a relaxa- 
tion time for these higher order (H) processes as: 


tH = Bpgo?T*?, (8) 


with By, a constant. K, can be evaluated for 770 from 
Eqs. (1), (3), (4), and (8) if c is taken as 


He = fa BO ad 


In the region where T>6 the quantity « in Eq. (1) is 
small, and the integral simplifies to 


b /RTNS pol? 
K= (S) | tox dx. (9) 
2r v\ h 0 


45 T. Pomeranchuk, Phys. Rev. 60, 820 (1941). 
46 T. Pomeranchuk, J. Phys. USSR 4, 259 (1941). 
47 T. Pomeranchuk, J. Phys. USSR 7, 197 (1943). 
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Also the exponential factor in By disappears to make 
By temperature-independent. Thus 


rc 1— (BuT+BuT)w*+ Ao*. (10) 


For T>6 the isotope scattering is much less important 
than the phonon-phonon scattering. In this limit Eqs. 
(9) and (10) can be reduced by the method used by 
Ambegaokar;? to 


Kj12W,—Wu-t-Wu-r-Wi, 
where 
Wo=avhByT /OR’ , 
Wy ah By T?/0R* , (11) 
Wr-4a?VfT/In? . 


This reduction requires W ,>>W z. This condition is ful- 
filled for Si and Ge at high temperatures. The only 
really unknown quantity in Eq. (11) is Bu. The quanti- 
ties By and By can be evaluated experimentally from à 
plot of (WW,—W;z)T- versus T. The quantity Wr i$, 
except for a difference of a factor of 12 in the definition 
of T, the same as that given by Ambegaokar.** For Si 
one obtains W,;=0.033 cm deg/W. For Ge the value I$ 
Wr—047 cm deg/W using Vo=2.26X10-% cm, 
6—395*K,9 T=4.90X 105,9? and v=3.94X 105 cm/sec. 


48 V. Ambegaokar, Phys. Rev. 114, 488 (1959). : 

59 P. Flubacher, A. J. Leadbetter, and J. A. Morrison, Phil. 
Mag. 4, 273 (1959). 

9D. Strominger, J. M. Hollander, and G. T. Seaborg, Rev. 
Mod. Phys. 30, 585 (1958). 
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The determination of W, is the next problem. Con- 
sider Eq. (7). As has been shown,* K, is negligible at all 
temperatures for Ge. The measurements of Ukhanoy?! 
on the optical absorption of Si and Ge at elevated tem- 
peratures were employed to calculate K,. This calcula- 
tion shows that K, is also negligible (i.e., less than 2% 
of total K) for Si at all temperatures. The K, term in 
Eq. (7) becomes important only at the very highest 
temperatures when the number of thermally generated 
carriers becomes very large. This term is responsible for 
the upturn in the K results shown in Figs. 2 and 3 above 
800°K. Thus K, is the only significant term in Eq. (7) 
for 7<800°K in Si and Ge. Therefore, the total meas- 
ured thermal conductivity is K= W~ for T «800*K. 
Figures 10 and 11 show the plots of (W —W ;)T- versus 
T for Si and Ge. The linear approximation predicted by 
Eq. (11) is satisfactory for 720.50 until at high temper- 
atures the K, term becomes important. The dashed ex- 
trapolations in Figs. 10 and 11 show what the behavior 
would be if only the lattice thermal conductivity K, 
were present. 

The linear portions of the curves in Figs. 10 and 11 
yield the following empirical expressions for W,: 


Ge: W,-(3.95X 10-7 --3.38» 10-97?-4-0.17) 
cm deg/W, 


Si: W= (1.56X 107+ 1.65X 10-*7?--0.03) 
cm deg/W. 


The first term in these equations yields an experimental 
value of By which agrees with that estimated from Eq. 
(S) to within a factor of two for both Si and Ge. Such 
agreement is quite satisfactory. If the very approximate 
theory used in obtaining Eq. (11) is taken at face value, 
the results in Eq. (12) for By show that the ratio ry/ry 
is 3 for Ge and 1 for Si at T=0. This ratio becomes 
smaller at higher temperatures. Notice that we have 
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Fic. 12. Comparison of the experimental and theoretical electronic 
thermal conductivity, Ke versus T, for Si. 


35 Yu. T. Ukhanov, Fiz. Tverd. Tela 3, 2103 (1961) [English 
transl.: Soviet Phys.—Solid State 3, 1529 (1962). 
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Fic. 13. Comparison of the experimental and theoretical electronic 
thermal conductivity, K, versus T, for Ge. 


chosen to interpret the observation that K,forSiand Ge 
at high temperatures decreases faster than T~ as a sign 
that 4-phonon processes are taking place. A similar con- 
clusion has been arrived at by Stuckes! and by Steig- 
meier and Kudman* for various IV and III-V semi- 
conductors. The result that the 3-phonon and 4-phonon 
relaxation times, rø and ry, are nearly equal at T=9 
should, however, be greeted with some caution. The 
main purpose of the plots in Figs. 10 and 11 is to allow 
one to make a reasonable extrapolation of K 9 in the 
temperature region where K, is appreciable. 

This extrapolation of K, from Figs. 10 and 11 has 
been replotted as the dashed curve in F igs. 2 and 3. The 
difference between this curve and the experimental re- 
sults is, from Eq. (7), just K,. These experimental 
values of K. for Si and Ge are plotted in Figs. 12 and 13, 


Electronic Thermal Conductivity 


The experimental results for K, can be compared with 
the theory that has been developed by Davydov and 
Shmushkevitch,? by Pricej4 and by Drabble and 
Goldsmid.** Their results for K, can be written as the 
sum of an electronic polar contribution K.p, and an 
electronic bipolar part Ky. 


K.= KeptKey E 
where 


2 


k£ 
Ka=-] cT, 
e 


b Ee Prk T 
Keo= [+4] B oT. 
(1+-b)*LkT e 


5 E. F. Steigmeier and I. Kudman, Phys. Rev. 132, 508 (1963). 

5 B. Davydov and I. Shmushkevitch, Uspekhi Fiz. Nauk 
USSR 24, 21 (1940). 3 E 

Sepe ach Phil. Mag. 46, 1252 (1955). 

55 J. R. Drabble and H. J. Goldsmid, Thermal Conduction in 
Semiconductors (Pergamon Press, Inc., London, 1961), pp. 104-119. 


(13) 
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TABLE ITI. The theoretical electronic thermal conductivity 
of Si and Ge versus temperature. 


GLASSBRENNER 


AND) (Gu NS SING i 
TARLE IV. The experimentally determined contributions to K in 
solid Si and Ge at their respective melting points. 


c : = 5 
ohm Kep Ken Ke 


T. ——— Ea 
Crystal °K  (1+b) eV cm mW/cm deg 
Si 800 0.21 0.92 3.9 00 14 14 
1000 — 021 0.85 22. 03 62 6.5 
1200 — 021 0.78 73. 13 164 17.7 
1400 — 021 0.71 170. 3.5 319 35.4 
1600 — 021 0.55 315. 7.5 50.5 58.0 
1681 0.21 0.52 385. 9.6 582 67.8 
Ge 400 0.20 0.61 0.9 00 03 0.3 
600 019 0.52 30. (X3) SO 59 
x 800 0.18 0.43 160. 19 18.0 19.9 
1000 0.16 0.34 440. 6.5 33.4 39.9 
1200 0.16 0.26 870. 15.5 51.9 67.4 
1210 0.16 0.25 890. 16.0 52.5 68.5 


The various quantities are defined as k= Boltzmann’s 
constant, e=electronic charge, o= total electrical con- 
ductivity at T, b= mobility ratio at T, and Ec— indirect 
energy gap at T. The polar part K.p is the usual 
Wiedemann-Franz-Lorenz contribution present in met- 
als. The bipolar part Ke» is a property of semiconduc- 
: tors, and is caused by electron-hole pairs with energy 
Eg diffusing down the temperature gradient. Equation 
(13) has been derived for a very idealized semiconductor 
which is assumed to be intrinsic, to possess simple para- 
bolic valence and conduction bands, and in which the 
charge carriers suffer only acoustic mode lattice scat- 
tering where the scattering probability varies with the 
carrier energy e as 4/e. Neither Si nor Ge can be said to 
fit this model very well since they have multiple minima 
conduction bands, multiple valence bands, and since 
both acoustic mode and intervalley scattering occur. An 
exact calculation of K, in which all of these effects are 
considered is not undertaken here. However, as will be 
seen, Eq. (13) yields predictions which are in reasonable 
agreement with the experiment. 

The experimental parameters that are used in Eq. 

(13) are v, b, and Ec. Representative values for these 
parameters and the calculated values for Kep and Ke» 
are given in Table III. For Si, the c (T) values have been 
taken from Morin and Maita*? and Mokrovskii and 
Regel? and the values of b versus T have been esti- 
mated from an extrapolation of the mobility curves of 
Morin and Maita.9* The Ec(T) values? have been 
corrected for more recent values of the effective mass, 
and are taken from Hannay.5* It has been assumed that 
Eg varies linearly with temperature and the value of 
Eg= (1.21—4.1X 10-57) eV was used. 

For Ge the c (T) values have been taken from Morin 
and Maita,? from Epstein,® and from the recent com- 
Tun 

s F, J. Morin and J. P. Maita, Phys. Rev. 96, 28 (1954). 

57 N. P. Mokrovskii and A. R. Regel, Zh. Tekhn. Fiz. 23, 779 
Ce M B. Hannay, Semiconductors (Reinhold Publishing Corpora- 
tion, New York, 1959), p. 332. 

9 F, J. Morin and J. P. Maita, Phys. Rev. 94, 1525 (1954). 

o DE Epstein, Ph.D. thesis, Purdue University, 1954 
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Si Ge 
Quantity 1681°K 1210°K 
Kep/K 5% 9% 
Keo/K 32% 32% 
K,/K 63% 50% 
K (W/cm deg) 0.22 0.17 


plication of Hamilton and Seidensticker.! The values 
of b versus T have been estimated from an extrapolation 
of the data given by Morin and Maita.? The Es(T) 
values? have been corrected for better effective mass 
values. The equation used for Ge was £g (0.785— 4.4 
10-7) eV. The computation of Kep depends only on 
c and is probably accurate to +5%. The computation 
of K.» depends similarly on ø and is also sensitive to 
errors in Eg and b. The over-all accuracy in the calcula- 
tion of K.» is estimated to be 4- 1597. Figures 12 and 13 
show how the calculated theoretical value of K, com- 
pares with the experimental determination, which is 
judged to be accurate to within +5 mW/cm deg. This 
uncertainty is indicated by the broad band in Figs. 12 
and 13. Since the theoretical values of K, are probably 
accurate to 4-1505, the agreement with theory is con- 
sidered quite adequate. In view of the uncertainties in 
both the calculated and experimental values of K, it 
does not seem worthwhile, at present, to undertake a 
more exact calculation of K, in which the details of the 
band structure of Si and Ge are carefully taken into 
account. 

The three contributions to the total thermal conduc- 
tivity, Kto at the melting point are K,, Kep, and Kes. 
Table IV shows the relative contributions of these three 
terms for Si and Ge. K, is estimated from the extra- 
polated dashed curve in Figs. 2 and 3, Kep is calculated 
form Eq. (13), and Ke» is taken to be equal to Ki— Ko 
— K,,. For both materials the lattice thermal conduc- 
tivity is still dominant at the melting point and is 61% 
of the total, the bipolar electron-hole pair diffusion ac- 
counts for 3295, while the usual Wiedemann-Franz- 
Lorenz contribution is only 7% of the total. 

The experimental value of K.» determined from 
Ktot—K—Kep can be used to estimate Eg by means of 
Eq. (13). The values of b and o at the melting point are 
taken from Table III, and the experimental values of 
K. are taken from Figs. 12 and 13. The result is that at 
the melting point the thermal band gap is Eg=0.6+0.1 
eV for Si and Eg=0.26+0.08 eV for Ge. These values 
are not very far from the linearly extrapolated values 
of 0.52 and 0.25 eV given in Table IIT. The K, results 
show that the band gap in the solid at the melting point 
is nonzero. The gap thus drops discontinuously to zero 
upon melting, since both liquid Si and Ge are metallic 
in character.*?.6! 


*: D. R. Hamilton and R. G. Seidensticker, J. ]. Phys. 34, 
2697 (1963). eidensticker, J. App y 
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THERMAL CONDUCTIVITY OF Si AND Ge 


MAGNETIC FIELD EFFECTS 


The electronic contribution to the K of Ge is about 
40% of the total at 1175°K, a temperature 35°K below 
the melting point. An attempt was made to alter this 
electronic contribution by applying a moderate mag- 
netic induction in a direction perpendicular to the axis 
of the K sample. Magnetic induction strengths (MKS 
units) up to 0.6 Wb/m? (6000 G) were applied, but no 
effect on K was observed to within zE 297. Steele found 
that values up to 1.15 Wb/m* had no effect on the K of 
Ge at 278°K. Here the estimated K.=Kept+K ep is only 
4X 10% of the total K for intrinsic Ge. His experiment 
was not sufficiently sensitive to see changes this small 
even if the magnetic induction had been sufficiently 
strong to suppress K.. Magnetic effects on K have been 
seen in HgSe,™ BisTe3,° and InSb.9*? The requirement 
on the magnetic induction is that it must be large enough 
so that the charge carriers will make about one cyclo- 
tron orbit before being scattered. This minimum value,” 
Bm, is approximately 


Bn=T/p, 


where B is in Wb/m?, and y is the carrier mobility in 
m?/V-sec. The electron mobility in intrinsic Ge at 
1175?K is about 0.03 m?/V-sec, the hole mobility is 
even lower. The minimum magnetic induction is thus 
about 100 Wb/m? (10* G). It is rather clear why no 
magnetic effects were seen in Ge. No experiments on 
Si were tried. 


CONCLUSIONS 


The thermal conductivity K of silicon has been meas- 
ured from 3 to 300°K using a steady-state, longitudinal 
heat flow apparatus. By using the relaxation times for 
umklapp scattering, isotope scattering and boundary 
scattering, and Callaway's simple model, a theoretical 
phonon thermal conductivity was calculated and was 
found to agree with the experimental measurements. 


& M. C. Stecle, Phys. Rev. 107, 81 (1957). 

& Kh. I. Amirkhanov, A. Z. Daibov, and V. P. Zhuze, Dokl. 
Akad. Nauk SSSR 98, 557 (1954). ; 

St A. E. Bowley, R. Delves, and H. J. Goldsmid, Proc. Phys. 
Soc. (London) 72, 401 (1958). s / k 

6 D, Kh. Amirkhanova and R. I. Bashirov, Fiz. Tverd. Tela 
2, 1597 (1960) [English transl: Soviet Phys.—Solid State 2, 
1447 (1961) ]. 3 

e e este and F. Hornstra, Rev. Sci. Instr. 33, 1067 
(1962). 
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A high-temperature thermal conductivity apparatus 
employing steady-state, radial heat flow in a cylindrical 
sample was designed and was found to give accurate 
(5%), absolute, measurements at temperatures from 
300 to 1600°K. The method does not require any cor- 
rection for heat loss by radiation. The K values so ob- 
tained agree with those found for both Si and Ge using 
the low-temperature, longitudinal heat flow apparatus. 

The results show that phonons are the dorninant 
carriers of heat in high purity, solid Si and Ge at all 
temperatures from 3°K to their melting points. The 
phonon or lattice thermal conductivity K, decreases 
faster than the T~ law predicted by three phonon urn- 
klapp scattering for T>6, where 0 is the Debye tempera- 
ture. For temperatures 776, an approximate theoret- 
ical formula has been developed for K, which agrees 
with experimental results between 0.50 « T — 1.60. This 
formula is based on the assumption that three-phonon, 
four-phonon, and isotope scattering are important at 
high temperatures, and it predicts that the thermal re- 
sistivities of these three processes are additive. It is sug- 
gested that the relaxation times for both three- and 
four-phonon processes are comparable for T29. 

For temperatures above about 1.69 an electronic ther- 
mal conductivity K, has been found in both Si and Ge. 
The theoretical and experimental values of K, agree 
quite well. The major part of K, comes from the bipolar 
term, with some small contribution from the polar term. 
At the melting point the phonons contribute an average 
of 61% to the total K while the bipolar and polar con- 
tributions are 32% and 7%, respectively. 

The thermal band gaps at the melting points are esti- 
mated from K, as 0.62-0.1 eV for Si and 0.26+0.08 eV 
for Ge. They are in reasonable agreement with the linear 
extrapolations from lower temperatures. 
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Soft X-Ray Production by 1.5-MeV Protons f 


W. T. Osr, G. J. Lucas,* J. S. Murray,* AND T. E. Horzer* 
Pomona College, Claremont, California 
(Received 31 December 1963) 


Thick-target yields of x rays with wavelengths from 1.5 to 44 À have been measured as functions of 
bombarding proton energy near 1.5 MeV. X-ray production cross sections computed from these yields are in 
satisfactory agreement with existing theoretical excitation cross-section calculations insofar as current 
estimates of fluorescence yields permit comparisons to be made. Yields at 1.5 MeV of the ultrasoft x rays 
measured were, in quanta per proton steradian : Alg, 3.5X 1075; Cuz, 9x 1075; Cx, 4.5X 10%. 


INTRODUCTION 


NTEREST in determination of x-ray yields from 
proton-bombarded targets revived with the advent 
of scintillation counting. Extensive early investigations 
by Lewis, Simmons, and Merzbacher! and by Bernstein 
and Lewis? were followed by a number of experimental 
and theoretical studies. The state of the field was then 
reviewed and analyzed in a satisfying way for proton 
energies above one MeV by Merzbacher and Lewis.? 
Subsequent interest has centered on energies below one 
MeV for which the Born approximation does not yield 
completely satisfactory results.^? The use of scintil- 
lation counters in these modern measurements served 
to limit observations to x rays with quantum energies 
of 4.5 keV or more. 

The availability of thin-window proportional counter 
techniques makes possible quantitative study of yields 
of x rays with quantum energies at least as low as 280 
eV.5 Reports of such proton-yield measurements have 
only recently begun to appear.' In this investigation 
yields of a number of K and L x-ray bands with quantum 
energies between 8000 and 280 eV (1.5 to 44.6 À) were 
measured for bombarding proton energies between 1.2 
and 1.6 MeV. These yield functions are used to calcu- 
late cross sections for x-ray production at 1.5 MeV. 
The ratio of production cross section to excitation cross 
section is the fluorescence yield. Current estimates of 
excitation cross section and fluorescence yield both in- 
volve uncertainties in the soft x-ray region. These re- 
sults do not resolve either uncertainty, therefore, but 
they fit well with reasonable estimates of both 
parameters. 


; * National Science Foundation undergraduate research par- 
"SI W. Lewis, B. E. S 

1H. W. Lewis, B. E. Simmons, and E. Merzbacher, Phys. 

91, gis (1953) erzbacher, Phys. Rev. 
. M. Bernstein and H. W. Lewis, Phys. Rev. 95, 83 (1954). 

3E. Merzbacher and H. W. Lewis, Handbuch der sk 
as by S. Fliigge (Springer-Verlag, Berlin, 1958), Vol. 34, p. 

4S. Messelt, Nucl. Phys. 5, 435 (1958). 

SR. C. Jopson, H. Mark, C. D. Swift, and M. A. Williamson, 
Phys. Rev. 131, 1165 (1963); 127, 1612 (1962) contains a recent 
bibliography. ; 

6 R. M. Dolby, Brit. J. Appl. Phys. 11, 64 (1960). 

7 J. M. Khan and D. L, Potter, Phys. Rev. 133, A890 (1964). 


EXPERIMENTAL PROCEDURE 


Targets of the metals investigated were bombarded 
inside the main tank of the 23-in. cyclotron. This cyclo- 
tron, a prototype model, employs radial sector Thomas 
focusing without the usual tapering of the gap at larger 
radii to maintain orbit synchronism. Since the sector 
focusing is strong in the vertical direction for eccentric 
orbits as well as for symmetric orbits, a target inside 
the tank is exposed to a considerable spectrum of ener- 
gies which can be limited somewhat by careful collimat- 
ing of the incident beam at the target position. Such 
collimation was provided, and since negligible currents 
are found in the symmetric orbits at large radii, the 
angle of collimation was set at approximately 83? with 
the radial direction in order to select the more energetic 
eccentric orbits. The proton energy was varied by mov- 
ing the target assembly radially. In the absence of 
computed energies, the yield of tantalum L x rays was 
used to calibrate the beam energy using the data of 
Bernstein and Lewis? extrapolated below 1.5 MeV with 
the aid of the calculations of Merzbacher and Lewis. 
The surface of the target was set optically in each case 
at the angle of specular reflection between the beam 
direction and the radial direction along which x rays 
were viewed. The proton and the resulting x ray thus 
traveled equal distances in the target material. 

'The x rays were counted outside the magnetic field 
at the end of a 129-cm evacuated drift path by means of 
a flow proportional counter of standard design using 
P-10 gas (10% methane, 90% argon). The window 
aperture was a single 0.063-in.-diam hole inside the 
vacuum, which thus subtended a solid angle at the 
target of 1.21 10-5 sr. The window material, 0.00025- 
in. aluminized Mylar, was bonded with epoxy cement 
over a slightly larger aperture set just behind the win- 
dow. One corner of the Mylar was folded under to 
ground the aluminized surface. This structure gave good 
counting rates with proton currents smaller than one 
microampere in all cases except that of tantalum L 
radiation. Because of the importance of the tantalum 
data for calibration, it was checked using an aluminum 
window of larger aperture. The two runs were in very 
close agreement. 


* C. F. Hendee, S. Fine, and W. B. Brown, Rev. Sci. Instr. 27; 
531 (1956). 
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SOFT X-RAY 

Yields were determined by counting the pulses cor- 
responding to the x-ray band desired with a single- 
channel pulse-height analyzer while proton current was 
being monitored with a calibrated galvanometer. The 
pulses corresponding to a particular band were well 
resolved, but the structure within bands was not visible. 
The bremsstrahlung background was apparently very 
small and was ignored. In particular, in the case of 
carbon-K radiation a clear minimum existed in the 
pulse-height spectrum between the counting pulses and 
the amplifier noise. The discrimination level was set 
just above the noise. The yields reported from the x rays 
with quantum energy greater than the ionization poten- 
tial of argon include the attenuated argon-K escape 
pulses. 

Counter efficiencies due to window and gas absorp- 
tion were computed from the measured geometry of 
the counter using the mass absorption-coefficient inter- 
polations of the data of Allen by Henke, White, and 
Lundberg.’ The path length in the argon was 2.26 cm. 
The computed efficiencies are recorded in the first 
column of Table I. The transmission factor of the Mylar 
for carbon-K radiation was thus estimated to be 0.04. 
This transmission was checked experimentally by plac- 
ing a second layer of the same material in the x-ray 
path in the vacuum. The measured transmission varied 
from place to place in the film. The value adopted was 
0.038+0.01. 


EXPERIMENTAL RESULTS 


The yields of x rays, Y,, in quanta per proton stera- 
dian at 1.5-MeV bombarding energy are given in the 
second column of Table I and the yields as functions of 
bombarding energy are presented graphically in Fig. 1. 
Considerable uncertainty lies in the determination of 
proton energy, since the collimated proton beam had a 
spread in energy of some 0.2 MeV and the energy 
calibration below 1.5 MeV is based on extrapolation. 
The curves for tantalum-L and iron-K radiation both 
fit well the results of Merzbacher and Lewis,’ but there 


"TABLE I. Data at 1.5 MeV. 


Y, 
uanta 4 Oz 
Counter 2 - (1.5 MeV) Yield 
efficiency proton-sr p cm? factor 
0.038 4.5X10 3000 3 X107 0.007 
A 0.45 3.5X10* 330 1.0X10 0.065 


114 11x10? 0 
3.8X103 0 
2, v w 
1-3» 0D: 
@ xr 
40X10? 0. 
27X108 0 


°B. L. Henke, R. White, and B. Lundberg, J. Appl. Phys. 28, 
98 (1957). 
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Fic. 1. Thick-tar- 
get characteristic x- 
ray yields, Y,, in 
quanta per proton 
sr as functions of 
proton bombarding 
energy. Proton and 
quantum traveled 
equal distances in 
the target material, 


(0-5 ed 


hl Proton Energy (MeV) 1.6 


is a sizable discrepancy in the data for titanium K. In 
the preliminary phases of this work the yields from 
various titanium targets were compared with those from 
tantalum and iron, always with essentially the results 
reported. 

The cross section for x-ray production ¢,(£) is com- 
puted from the yield curve by the familiar formula? 


(E =( : iE A (b 
ox(E)=—| uY, zr 1 
n dE dx 


where 7 is the number of atoms per unit volume and g 
the linear absorption coefficient. The stopping powers, 
dE/dx, are readily available,” but the self-absorption 
coefficients y are not nearly so easy to obtain, particu- 
larly for the Z x rays. The K self-absorption coefficients 
used are based on the Henke, White, and Lundberg 
interpolation formula? and the L coefficients on the 
Norelco extrapolation of the data of Allen and others. 
Even the data itself is open to question. The estimated 
values of u/p used are recorded in the third column of 
the table, and the resulting calculated x-ray production 
cross sections in the fourth column. 


DISCUSSION 


"Theoretical excitation cross sections were derived by 
Bethe and Walske"? on the basis of screened hydrogenic 


0 W. Whaling, Handbuch der Physik, edited by S. Flügge 
(Springer-Verlag, Berlin, 1958), Vol. 34, p. 193. 

LS. J. M. Allen et al., Norelco Reporter, May-June 1962 
(unpublished). 

2 B. A. Cooke, K. A. Pounds, P. C. Russell, and E. A. Steward- 
son, Proc. Phys. Soc. (London) 79, 883 (1962). 

3 M. C. Walske, Phys. Rev. 101, 940 (1956). 
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wave functions. The resulting integrals depend critically 
on the minimum energy which must be transferred to 
the atomic state in question in order to produce excita- 
tion. The effect of screening on this energy is expressed 
by 6, the ratio of the ionization potential of the state 
in question to the theoretical ionization potential in 
the absence of outer screening. Merzbacher and Lewis? 
have computed and plotted excitation cross section as 
a function of proton energy for various values of 0. 
Comparing their computed curves with experimental 
data, they found that the predicted cross sections were 
too large and that the data was consistent with empirical 
values of 0x and 6; of 0.85 and 0.7, respectively, both 
essentially independent of atomic number. Both values 
correspond to effective minimum energy transfers higher 
than the ionization potentials, particularly for the light 
elements. The extrapolation of this result from the 1.5-À 
wavelengths at which it was found to longer wavelengths 
is a doubtful procedure. Some support may be found in 
Observations of effective thresholds for electron excita- 
tion. For example, Dolby? observes effective bombard- 
ing energy thresholds for strong K x-ray production of 
about 2200 and 400 eV from aluminum and carbon, 
respectively. Both correspond roughly to unit values of 
8x. The ionization potential does not appear to be a 
valid effective minimum for rapid energy transfer. 

"The ratios of observed x-ray production cross sections 
to theoretical excitation cross sections obtained from 
the curves for 6x=0.85 and 075—0.7 plotted by Merz- 
bacher and Lewis? are tabulated as yield factors in the 
last column of the table. To the extent that confidence 
can be placed in the empirical values of 0 these factors 
are fluorescence yields. Current estimates of soft 
K-fluorescence yields are classifiable into a low set of 
values based on extrapolation of measurements with 
heavier metal targets!*!* and a considerably higher set 
of values based on data from argon!? and a number of 

“E. H. S. Burhop, J. Phys. Radium 16, 625 (1955). 

(1990) L. Hagedoorn and A. H. Wapstra, Nucl. Phys. 15, 146 


1T. Watanabe, H. W. Schnopper, and F. N. Cirillo, Phys. 
Rev. 127, 2055 (1962). 
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theoretical calculations including recently a screened 
hydrogenic wave function calculation by Callan.!” The 
K yield factors in the table from copper through alumi- 
num fit well slightly below Callan's curve. Very little 
data on L fluorescence yields is available.!5 Since the 
trajectory of a massive particle in a target is inherently 
easier to handle theoretically than that of an electron, 
measurements such as these are a potentially useful way 
of determining fluorescence yields. They would be worth 
repeating with more precise apparatus. 

The yield of aluminum-K x rays at 1.5 MeV corre- 
sponds to a production efficiency of 1.4X 10? quanta 
per joule sr, about the maximum efficiency obtainable. 
On the other hand, 1.5 MeV is about six times the proton 
energy required for maximum production efficiency from 
carbon. The maximum efficiency from carbon is readily 
estimated with the aid of the measured yield and the 
excitation cross section curve to be approximately 10! 
quanta per joule sr. The carbon efficiency is comparable 
to the efficiency attainable with electron bombardment,!? 
but the aluminum efficiency is smaller by a factor of 
four or more. The proton efficiencies can be raised some- 
what by irradiating with protons at grazing incidence. 
Since the resulting radiation is nearly free of bremsstrah- 
lung, it appears that proton excited sources might be 
superior to filtered electron excited sources in a few 
applications. 
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Self-Compensation Limited Conductivity in 
Binary Semiconductors. I. Theory* 


G. MANDEL 
IBM Watson Research Center, Yorktown Heights, New York 
(Received 23 December 1963) 


The techniques of Kroger and Vink and Brebrick are extended to allow a calculation of the minimum 
extent of self-compensation by simple vacancies or interstitial atoms in heavily doped binary semiconductors. 
The resulting equations are applied to a series of compounds, and it is found that the degree of self-compensa- 
tion by singly ionizable vacancies varies from essentially complete in KCI (all but ~10~ of the impurities 
compensated) to practically none in GaAs (only $10-4 of the impurities compensated). The II-VI com- 
pounds occupy an intermediate position with about ~99 and 99.9% self-compensation in CdTe and ZnTe, 
respectively. These theoretical conductivity limitations are not sufficient to account for the experimental 
limitations found in, for example, n-Zn'Te or p-CdS. The above results are extended to include multiply 
ionizable vacancies, the ionization levels of which fall within the bandgap. It is found that essentially 
complete self-compensation by a combination of singly and doubly ionized vacancies will occur in the higher 
bandgap II-VI compounds. As a consequence, for example, the Fermi level in ZnTe cannot be pushed closer 
to the bottom of the conduction band than half the energy separation between the second ionization level 
of the acceptor vacancy and the bottom of the conduction band, Some specific implications of the above 
calculations with respect to CdTe and GaAs are discussed. Finally, certain solubility effects (of impur- 
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ities) related to stoichiometry and the above calculations are discussed. 


I. INTRODUCTION 


T is well known that the introduction of electrically 
active impurities into a semiconductor host crystal 
induces the formation of electrically active ‘‘natural” 
defects (vacancies, interstitial atoms, etc.), which tend 
to at least partially compensate the electrical activity 
of the impurity. This phenomenon has been considered 
by a number of workers, particlarly in a review article 
by Kroger and Vink! and, somewhat more rigorously 
and recently, by Brebrick.?? This self-compensation 
may be analyzed simply in terms of an energy balance 
equation, i.e., energy must be supplied by the crystal 
to produce the “‘excess” concentration of defect centers 
while energy is gained by the crystal by the interaction 
of the defects with the free carriers produced by the 
added impurity centers. Clearly, if the energy of defect 
formation is large compared to the energy gained by 
compensation, very little self-compensation will take 
place. On the other hand, if the energy of defect forma- 
tion is small compared to the energy gained by compen- 
sation, all free carriers will be compensated by the 
formation of defects and only insulating crystals will 
be accessible by equilibrium processes. —— 

The purpose of this paper is the calculation, in terms 
of the energy parameters of the host crystals, of the 
minimum extent of self-compensation by simple vacan- 
cies in a series of binary compounds, i.e., KCl, ZnTe, 
Cd'Te, and GaAs. Qualitatively, we expect to observe 
a trend in this series due to the fact that the more 
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ionic compounds, e.g., KCl, have electronic bandgaps 
which are large compared to their cohesive energies 
while the reverse is true for the more covalent com- 
pounds, e.g, GaAs. Roughly speaking, the energy 
gained by the system upon “recombination” of a free 
carrier at a vacancy is expected to correlate with the 
bandgap, while the energy required to generate a va- 
cancy, i.e., the energy required to remove an atom from 
the lattice, is expected to correlate with the cohesive 
energy. 

The possibility of obtaining appreciable n- and p-type 
conductivity in the [I-VI compounds, e.g., CdTe and 
ZnTe, is a question of increasing technological im- 
portance because of possible application toward an 
injection luminescence device operating in the visible 
region of the spectrum. These compounds appear to be 
an intermediate case requiring somewhat more careful 
analysis. 

In Sec. II, we calculate the degree of self-compensa- 
tion by simple singly ionizable acceptor vacancies in a 
binary semiconductor MN, into which we have intro- 
duced a large concentration of shallow donors, D. (A 
precisely analogous situation will exist for acceptors.) 
This calculation is based essentially upon the equation 
for charge neutrality and Fermi statistics as applied to 
a nondegenerate semiconductor.!-? 

In Sec. III, we apply the resulting equations to the 
series of compounds under consideration and find the 
expected trend in the results. Of particular interest are 
the results for CdTe and ZnTe, which indicate that the 
degree of self-compensation by singly ionizable vacancies 
calculated theoretically is xot sufficient to account for 
the observed conductivity limitations in the II-VI 
compounds, ZnTe being a particularly appropriate 
example. 

In Sec. IV, we generalize the results of Sec. IT to in- 
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clude multiply ionizable vacancies, in particular, doubly 
ionizable vacancies, such as might be expected to be 
important in the II-VI compounds. It is demonstrated 
that multiply ionizable vacancies can be of critical 
importance in determining the degree of self-compensa- 
tion under certain circumstances, these circumstances 
being most probably appropriate to the II-VI com- 
pounds. In particular, under these circumstances, we 
find that the Fermi level cannot be pushed appreciably 
closer to the bottom of the conduction band than half 
the energy separation between the second ionization 
level of a simple doubly ionizable acceptor vacancy and 
the bottom of the conduction band. 

Some specific implications of Secs. II and IV are 
discussed with respect to GaAs and CdTe in Sec. V. In 
particular, it is predicted that the concentration of 
simple vacancies in GaAs must be small, and certain 
experimental evidence supporting this view is cited. 
With respect to CdTe, it is pointed out that the assign- 
ment of a level much below the conduction band to the 
second ionization of a Cd vacancy cannot be correct. 

Finally, in Sec. VI, we discuss certain effects of vacan- 
cies on the solubility of impurities, in view of the above 
calculations. 


Il. SINGLY IONIZABLE VACANCIES 


Let us consider a binary compound semiconductor, 
MN, into which we have introduced a large concentra- 
tion of donors, D, at the temperature T. The pressures 
of the various species in the vapor phase are fixed, par- 
ticularly the pressures of Mq) and Nco), par and py. 
The only defects in the host crystal which we consider 
are simple singly ionizable vacancies, an M atom vac- 
ancy being assumed to be an acceptor while an V atom 
vacancy is assumed to be a donor. For simplicity, we 
will analyze only the case of //eavy doping in which the 
concentration of ionized donors, Dt, is large compared 
to the concentration of ionized JV vacancies, Vyt, or 
holes in the valence band, p. [The calculations can easily 
be extended to include Vyt by simply writing 
(D*--Vy*) in place of D+ throughout. ] 

Tt will be shown in Sec. III that self-compensation 
by interstitial atoms rather than vacancies is essentially 
equivalent and leads to similar results. 

The charge conservation equation, which is sufficient 
to describe the system’? is given by 


D*-n4- Var, (1) 


where v is the concentration of electrons in the conduc- 
tion band and Va, is the concentration of ionized M 
vacancies. The competition between n and Vy in 
maintaining charge neutrality is clearly the question 
at hand. 

The interactions between defects in equilibrium in a 
nondegenerate semiconductor are given by the equa- 
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tions of Fermi statistics 
n=N, exp(Ep— E-/kT) ; (2) 
Vyu-/Vu- gu exp(Er— Ea/RT), (3) 


where WV, is the density of states near the bottom of 
the conduction band (given in terms of the electronic 
effective mass and T), Er is the Fermi level, E, is the 
energy at the bottom of the conduction band, Vj is 
the concentration of neutral M vacancies, Æa is the 
acceptor energy level associated with an M vacancy, 
and gx is a degeneracy factor (generally we assume 
gu=2 for a singly ionizable level). An equivalent 
description is given by the mass action formalism.1-? 


nV y/ NV y-—1/ga exp(Ea—E./kT) , (4) 


where the right-hand side of Eq. (4) is an equilibrium 
constant describing the ionization of an ionized M 
vacancy to yield a neutral M vacancy and an electron 
in the conduction band. 

Combining Eqs. (1) and (4), we obtain immediately 


n/D*-1/14Q, (5) 


Q- guV y/N. exp(E.— E4)/RT — guV u/N., 
Xexp(E,—E4)/kT, (6) 


where E, is the electronic bandgap and £4’ is the ac- 
ceptor level of the M vacancy relative to the valence 
band. 

Equations (5) and (6) are our central result. All the 
quantities in Eq. (6) except Var, the concentration of 
neutral M vacancies," are generally known with suf- 
ficient accuracy (withTthe possible exception of E4' in 
some cases). We note that, except for the temperature, 
T, only V y is under experimental control. Vj; will be 
determined by the pressure of Mq) in the system, pm, 
which can take on a range of values. The connection 
between Va; and fy is written! 


PuVu/Nau-—expASa/kexp(—AHy/kT), — (7) 


where JV y is the concentration of M sites in the lattice, 
and where the right-hand side of Eq. (7) is an equilib- 
rium constant referring to the process in which an atom 
of M is removed from the MN lattice and placed into 
the vapor, the pressure of M(, being maintained at 
1 atm (standard state), leaving behind an M vacancy 
in the lattice. The quantities ASj; and AH y are defined 
as the entropy and enthalpy of M vacancy formation, 
respectively. Note that we have not assumed that Mco) 
is the only species in the vapor phase but are simply 
using $a as a convenient measure of the chemical 
potential of M in the system. 

Inasmuch as we wish to calculate the minimum extent 
of self-compensation, we must estimate the minimum 
value of Vy and, therefore, the maximum value of pm: 
Clearly par must always be less than the value associated 
with the presence of a pure condensed phase of M 
(generally Mq in cases of interest). For temperatures 
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several hundred degrees centigrade below the maximum 
melting point of MN, or lower, the maximum achievable 
value of par is fairly well approximated by that associ- 
ated with M, pau, for most cases of interest. We 
will confine our calculations to such temperatures that 
pu=pm** and simply point out that the results will 
represent a lower bound to the degree of self-compensa- 
tion at other temperatures where fs « pu, e.g., the 
maximum melting point. We write, 


pu?—expASyavP/b exp(— AH yvP/RT), — (8) 
and obtain, finally, from Eqs. (6)- (8), 


kTinQrinz E,— EA = AH y+ AH yy¥*? 
+ T(ASm = AS yp) --RT In (gu N w/ N.) 7 (9) 


Thus, with Eqs. (5) and (9) we can calculate the 
maximum relative concentration of free carriers that we 
can introduce into MAN qs) at the temperature T. If T 
is sufficiently large, these carriers will be equal in 
concentration to the “uncompensated” donor impuri- 
ties present. If the temperature at which we measure 
the conductivity is much less than 7, some of these 
carriers may “freeze out.” The extent of this ‘‘freeze- 
out" is determined largely by the energy level associated 
with the donor impurity (for the case of singly ionizable 
vacancies only). We will not be concerned with this 
"impurity limitation" on the accessable carrier con- 
centration and conductivity but merely state that Eqs. 
(5) and (9) allow us to calculate the maximum relative 
concentration of uncompensated shallow impurities 
(virtually all ionized at temperature 7) that can be 
introduced into MN qs). 


III. CALCULATION OF RESULTS; KCl, 
ZnTe, CdTe, AND GaAs 


We now apply Eqs. (5) and (9) to p-KCl, n-ZnTe, 
n- and p-CdTe, and n-GaAs. The energy bandgaps, 
E,, are available in the literature.*-? The energy levels 
of the compensating vacancies, E4' are available for 
p-KCl (F center), n-ZnTe," n-CdTe,* and 5-CdTe.*/* 
No data are available on E4' in GaAs and we will 
simply assume E4'— 0 in this case to put an upper bound 
on the extent of self-compensation. 

The enthalpy of compensating vacancy formation, 
AH y, is available as such only for n-CdTe. The en- 
thalpy of formation of interstitial Cd atoms in p-CdTe 
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is reported,* however, according to the equation 
Tea! N capca= exp— ASca!/k expAHfca!/RT , (10) 


where Ica is the concentration of interstitial Cd atoms. 
It is, in practice, impossible to distinguish between 
interstitial Cd atoms and Te vacancies by electrical 
measurements.! If we reinterpret the data in terms of 
Te vacancies and use Eq. (7) (with M=Te) in place of 
Eq. (10), it is easy to demonstrate that the “equivalent” 
enthalpy of Te vacancy formation is given as 


Allq.— AH cate— AH ea! ; (11) 


where AZfcar, is the standard molar enthalpy change 
for the reaction 


CdTe = Cd ;4- Tec. (12) 


AHcate is essentially twice the cohesive energy per 
gram-atom of CdTe(,; and is available in the litera- 
ture. On the basis of such reinterpretation, we have 
obtained AZ, for the calculation appropriate to 
p-CdTe. It is apparent that, for our purposes, no signi- 
ficant difference exists between interstitial atoms and 
vacancies and that the available data can always be 
interpreted without error in terms of vacancies, as 
above. 

The enthalpy of vacancy (F center) formation in 
KCl has not been reported. Data are available, however, 
as to the concentration of F centers as a function of 
potassium pressure, px, at one temperature (697°C). 
In particular, the concentration of F centers is reported 
to be proportional to px°-*®, rather than the expected 
linear dependence. We may handle these data in several 
ways, with slight variation in the calculated results. 
Firstly, we arrive at the minimum value of Vci by 
extrapolating the reported data to a value of px 
consistent with the highest possible value of fci 
(pc), an extrapolation of perhaps a factor of 10”, 
and use Eq. (6). Secondly, we may assume a linear de- 
pendence and carry out the same extrapolation with a 
resultant difference of a factor of 10? (fewer vacancies). 
Finally, in a fashion equivalent to the linear extrapola- 
tion, we may estimate the standard molar entropy 
change appropriate to the removal of a Cl atom to the 
vapor as half the standard molar entropy change of the 


reaction. 
KCl) = Key + Clo), (13) 


plus a small contribution due to*the presence of the 
vacancies,*?!5 which we somewhat arbitrarily estimate 
as ~4 e.u.!® On the basis of our estimate of the total 
entropy change, we use Eq. (7) and the data mentioned 
above to calculate AHc;. Use of this calculated value in 
Eq. (9) leads, as expected, to essentially the same result 
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as the linear extrapolation, and is probably the most 
reliable method. 

No thermodynamic data on vacancy formation is 
available for either ZnTe or GaAs. We may, however, 
follow Swalin'® and write 


AH y= AH ux — E2 E.on E, , (14) 


where AZ yx is the standard molar enthalpy change for 
the reaction 


MN ww =M +N (t. (15) 


Eon is the cohesive energy per gram-atom and Æ, is a 
“relaxation” energy associated with the vacancy. Eq. 
(14) is based on the assumption of constancy of bond 
energy and may be expected to apply to covalent 
crystals. An equation of the same form can actually 
also be derived for ionic crystals." If we put the data 
for - Cd Te? into the form of Eq. (14), we find E,/AH yx 
—0.24. Similarly, for Ge, E,/AH yx = E;/2 Ec 7 0.19. 
We will assume that Eq. (14) applies to both Zn Te and 
GaAs. Further, we will assume that E,/AH yx has the 
same value for ZnTe as for CdTe. For GaAs, we will 
assume a value of E,/AH yy midway between that of 
CdTe and Ge, i.e., E/AH yxy—0.21, and use the data 
of AH Gaas.!8 

The enthalpies and entropies of vaporization [Eq. 
(8)], are available for all the elements involved." 

For each case, we take the temperature, T, to be 
several hundred degrees centigrade below the maximum 
melting point of the host crystal in order that the 
maximum value of py, be reasonably accurately de- 
scribed by Du. 

The entropies of vacancy formation, AS;r, are gener- 
ally unknown. The over-all entropy term in Eq. (9), 
(ASy;—AS3,“*”), is clearly the standard molar entropy 
change that occurs upon transfer of one mole of M atoms 
from MN (4) to M (5, leaving M vacancies in the crystal. 
This must be given by approximately minus half the 
standard molar entropy of formation of MN.) from 
the solid elements (~2 e.u.) plus the standard molar 
entropy of fusion of Mgs) (~3 e.u. for most elements) 
plus a contribution due to the “excess” entropy of the 
vacancies”?! (which is ~4 e.u.).5 We therefore take 
(AS — AS?) as ~9 e.u. in each case. 

The degeneracy factor, gr, is taken as 2 for all cases 
and the concentration of available M sites, Na, is 
calculated from the densities of the host crystals, which 
are easily available. < 

The density of states, N., is calculated from the tem- 
perature, T, and the appropriate effective mass, m*, 
which is available for m-CdTe,9295 p-CdTe® (with 
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some uncertainty) and n-GaAs.” We will take m* for 
n-ZnTe as identical to that of n-CdTe and that for 
p-KCl as equal to the free-electron mass. 

l'inally, the only implicit temperature dependence in 
Eq. (9) that we consider is that of the bandgap, E,. 
Estimates of this dependence are available for CdTe’.2 
and CdAs.? The value for ZnTe is interpolated from 
data for Cd Te, CdSe, and ZnSe.” The value of (9E,/9T) 
for KCI is arbitrarily estimated as — 107? eV/?C. 

The inputs for Eq. (9) are given in Table I, excluding 
(ASar— AS) =9 €.u.=3.9X 107* eV/?C and gy 2. 

The results of the calculation based on Eq. (9) and 
the data in Table I are presented in Table II, along with 
data on £, and Feon. It is clear from the above discussion 
relating to the data in Table I that the results in the 
second and third columns in Table II are reliable only 
as to order of magnitude. Nevertheless, we may draw 
certain firm conclusions from these results: 

1. There does exist a self-compensation “boundary” 
such that certain compounds, falling beyond the 
“boundary,” cannot be doped by any equilibrium pro- 
cess to have appreciable electronic conductivity, e.g., 
the alkali halides. A rough rule of thumb for determining 
the position of the self-compensation “boundary” is 
given by the ratio of the bandgap, E,, to the cohesive 
energy per gram-atom, Eeon; essentially complete self- 
compensation occurring for values of this ratio much 
above unity while little self-compensation occurs for 
values below 0.5. The expected trend from ionic to 
covalent compounds is observed. 

2. The II-VI compounds constitute a class in which 
considerable self-compensation may be expected to 
occur (£5/ Ez 71). Nevertheless, the calculated degree 
of self-compensation by singly ionizable vacancies for 
these compounds is mo sufficient to explain the well- 
known difficulty of obtaining, for example, appreciable 
conductivity in z-ZnTe or p-CdS. Similar results are 
expected for compensation by simple singly ionizable 
interstitial atoms [see discussion of data in Table I for 
p-CdTe and Eq. (14)]. We conclude, therefore, that 
some other limitation on the accessible carrier concen- 
tration and conductivity in II-VI compounds must be 
involved. There may, of course, be impurity limitations 
related to solubility or position of energy levels within 
the electronic bandgap. There may also be other 
"natural defects" involved, e.g., impurity-vacancy 
pairs, etc. A particular possibility that we will consider 
in more detail is the presence of doubly ionizable va- 
cancies, of which both ionization levels fall within the 
electronic bandgap. 


IV. MULTIPLY IONIZABLE VACANCIES 


In the general case, we designate the concentration 
of M vacancies, which have been ionized to a charge of 
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: 'TABLE I. 1 iui for Eq. oa 
i f(T) E AH xy AH (P 
COTS EGS, (eV) (eV) (cV)  (298°K) (eV) Nu (emo?) XN, (cm) 
p-KCl 970 809 240 6.20 130^  138X10?  L50x109» 
n-ZnTe 1500 1 361e. 121m 1835€ 10% 15 
n-CdTe 1250 1.02f 1.077» 1.55% 10” 7.76% 1018 í 
p-CdTe 1250 1.02! 1.86m 1.55% 102 3.495€ 1019 14.1 
Nr n-GaAs 1250 URE 2.76" 1 OVX 108 4.12% 1075 
a See text in Sec. III. ee DNA i NENNEN C 
foe IM above the critical temperature of Cla so that Clas) is the condensed phase of interest rather than Clic; We will neglect the small effect of this 
* See Ref. 4. e e i See Ref. 11. k See m Se 
dSee Refs, 5-7, feces  bSee Rel 10, ISe Refs 102. TSE REL a DEDE 
Taere II. Results from Eq. (9). 
n \ max pN 
> ( T or ES. I5 (298°K) Allyn =2 E 
Compound Qi min D* AT) calc (eV) (eV) Ey / Ect 
?-KCl 2.12X105. 1.71X 107? 8.708 EU E 
n-ZnTe 3.09X 10? 3.22K 107? 2.20" 4.51^ 0.97 
n-CdTe 3.23X10 3.00 10% 1.50* 4.20* 0.72 
p-CdTe 8.70 10? 1.15x 1073 1.50* 4,20° 0.71 
n-GaAs 1.70 107? 0.998 1.431 7.35t 0.39 
CdS 0.94 
ZnS 1.22 
a See Ref. 4. * See Ref. 8. * See Refs, 12 and 13. EIN ie p 
b See Refs. 5-7. d See Ref. 9, f Sce Refs. 16 and 13. 


—i as Var and follow Sec. II in writing a charge 
neutrality equation, 


N 
n+ iV =D", (16) 


i=] 


where N is the highest charge state to fall within the 

bandgap. We also write an equilibrium constant for 

ionization of a vacancy in the ith charge state, 
nV y H/N V w= gia/giexp(Ea;—Ec/ RT), (17) 


where g; is a degeneracy factor and E4; is an energy 


‘level describing the ith charge state of the vacancy. As 


before, E, is the energy at the bottom of the conduction 
band. Tt follows from use of Eq. (17) in Eq. (16) that 
Eq. (5) still applies, with Q given by 


0-0. [1:66 AX) 


i=2 


xepl(/ENE (E—Ea)]f, (8) 


where 


Qi nVa/N.e exp(E,— Eaj)/kT , (19) 


as in Eq. (6). : 4 
For the II-VI compounds, it is believed that IN 


and we write, for these compounds, 


Q- Q.(1-- Qg/g) (/N9 exp — Ea,/kT)}. Q0) 


Certain general features of Eqs. (18) and (20) should 
be noted. We see that the effect of the second ionization 
level of the vacancy only becomes significant in Eq. (20) 
if the doping level is such that the Fermi level rises above 
the second ionization energy level [see Eq. (2) ]. If we 
attempt to dope the host crystal so heavily that the 
Fermi level does rise above the second ionization level 
of the acceptor vacancy, we see from Eqs. (1) and (20) 
that the situation is soon reached in which 


n/N — exp(Er— E)/RT = (gD*/2g4QiN ,) ? 


Xexp(£aA,—E./2kT). (21) 


It is clear that Q1 for the higher bandgap II-VI com- 
pounds (see Table II). Further (g:D*/2g,0.N ) ^ S 1. 
It follows that the Fermi level in the II-VI compounds 
cannot be pushed closer to the bottom of the conduction 
band than half the energy separation between the 
second ionization level of the acceptor vacancy and the 
bottom of the conduction band. In other words, shallow 
impurities will be completely compensated by a combi- 
nation of singly and doubly ionized vacancies in a 
II-VI compound if the second ionization level of the 
compensating vacancy falls within the bandgap. 

Even if the Fermi level does not rise above the 
second ionization level of the acceptor vacancies, these 
levels will still contribute to the temperature dependence 
of the carrier concentration. If the donor impurities 
are sufficiently shallow, the position of these second 
ionization levels will entirely determine this tempera- 
ture dependence. However, as long as Qr»1, “freeze- 
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out” of carriers onto these levels cannot push the Fermi 
level higher than the maximum value discussed above. 
Complete compensation of impurities by a combination 
of singly and doubly ionized vacancies will then always 
occur as long as the energy level of the added donor 
impurities is shallower (closer to the conduction band) 
than the second ionization level of the vacancies. If the 
energy level of the donor impurity is sufficiently deep, 
the donors will not be completely compensated, but 
cannot then contribute appreciable conductivity. 


V. SPECIFIC IMPLICATIONS: GaAs, CdTe 


The magnitude of the concentrations of vacancies in 
GaAs is of interest because it has been suggested that 
the concentration of some natural defects can be quite 
high (~10!°/cm!).?5 The estimated value of the energy 
of vacancy formation used in Sec. III, however, leads 
to the conclusion that the total concentration of simple 
vacancies, neutral or ionized, must be low. In particular, 
the maximum concentration of neutral Ga vacancies 
(under the maximum pressure of As) is calculated to be 
7» 104 /cm*. If the Ga vacancies were sallow acceptors, 
the total concentration of Ga vacancies could hardly 
exceed 10!5/cm? in a crystal of GaAs grown from an 
As-rich melt and doped to be degenerately n-type (at 
the growth or equilibration temperature). This view is 
supported by the well-known fact that GaAs can be 
prepared either z- or p-type at the 10!7/cm? level 
without appreciable compensation. Thus under *nor- 
mal" preparation conditions, the concentrations of 
electrically active natural defects must be low, i.e., 
S 105/cm*. 

As regards CdTe, it has been suggested that the 
second ionization level of the acceptor Cd vacancy is 
0.6 eV below the conduction band.*-” It is, however, well 
known that CdTe can be doped degenerately z-type.520 
From the results of Sec. IV, it is clear that this contra- 
dicts the assignment of the 0.6-eV level to the second 
lonization level of the isolated Cd vacancy. This con- 
clusion does not depend on the results of Sec. III but 
simply requires that Q; not be much smaller than unity. 
The experimental observation that substantial self- 
compensation is observed in n-CdTe® demonstrates that 
this requirement is met. There is, in fact, some support 
for the estimated energy of vacancy formation in CdTe 
used in Sec. III in that large concentrations of singly 
ionized Cd vacancies can be introduced into p-CdTe.” 

It would thus appear that the 0.6-eV level must be 
assigned to some other center, possibly some complex 
involving a doubly ionized Cd vacancy.? The second 
ionization level of the isolated Cd vacancy in CdTe 
must, in fact, be much closer to the conduction band 

and may be the level recently observed by Lorenz 
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et a1.525 in CdTe and some other II-VI compounds (all 
of which can be doped heavily z type). The exclusion of 
ZnTe (which apparently cannot be doped z type) from 
this group is significant. 


VI. SOLUBILITY EFFECTS 


Inasmuch as an impurity must reside on either an M 
site or an JV site in MN.) we expect that the solubility 
of impurities in binary compounds must, in general, be a 
function of the stoichiometry of the system, i.e., the 
concentration of vacancies. In Sec. II, we have a situa- 
tion in which the concentration of added donor impuri- 
ties in MN.) is held fixed. Under these circumstances, 
it was pointed out, the maximum relative amount of 
uncompensated donor impurities will occur in a system 
in which the concentration of compensating acceptor 
vacancies is a minimum. What is generally held constant 
in the laboratory, however, is, to a first approximation, 
the thermodynamic activity of the added donor impuri- 
ties. As a consequence of the dependence of the solu- 
bility of the impurities on the vacancy concentration, 
it is not necessarily true, then, that the maximum total 
concentration of uncompensated impurities will occur 
in a system in which the concentration of acceptor 
vacancies is a minimum. 

For, example, let us consider a compound semicon- 
ductor MN gs), which is brought into equilibrium at the 
temperature T with a total constant activity of added 
donor impurities, i.e., the partial pressure of atomic D; 
is held constant at a value pp. These impurities are 
presumed to reside on M sites in the host crystal for 
this system. The pressure of M (;; is again designated 
par and we assume, as in Sec. II, that only singly ioniz- 
able acceptor M vacancies need be taken into account. 

The concentration of neutral donors, D, in the host 
crystal is determined by the equilibrium constant.'? 


D/poVu=K(T), (22) 


which refers to the process in which an atom of Dig) 
“fills” an M vacancy to yield a neutral donor impurity 
in the lattice. 

The concentration of ionized impurity atoms in the 
crystal, D*, is determined by the equilibrium constant 
for the ionization of a neutral impurity to yield a free 
electron. 


nD*/N.D—(1/gp) exp(Ep—E./kT), (23) 


where gp is a degeneracy factor and Ep is the energy 
level associated with the donor impurity. 
It follows, from Eqs. (5) and (23), that 


w=[1/(1+Q) \(N-D/gp) exp(Ep—E.)/kT). (24) 


At constant pp and T, D is proportional to Vir ac- 

cording to Eq. (22). On the other hand, according to 
“4M. R. Lorenz and H. H. Wood . Rev. Letters 10, 

215 (1963). z an oodbury, Phys. Rev. Letters 


23 M. R. Lorenz, M. Aven, and H. H. Woodb Phys. Rev. 
132, 143 (1963). > oe a aaa 
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Eq. (6), Q is also proportional to Vj; at constant T. We 
see, therefore, that the total concentration of free 
carriers, 7, is a maximum when Vy is a maximum, 
despite the fact that the relative amount of uncompen- 
sated donors, 1/ D*, is a minimum under the same condi- 
tions. When Q>>1, as in the II-VI compounds (see Sec. 
III), the zotal concentration of free carriers is essentially 
independent of V yy. Thus, as p» is varied in these sys- 
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tems, only the concentration of compensated donors 
varies. 

It should be noted that the situation is entirely dif- 
ferent if the donor impurities reside on V sites rather 
than M sites. Under such circumstances, the solubility 
of the impurities is maximized and the degree of self- 
compensation minimized when V, js minimized, i.e., 
when pm is maximized. 
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Theory of the Optical and Magnetic Properties of the Self-Trapped 
Hole in Lithium Fluoride* 
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Using a semiphenomenological method, the energy and wave functions of a self-trapped hole (V, center) 
in Lif are obtained as a function of the separation between the two F~ ions at which the hole is assumed 
trapped. The lattice distortion energy due to the changes in Madelung, repulsive, and polarization energies 
is calculated as a function of the totally symmetric displacement of the two participating F~ ions and six 
positive ions adjacent to the F- ions. This lattice energy is combined with the calculated energy for the 
F;- molecule to obtain the total energy as a function of the distance between the participating F` ions for 
both the symmetric (2,) and antisymmetric (~,,) states of the hole on the Vg center, Only the energy curve 
for the ground (Xu) state exhibits a minimum in the expected region of F~-ion separation. From the resulting 
configurational coordinate curves, the optical absorption energy and width are computed and found to be 
in order-of-magnitude agreement with experiment. Computed values of the experimentally known isotropic 
and anisotropic hyperfine constants are used to assess the validity of our molecular wave functions, which 


were obtained in a one-electron approximation. 


I. INTRODUCTION 


N recent years, a fairly detailed understanding of the 

F center in alkali halides has been achieved by con- 
certed theoretical and experimental studies of its elec- 
tronic structure.! The F center consists of an electron 
bound to a negative-ion vacancy, and is the simplest 
of several “electron” color centers such as the M, R, 
and F’ centers? A series of “hole” color centers also 
exists (V1,H,V.); these are characterized by optical 
absorption bands lying at somewhat higher energies 
than those of electron centers, are generally stable only 
at low temperatures, and must be formed by high-energy 
irradiation. The best understood of these, a center 
which consists of a self-trapped hole, has been studied 


i i i i din 
* Supported in part by the National Science Foundation an 
part p Grant’ (62-145) from the U. S. Air Force Office of 


Scientific Research. 3 
1 See, ee onan the review article by B. S. Gourary and 


? $ 3 x d D. 
F. J. Adrian in Solid State Physics edited by F. Seitz an 
dH (Academic Press Inc., New York, 1960), Vol. 10. as 
2See, for example, J. H. Schulman and W. D. CoA olor 
Centers in Solids (Pergamon Press, Inc., New York, 1963). 


elaborately by Castner and Kinzig.’ Electron resonance 
data have conclusively shown that this center is not 
associated with a vacancy but instead resembles a 
negative halogen molecule-ion with a hole shared be- 
tween two adjacent negative ions. Two previous theo- 
retical attempts'® have been made to explain the 
stability of this self-trapped holes. Yamashita* tried to 
calculate the energy of the hole in KCI as a function 
of the displacement of the CI- ions which trap it. He 
could not obtain a minimum in the energy as a function 
of this displacement but suggested that the repulsion 
between the core electrons of the«Cl- ions, which he 
had neglected, might provide a minimum in the energy 
curve at half the usual distance between the ions in the 
crystal. A subsequent attempt by Nettel® gave the 
result that the energy of a hole trapped as a Clz- ion 
in the crystal had higher energy than a chlorine atom 


3T. G. Castner and W. Kanzig, Phys. Chem. Solids 3, 178 
(1957); Nuovo Cimento Suppl. 7, 612 (1958). 

4J. Yamashita, work at the University of Illinois, 1958 
(unpublished). a 

5S. J. Nettel, Phys. Rev. 121, 425 (1961). 
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at a cubic site. It appears that Nettel had not employed 
adequate variational functions for the hole to take 
account of the relatively strong distortion of the wave 
function of one of the trapping ions by the other. 

In the present paper we attempt to obtain the energy 
of the LiF crystal containing a hole relative to that of 
the perfect crystal. The hole is assumed to be shared 
between two F- ions so that we have in effect a Fy 
molecule. The energies of the three c electrons are 
calculated as a function of the distance between the 
ions in the two cases corresponding to the unpaired 
electron being in a symmetric or antisymmetric state. 
For each distance between the ions, the repulsive, 
Madelung, and polarization energies are also calculated 
allowing for relaxation and electronic polarization of 
six positive ions in the plane of the F;- molecule, which 
are the nearest neighbors to the two F- ions forming 
the molecule. The minimum in the total energy curve 
plotted as a function of the distance between the 
fluorine nuclei of the Fy molecule then gives the equi- 
librium configuration of the crystal. 

In Sec. IT, the various contributions to the energy are 
discussed qualitatively. In Sec. III, the calculation of 
the lattice energy change is described and in Sec. IV, 
the computations for obtaining the electronic energies 

and wave function of the Fy molecule are described. 
In Sec. V, the configuration coordinate curves for the 
crystal are presented and the optical absorption energy 
and width and hyperfine constants are calculated and 
compared with experiment. The nature of the agree- 
ment between theoretical and experimental results is 
then analyzed. In the last section, limitations of the 
present work are discussed and suggestions for improve- 
ment are listed. 


II. ANALYSIS OF THE VARIOUS TERMS IN 
THE ENERGY OF THE CRYSTAL 


Our calculation is based on the variation principle. 
The energy of the crystal containing the hole is calcu- 
lated as a function of several variation parameters. 
Some of these parameters may be regarded as equiva- 
lent to the configuration coordinates in calculations on 
luminescent centers, while others are variation pa- 
rameters that occur in the electronic wave functions. 
With respect to the former, our calculation resembles 
that of Williams, with at least one major exception. 
He was able to use experimental values of the excitation 
energy of free 'T]* ion in constructing his excited state 
configurational coordinate curves, whereas we must rely 
‘on computations of this quantity. 

Before we describe our variation parameters in detail, 


we will list and explain the various terms in the total 


crystal energy Æ when a hole is present 


: : EE Est (ame?/a)+EetAEy+AEr+AEp. (1) 


_E. Williams, J. Chem. Phys. 19, 457 (1951). 


"d 


pk 
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These terms may be understood in terms of a sequence 
of events occurring during the removal of an electron 
from the crystal. Ey is the energy of the perfect crystal; 
the next two terms represent the energy required to 
remove an electron from an F- ion and place it at in- 
finity; Ea is the electron affinity (work done in taking 
the electron from F~ itself) and ayre?/a is the Madelung 
energy (work done taking the electron out of the electro- 
static potential of the crystal). Here a is the nearest- 
neighbor distance and ay, the corresponding Madelung 
constant. Up to this point we have regarded the lattice 
and remaining electrons as stationary; now let us as- 
sume that the hole becomes shared with a nearest 
neighboring F- ion so that a molecule Fy is formed 
(see ions A and B in Fig. 1). This will “require” an 
energy Er, a quantity which is negative and which 
will certainly depend on R, the nuclear separation of 
A and B. The redistribution of charge on the two F- 
ions, as well as their relaxation toward each other, will 
cause a further change in electrostatic energy of the 
crystal which is included in AZy. In this term is also 
included the change of electrostatic energy due to the 
relaxation of neighboring positive ions which will now 
occur as explained in detail below. Finally, AZ and 
AEp are the analogous changes in repulsive and po- 
larization energies, respectively, occasioned by the re- 
distribution of charge and motion of the ions. 

Of the six terms in (1) which represent the change in 
the energy of the crystal due to the creation and capture 
of a hole, the last three and a,e2/a may be considered as 
constituting a change in lattice energy AE,=AEm 
-EAEp-- AE gJ- (aye?/a). These three contributions are 
calculated as a function of the fractional displacements 
(in terms of nearest-neighbor distance in the lattice) 
x, y, and z of the ions of types A, C, and E shown in 
Fig. 1. The type-A ions are the two F~ ions (A,B) which 
share the hole, type C are the two Li* ions (C,D) which 
are the nearest neighbors of 4 and B. The type E ions 
are the four Lit ions (E,F,G,H) which are the nearest 
neighbors of the A and B ions. They lie on the same 
plane as the A- and C-type ions. The relaxation of the 
four Lit ions, which are nearest neighbors of the A an 
B ions but which do not lie on the plane of the A- and 
C-type ions, is neglected. Because of the geometry of 


a 


ee a 


— 
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the location of these ions, they are an order of magni- 
tude less effective in influencing the lattice energy than 
the ions C, D, E, F, G, and H. In calculating the re- 
pulsive energy change AE, only nearest-neighbor inter- 
actions are considered. The polarization energy of the 
rest of the lattice as well as of the ions A,- F is 
calculated following Mott and Littleton’s procedure.’ 
The net lattice energy is thus obtained as a function 
AEz (x,y,z). Since x represents the major relaxation, 
AE, (x,y,z) is minimized with respect to y and z to 
obtain AF, (x,yozo) which is a function of x alone. The 
details and results of the lattice energy calculation are 
presented in Sec. IIT. 

The terms £4 and Ex represent the electronic energy 
changes associated with the creation and capture of the 
hole. The electron-affinity E4 of the F- ion is taken as 
0.363 eV, from Hartree-Fock calculations? on the F- 
ion. The Fy binding energy £z is obtained by calculat- 
ing the energy of three c electrons in the molecule (two 
electrons in e, state and one in o, when the hole is in 
the antisymmetric state E,, and two electrons in c, 
state and one in c, state for the hole in the symmetric 
state Z,) and taking the difference between this energy 
and the energy of three 25 electrons on the F- ion. The 
details of the calculation of Eg as a function of the dis- 
tance between A and B, that is, Zz(x), as well as the 
potential and the variational form of the wave function 
employed will be discussed in Sec. IV. Ezg(x) is then 
combined with AZr(x) to obtain the variation of the 
total energy of the crystal with x; the position of its 
minimum gives the equilibrium configuration of the 
crystal. It will be seen later that a minimum is found 
only when the hole is in the Z, and not in the state X,. 


IIl. CALCULATION OF LATTICE ENERGY CHANGES 


We shall first consider the change in the Madelung 
energy of the lattice due to the presence of the hole. 
For this calculation, we shall assume that the hole is 
equally divided between the two F- ions. This is a 
fairly good approximation if the bonding between 4 
and B is not too strong so that there is not much 
migration of charge to the region between the two ions. 
The general expression for the difference in Madelung 
energies of the crystal between the two situations in 
which the ions 4, B, C, ---, H are displaced, Ew (7,92), 
and undisplaced, Ew (0,0,0), can be shown to be given by 


Ey (2,3) Es Eu (0,0,0) 
H x 1 1 
=! Ss ni (—- 


L2A MzA,--*. H TLMo T LoMo 
M-1 H 1 1 " 
qa Sy AE mos . Q) 
L=A M=A TLM TLoMo 


? N. F. Mott and M. J. Littleton, Trans. Faraday Soc. 34 485, 


1938). i 
í : e Froese, Proc. Cambridge Phil. Soc. 53, 206 (1957). 


SELF-TRAPPED HOLE IN LiF ATOSI 
Here €; and £r are the charges on the lattice points L 
and M, and Lo and L represent, respectively, the un- 
displaced and displaced positions of the point L. Thus, 
for example, rrar, represents the distance between the 
displaced position of the point L and undisplaced 
position of M. If we are interested in an expansion of 
(2) in powers of x, y, and z correct to second order, we 
may take 


v» Sibu = fil 
T (3) 
MAL Tit, MALT host, 


provided that all € are equal to +e and are situated 
on a regular cubic lattice. Equation (3) is a result of 
the fact that in regular cubic crystals, the next multi- 
pole interaction between ions after the monopole corres- 
ponds to /=4, Hence only quartic and higher terms in 
x, y, or z are to be expected from the summation in (3). 
It follows that 


o Coba 2 CROACIE SUM 


—- Y — yp ae 


, 
MyA--H Tp, MALT Lost Gym TOE 


which may be used to simplify Eq. (2) to the following 
form: 
Ey (x,y,z) Fe Ey (0,0,0) 
1 Hv 1 2 1 
mem DU UMD stu —-—4—). (5) 


2M=AL=A TLM TLMy) fLsM, 
MAL : pei 


If the charges on the 4 and B ions are —ge and the 
charges on the rest of the lattice points are the same as 
in the perfect lattice, then the above calculation must 
be modified in an obvious way, and one obtains 


Eu (2,y,2) zen Eu { 0,0,0) 


Mek i 1 1 1 1 
-E E itu +) 


L-A M=A TLM YioMo TLMy Tio 


1 1 
+2(¢-9(—-—) 


TABy TAgBy 


H 1 1 
-24-95 ( -—), (6) 


L=C\PLAg  TLoAg 


In the present case we are assuming that g=4. Also, 
(ris +e for a positive ion and —e for a negative ion. 
With the help of Fig. 1, one can calculate the various 
distances involved in Eq. (6) correct to terms quadratic 
in x, y, and z. Thus, for example, 


1 1 1 1 v2 


TAE TAgEg TAE} TAE a 


Equation (6) includes the repulsive energv between 
charges —e/2 on the ions 4 and B. This is a spurious 
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term introduced by our assumption that the hole is 
equally distributed between the two F^ ions, and must 
be excluded because all such intramolecular interactions 
are contained in Ez. We therefore obtain the following 


expression for Ay: 


AE (x,y,z) x Eu (x,y,z) TD Ey (0,0,0) 
—pe/AVZa(1—V2x)]. (8) 


Using equations like (7), Eqs. (6) and (8) may be ex- 
panded in powers of x, y, and z, and after a lengthy 
calculation one obtains 


AEy — (&/a)(—0.177 —0.500x— 0.3542? 
— 1.4145 -- 1.207 y?4-2.3582-1- 3.4267? 
+3.000xy+2.500zx— 1.7072). (9) 


It should be emphasized that this equation has been 
derived for a particular (fcc) lattice. 

Next, we consider AEg. In keeping with our earlier 
assumptions for the Madelung energy change we shall 
not consider the repulsion between the ions A and B. 
"This latter repulsive energy, when it becomes appreci- 
able, will be included in the electronic energy terms. 
Also, we shall consider repulsion between nearest neigh- 
bors alone. There are several choices that one can make 
for the repulsive interaction energy between the positive 
and negative ions. One possible choice is the Born- 
Mayer potential? which has the form 


V (r)— Be-rl^, (10) 


with B and p obtained from the observed compressi- 
bility and lattice constant of the crystal. In the deter- 
mination of B and p in the Born-Mayer formula from 
the perfect crystal, Van der Waals forces were neglected, 
i.e., their effect is assumed to be adequately included in 
the exponential parameters. Therefore in using this 
formula for the imperfect crystal, we should ignore 
Van der Waals forces for the sake of consistency. 
Huggins and Mayer" have also obtained B and p from 
data on the perfect crystal, but including Van der 
Waals forces separately. We shall also make use of these 
constants to test the sensitiveness of the calculated 
lattice energy to the choice of force constants. Another 
form for the repulsive energy has been proposed by 
Pauling," namely 


V (r) «0.0485e7,1/7^ , (11) 


where ro is the sumxof the ionic radii of the positive and 
negative ions and z depends on the nature of the posi- 
tive and negative ion pair. For LiF one can take n=6 
so that for the repulsive potential between Lit and F- 


9M. Born and J. E. Mayer, Z. Physik. 75, 1 (1932); we will use 
the simple theory given by M. Born and K. Huang, Dynamics of 
Crystal Lattices (Oxford University Press, New York, 1955), p. 26. 
1 M. L. Huggins and J. E. Mayer, J. Chem. Phys. 1, 643 (1933). 
uL, Pauling, Proc. Natl. Acad. Sci. India Sec. A, Pt. I 25, 

E an . See also Milne and Cubiccoti, J. Chem. Phys. 29, 846 
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jon we can rewrite (11) as 
V()- Aen, (12) 


where A=0.0485r0°. Using the form in Eq. (10) for 
the repulsive energy, the expression for AE r(x,y,z) is 
given by Eq. (13): 


a xy 


AEg(2,y,2) - 4Be {= (22-6297) 
p 2 


a 
r ia a voa) | ] x18) 
p 


In obtaining Eq. (13), the energy differences between a 
pair of ions L and M is calculated using the relation 


AEn(L,M)- Ble l0— eMe], (14) 


where rrj; and 7,1, have the same meaning as in Eq. 
(6). The distances rz; between the displaced ions and 
their nearest neighbors are calculated up to quadratic 
terms in xv, y, and z as before. 

Using the Pauling expression (12) for the force law, 
Eq. (15) is obtained for the change in the repulsive 
energy as a function of x, y, 2: 


AEg(x,y,z) = (A&/a*)[602?-- 60y?-+ 1202? 
—96xy—84V2xz]. (15) 


In Table I are tabulated the constants B and p both 
in the Born-Mayer and Huggins-Mayer approximations 
as well as the value of A in the Pauling approximation. 
In obtaining A, a value of ro— 1.95 A was used in Eq. 
(12), this value of ro being the sum of the ionic radii 
of Li* and F- ions. The lattice constant a for LiF is 
2.01 A, only slightly different from ro. On introducing 
the values of B, p, and A in Eqs. (14) and (15), we get 
the following equations for AE (x,y,z) in the three cases. 
Huggins-Mayer: 


2 
AER(a,y,2) — —[1.6643?--1.6645? 
a 


--3.32822— 3.0312:y — 3.642xz ], (16a) 
Born-Mayer: 
^ e 
AEn(x,y,2) — —[2.4822?4-2.4825? 
a 
+4.96422—3.675xy—4.635xz], (16b) 


Pauling: 
e 
AEg(x,y,2) =—[2.910x2+-2.910y? 
a 
+5.82022—4.656xy—5.762x2]. (160) 


The difference between Eqs. (16a) and (16b) which 
use the same force law but different force constants 

and p could partly be ascribed to the neglect of the 
effect of the Van der Waals forces in using the Huggins- 
Mayer formula. In order to get an expression for 


p 
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Paste I. Table of constants in repulsive energy formulas, 


T a B (ergs) p (A) A 
Born-Mayer 105 x10 roc CM 
Huggins-Mayer 1.3541 10-19 0.345 


Pauling 1.368% 10719 


AEn(x,y,2) which would be comparable to (16b) one 
would have to include in AEp(x,y,z) the contribution 
from the change in Van der Waals energy due to the 
displacement of the lattice points. We shall not do this 
but instead use the differences between the lattice 
energy changes with Huggins-Mayer and Born-Mayer 
formulas as a measure of the importance of Van der 
Waals forces. Equations (16a) and (16b) indicate that 
while the Van der Waals contribution to the lattice 
energy is significant, it is definitely less important than 
the repulsive energy change. The difference between 
the energy expressions in (16b) and (16c) is due to the 
different formulas [ Eqs. (10) and (12) ] employed. It is 
encouraging that the difference between Eqs. (16b) and 
(16c) is less than that between (16a) and (16b) since 
both Born-Mayer and Pauling formulas include the 
effect of the Van der Waals forces indirectly. One last 
comment about the repulsive energy calculation. We 
have used Eqs. (10) and (12) together with the con- 
stants in Table I which refer to the energy of repulsion 
between Li* and F- ions. Actually ions A and B are 
somewhat different from the regular F- ion because 
they each contain a total charge —e/2, which is less 
than that of a regular F- ion. Our belief that the use 
of Eqs. (10) and (12) does not lead to significant error 
is based on two considerations. First, that the charge 
distribution of five and one-half 2p electrons is probably 
not too different from what it would be for six. Secondly, 
at least in the £, state, the hole probably affects only 
the charge distribution in the region between the two 
ions A and B. Hence, to the nearest-neighbor positive 
ions, C, D, ---, H, the A and B ions do not “look” very 
different from perfect F- ions. 

We shall finally consider AE», the energy associated 
with the polarization of the lattice due both to the two 
holes on ions A and B and the displacements of the 
neighboring points C, D, -::, H. The polarization en- 
ergy due to a point charge at a lattice site has been 
considered in detail by Mott and Littleton.” In the 
zeroth-order approximation, where one considers the 
crystal as a. dielectric continuum in calculating local 
fields at the ions, the dipole moments on the positive 


and negative ions are given by 


M &a*e 


N E) 
== 2 


r 


(17) 


with the dipoles directed outwards in the direction of 


12 See Ref. 7; B. G. Dick (private communication). 
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r, the radius vector joining the impurity charge ¢; and 
the ion under consideration. 


AIK le A a4) 
pm MEM 
dl (e/A)+4 (a,+a_) | 


where a, and a. are the electronic polarizabilities of 
the positive and negative ions on the lattice, Ks is the 
static dielectric constant and A=6a/@ is the spring 
constant with the compressibility 1/0 given by 


E Urin 
eee (19) 
0 3aLp? ap 


We however follow the first-order approximation of 
Mott and Littleton in that we permit the nearest- 
neighbor ions (B, C, ---, H in the case of the hole on A) 
to move discretely and determine their displacements 
by a detailed consideration of the various terms in the 
lattice energy. So we have to allow for this difference in 
nature between the ions A, ---, H and the ions in the 
rest of the crystal. We shall do this by taking into 
account only the electronic polarization of ions A to H, 
Since their ionic polarizations have already been con- 
sidered by calculating their electrostatic and repulsive 
interactions with other ions in the crystal in their dis- 
placed positions. 

Our polarization energy calculation is also somewhat 
more involved than the calculation of the polarization 
energy due to single point charges such as single va- 
cancies and interstitials in a lattice. The ions 4 and B 
carry only charges —e/2 and are in displaced positions 
as compared to their original positions Ay and Ba in 
the lattice. Therefore, in the calculation of polarization 
effects we have in effect to consider four charges, 
namely — 1e each at A and B, and +e each at the points 
Ao and Bo. The latter two represent the effective charges 
the vacancies would have if the charges —}e were 
entirely removed. Each of these four charges polarizes 
the ions in the rest of the lattice and the dipole moment 
on the polarized ion not only interacts with the polariz- 
ing charge (direct polarization energy) but also with 
the other three charges (indirect polarization energy). 
We shall consider the four ions in six pairs, namely 
(4 0Bo), (A B), (A By) (A B), (AA o), and (BB). To 
avoid counting the direct polarization energy for each 
of the four charges twice, we shall include the direct 
polarization energies only while considering the first 
two pairs (AoBo) and (AB). Strictly speaking, one 
should consider similar pairs for the ions C, D, E, F, G, H 
because they are also displaced from their original 
equilibrium positions. A consideration of such effects is 

equivalent to considering the polarization of the lattice 
due to the dipoles produced by the displacements of 
C, --:, H from their original positions. Since the dis- 
placements of the ions C, ---, H are expected to be less 
important than those of 4 and B, we shall neglect the 
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- effects of the dipoles at C, --+, Æ on the polarization energy. Thus we have | 
AE p= Alis (AoBo) AE, (AB) AE, (4Bo)- AE, (A49B) AES (A40) AE, (BB9). (20) 

N f : 2 " e 1 COS Asi Bs COS A sj Bo Ms Mi My | 

AE,(AoBo = [ane he hin ay n- x Ae j 

a i Tao j TA i Agi T Bow j TAgjTBoj TA0Bo JAgCo TAC | 


3 " r (~ COSÓ A Co By My e) ( Mi My ) ét 


3 2 4 4 

TAgCol Bole Y AC TBC T AgEo TAgE | 

" $: ^ = 

Mi My Mı COSÎAoEoBo Mi cosbagrny 
h 2 — —4 E PS S E , (21) [ 

Ss Gian TAa T A0Eo T BoEo T AoE T BoE | 
d | 
| 


DOKLAD] y e 
4/4 ea] ^ e 


(22) 
 D-0/KSYX(/ 4e] |e 
am[(8/4)-(m-o-)] ^ @ 


The summation in i in the first and third terms of (21) runs over the positive ions in the lattice while the summa- 
y tion over j in the second and fourth terms runs over the negative ions except that in the fourth term the points 
«j= Ay and j= Bo are excluded. The fifth term arises because (when considering the effect of the charge +e at Ao) 
we do not have an ion at the B site and vice versa for the charge +e at Bo. The sixth, seventh, eighth, ninth, and 
th terms in (21) take account of the correction to be introduced because we have to consider only the electronic 
rization of the displaced ions C, D, E, F, G, and H. The angles like @.4,;2, stand for the angle subtended by the 
ts Ap and Bp at the lattice point j. 


cos64;B cosh A; 


Y e 1 1 
an--—nr—unr Ex ; -n-My) 


i ray 7 quu Ararte j TAPTB; rap’ 


{2 =) (= COSÜAC,B My er) ( Mi E Mi; My | 
74 a b. E " ^ F ( ia; ) | 
Aco TAC* TACQT BO. TACT BC” TNE a STAVES Face’ TAG! 


Mi COSÜ A EoB MY cOSOAEB i 
—( | O 


TAESTBEQ. TAE BE! 


M, M5, and M; are defined in Eq. (22) and 
ou! f £ 

M” =—. (24) 

a3 


tity a’ represents the polarizability of the r electrons on the F- ion. The fifth term is introduced to take 
he fact that only the polarization of the v electrons on A and B ions has to be considered; the polariza- E. 
g electrons will be taken care of indirectly in the electronic energy. The summations in the first and 
ain run over the positive ions alone and those in the second and fourth terms over the negative ions - 
points B are again excluded from the fourth summation. The rest of the terms are introduced to 
ionic potere of the ions C, D, ---, H in AEp. d 


Mey 


i T Aoi ira? 


COSÜ A,;B 23 € 1€0s04,c,5 Mi Tr) 
T Aco T BCS T As T BC? 


ozo Mi =) i 


ae 
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spectively, the ion B being left out of the summation in j. Finally, we have 


: ; e 
Abl AA) HAE, (BB) = — 


COSI Aaj A 


M13, ——--Mi;- 


? 5 
i TAg TAV 


| COSA Agi A 


a 


; Ly 
Í TAg TAj 


4 (= 1COB 4,c,4 My! at) 


- 
T AoC T A Co" Y AC T AC. 


MicosÜ4,g, Mi cosfagA Mi cosOaya.a My! cosOsgoa 
-2 : x aie 3] (26) 


9 9 
TAGES TAES 


We can combine Eqs. (21), (23), (25), and (26) to 
obtain the net polarization energy of the lattice given 
by Eq. (20). Since we require the energy only up to 
quadratic terms we have to express the polarization 
energy in terms of linear and quadratic terms in x, y 
and z only. This can be done with the help of Fig. 1 
and expressing the distances and angles involving the 
displaced ions A, B, - --, H in terms of linear and quad- 
ratic terms in x, y, and z. The detailed evaluation of the 
coefficients of these terms requires several lattice sum- 
mations, some of them about the single center 4 and 
some involving the two centers 4 and B. A few of these 
summations were available from the work of earlier 
authors," namely 


1 
—= 10.1977, 

i T Aid 

1 
Y —- 6.3346, 
i 7 Agj* 

(21) 
D» = = 5.036, 
i 7 Agi Boi? i 7 Agi T Boi 
1 1 

> ——— ————24432. 


jo fagfB! i fag Bo) 
These summations are the least convergent of the 
various sums needed. The rest of the sums have the 
general form 

9 Agi Boi Y Asi y Boi? 


(28) 


TA TB 


with similar expressions involving j. «A i, YAoi and Zagi 
represent coordinates of the ith ion in terms of the 
crystal axes with respect to the undisplaced position 
of A. These were performed on the Illiac at the Uni- 
versity of Illinois computing center and checked on the 
IBM-1620 at the computing center of the Riverside 
campus of the University of California. Their values 
are presented in Table IT. The sums were carried out 
in all cases up to the eleventh neighbors of the relevant 
type (positive or negative ion), and the convergence 


. Chem. Phys. 19, 894 (1951); 


"LR. Reitz and Gammel, T Rev. 105, 819 (1955). 


F. Bassani and N. Inchauspé, Phys 
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was tested by finding the additional contribution in 
going to the thirteenth neighbors. The convergence in 
all cases was better than 0.1%. 

Next we have to consider the values of the quantities 
Mı, Ma, My’, M7" to be used in the calculation. The 
expressions for these are given in terms of Ks, o, o, a’, 
and the spring constant A in Eqs. (22) and (24). In 
Table III the values of these quantities are tabulated 
together with the source from which they were obtained. 


TanLE II. Lattice sums involved in the calculation of AE;. The 
sums and notation are defined in the text [see Eq. (28) ]. 


a B y 6 c r Sumovert Sum over j 
0 0 0 0 1 5 4.62961 1.32584 
0 0 0 0 3 5 2.79582 0.41540 
0 1 0 0 3 3 1.73987 0.54498 
0 1 0 0 1 S 0.73710 0.13719 
0 1 0 0 3 5 1.11161 0.14987 
0 2 0 0 3 5 1.42513 0.34914 
0 1 0 1 3 5 0.18422 0.00136 
1 1 0 0 3 5 0.31351 0.19927 
1 0 0 1 3 5 — 0.92679 —0.14850 
1 1 0 0 5 5 0.08321 0.04716 
0 1 0 1 1 7 0.02760 — 0.05133 
0 2 0 0 3 7 1.17244 0.12574 
0 1 0 1 3 7 0.05051 —0.01262 
0 2 0 0 1 7 1.64155 0.41656 
0 2 0 0 0 8 2.19260 0.59693 
1 0 0 1 5 5 —0.97343 —0.07417 


i 
I 

i 

i 
| 
| 
i 


TABLE III. Values of the quantities involved in expressions for 
My, M2, My’, and M4". 


Quan- 
tity Value Source 
Ks 9.27 M. Born and K. Huang, Dynamical 
Theory of Crystal Lattices (Oxford 
University Press, New York, 1954), 
Table 17. 
a, 0.03 X10cm* J. R. Tessman, A. H. Kahn and W. 
Shockley, Phys. Rev. 92, 890 (1953). 
a- 0.652K10*cm? C. Kittel, Solid Slate Physics, (John 


Wiley and Sons Inc, New York, 
1959), p. 165. 

Represents polarizability of r electrons 
which is ? of a [see R. M. Stern- 
heimer, Phys. Rev. 96, 951 (1954) ]. 


a’ 0.391X 10^?! cm? 


A 147 X10-8 M. Born and K. Huang, 
Theory of Crystal Lattices (Oxford 
University Press, New York, 1954), 
Table 9. 
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Using these values, we obtain 
M,=0.0624, 
M —0.0036, (29) 
M3 0.0796, 
M —0.0482. 


The result of our expansion and substitution is finally, 


AEp(x,y,z)— — (e/4a)[1.416— 0.505:x-4-0.702a? 
—0.010y—0.0075?+-0.089y+-0.0372 


+0.08722+-0.106%2]. (30) 


It is understandable why in the expression for the 
polarization energy the coefficients of the terms that 
do not involve x are small. These terms arise out of the 
electronic polarization of the positive ions which have 
rather small polarizability. Combining Eqs. (9), (16), 
and (30) we get the following equation for the net 
change in the lattice energy, namely, 


AEz (x,y,z) — (eare’/a)-+ (e/a) —0.538—0.374x 
—1416y4-2.348z-l-aa*--By*-- yz? 
-FAwy-Fuyzd-vxz], (31) 


where the coefficients a, £, Y, ^, y, and v depend on our 
choice of the repulsive energy formula. In Table IV, 
the values of these coefficients are tabulated when the 
Born-Mayer, Huggins-Mayer, and Pauling repulsive 
energy formulas are used. The coefficients in the lattice- 
energy expression are seen to be sensitive to the choice 
of the repulsive energy formula, as discussed above. 
From Eq. (31), one obtains the energy of the lattice 
as a function of v alone by minimizing the lattice energy 
with respect to y and z. This leads to the equations 


28y--uz—1.416—2x, 


(32) 
py+2yz= —2.348— yx. 


One can now solve Eqs. (32) in terms of « and substi- 
tute the following expressions (33) obtained for y(«) 
and z(x) in Eqs. (31) and hence express the lattice 
energy in terms of x alone, as in (34): 


yo(u)=b+cx, 


zo(x) — d--ex, S) 


and 
AE1(x)=A+Ba+Cx?. (34) 


From Eq. (34), one can obtain a plot of the lattice 
energy as a function of x or of R, the distance between 
the ions A and B, the relation between R and x being 
given by 

R=v2a(1—v2z). (35) 


In Table V, the values of 5, c, d, and e in Eq. (33) and 
A, B, and C in Eq. (34) are tabulated for our three 
different repulsive force laws. In Fig. 2, we have plotted 
AE;(R) as a function of R and as expected there is 
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TABLE IV. Values of coefficients in lattice energy expressions, 


Coefhi- 


cient Born-Mayer Huggins-Mayer Pauling 
a 1.953 1.135 2.381 
B 3.091 2.873 4.119 
y 8.368 6.732 9.224 
A — 0.698 — 0.053 —1.678 
u —1.707 — 1.707 — 1.707 
y —2.162 — 1.169 —3.288 


great similarity between the Born-Mayer and Pauling 
curves, there being a minimum in the lattice energy 
curve in each case. The positions of the minima can be 
expressed either in terms of x or in terms of A, using 
Eq. (35). In Table VI, we have tabulated for the cases 
of the three repulsive formulas employed, the values 
of xo Ro, and AE; (R;), the lattice energy at the posi- 
tion of the minimum in AZz(R). The minimum in the 
lattice energy curve is rather significant and arises out 
of a compromise between the energy of interaction be- 
tween the ions A and B with the ions C, D, E, F, G, H. 
Thus, the distance between the ions A and C decreases 
as x increases while the distances between ion 4 and 
ions E and F increases as x increases. The fact that we 
get a minimum in the lattice energy curve indicates 
that there is self-trapping of the hole even in the ab- 
sence of appreciable electronic conjugation between the 
ions A and B leading to the Fy molecule ion. As we 
shall see in the next section, there is also a stable state 
for the free Fy ion. The relative depths and shapes of 
the lattice energy and electronic energy curves are im- 
portant in determining the nature of the total energy 
curve for the crystal. We shall discuss this point further 
in Secs. IV and V after describing the electronic energy 
calculation. 


IV. ELECTRONIC ENERGY CALCULATIONS 


The next quantities that we have to calculate are the 
energies and wave functions for a Fy ion as a function 
of internuclear separation. There are various standard 
methods in molecular physics to accomplish this, 
broadly divided into molecular orbital and valence- 


TABLE V. Coefficients b, c, d, e, A, B, and C in Eqs. (34) and 
(35) for the different choices of repulsive energy formula. A, P, 
and C are in Ry (e?/2a0). 


Coeffi- 

cient Born-Mayer Huggins-Mayer Pauling 
b 0.163 0.202 0.148 
c 0.127 0.036 0.245 
d —0.124 —0.149 —0.114 
e 0.142 0.091 0.201 
A 0.503 0.473 0.515 
B —0.116 —0.111 —0.131 
C 0.923 0.569 0.971 


M 
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TAPLE VI. Positions Ro of the*minima and minimum values 
AEr(xo) of the lattice energy for the various approximations. 
AFL(xo) is expressed in Ry, and Rs is in units of as. For x=0 
that is, for the undisplaced positions of A and B, R=5.371ap. 


Quantity 


Born-Mayer 


Huggins-Mayer Pauling 
E 0.063 0.0907 . 0.068 
Ro 4,893 4.680 4.855 
AE (xo) 0.502 0.470 0.513 


bond methods." Both these methods in their improved 
forms, including configuration interaction in the mo- 
lecular orbital approach and ionic character in the va- 
lence bond approach, are essentially equivalent. In their 
improved forms they are comparable to variational 
approaches in the self-consistent procedure for atoms 
including configuration interaction. In our calculations, 
we have not attempted to include any configuration 
interaction or self-consistency requirements, but have 
adopted a simple molecular orbital procedure. In the 
usual molecular orbital method, as in the Hartree-Fock 
approach for atoms, one generally starts from the 
following many-electron Hamiltonian and computes 
from it a one-electron Hamiltonian for each state: 


K=- VP +E A. (36) 
i Ts if. 


For simplicity, we have adopted the following approxi- 
mate one-electron Hamiltonian for the three 2p elec- 
trons which take part in the o bond for the F; molecule: 


= — V23- V A (ra;) 3- V elre). (37) 


The potentials V4(r4;) and Vg(rg:) both represent the 
one-electron potentials seen by a 2p electron on a F- 
ion. The justification for choosing (37) is the following. 
When the unpaired electron in the molecule is in the 
neighborhood of ion 4, then as far as the ion A is con- 
cerned, it sees the potential that a 25 electron on an 
F- ion would see. In the same situation, as far as ion 
B is concerned, the charge distribution on it looks like 
a neutral atom to the unpaired electron on 4 and there- 
fore the electron again experiences the same potential 
from ion B as a 25 electron on an F- ion would. These 
arguments are valid when the cores of the F — are not 
too distorted by the presence of the hole. This would be 
true if the distance between the fluorine nuclei in the 
molecule is not very greatly different from the distance 
in the perfect crystal. , 

As basis functions for our determination of the o, 
and c, functions for the Fs molecule, we have selected 


4C. Coulson, Valence (Oxford University Press, New York, 
1953). 
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_ Fic. 2. Curve showing lattice energy Ar as a function of the 

distance R between the F- ions A and B. Curves for all three 


approximations for the repulsive energy, namely Born-Mayer, 
Pauling, and Huggins-Mayer are given. 


the following linear combinations: 


qu Cio, 


(38 
Do diis, ; ) 
where 
$197914—6012,  óiu.— $iat- dun, 
$29=G21—G22, Pru= $24-- $n, (39) 
$397 psa Fpa, au $3A— dan, 
i= 41a -QiB, Qiu= $14— pis, 
with 
Pap(ra) Pap(ra) |. 
$14 — — ——Y (84,04) , ae (4,04) , 
TA TA" 
(40) 
Pop(ra) 
$24=Pop(ra)VO4,04), $414=———V (04,04). 
TA 


The function P»,(r4) is taken to be the radial part of 
the 2p wave function of the F- ion. Ideally one should 
have used in addition to $14 and ¢24, functions whose 
radial parts are 74P2p(r4), ra?Pass(r4), --- to include a 
complete set. We have chosen only: two functions out 
of the complete set as a compromise to reduce the work 
involved. The same remark applies to the choice of 
the functions ¢34 and $44. Taking the radial parts of 
all the functions as being related to P;,(r4) enables us 
to make use of the known integral and differential 
properties [ Eq. (43) ] of this function, especially as they 
relate to the effective potentials V4 and Vg and the 
atomic eigenvalues 65;. 

To set up secular equations to obtain the coefficients 
c; and d; in Eq. (38), we have to compute matrix ele- 
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(bial — Velia) (bial Valdina), and 
(óia|Vnlón), (Qil -V loia) (ial Valdis). 
Of these integrals only the first two are one-center 


integrals, the other three are two-center integrals. The 
one-center integrals can be obtained by direct numerical 


that have to be calculated are | 
| 


integration. For example, we have id 
| 

* /d Pap 2 © Pore 

(14| -visay- | ( ) dr--2 a » 

0 dr 9 7? | 


(41) 

i) JE | 

(b14| Valoa) = -f — —2Z (r)dr , | 
0 T 


where the potential V 4(r) has been expressed as | 
Va(n=—2Z(r)/r. (42) 


We obtain V 4 (7) directly from its definition by the one- 
electron Hartree-Fock equation: 


Val) =+ 6 ————- (43) 


n 
o 2 4 6 8 10 2 2p. 


where 6», is the one-electron energy for the 2p state 
Fio. 3. Plot of 2Z (r) versus r. and P'"-dg?P/dr. The function P», and the energy 


ments 365 (6;|3€|d;) for both « and g states. These parameter 65, are taken from Froese’s calculations.’ In 
‘matrix elements I€; involve integrals over the orbitals Fig. 3 we have plotted the values of 2Z (r) as a function 


DEC REP ehe various types of inteeral of r. For the two-center integrals we employ Lówdin's 
i 389) P vous types OL mntegta"s  a-function method.5 The a functions and the integrals 


involving them that were required [to determine the 
matrix elements of the Hamiltonian 3€, Eq. (37) ], were 
obtained at the computing centers of the University of 
Rochester and Argonne National Laboratory. Since we 
require the electronic energy as a function of the dis- 
tance between the ions A and B we have to compute | 
the matrix elements for different values of R. These | 
computations were done for six values of R, namely 
4.0a0, 4.4ae, 4.8ao, 5.000, 5.2a9, and 5.4ao. Since the 
functions ¢; do not form an orthonormal set, we must 
take account of the integrals S;;=(¢:|¢;) for both the 
c, and c, states and the eigenvalue secular equation 
then takes the form: 


Det|3¢,;—S:;;E| =0 (44) 


for both To and c, states. The eigenvalues 6, and Su 
for the six values of R considered are tabulated in 
Table VII. Also tabulated in Table VII are the 
quantities 


5.0 5.4 


Eu= 644-26,4- 2/ R) 365; (45) | 


and i 
1:96 4 GTI) GER (46) 


The quantity E, represents the decrease in ener, 


the three o electrons due to ponding between a 102 
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TABLE VII. Energy values in rydbergs for electronic and hole states for different values of 
the internuclear distance of R. 
Energy NR 
value 4.0 44 4.8 5.0 52 54 
Eo —0.5997 —0.5666 -05400 —0522 03179 05086 
Su — 0.4103 — 0.4352 — 0.4498 —0.4534 — 0.4558 — 0.4552 
Eu-H2 80+ (2/R) —1.1097 —1.1137 —1.1131 — 1.1098 —1.1070 —1.1020 
&j--28.-- (2/R) — 0.9203 — 0.9823 — 1.0229 — 1.0350 —1.0414 — 1.0486 
Eu — 0.0207 — 0.0247 — 0.0241 —0.0208 —0.0180 —0.0130 
Eg --0.1687 +0,1067 +0.0661 +0.0540 -+0.0376 -++0.0304 


A and B, when the hole is in the Z, state; £g represents 
the corresponding energy decrease of the three c elec- 
trons for the Z, state of the hole. The term 2/R repre- 
sents the Coulomb energy of repulsion between the 
effective unit positive charges at the A and B ions 
about which the three c electrons of the Fy molecule 
can be considered as moving. Thus E, and £, give the 
values of the term Eg in Eq. (1) for the Z, and Z, 
states of the hole, respectively. The variations of E, 
and E, with R are shown in Fig. 4. It is seen that while 
E, has a minimum at R—4.56ao, E, has no such mini- 
mum. This result indicates that while the Z, state for 
the hole in the free F; molecule is stable, the Z, state 
is not, as expected. Finally, to obtain the variation in 
total energies of the hole states Z, and Z, in the crystal 
with the distance R between A and B, we have to 
combine the energies plotted in Figs. 2 and 4 which will 
give us the variation with R of the total energy of the 
crystal in the presence of the hole. 


V. TOTAL ENERGY OF CRYSTAL AND INTERPRETA- 
TION OF VARIOUS EXPERIMENTALLY 
STUDIED PROPERTIES OF THE HOLE 


In Table VIII, we tabulate E— Eo of Eq. (1) as a 
function of R. In addition to the lattice energy calcu- 
lated from Eq. (34) and the electronic energy tabulated 
in Table VIII we have added the electronic affinity £4 
which we have taken as 0.363 Ry from Froese's calcula- 
tions.® For the lattice energy, in view of the inaccuracy 
in principle in the use of the Huggins-Mayer expression, 
as mentioned earlier, we have only used the Born- 
Mayer and Pauling expressions. Also tabulated is 0E 


TABLE VIII. Energy in Ry of the crystal due to the presence of the hole. Š 


which is the difference between £ and the quantity 
Ome” 
——+E,4=1.283 Ry, (47) 
a 


representing the increase in energy of a crystal when an 
electron leaves the crystal (or a hole is produced in the 
crystal) excluding the energy change due to relaxation 
of the ions of the lattice and the electronic binding be- 
tween the ions A and B. The suffixes g and u to 8E and 
E are added in Table VIII to indicate the corresponding 
states 2, and Z, of the hole. The negative values of 
6, and 6£, indicate that the combined effects of the 
lattice distortion and electronic conjugation between A 
and B ions is to stabilize the hole center. In the case of 
the 2, state both the lattice distortion and electronic 
conjugation lead to stabilization. For the Z, state, the 
lattice distortion leads to stabilization while the pres- 
ence of two antibonding c, electrons and only one 
bonding c; electron leads to destabilization. In Fig. 5 
the energies 0E, and ôE, are plotted as functions of R 
for both the Pauling and Born-Mayer approximations. 
The zero ordinate on this plot would represent the 
energy of the undisturbed crystal after the hole has 
been removed, while the energy of the perfect crystal 
Ep lies at — 1.283 Ry. As was found in Fig. 4 with the 
electronic energy, again we have a minimum only in 
the Z, state and none in the Z, state suggesting that 
only the Z, state for the hole has a stable minimum. 
This result cannot be taken too literally, however, be- 
cause we must realize that when R corresponds to the 
normal lattice distance or greater distances, our de- 


d 4.0 44 48 5.0 5.2 5.4 

i 0.869 0.855 0.852 0.855 0.860 0.868 

zi eee er) 0.858 0.845 0.841 0.844 0.848 0.856 
v Borne 1.058 0.987 0.942 0.930 0.915 0.911 
Eee en) 1.047 0.976 0.931 0.919 0.904 0.899 
S nu cn 0415 —0.428 —0.431 —0,428 —0.424 —0416 
r Be Mayer) —0.425 = 0.439 —0.442 =0.439 —0.435 —0.428 
Ae aa 0.225 —0,297 Em -0355 -0368 —0.372 
BE, (RE —0.236 —0.307 —0.352 —0. —0. —0.384 
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Fic. 5. Curves for the total energies 6, and 8E, for the Zu 
and Z, states, respectively, of the hole as a function of R. 


scription is not a complete one. In that case, there is 
high probability of the hole hopping to a new pair of 
F- ions. 

A comparison of Figs. 2, 4, and 5 indicates that the 
depths and rates of variation around the minimum of 
the Born-Mayer and Pauling curves are comparable. 
Also it is seen that the minima of the lattice energy 
curves for the Pauling and Born-Mayer approximations 
are at R=4.89a and 4.92a,, respectively, while the 
minimum in the E, curve for the electronic energy 
occurs at about 4.5629. In comparison, the minimum in 
the 9E, curves for the total energy occurs at about 
4./2as for both the Pauling and Born-Mayer approxi- 
mations. These considerations indicate that according 
to our calculations, the lattice distortion around the 
V, center and the electronic binding within the F;- 
molecule are comparable in importance in contributing 
to the stabilization.of the V; center. 

From Fig. 4 one can also obtain the absorption energy 
for transitions between the 2, and Z, hole states. By 
the Franck-Condon principle, we expect the transition 
energy to be obtained vertically from the minimum in 
the E, curve. From Fig. 5, the absorption energy for 
both the Born-Mayer and Pauling approximations is 
0.097 Ry, which corresponds to 1.32 eV. This is to be 
compared with the experimentally observed!5 position 


19 C. J. Delbecq, W. Hayes, and P. H. Yuster, Phys. Rev. 121, 


1043 (1961). 
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of the absorption peak at 3.54 eV. We thus obtain only 
order-of-magnitude agreement with experiment. There 
are two possible sources of error which might be re- 
sponsible for the disagreement between theoretical and 
experimental absorption energies. Our molecular calcu- 
lations are approximate and perhaps a better calcula- 
tion would lead to a stronger binding that is, a deeper 
energy curve for the X, state of the hole and move the 
>, state correspondingly upwards in energy. The 
other possibility is the lack of self-consistency in our 
calculation which is implied by the direct addition of 
Eg and AEL. 

From the ground state and excited state curves for 
the energy as a function of R, we can also obtain the 
width of the absorption band. Assuming a linear de- 
pendence on the excited state configuration coordinate 
(R in our case)" one obtains the following expression 
for the width at an intensity equal to one half of that 
at the maximum, namely: 


AW —27A (uK) eV , (48) 


where A is the excited state slope in Ry/ao, u is the 
effective mass of the oscillator in units of the electron 
mass, and K is the ground-state force constant in 
Ry/ag. Properly, one should use not only the reduced 
mass of the two F atoms A and B but include a small 
correction from the ions C,D,---,H which relax. 
However this last correction is rather difficult to calcu- 
late and is not warranted because of the various ap- 
proximations that we have made so far in our calcula- 
tions. So we take 
n7 9.5m,X 1837, 


where the electron mass, me=9.11X 10-8 g. From the 
Born-Mayer curve in Fig. 5, we get K=0.1 Ry/av’, 
A=0.1 Ry/ae. Substituting these numbers in (48) we 
have 

AW =0.58 eV (49) 


as compared to the experimental value of 0.95 eV 
found by Delbecq et al.!6 So the calculated value of the 
half-width is about a factor of 2 less than experiment. 
An improved electronic energy calculation would lead 
to a deeper potential well for the Z, state implying an 
increase in K, while the 3, state would get steeper 
leading to an increase in A. These two consequences are 
opposite in their influence on AW but perhaps their 
combined effect would lead to better agreement with 
experiment. Another quantity that can be derived from 
Fig. 5 from the ground-state curve is the half-width of 
the lowest vibrational state. One can deduce the follow- 
ing expression for this latter half-width, namely 


(AR)1/2= (0.996) (uK) "a0. (50) 
Using the values of » and K for the Born-Mayer curve 
17 See, for example, D. L. Dexter in Solid State Physics, edited 


by F. Seitz and D. Turnbull (Academic Press Inc., New York, 
1958), Vol. 6, p. 353. 
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we get (AR)15—0.015a,. The Pauling curves in Fig. 5 
yield almost exactly the same values for AW and 
(AR);y» as the Born-Mayer curves. This small value 
of (AR)1/2 assures us that the actual inter-nuclear dis- 
tance in the Fs molecule is always well within the re- 
gion in which our calculation is valid. 

With the wave functions that we have obtained for 
the V, center, we can make an approximate calculation 
of the hyperfine coupling constants between the un- 
paired electron spin and the F" nucleus. The Hamil- 
tonian describing the magnetic hyperfine interaction 
between a nucleus and the spins of the orbital electrons 
is given by 


$6 — 3CaipT- 9C , (51) 
where 
2uxunp S-I 3(S;rj)(Irj) 
Kaip = —— z| -H =| (52a) 
rà rj 
and 


gÜp- (1627/3) (u vug/ 1) Du S; Ió(r;) x 


In Eqs. (52), uy and I are, respectively, the magnetic 
moment and spin of the nucleus, ug is the Bohr magne- 
ton, S; and r; are, respectively, the spin and position 
vectors of the ith electron. The summation in i runs 


(52b) 
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over all the electron spins in the atomic or molecular 
system under consideration. When we have a system 
with one unpaired electron and the wave functions for 
the various electronic states of the system are orthogonal 
we can drop the summation over i in Eqs. (52a) and 
(52b) and take expectation values over the orbital 
corresponding to the single unpaired electron. However, 
when the wave function for the orbital of the unpaired 
electron is not orthogonal to the wave functions for 
other states one has to take the expectation values of 
the summations in (52a) and (52b) over the entire 
determinantal wave function involving both the outer 
unpaired electronic state and all other paired states of 
the molecule. It can be shown!’ however, that in such 
cases, one can simplify the calculation by making the 
unpaired electron wave function orthogonal to the wave 
functions for the paired states by the Schmidt or- 
thogonalization procedure. One then has to take the 
expectation value of only one term out of the summa- 
tions in (52a) and (52b) over the orthogonalized wave 
function. For the ground state Z, of the hole, the un- 
paired electron will be ina o,, state. The orthogonalized 
wave function for the unpaired electron will therefore 
be of the form: 


Vu—5$i (Yisa Vis) —So (Yasa -rVasn) 


V pU P P LM 
N[12-281 (Visa |Visn)-- 282 (fosa |Vasp) +45 1S2(V 8A |Vasp) ]* 


where 


Si= Wu | Yisa)= as (Vu | Vise) ; 
So WulPesa)=— (Yul Voss), 
N- (WulWu)- 


The overlap integrals (Wisalvise), (VisalVess), and 
(Vss4|Vssn) are all one order of magnitude less than the 
overlap integrals Sı and S», so we shall neglect them. 
Using Eq. (53) for the wave function y of the unpaired 
electron and Eq. (52a) for 5Caip, we obtain Eq. (54) for’ 
the hyperfine constant “b”. Since we want to express 
“b” in Oersteds we have divided the hyperfine constant 
in energy units (ergs) by 2us. 


b= (Sun /2INÐ Eal 0| Yu) +5214 | O| V4) 
ES Vasa |0|V2s4)— 2S1 Wull visa) 
—2So(WulO|W2sa)+2S1S2~is4|0|Yesa)], (54) 
with 
0— (3 cos?04— 1)/r4* 


where r4 and 04 refer to the polar coordinates of the 
electron with respect to A and with the 4B direction 
as the Z axis. Equation (54) is approximate because a 
number of cross terms, involving 1S and 2S orbitals 
on atom B have been dropped. These terms are three 
orders of magnitude less than the leading term (Wal Olya). 
Furthermore it is clear that the second, third, and sixth 


(53) 


terms in (54) are identically zero. The fourth and fifth 
terms are two orders of magnitude smaller than the 
leading term. Therefore we can write, using (38) and 
(39), 


b= Gus/2IN*) 27 did { (pia | 0| pja) 


-Féói4016;2)--(6:510| 0;4)-- (0i210|6;5)) ]- (55) 


In Eq. (55) the fourth term in brackets is two orders 
of magnitude smaller than the contribution from the 
first term while the second and third terms are each 
more than one order of magnitude smaller. Also, when 
either į or 7 is three or four, the contribution to the 
first term is identically zero. We thus have finally, 


b= (6ux/ SIN?) [d (oia lra pia) tdl ra | Goa) 
2ddx(óix|r4-3|ó24)]. (56) 


Using Eq. (39) for $14 and za we then obtain for b 
the value 747.3 Oe for both the Pauling and Born- 
Mayer approximations. More than 99% of the contribu- 
tion to b comes from the first term in (56). Since our 
wave functions were tabulated for the points listed in 
Table VIII, we had to interpolate the coefficients d, in 
Eq. (55) at R equal to 4.72as. The calculated value 


18 See, for example, B. Gourary and F. J. Adrian, Phys. Rev. 
um (1957); W. Blumberg and T. P. Das, ibid. 110, 647 
1959). 
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TABLE IX. Contributions from the various terms in Eq. (57) 
to the isotropic hyperfine interaction constant for F?? nucleus in 
the V, center. The wave function densities are in units of aj ? 
and the contributions to ar in Oc. 


Term in Wave function Contribution to hyper- 
Eq. (57) density fine constant ar 
First 0.00012 0.175 
Second 0.00684. 10.317 
Third 0.00604 9.099 
Fourth —0.00178 —2.679 
Fifth —0.00167 —2.516 
Sixth 0.01288 19.384 
Total 0.02243 33.780 


of the hyperfine constant 5 is in fair agreement, cer- 
tainly in much better agreement, with experiment than 
the optical absorption frequency and width. The ex- 
perimental value of b is not known unambiguously? 
because it depends on the sign of a which is not 
known. If the sign of a is assumed negative, the ex- 
perimental value of b would be 946 Oe while for a 
positive the value would be 828 Oe. The hyperfine con- 
stant a is given by 


a=ar— (0/3), (57) 


where 


r= (Sxuy/ SIN?) [y (A) +S Vasa (4) 
"FSsVosa (4)— 283/« (A)as(4)— 282 (A) os (A) 
2518 wAs (A)Vss (4) ]. 


The term ap arises from the Fermi contact term 3Cr 
in the Hamiltonian and —5/3 from SCaip. In Eq. (57) 
we have neglected all the terms that involve yısg(4) 
and ysss(A). We have done this because the terms in- 
volving V1ss (4) and 255 (4) are an order of magnitude 
smaller than the smallest of the terms in Eq. (54). 
This approximation is in line with our assumption con- 
cerning the smallness of the overlaps between the 15 
and 2S orbitals about the two centers. To emphasize 
the importance of the overlap terms which may also 
be referred to as the Pauli correlation terms,!° we have 
tabulated in Table LX the values of the various terms 
in the brackets of Eq. (57) in units of ag? as well as 
the value of ap in Oersteds for R—4.72ae, the equi- 
librium distances for both the Pauling and Born- 
Mayer cases. 

Combining ag and —6/3 we get for a the value 
—215.3 Oe. It appears that the constant 6 is in much 
better agreement with experiment than the costant a. 
The reason for this can be gauged by examining 
Table IX. From Table IX it appears that the major 
contribution to @ comes from the second, third, and 
sixth terms in Eq. (57). All these terms depend bi- 
linearly on the overlap integrals S; and S2. A stronger 


— conjugation between the two ions A and B, as expected 


from earlier considerations of the optical data, would 
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reduce the distance corresponding to the minimum in 
the total energy curve in Fig. 4 and would therefore 
increase Sı and Sz significantly. Such an increase would 
enhance ap and therefore cause a greater cancellation 
between ap and —b/3 leading to a smaller theoretical 
value of a. The value of b on the other hand, as seen 
from Eq. (56) does not depend sensitively on R and 
should therefore be relatively unaffected by a stronger 
conjugation. This stronger conjugation would also 
make the electronic energy curve deeper than the lattice 
energy curve and therefore make the properties of the 
V, center less sensitively independent on the lattice 
potential Evidence that this is the case has been ob- 
tained by Twidell and Hayes? in recent work on CaF», 
It should be mentioned here that in addition to the 
Pauli correlation effect there will be a contribution to 
ap from the exchange polarization effect?! which arises 
out of the difference in the potentials experienced by 
electrons in the two spin states of the 15 and 2S shells. 
For an order of magnitude estimate of the exchange 
polarization one could use as a criterion the results from 
the nitrogen atom. Thus, noting that there are three 
unpaired 2? electrons in the nitrogen atom as compared 
to approximately one-half of a 25 electron per fluorine 
atom in the present case, one would expect the ex- 
change polarization contribution to a to be about a 
sixth of the 10 Mc result found for the nitrogen atom. 
This would correspond to only about 0.25 Oe. While 
this result is probably an underestimate because the 
1S and 2S wave functions in fluorine atom are more 
strongly bound than in nitrogen, it is however clear 
that a consideration of exchange polarization effect 
alone cannot explain the difference between our calcu- 
lated values of a and experiment. 


VI. CONCLUSION 


From the calculations in the previous sections it 
appears that both the distortion of the lattice around 
the V, center and the electronic binding of the c elec- 
trons play important roles in determining the properties 
of the V; center. The relative importance of the lattice 
and electronic energies would of course determine how 
the properties of the V; center will vary from one 
crystal to another. Recent work of Twidell and Hayes,” 
however, indicates that the electronic properties of the 
Vx center in CaF, are almost the same as in LiF. This 
could be explained either if the electronic binding 
energy of the F7 molecule and the lattice energy are as 
comparable with each other in importance as in LiF or 
if the electronic binding energy has been underestimated 
by us and really involves a curve with a steeper mini- 
mum than we have calculated and the lattice effect is 
of minor importance in determining the equilibrium 


% A. Twidell and W. Hayes, Proc. Phys. Soc. (London) AT9, 
1295 (1962). 

?! V. Heine, Phys. Rev. 107, 1002 (1957); T. P. Das and A. 
Mukherjee, J. Chem. Phys. 33, 1808 (1960). 
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configuration. The latter possibility seems somewhat 
more probable, from our considerations of optical and 
hyperfine data in the last section. Also, we have made 
an approximate calculation of the energy of a trapped 
fluorine atom at a F- site using the Born-Mayer 
approximation and allowing for displacements of the 
nearest neighbor Li* ions. The procedure employed in 
the calculations is analogous to that used by Dick? 
in recent work on alkali halide solid solutions. The 
energy of the trapped fluorine atom relative to the un- 
disturbed crystal after the hole is removed is then 
found to be —0.441 Ry. Comparing this with the corre- 
sponding energy, —0.443 Ry for the V; center for the 
Born-Mayer approximation, it is noticed that they are 
almost equal, although one would expect the V; center 
to be lower in energy from a consideration of relative 
stability. This result seems to substantiate our conclu- 
sion that we have underestimated the electronic binding 
energy of the V, center and should have a curve with a 
steeper minimum for the ground state in Figs. 4 and 5. 

It would be useful to combine the results of a careful 
recalculation”? of the electronic energy of the Fs 
molecule with the results of our lattice energy calcula- 
tions in Sec. III to obtain a better total energy curve 
for the V, center. It would also be interesting to find 
out if the recalculated wave function for the electronic 
c, State gives better agreement with the experimental 
value of the hyperfine constant a. It is encouraging 
that the results of the lattice energy calculations in 
Sec. III with two widely different repulsive energy 
formulas, such as the Born-Mayer and Pauling expres- 
sions (10) and (12), are not very different from one 
another and especially that the curves in Figs. 2 and 5 
are so similar in shape. This is a very significant result 
because the effects of such widely different force laws 
have not been extensively compared in imperfect crys- 
tals. In future calculations one could therefore use 
either the Born-Mayer or Pauling repulsive formula 
with equal confidence. 

One possible source of error in the calculation of the 
absorption energy should be mentioned here. When we 
consider the Hartree-Fock approximation for an atomic 

ick rs. Rev. 127, 1053 (1962). 
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Phys. 32, 219 (1960); B. Ransil, Rev. Mod. Phys. 32, 245 (1962). 


Such calculations are already in progress at Argonne National 
Laboratory (A. C. Wahl and T. L. Gilbert, private communi- 


cation). 
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or molecular system, the net energy of the system is not 
just the sum of the one-electron energies for the in- 
dividual states because then the Coulomb and exchange 
interaction energies get counted twice. One has there- 
fore to subtract off the Coulomb and exchange energies 
from the sum of one-electron energies for the various 
states. The difference in E, and Æp, which are tabulated 
in the last column of Table VII, is therefore likely to 
differ from the Z, — X, excitation energy for the hole 
if there is appreciable difference in the exchange and 
Coulomb integrals occurring in the energy expressions 
for the c, and c, states of the electrons. We have not 
made any estimates of the correction due to this source 
of error because such a refinement would be inconsistent 
with the approximations we have already made in our 
electronic energy calculations in Sec. IV. 

There is one other correction that has to be con- 
sidered in any improved calculations in future. In Sec. 
IV and in Fig. 5 we have directly added the electronic 
and lattice energies associated with the V, center 
which amounts to a neglect of self-consistency require- 
ments. To be entirely self-consistent, one should con- 
sider the change in the potential seen by the electrons 
due to the distortion and polarization of the lattice. 
Correspondingly, one has also to consider the change in 
lattice energy produced by the redistribution of the 
electron density on the F7 molecule due to electronic 
binding. However, if it turns out in a refined calculation 
that the electronic binding is much more important 
than the lattice distortion in determining the equi- 
librium situation for the V; center, one would expect 
the consistency condition involving interaction be- 
tween lattice and electronic potentials to be correspond- 
ingly less important. 
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The thin-film Al-AI;0;-Al system is studied. The current density as a function of temperature and voltage 
through aluminum oxide less than 30 A thick is analyzed using the theory of Stratton. The charge transport 
mechanism is established as tunneling by the functional form of the small temperature dependence. By 
fitting the current density data to the temperature dependence for 79°K < T «300*K and OX V € 1.0 V, the 
integrals arising from the WKB approximation are evaluated for an arbitrary barrier shape. The region of 
applied voltage for which quadratic power series expansions in V of these integrals are valid is established 
as 0.0 V «0.3 V. Assuming a trapezoidal barrier shape, approximate values for the barrier heights of 0.89 
and 0.78 eV and an electron effective mass of m*/m=1 are deduced. Comparisons are drawn between the 
more general forms of Stratton's and Simmons’ theories. 


I. INTRODUCTION 


HE flow of electrons through a sufficiently thin 

insulating film will occur by the quantum mechani- 
cal mechanism of tunneling.! Barrier layers thicker than 
a few tens of angstroms are nearly impenetrable to tun- 
neling electrons, as the transmission probability falls 
off exponentially with thickness, except where the 
tunnel distance is shortened by the application of a 
sufficiently large voltage (ie. Fowler-Nordheim?? 
region). For diminishingly small applied voltages, the 
electronic conductivity of films a few tens of angstroms 
thick may arise from thermionic or Schottky emission 
over the barrier,* tunneling through the barrier, space- 
charge limited current, or a combination of these 
mechanisms with traps in the forbidden gap of the 
insulator participating as an intermediary.* 

At temperautres below the superconducting transi- 
tion for the aluminum films in the Al-Al,0;-Al system, 
the electron current through an aluminum oxide film 
a few tens of angstroms thick has been shown to result 
from tunneling.’ The thermionic or Schottky contri- 
bution to the current is clearly negligible at these 
temperatures. 

Emtage and Tantraporn‘ have reported that for the 
AI-Al,03-Al system the entire current through a 50-A- 
thick oxide film at room temperature appears to come 
from Schottky emission. Whether thermionic emission 
or tunneling dominates the zero-voltage conductivity 
depends very strongly on the thickness of the oxide 
film. For sufficiently thin oxide films tunneling will be 
the dominant conduction mechanism. It is easy to show 
for the Al-Al,0;-Al system, assuming a rectangular bar- 
rier 0.85 eV high afid an effective mass m*/m=1, that 


* Presented in part at the Pasadena meeting of the American 
Physical Society, December 1963. 


1 D Holm, Electric Contacts (Hugo Gerer, Stockholm, Sweden, 


* R. Fowler and L. Nordheim, Proc. Roy. Soc. (London) À119, 
173 (1928). 

3 C. A. Mead, J. Appl. Phys. 32, 646 (1961). 

4P. R. Emtage and W. Tantraporn, Phys. Rev. Letters 8, 
267 (1962). 

5 D. V. Geppert, J. Appl. Phys. 33, 2993 (1962). 

$ C. A. Mead, Phys. Rev. 128, 2088 (1962). 

* T. Giaever and K. Megerle, Phys. Rev. 122, 1101 (1961). 


at room temperature the critical thickness is 35 A 
(or thinner), tunneling will dominate almost exclusively. 
For oxide films thicker than 35 À, thermionic or 
Schottky emision will dominate and the conductivity 
will be highly temperature-dependent. 

The current density through aluminum oxide films 
less than 30 À thick, sandwiched between two aluminum 
films, was measured as a function of temperature and 
voltage. The conduction mechanism, even for diminish- 
ingly small applied voltages, is shown to be tunneling 
by the functional form of relatively small temperature 
dependence. The data are analyzed using the theory 
of Stratton. Using the barrier height and thickness 
deduced from the data the Schottky current was cal- 
culated and found to be more than two orders of magni- 
tude less than the measured current at room tempera- 
ture. The integrals arising from the WKB approximation 
in the theory are evaluated for an arbitrary potential 
barrier shape by fitting the current density data to the 
temperature dependence. Finally, the theory of Stratton 
is modified to take more accurate account of the back 
current, which is the current that flows in the direction 
opposite to the current which is aided by the applied 
potential. The resulting equation can be reduced to 
either Stratton's or Simmons’ theory depending on the 
approximations employed. 


IIl. EXPERIMENTAL TECHNIQUE 
A. Sample Preparation 


The configuration of the aluminum-aluminum oxide- 
aluminum thin-film sandwiches employed in these ex- 
periments is shown schematically in Fig. 1. The sub- 
strate is a glass microscope slide cleaned by a technique 
similar to that of Lowe.’ The slide is subsequently ion 
bombarded in the manner described by Holland.” 

The metal layers are deposited on an unheated sub- 
strate in a vacuum-evaporation system at a pressure 
of less than 1X 10-8 Torr. The composition of the resi- 

8 R. Stratton, Phys. Chem. Solids 23, 1177 (1962). 

? L. Holland, Vacuum Deposition of Thin Films (J. Wiley & Sons 
Inc., New York, 1958), p. 71. 


? L. Holland, Vacuum Deposition of Thin Films (J. Wiley & 
Sons Inc., New York, 1958), p. 84. 


A1094 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar ji 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


ELECTRON 


FABLE I. Residual gas analysis.* 


Partial pressure, % of 
Gas or vapor total pressure 


Water vapor 89 
Carbon monoxide 8 
Nitrogen 1 
Oxygen 1 
dc 704 (pump oil) «1 


^ The residual gas analysis was 
to the one described by W. D. Davis and T. A. Van 
posium Transactions, edited by C. R. Meissner (Perga 
don, 1960), p. 417. 


rslice 


mon Press, [nc., Lon- 


dual gasses in the system at its ultimate pressure of 
3X 10^ Torr are shown in Table I. The silver used for 
contacts and interferometry gauges is vaporized from a 
molybdenum Knudsen surface emitting source! and 
the 99.999+-% pure aluminum is vaporized from a 
tungsten hairpin filament source.? 

First the lower aluminum layer is deposited on 
aluminum contacts. The slide is then removed from the 
vacuum system, and placed in a preheated furnace of 
dry air at atmospheric pressure and a temperature of 
300°C for two to four hours.? The slide is then vapor 
degreased in isopropyl alcohol, replaced in the vacuum 
system, ion bombarded, and the remaining metal layers 
are deposited. 

The metal films are cach approximately 1000 A 
thick. The junction area is 107? cm?, and is coated with a 
protective layer of silicon monoxide 2000 to 3000 À 
thick (not shown in Fig. 1) using a tantalum perforated 
chimney source similar to the one described by Drum- 
heller." The samples are stored in a vacuum desiccator 
at room temperature. 


TUNNEL JUNCTION 


CURRENT 


aa ae 
INTERFEROMETER: 


THICKNESS GAUGE 


RESISTANCE 
THICKNESS 
MONITORS 


VOLTAGE CONTACTS 


Fic. 1. Sample configuration. 


uL, Holland, Vacuum Deponi of Thin Films (John Wiley & 
Y k, 1958), p. 119. - j 
E d satim, of V acuum OE ee) 
edited by J, M. Lafferty (John Wiley & Sons, Inc., New York, 1962), 
. 706 


h idati i films in very 
a ate of oxidation of evaporated aluminum 
dry E ee air or oxygen is observed to be much slower 
than when water vapor is present m small und Oo 
t wi sults of P. E. Blackbum and E. A. n, 
qun soe 107, RU and I. Giaever and K. 
: . 122, 1101 Ses K j 

MIU. Vacuum Symposium Tr arr PO 
by L. E. Preuss (Pergamon Press, Inc., London, , P- j 
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Frc. 2. Electrical 
characteristics of a 
typical sample as a 
function of time. (a) 
Initial and final plot 
of the current den- 
sity as a function of 
the applied voltage. 
(b) Plot of the zero 
bias junction resis- 
tance as a function 
of time. 


j (Acn?) 
8 


B. Sample Characteristics 


For applied voltages less than 90 mV, the electrical 
characteristics of the samples are Ohmic (see Fig. 7). 
The resistance and capacitance of the samples are 
measured using a low-frequency-transformer ratio arm 
bridge which impresses 90 mV rms (or less) upon the 
sample, hence the values obtained correspond essen- 
tially to the zero-applied-voltage condition. All other 
electrical measurements are made using four-terminal 
methods, so that lead and contact resistance errors are 
avoided. The resistance of the samples range from 100 
to 400 ohms and their capacitance is about 0.022 to 
0.028 uF. The zero-applied-voltage resistance R and 
the j versus V characteristics for a typical sample are 
shown in Fig. 2, where j is the current density, V is the 
applied voltage, and / is the time (¢=0 is the time when 
the sample fabrication was completed). 

An increase in the junction resistance of the same 
general form as that reported by Handy” is observed. 
As shown in Fig. 2(b) the junction resistance increases 
approximately as the logarithm of the time. Extrapo- 
lating this plot the junction resistance would be expected 
to be double its initial (i.e., =20 min) value at /— 1.6 
X10? min (or in about three years). The slope of the 
junction resistance versus log / is similar to that reported 
by Handy for unprotected junctions within 2 to 3 min 
after deposition of the second aluminum layer. Handy 


15 R. M. Handy, Phys. Rev. 126, 1968 (1962). 
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Fic. 3. Schematic diagram of electrical measurement 
and temperature controlling apparatus. 


reported that the slope is pressure sensitive and in a 
vacuum of 5X10 Torr the resistance doubled in less 
than 3000 min (i.e., approximately two days). These 
results suggest that coating the junction with silicon 
monoxide immediately after its formation tends to 
preserve the initial junction characteristics." 

'The measurement techniques described below are 
designed to yield 0.1% precision, which is required for 
the analysis of the data in the following section. The 
stability of the junction is consequently of prime im- 
portance. After the sample has been aged for four months 
the junction resistance changes 0.1% in about 3000 min 
which is approximately the time required to take the 
data for the volt-current characteristics analyzed in 
Sec. IV. 

The change in the zero-applied-voltage resistance is 
accompanied by a shift in the volt-current character- 
istics. After storing the samples for four months, the 
electrical characteristics up to 1.0 V are quite stable 
with time. The initial plot of log 7 versus V and the plot 
after aging for four months are shown in Fig. 2(a). The 
final plot is reproducible to within 0.1% at a constant 
temperature even when the sample has been cycled to 
liquid-nitrogen temperature between measurements. No 
shorting,!” hysteresis;!" long time constant, forming,5:19 
negative dynamic resistance??? or change in the direc- 
tion or rectification with temperature’ effects were 
observed. A small rectification effect is observed and is 
discussed in Sec. IV. 

Aluminum oxide films formed by a number of different 
techniques have been investigated in this laboratory. 
Only the oxides gfown thermally, as described above, 


16 This result is in disagreement with the results of J. Penley, 
J. Appl. Phys. 33, 1623 (1962), but suggests that the pressure 
sensitive resistance change may be due to diffusion through the 
unprotected top metal layer, see J. J. Wortman and R. M. 
Burger, Second Annual Symposium on the Physics of Failure in 
Electronics, 1963 (unpublished). 

565) Meyerhoffer and S. A. Ochs, J. Appl. Phys. 34, 2535 

1 5 
3 18 T. W. Hickmott, J. Appl. Phys. 33, 2669 (1962). 
19S, R. Pollack, J. Appl. Phys. 34, 877 (1963). 


= 3H. Kanter and W. A. Feibelman, J. Appl. Phys. 33, 3580 
- (1962). 
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were used in the experiments reported here. However, 
the results are considered to apply to almost all alu- 
minum oxide films of less than 30 À thickness grown on 
the surface of vacuum deposited aluminum films, pro- 
vided the applied voltage is less than 0.2 V. The shape 
of the log j versus V plot can be correlated with 
the oxidation technique.” However, the characteristic 
shapes for each oxidation technique are nearly identical 
for applied voltage less than 0.2 V, aside from a multi- 
plicative constant which arises from small differences in 
the oxide film thickness. 


C. Measurement Techniques 


A schematic diagram of the electrical measurement 
and temperature controling apparatus is shown in 
Fig. 3. The apparatus is designed to maintain the sample 
inside a temperature-controlled light-tight chamber at 
any temperature between the boiling point of liquid 
nitrogen and room temperature. Since the substrate is a 
relatively poor heat conductor, it is necessary to enclose 
the sample in a constant temperature chamber, because 
the sample tends to come to radiation equilibrium with 
the temperature of its surroundings. The apparatus is 
similar to that of Ure? with the addition of a propor- 
tional temperature controller which regulates the dc 
heater power. With this apparatus the temperature can 
be maintained within +0.2°K during the time required 
to measure the volt-current characteristics. The con- 
troled temperature is measured with a copper-con- 
stantan thermocouple. 

The dc volt-current characteristics at constant tem- 
perature are measured by setting a predetermined volt- 
age across the junction, as compared with a volt poten- 
tiometer, and measuring the voltage drop across a 
100 ohm (20.0592) resistance in series with the sample 
with a K-3 potentiometer to determine the current. 


II. THEORY 


The net current density 7 in the direction normal to 
the insulating film in a metal-insulator-metal system, 
from a conducting region 1 to a conducting region 2, 
through the potential barrier formed by the forbidden 
energy gap of the insulator, is given by Stratton’s® 
general equation 


T TkT ] 1, @) 
J= — — —— lexp(—63)[1— exp(— eigV) J, 
lana eee 
where 
721 a T21 
bi=a (o—t1)! dx; a= | (g=) Pdz, 


71 Tu 


which are functions of the applied voltage V that arise 


21 The correlation of the shape of the log j versus V curves with 
the oxidation technique is the subject of another paper by one 9 
the authors (T.E.H.) (to be published). 

? R. W. Ure, Jr., Rev. Sci. Instr. 28, 836 (1957). 
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from the WKB approximation and can be computed 
for a given barrier e(x,V). The limits x1, and Xo, are 
the classical turning points given by the values of x for 
which e(x,V)— fi, t is the Fermi energy, x is the direc- 
tion normal to the insulating film, q is the charge and 
m the mass of the electron, / is Planck’s constant, T is 
the absolute temperature and ac 4g (2m)! ?/ f. 

Equation (1) is derived assuming a parabolic energy- 
momentum relation with the same electron mass in 
each of the three regions. Stratton* has shown that the 
equation is still valid when the effective mass in the 
insulator is less than the mass in either conducting 
region. It is found that the more general form of 
Stratton's equation for a nonparabolic energy-momen- 
tum relation in the insulator is not required to explain 
the results in Sec. IV. This presumably arises because 
the barrier heights deduced in Sec. IV are a small frac- 
tion of the energy gap. Other conditions on the range of 
application of Eq. (1) are discussed by Stratton. 

The last term in brackets in Eq. (1) represents the 
back current, or electrons tunneling from the conducting 
region 2 to conducting region 1 against the applied 
electric field. This term diminishes in importance rapidly 
with increasing applied voltage. The voltage dependence 
of the forward current results predominantly from bi(V) 
in the exponential. 

In deriving the equation it is assumed that the cur- 
rent is largely due to electrons whose energies are close 
to the Fermi energy in the conducting region 1. As a 
result an expansion with respect to energy near (i is 
employed. The result of this procedure is to assume that 
most of the current is forward current, and that even 
the back current arises from electrons whose energies 
are close to (1. This is a very good approximation for 
all but the very small applied voltages. 

The back current is more accurately represented by 
an expansion about the Fermi energy in the conducting 
region 2 given by (»—£1—4V.? From the symmetry it 
follows that the current density is more accurately 
given by 

Eee. 
4 kW lePLsin(reakT)t c? Lsin(rcskT) 
From this more accurate expression both Stratton's and 
Simmons” expressions for the current density, assum- 
ing an arbitrary barrier shape ex, V), can be derived 
depending on the particular assumptions employed.” 
Both Eq. (2) and Simmons” Eq. (20) are intractable 
to experimental analysis. ; 

The difficulty in arriving at any straightforward 


solution for the current density, where an arbitrary (or 
unknown) barrier shape e(x, V) is involved, is the evalu- 


23 The authors are indebted to R. Stratton for pointing out 


this correction to the theory. nd 
aA i . Appl. Phys. 34, 1793 (1963). — s 
25 To aa E 24) Eq. (20) some gocce. en 
the 8 factors from those of Simmons arise; however, in practic 


B*«1 over the range of interest. 
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ation of the integrals arising from the WKB approxi- 
mation. Stratton expands the functions around energies 
near the Fermi level to avoid the difficulty, whereas 
Simmons employs an approximate value for the 
integrals. 

The advantage of Stratton's theory is that it holds 
for nonzero temperatures and in the form given in 
Eq. (1) it is tractable to experimental analysis. The 
unknown functions 6,(V) and &(V) can be deduced by 
fitting the experimental current-density data, as func- 
tions of temperature and voltage, to Eq. (1) as shown 
in the following section. The disadvantage is the inac- 
curacy in the back current, which is not serious as will 
be indicated. 

The disadvantages to Simmons! theory are the restric- 
tion to absolute zero temperature and the experimental 
intractability. The advantage is that since electrons of 
all energies are taken into account the back current is 
more accurate. By combining the two approaches, some 
of the advantages of both can be retained. In particular, 
the average barrier height as defined by Simmons and 
the tunnel path length can be given by 


$—61=b1/ (26), (3) 
and 
(Ax)? 2be1/ad, (4) 


respectively, in terms of Stratton's functions which can 
be deduced from experiment. 

Finally, it is easy to show from Eq. (2) (using the 
values of the functions 6; and c; deduced in the follow- 
ing section) that at a given temperature the current 
docs not rise as rapidly with increasing voltage as 
implied by Eq. (1). The error decreases monotoni- 
cally with increasing voltage and at 0.10 V the correction 
attenuates the tunnel current only about 3%. For 
diminishingly small applied voltages Simmons’ and 
Stratton’s expressions (at T—0) are equivalent. At a 
constant voltage, the current is less than that predicted 
by Eq. (1). The error decreases monotonically from a 
maximum of about 2% to about 1% at room tempera- 
ture. The corrections to Eq. (1) stemming from the more 
accurate account of the back current are seen to be 
small. 


IV. EXPERIMENTAL RESULTS 
A. Temperature Dependence 


It will now be shown that the experimental current 
density data as a function of temperature and voltage 
can be fitted to the functional form of the temperature 
dependence in Eq. (1). The temperature dependence is 
given by 

A(T) € [xe £T /sin(re&T) ]. (5) 


The functional form of Eq. (5) arises from the Fermi 
distribution of electrons in the metal layers and will not 
fit unless the electrons tunnel directly from the metal on 
one side to the metal on the other side (since voltages: 
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The quality of the fit of the experimental current- r 

density data, as functions of temperature and voltage, 

to the functional form of the theoretical temperature 

dependence establishes that tunneling directly between 

the two aluminum layers is the predominant mechanism 

for the electronic conduction through the aluminum 

oxide film. 


A typical example of the fit of the data to the expres- y 
sion (5) for V «0.10 V is shown in Fig. 4(b). The devia- if / 
tions may arise from small thermal emf's which can 
produce internal voltages, but do not appear to be due ¥ 


to the back current [1.e., fit the transcendental tempera- 

ture dependence given in Eq. (2)]. The form of the 

experimental curve, shown as the dashed curve in 

Fig. 4(b), is characteristic of all the samples investi- 

gated at very small applied voltages. ^ 
The temperature dependence in Eq. (1) can be ex- 

panded for small temperatures in a binomial series with 

respect to mcıkT which gives? 


M aui c Ee 
m 


A aar 


50 100 150 200 300 400 


TEE - 
Vy a XU 


j 0) 
Aj jT) jO - — (web 
7(0) 
360 


(acik D e (6) 


Chow? has analyzed the temperature dependence of 
the current density in the Be-BeO-Au system by fitting 
the data to the quadratic dependence given in Eq. (6) 
and neglecting the higher order terms. The 7(0) was 
calculated, assuming the quadratic dependence, using 


T (°K) 


(b) 

Fic. 4. The dependence of the current density on the tempera- 
ture at (a) 0.50 V and (b) 0.05 V. The points are experimental 
data, and the curve, which is theoretical, corresponds to Eq. (1) 
[or the expression in (5) ]. 


two of the data points. The result of plotting logio 
Aj=logiol ;(T)— j(0)] versus logio T is a straight line 
of slope 2.04 up to 300°K. Similar results for the 
AI-Al;05-AI system have also be reported." Analyzing 
the temperature dependence data in this way is subject 


less than those required for Fowler-Nordhiem emission 
are employed). 
The coefficients bı and cı are determined, at any 


particular voltage, by fitting the experimental plot of d 
login 2(T) versus logio T to the theoretical plot of 
Eg. (5) which is logio [rcıkT/sin(rcıkT)] versus logio Fic. 5. The dependence ^ 


of the change in the cur- 
rent density at 0.50 V on 
the temperature. The 
points are experimental 
data, the dashed line is 
the T? approximation, and 
the curve, which is theo- 
retical, corresponds to Eq. 
(6) and includes the 7* 
term. 


(reikT). A typical example is shown in Fig. (4a). The 
value of c; is deduced from the temperature correspond- 
ing to mckT'—1. Using Eq. (1), bı can be calculated 
from j(0), which is deduced from the current density 
extraploated to abselute zero temperature correspond- 
ing to [mckT/sin(zaGkT)]-1. The quality of the 
fit shown in Fig. 4(a) for 0.50 V applied potential is 
characteristic for all voltages between 0.10 and 0.90 V. 
It should be noted that the scale of the ordinate axis 
has been expanded greatly compared to the abscissa 
in order that the relatively small (e.g., compared to 
Schottky) temperature dependence (between 30 to 50% 
change in going from liquid nitrogen to room tempera- 
ture) can be compared accurately with the theoretical 


j(T)-J(0). (Acm °) 


Aj 


50 100 200 
T(*K) 


400 600 


25 C. K. Chow, J. Appl. Phys. 34, 2919 (1963). 
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to considerable error. In order that the quadratic 
approximation hold, j(0) must be determined from the 
data over a small range of temperatures usually between 
78 and 150°K. This may result in some inaccuracy in 
j(0) and considerable deviation of AJ(T) from the 7? 
line over the range of temperatures where the 7? approxi- 
mation is valid (cf. Chow's?e Fig. 1). However, (0) can 
be determined with good accuracy using the technique 
described above. Using the j(0) determined from Fig. 
4(a), a plot of logioA j versus logioT is shown in Fig. 5 
for the experimental data shown in Fig. 4(a). Results 
quite similar to those of Chow are obtained except that 
good agreement with the T? linear approximation 
(shown as the dashed line in Fig. 5) is obtained for tem- 
peratures less than 100°K. The solid curve is a plot of 
the theoretical temperature dependence as given by 
Eq. (6) including the T! term, using the value of 
£17 16.0 eV determined from Fig. 4(a). A straight line 
of slope slightly greater than two can be fitted to the 
data in Fig. 5, but in fact it is seen that the deviation of 
the experimental data from the 7? line is due to the T* 
term in Eq. (6). 


B. Functions b,(V) and c(V) 


The coefficients bı and c; are functions of voltage and 
arise from the WKB approximation. The specific volt- 
current characteristic for a given barrier potential 
€(x, V) can be given from Eq. (1) by evaluating bı and 
cı as functions of V. Stratton? has developed Eq. (1) 
further for small applied voltages, while maintaining 
an arbitrary barrier shape, by expanding bı and cı as 
the following quadratic power series in qV 


bi bio— b (qV)-- bis (qV)?, (7) 


18.0 


LA ssec [^] 


16.0 


X 19.0 
a 


2 
«^ EXPERIMENTAL bj N 


a 
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v(v) i 
cients bı and cı on voltage. 
oo ae RM data by fitting the 
heoretical curves are quadratic 
to Eqs. (7) and (8). 


Frc. 6. The dependen 
"The points are deduced from 
temperature dependence. The the 
power-series expansions corresponding 
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Fic. 7. Low-voltage Ohmic region. The current as a function 
of small applied voltage at T=78.9°K. 


and 
&7 60—61(qV)-Fe(qV). (8) 


By fitting the experimental current density data to 
the theoretical temperature dependence for a set of 
applied voltages in the range O<V<1.0 V, b and c 
can be deduced as functions of V. The results are shown 
in Fig. 6. The experimental curves are plots of by and cı 
deduced from the experimental data. The theoretical 
curves are the quadratic power-series expansions given 
in Eqs. (7) and (8) which have been fitted to the experi- 
mental curves for small applied voltages. The coeffi- 
cients for the power series are listed in Table IT. The 
region of applied voltage for which the quadratic power- 
series expansions in V, of the integrals arising from 
the WKB approximation, are valid, is seen to be 
0.0<V<0.3 V. 

Although c; can be represented approximately by the 
low-voltage quadratic over a substantially larger range 
of applied voltage, 5; cannot. A plot of the current den- 
sity as a function of voltage [cf. Fig. 2(a)] suggests a 
nearly linear approximation for b; in the range 0.2 V 
«1.0 V, preceded by an Ohmic region (cf. Fig. 7). 
Stratton,? in analyzing the earlier data of Fischer and 
Giaever” and Pimbley,” fitted the quadratic power 
series for b; out to about 1.4 V. This procedure is equiva- 
lent essentially to fitting a parabola to the lower part 
of the experimental curve for 5; shown in Fig. 6, which 
does not converge to the experimental curve for 5; for 


"TABLE II. Quadratic power-series coefficients. 


b ci (eV)! 


by) = 22.37 Cig = 15.38 (eV)71 
5177.56 (eV) 611.32 (eV)? 
bis 1.4 (eV)? £135.29 (eV)? 


* J. C. Fisher and I. Giaever, J. Appl. Phys. 32, 172 (1961). 
23 W. T. Pimbley (unpublished). 
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Fic. 8. The experimental dependence of the current density 
on the voltage at room, liquid nitrogen and extrapolated to ab- 
solute zero temperature. The theoretical curve was calculated, 
assuming the quadratic power-series approximations for b; and 
c, using Eq. (1). 


small V. The coefficients of the quadratic expansions 
are related to physical properties of the potential 
barrier.? The numerical values for the physical proper- 
ties reported by Stratton, therefore, are in error. 
The low-voltage quadratic expansions given in Eqs. 
(7) and (8) using the coefficients in Table II were used 
to calculate the theoretical volt-current characteristic 
for T=0°K shown in Fig. 8. The experimental character- 
istics for room, liquid nitrogen and extrapolated to 
- absolute zero temperature are also shown. It is apparent 
that the theoretical curve is a good representation for 
-0<V<0.3 V, which is the range of voltage for which 
— the quadratic power-series expansion for bı is valid, 
and that the divergence between the theoretical and 
xperimental curve for T=0°K results from the func- 
tional form of bı shown in Fig. 6. 
he functional form of 6; and c1 shown in Fig. 6 for 
.3 V is not expected to be universal. In particular, 
functional form shown here pertains to an oxide 
by the technique described in Sec. ITA. 


€. Physical Properties of the Barrier Layer 


e numerical values of the coefficients in Table II 
elated to the physical properties of the barrier 


* 
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when a particular barrier shape is assumed. Using 
Simmons’ expression for the tunnel path length given 
in Eq. (4), bı and cı given by Eqs. (7) and (8), and the 
coefficients from Table II, it is easy to show that Ax is 
approximately constant over the range 0.2« V «0.6 V 
which is consistent with a trapezoidal (or rectangular) 
barrier shape. 

Using the asymmetry in the experimental current 
density with respect to the polarity of the applied poten- 
tial, and the relation for asymmetrical barriers that? 


birt b21= C10 


a value of 031— 7.82 (eV)~ is obtained. 

Assuming a trapezoidal barrier shape for e(x,V), 
integrating, carrying out the Taylor expansions with 
respect to energy and the binomial expansion in Eq. (7) 
(and assuming a corresponding series for b2) leads to the 
following relationships between the physical parameters 
and the coefficients: 


Qa x32 — x,'tl? 
b19— (= ) , 


> 


3 X3 — Xr 
11/2 7 , (9) 
bii (aLxy!? —b))/ (X? — Xj), 
ba = — (o.LX,? — b19)/ (x»' — Xy) " 
where 
X47 X4 - AX ; n=1,2. (10) 


Xy and Xy are the effective heights of the trapezoidal 
barrier above the Fermi level on the side adjacent to 
metal 1 and 2, respectively, which differ from the true 
metal-insulator work functions X; and X» by an amount 
AX. It will be assumed that the barrier is a result of 
two unequal metal-insulator work functions and the 
image force. In Simmons"? evaluation of b in terms of an 
average barrier height and an effective tunnel path, the 
image force barrier essentially is replaced by a rectan- 
gular barrier which is lower in height than the image 
force barrier by [cf. Simmons"? Eqs. (33) and (40)] 


Ax — [145XL/ (x5—23).] In[gs(L—23)/21(5—22)] (11) 
for zero applied voltage, where 
A= (¢?/8mxL) In2, 


and L is the thickness of the insulating film. The quan- 
tity Ax—0.11 eV for the parameters deduced from the 
experimental data. Using this value of Ax as an approxi- 
mate correction term in Eq. (10), solving Eqs. (9) 
simultaneously and substituting the values for b10, 015 
and bz yields 


X,—0.78 eV; X2=0.89 eV 
which will represent a slight underestimate, and 
LEPSIE 


2J. G. Simmons, J. Appl. Phys. 34, 2581 (1963). 
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assuming an effective electron mass in the insulator 
m*/m=1. Meyerhoffer and Ochs have pointed out 
that the effective mass theoretically should be near to 
one.?? Assuming a dielectric constant of 8.0/5 the 
thickness of the insulating layer calculated from capaci- 
tance is 25.7 Á, and the tunnel path length, calculated 
from Eq. (4), is 25.6 A. The aluminum film correspond- 
ing to x1 is the one on which the oxide was grown. 
Greater currents are passed when this layer is positive, 
which is in agreement with other observations." 

A plot of the experimental data for a thicker aluminum 
oxide (formed in an oxygen glow discharge) and larger 
applied voltages is shown in Fig. 9. The data are plotted 
as log I/V? versus 1/V for comparison with the Fowler- 
Nordheim theory? Including the image force correction 
the slope (for large V) results in a value of X5— 0.85 eV, 
which is in good agreement with the value deduced 
from the low-voltage measurements. 

[Note added in proof. Recent investigations on the 
thicker aluminum oxides at larger applied voltages 
(similar to specimens used to obtain the data shown in 


0.4 0.6 0.8 1.0 12 
Vv. (7!) 

Fic. 9. A Fowler-Nordheim plot for a thick aluminum oxide 
film. The slope leads to an approximate metal-insulator work 
function of x2=0.85 eV. 

i 245 (1957). 

a i ER Ce EDIT eR Rom 23, 1423 

(1962). 
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Fig. 9) indicate considerably larger effective barrier 
heights and smaller effective thicknesses than are ob- 
tained from the analysis of low-voltage data. The 
barrier heights given above are based on the assump- 
tion of a trapezoidal barrier shape which does not 
appear to be an adequate representation for all 
voltages. ] 


V. CONCLUSIONS 


The current density through aluminum oxide films 
less than 30 Å thick, sandwiched between two alu- 
minum films, is presented as a function of temperature 
and voltage. The analysis of the temperature depend- 
ence of the data established that tunneling directly 
between the two aluminum layers is the predominant 
mechanism for electron conduction through the alu- 
minum oxide film. 

The data are analyzed using the theory of Stratton. 
The theory is modified to take more accurate account 
of the back current and compared with the theory of 
Simmons. Both the modified and Simmons expressions 
are intractable to experimental analysis. It is indicated 
that the errors involved in using the simplified version 
of Stratton's theory are small, therefore, the original 
form given in Eq. (1), is employed. 

The region of validity is established, for quadratic 
power-series expansions in V, of the integrals arising 
from the WKB approximation. Fitting the quadratics 
to the experimental curves for these integrals results 
in approximate values for the heights of the potential 
barrier above the Fermi level and the barrier thickness, 
when a trapezoidal barrier shape (modified by image 
forces) is assumed. 

The actual shape of the potential barrier cannot be 
extracted with any degree of confidence using only the 
low-voltage current-density data as a function of tem- 
perature. Therefore, even though the preliminary 
Fowler-Nordheim analysis results in a barrier height 
in agreement with the value obtained for an image force 
barrier, a more elaborate investigation is required. 
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Interaction of Electron on the Donor Center with the Crystal Lattice 
in Germanium and Silicon 
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'The diagonal matrix element of interaction of the bound electron with longitudinal acoustic phonon, 
squared and averaged over all directions of the phonon wave vector, was calculated for n-type Ge and Si 
assuming a single-deformation-potential constant. The donor wave functions have been taken in the elipsoidal 
form. Some simple integrals containing the squared and averaged matrix element are given and are used, as 
an example, to get the one-phonon transition rate in the impurity conduction by “hopping” and the re- 
laxation energy of the lattice deformed by interaction with the donor electron. Appreciable differences are 
found between the case of elipsoidal and spherical donor wave functions. 


INTRODUCTION 


gp electron bound to the donor center in the 
homopolar crystal produces a deformation (a 
change of the equilibrium positions of the crystal 
atoms) in the vicinity of the center because of the 
interaction with acoustic phonons. This deformation 
influences several physical processes in the crystal. It 
changes the one-phonon transition probability and 
gives the possibility of the multiphonon processes in 
the case of the impurity conduction by “hopping. ”!=4 
A similar effect occurs in the case of the thermal re- 
combination of donors and electrons.’ The deformation 
of the crystal lattice gives also the difference between 
the energies of thermoionization and photoionization 
of donors.5 In the process of absorption of the electro- 
magnetic wave by the bound electron this deformation 


Kx UG 


Fic. 1. The dependence of the function U (7) on 1— [&/ (e4-1) ]? 
[1+ (4/277) ] 12. For | — 1 the asymptotic behavior is (1—/?)=3/ 
24 and (1—7)-3/18 for Ge and Si, respectively. 


1 J. Mycielski, Phys. Rev. 125, 46 (1962). 

2 J. Mycielski, Phys. Rev. 125, 1975 (1962). 

3]. Yamashita and T. Kurosawa, J. Phys. Soc. Japan 15, 
802 (1960). 

4N, F. Mott, and W. D. Twose, Advan. Phys. 10, 107 (1961). 

5 H. Gummel and M. Lax, Ann. Phys. (N. Y.) 2, 28 (1957). 

$M. F. Deigen, Zh. Eksperim. i Teor. Fiz. 31, 504 (1956) 
[English transl.: Soviet Phys.—JETP 4, 424 (1957) ]. 


gives the possibility of the simultaneous absorption of 
photon and emission of phonons and so has an influence 
on the shape of the absorption lines.’ In all papers 
referred to above!” taking into account lattice-defor- 
mation effects requires computation of the diagonal 
matrix element of deformation potential of the phonon 
in the ground state of the electron on the isolated 
donor. This matrix element squared must then be 
averaged over all directions of the wave vector « of the 
phonon. The result in the case of the isotropic sound 
velocity is proportional to the function U(r) which is 
by definition 


UC x) Í | iE Fw eperonunex| da. a 


if we assume a single deformation-potential constant 
and we are interested only in the interaction of electron 
with longitudinal acoustic phonons. By u(r) we denote 
the ground-state wave function of electron on impurity 
center in nondeformed crystal lattice and we define 
z—TQ. 

The function U(r) was obtained in the referred 
papers-7 assuming in most cases the isotropic con- 
duction band (the hydrogen-like donors). The purpose 
of the present work is to obtain the function U(r) in 
the case of donors in Ge and Si. We simplify the prob- 
lem assuming a single, isotropic deformation-potential 
constant and the isotropic velocity of the sound and 
taking into account only longitudinal acoustic phonons. 
The results of this paper can be easily used in another 
materials with similar shape of the conduction-band 
minima. The paper contains also same simple integrals 
with the function U(r) for Ge and Si. We used them, 
as an example, to get the corrections to the results of 
the Ref. 2 which deal with the one-phonon transition 
rate in the impurityc onduction by “hopping.” We 
obtain also corrections to the lattice relaxation energy- 


CALCULATION OF THE FUNCTION U(^) 


In Ge and Si we have in the presence of the chemical 
shift the donor ground-state wave functions of the 


* M. Lax and E. Burstein, Phys. Rev. 100, 592 (1955). 
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form? 


ur) - n1? Z F,(r)o,(r). (2) 
q71 


The sum is performed over the n minima of the con- 
duction band (z equals 4 and 6 in Ge and Si, respec- 
tively). 4 is the Bloch function for the gth minimum 
and 7, is of the form 


F(2,3,2) — (rab)! 
Xexp(—[(4-5)/e--z/0 y, (3) 


where the z axis is parallel to the axis containing the 
qth minimum. The constants a and b are the transverse 
and longitudinal radii of the orbit, respectively. 

With (2) we have 


IE (r) exp(irQr)u(r)d?x 


== | 2 FpFpp*p, exp(irQr)d’x. (4) 


p.4=1 


Neglecting terms with pq because of the rapid oscil- 
lation of exp[r(x,—x:5) ], where x, is the wave vector 
for the gth minimum, and performing the integration 
we obtain 


A1103 


10? 10? io! [ 
T (a’) 


Fic. 2, The function U (7) versus wave number r of the phonon. 
For r — æ the asymptotic behavior is 367.4% K— and 14.81» K^* 
for Ge and Si, respectively. 


For arbitrary 7 we have 0X P « 1. 
From (1) and (5) we obtain 


n 


[uo exp(irQr)u(r)d!x — n » du (5) U(r)=n> P Ip (7) , (10) 
= zu 
3 where 
where A 
f," K-[1— P(Qp,?]**. (6) palT) = Gn [ fof (11) 
Meidenotein D Performing the integral in (11) and denoting p,p, 
K-14 ig, (7) (5,9) we obtain 
P— [o/ (e4- 1) [12- (4/2272) ]H , (8) Up= (A7K*) (ML (p,q) MU, —(5,)]), (12) 
a= (a/b)?—1. (9) where 
"eem dt (7—P)[1— (p) ]— 2) 6? (p,qL1— (6,9)"] 
TI T 2 
HE [1— (5,9* 1 —5EQ — Cpa) 
(C18 PF ODT- (9,9) H-3)2-P-P (9) HSI (Qu) E1- (7 
+ (x/16) [1— (p,q) P^[2— 2 — P (5,9) e - 
(12-D(L1— (5,9) ]EL2— P— 7 (5,9) ] ^3-IE1— (6,9) 7] + EG —P)E1— (6,9) 17 Fh (13) 
X DNI; TID E- E61 (69: ]H-La BEI (9 T2) 
In the case of (5,9) — -E1, pa reduces to 
3—40P+15 5 1+ 
AAE n e 7). (14) 


48(1—P) 32 


* W. Kohn, 


1—1 


in Solid-State Physics, edited by F. Seitz and D. Turnbull (Academic Press Inc., New York, 1957), Vol. 5, p. 257. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


| A1104 A. MYSZKOWSKI AND 5S. GÓMULKA 
i 
For 7 0(1— 0), we have from (12) and (13) u5,— 1 for all p and q and so U(r)=1. For z c, (l> [o/ (e4- 1) 2) l 
we have upa — 0 and U(r) — 0. - S 
i In the isotropic case (a=b, a=0, 1—0) we have simply p= K^* and U(7) =R j 
If we expand f, and f, in powers of / and perform the integral in (11) we get instead of (12) 
A 


CJ T 5 2 
per East +x E( ‘Vogt (p,q)? J= Or—2s+21—1)!!(2s—21—1)!!. (15) 
r=0 (2r+1) !! s=0 1=0\ 2l 


GERMANIUM AND SILICON | / 
In the case of germanium n= 4, (5,9) 74-1, +3. From (10), (12) and (13) we obtain , 
Ue (erue LU 10 ul 27L(3— 22) (3— 2) (19— 41) 4- 6I* ] 
P 3(1—P)3 io ile (1—P) 3—22} (3—1) 
UO aret (1+){(3—DL(3—2P)'?+1J+3(1—P)} , 


MOABE "ID G--)EG-2Py2—7H-30—P) 
9L(65—2P) 8- +6] (1 +DLGHDE2 (8 — P) *4-21)4-6(1—2)) 
2s] (3— Bere " (—D(G-—D[2/ (3 —5)5—274-6(1—2)) 


(16) 
or from (10) and (15) 
Uee(r) = K-1(1.0004- 1.3332-4- 1.644/44- 1.943/9-1-2.24713-1-2.567194-3.191/24-3.285144-3.608/54- -.-). (17) 
In the case of silicon n=6, (p,g)=-1, 0. From (10), (12) and (13) we obtain 
33—40P+154 15 141 24(5— I) 
U si(7) Eu Ka +— In 
aP) 2% 1-1 (1—B)Q—DP) 
3[ (22—P) Gtalk (1+)((2—P)'!?+1+1—-P] 


=F n à 
1(2—P)5/2 =| Opie Esp 
2d or from (10) and (15) ( ) (1—DL( ) Hr ] 


Usi(7)-— K—(1.0004-1.3337-1- 1.644/5-- 1.943/8-1- 2.26318-1- 2.628/19-1- 3.048/2-1- 3.528/4--4.069]8.----). (19) 


Dependence of K*U (7) on / for Ge- and Si-type materials is plotted on the Fig. 1. Taking a=18.8, a= 70.8 A 
for Ge and a— 4.2, a—22.1 A for Si we obtain the dependences of U(r) on r given on the Fig. 2. 


(18) 


PARTICULAR INTEGRALS | 


We calculate here the integrals of the type Jo” U(r)r*dr for x=0, 1, 2, 3. We use formula (10) and upg in the Us 
form (14) for (5,9) —-E1 or in the form (15) for (p,q)4+1. 4 
In the case of germanium we obtain 


Z 15r(ip o T! a qu a a Ê 
| Us (ode — 1 — | arc sin] — | + 1.000+-0.1667——+-0.05722] — | 
0 64013 a+ a+1 a+1 a+1 


+0.02560[a/ (+1) P+ 0.01327 [0/ (+1) +0.007581[0/ (a-1- 1) ]$ 
+0.005640[e/(a+1) ]92-0.002991[a/ (0+1) 4+- - - | , Q0) 


; 1 1 1/2 ute: s 
f Datri Ld AE 
0 2a? L6La--1 1— [o (a4-1) ]? 


+0.08190[a/ (a+ 1) F+0.04855[a/ (a-1-1) ]:2-0.03081 [a (a-I- 1) ]* 


a a P 
SF 1,000-+0.8333-—+0.1526] — | 
a--1 


a-F1 


-F0.02500[/ (a--1):]*-- 0.01428 [a/ (o-1-1)-- | EO 
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a 2 a 3 
LE — soi] —— | 
a--1 a+ a--1 


+0.1195[a/ (a4- 1) ]*--0.08340[/ (a+ 1) P+0.07332[a/ (a1) ]* 


ouium 


-F0.04487[«/ (a--1) Jj +0.03187[a/(a-+1) P+ - - - | , Q2) 


$ 1 1 -112 
| a fare arene | a ] 
2 l6 12La+1 


e E c 


a 2 
-+1 000+-0.6667-—--0.4578| — | 
a+ 1 


+0.3276[a/ (a-- 1) P+0.2427[a/ (a+ 1) ]2-0.1848[a/ (a-I-1) ]*2-0.1750[a/ (a--1) 


In the case of silicon we obtain 


+0.1142[a/ («4- 1) F+ 0.08591[a/ (a--1) P+ 


h. (23) 


e Sr (if o "r1 a 7" a a T a P 
| Usi(r)dr 2 — E | | arc si ] 10004-01667 00559] —— | +0022] — ] 
0 24a\2La+1 atl a--1 a+1 atl 


4 (ip a T 
f Usi(t)rdr= Fl ] In 
0 9a? \4La+1 


+0.01133Le/(a-+1)}!+0.006138fa/(a+1) ]92- 0.003602[a/ (a-1-1) +- - - | , Q4 
1--[o/ (a--1) ]V? a a F 
+1.000+0.3333—+ 0. 1467] | 
1—[a/(a+1) }!? a+l a-r1 
+0.07429[a/(a+1) P+ 0.04145[a/ (a-l- 1) ]12- 0.02494 (a-I-1) ])H-0.01597[a/ (a-1- 1) +- - 1, (25) 


2 T 
| Usi(s)rtr=— 5 
0 6a? 


2o 8 a que 
| Us (eedem (a+1 )+- {= —] In 
0 8La+1 


a+1 


Taking a=18.8, a=70.8A for Ge and a= 
=22.1 A for Si we obtain values given in the Table I. 


2 3 
(a+1)!/2-++1.000-+0. 5000——+0. nis] —— ] +0.1625| — | 
a+1 a1-1 


+0.1019[e/(a+1) ]14-0.06752[o/ (a-1- 1) ]*2-0.04683[a/ (a-1- 1) ]-.- - - - | , (26) 
1+[o/(a+1)}* a 7 
————— xr Les: 6667 ——+0. ai — ] 
1—[a/(a+1) ]? atl a1 
4-0.2971[o/ (a-1- 1) P+0.2073[a/ (a-1- 1) ]21- 0.1497 [0 (a-1- 1) ]*2-0.1118[a/ (a+1) f+- - | n GR 
4.2, a donor b is 
W oai*°*= W oa! (1— A) exp(—a), (28) 


CORRECTIONS TO THE ONE-PHONON “HOPPING” 
PROBABILITY AND LATTICE 
RELAXATION ENERGY 


According to Ref. 2 the total one-phonon transition 
rate for the electron transfer from the donor a to the 


TABLE I. Values of the integrals of the form 
Ja? U(r)r"dr for n-type Ge and 


k=0 k=1 k=2 k=3 


Material (cm™) (cm?) (cm?) (cm) 
= i 105 2.4 102 6.1 1015 2.5 105 
S ES 105 2.05x 1015 1.4x 10» 1.4x 1077 


ee eee 


where W;; is the one-phonon transition rate if we 
neglect the dependence of the equilibrium positions of 
the lattice atoms on the state of the carrier. a and A 
are the quantities described in Ret. 2. In particular for 
low absolute temperature T 


Ej iB U (red 29 
TUE zaJ, T)TOT , ( ) 
hE? o d 
Aquam ————— Í U(r)rdr. (30) 
Ja vA Eus d Jo 


E, is the deformation-potential constant, AEs the A 
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"TABLE IT. Corrections to the one-phonon “hopping” probability 
and lattice relaxation energy for n-type Ge and Si. 


(W pattot/ 3 
Material aT40 Ato Wa!) 7-0 Err (eV) 
n-Ge 0.059(0.08) 0.067 (0.08) 0.82 (0.78) 2.5 X1075(3.3 X1075) 
n-Si 0.43 (0.56) 0.39 (0.59) 0.24(0.10) 0.80 X1073(1.1 X 107) 


difference of the electrostatic energy of the carrier in 
the two donors, v the velocity of sound in the crystal, 
and d the density of the crystal. 

In the Table II we give the values of arso, Aro 
and (Wie/Wa)p,e obtained from (28)-(30) and 
from data given in Table I for n-type Ge and Si samples 
studied in Ref. 2. We give also the values of relaxation 


PHYSICAL REVIEW VOLUME 


INTRODUCTION 


N this paper! we are concerned with the determina- 
tion of the conductivity effective mass, which is un- 
derstood to be the effective mass entering in the free- 


carrier mobility 
u= (e/mP)(7), (1) 


where e is the electronic charge, me is the conductivity 
effective mass, and (7) is the relaxation time averaged 
over all of the free current carriers. The determination 
of this parameter hás received increasing attention in 
recent years. The earliest and the most extensive in- 
vestigations to date have utilized cyclotron resonance 


*Presently with Energy Conversion, 
Massachusetts. 

1 This paper is based upon a dissertation by the author sub- 
mitted in partial fulfillment of the degree of Doctor of Science to 
the Electrical Engineering Department of the Massachusetts 
Institute of Technology and published as Scientific Report No. 7 
under Contract No. AF 19(604)-4153 (Jan. 1962). Air Force 
nbridge Research Laboratory document number AFCRL- 


Inc, Cambridge, 
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AND S. GÓMULKA 
energy Erg of the lattice deformed by interaction with 
the donor electron, which is? 


E r W ; 
] U(r)rdr. 
4rd Jo 


In Table II we give in brackets for comparison the 
values obtained previously? in approximation of hy- 
drogen-like, spherical donor. 


Err= 


(31) 
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Measurement of the Conductivity Effective Mass in 
Semiconductors Using Infrared Reflection 
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There exist various approximate relationships between the conductivity effective mass of free carriers mte 
and the angular frequency wo of the reflectivity minimum in the infrared arising from the free-carrier dis- 
persion. A detailed analysis of the reflectivity equation shows that it is possible to obtain a relationship 
between zr and wo when in addition one has a knowledge of the free-carrier concentration, the drift mobility, 
and the dielectric constant of the semiconductor at very high frequencies. This relationship is shown to 
yield a value of m.*=0.145 for n-type germanium at room temperature using data presented in the literature. 
This value compares well with the value of m.*=0.15 obtained using a combination of reflection and trans- 
mission measurements. The experimental work required to obtain accurate values for m using this relation- 
ship is considerably less than that required by other techniques, such as cyclotron resonance, magnetoplasma, 
and combined infrared reflection and transmission measurements, which are presently used. Moreover, 
this procedure can be applied over very broad ranges of temperature and of free-carrier concentration. 


techniques. For example, the conductivity effective 
mass has been measured at liquid-helium temperature, 
where the relaxation time of free carriers is sufficiently 
long for both germanium and silicon? Recently in- 
frared techniques also have been employed for this de- 
termination. In one instance, the conductivity effective 
masses in germanium, silicon, and indium antimonide 
were obtained from a combination of reflection and 
transmission measurements conducted at room tempera- 
ture.* In another instance only reflection measurements 
were required, in conjunction with varying intensities 
of magnetic field, to determine the conductivity effective 
masses of mercury selenide and indium antimonide.* 
We are proposing in this paper that it is possible to 
determine m, very accurately simply from a measure- 


(195) Dresselhaus, A. Kip, and C. Kittel, Phys. Rev. 98, 368 
* R. Dexter, H. Zeiger, and B. Lax, Phys. Rev. 104, 637 (1956): 
* W. Spitzer and H. Fan, Phys. Rev. 106, 882 (1957). 

5 B. Lax and G. Wright, Phys. Rev. Letters 4, 16 (1960). 
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CONDUCTIVITY EFF ECTIVE 
ment of the frequency of the reflectivity minimum in the 
far infrared. Through a straightforward differentiation 
of the reflectivity equation, utilizing the dispersion rela- 
tions, and the evaluation of this derivative at the mini- 
mum, a relationship is obtained which can be solved for 
me using an iterative procedure. 


FREE-CARRIER DISPERSION 


In the far infrared, at wavelengths longer than that 
of the intrinsic absorption edge, electromagnetic radia- 
tion interacts with the free carriers of the material and 
undergoes dispersion. The theory of this free-carrier dis- 
persion is developed in detail in a recent book by Moss.’ 
He has shown that both classical and quantum mechani- 
cal analyses lead to the following relations for the real 
refractive index 2 and the absorption index (extinction 
coefficient) & of free carriers of one type only (i.e., holes 
or electrons): 


Ne 7 $ 
n?— k — en = —— ——— (2) 
meg 1+a*7? 
and 
Ne T 
2nko = — ——— (3) 


? 
meg 1--o?7? 


where e, is the relative dielectric constant of the medium 
in the absence of any contribution from free carriers at 
very high frequencies, W is the free-carrier concentra- 
tion, eo is the permittivity of free space, m is the effec- 
tive mass of the free carriers, and w is the angular fre- 
quency of the infrared radiation. 

Several points should be noted with regard to Eqs. 
(2) and (3). It has been shown for the energy-band 
model in which ellipsoidal surfaces of constant energy 
lie along rotation axes of the cubic point group, that the 
effective mass which enters in these expressions is the 
conductivity effective mass, me.” In this case, nte is de- 
fined by the relation 1/71,— 3 (1/mid-2/m), where my is 
the value of the effective-mass tensor associated with 
the longitudinal axis of the ellipsoidal energy surface 
and m is that associated with each of the two trans- 
verse axes of the surface. Secondly, we note that the 
relaxation time 7 is assumed to be independent of fre- 
quency and that the total frequency dependence is ex- 
hibited explicitly by the quantity e. This assumption 1s 
not restrictive since the relaxation time is determined by 
the temperature of the crystal, its energy-band struc- 
ture, and the density of imperfections in the lattice. 


it should be noted that the relaxation time 7 


Finally, : i 
is necessarily assumed to be energy-independent. This 


follows from the fact that the terms containing T have 
not been averaged, ( ), with respect to the distribution 
function. This last assumption requires additional con- 


erlies of Semiconductors —a Semicon- 


6T. Moss, Optical Prop bs Scientific Publications, London, 


ductor Monograph (Butterwort 
1959), Chap. 2, p. 29. 
7 Reference 6, p. 31. 
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sideration. For example, we wish to establish the con- 
ditions under which + may be replaced by (7) in Eqs. 
(2) and (3). 'The details of the analysis for establishing 
these conditions are presented in Appendix A. How- 
ever, in summary if the relaxation time is assumed to be 
proportional to a power of the free-carrier kinetic 
energy r- roE', it is possible to compare the exact 
terms (7?/(1+w*r*)) and (7/(1--o?72)) with the more 
convenient substitute terms (7)?/(1-Fo*(7)) and ¢7)/ 
(1+-w%7)*). Such a comparison shows that the latter 
terms can be substituted for the former under the follow- 
ing conditions with no introduction of error: 

(i) when r is independent of energy, s=0, as in the 
case of neutral impurity scattering, 

(ii) when there is total degeneracy and the kinetic 
energy of all carriers may be considered to be the 
Fermi energy £y. 

The substitution can be effected under the following 
conditions with an introduction of error «1397: 

(iii) when s= —3, as in the case of scattering arising 
from acoustical modes and from dislocations, for all 
degrees of degeneracy where wr21 while the degree of 
degeneracy required increases from 73 —4 to 7=0 as 
wr decreases below unity, 

(iv) when s=4, as in the case of optical-mode scat- 
tering above the Debye temperature, for all degrees of 
degeneracy and for all values of wr, and 

(v) when s—$, as in the case of scattering by ionized 
impurities, only under the condition of total degeneracy. 
These conditions are not considered to be overly restric- 
tive but, in fact, to be rather generally satisfied. This 
consideration follows particularly in view of the fact 
that in order to observe the free-carrier reflectivity 
minimum at convenient wavelengths (230 u), it is 
often necessary to utilize free-carrier concentrations 
which result in at least partial, if not total, degeneracy. 
For these reasons, in our further consideration of Eqs. 
(2) and (3), we will assume that one of the conditions 
(i)-(v) is fulfilled and that + may be replaced by (7) 
with an introduction of error <13%. 

At this point, it must be recognized that the total 
dispersion of incident radiation in the far infrared arises 
from interactions with the free carriers and with the 
lattice as well. This latter dispersion is important in 
polar crystals. The complete dispersion relations, which 
include the contributions of both the free carriers and 
the lattice of a polar crystal AB, may be expressed as 
follows*?: 3 


Ne wg 
MOM Rt 


= PONO? (cà —o)r? ( ) 
3 ] Mey zur 


8 Reference 6, Chaps. 2 and 16. 
9B. Szigeti, Trans. Faraday Soc. 45, 155 (1949). 
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and 


Ne er) 


Mego? (1-++-w*(r)?) 
WT] 


En t2 ? VO? k 
+| | Ee = ©) 
3 Meo Lw’ (w — a)r? 


In both Eqs. (4) and (5) the last term is the contribu- 
tion made by the ions of the crystal. The parameters in 
the other terms have been defined following Eqs. (2) 
and (3). The new parameters are as follows: N, is the 
concentration of atom pairs AB; Q is an effective charge 
defined by Q=SZe, where e is the electronic charge, 
Z is the valence of the compound, and S is a measure of 
the extent to which the ions of charge Ze are capable of 
being polarized; M is the reduced atomic mass 1/M 
—1/M4--1/Mp; w: is the angular frequency of the 
transverse lattice vibrations (Restrahlen); and 7; is a 
relaxation time related to the damping of the lattice 
vibrations. 

The relative contribution of the lattice to the total 
dispersion varies from one semiconductor to another. 
Moreover, it even varies for a single semiconductor de- 
pending upon the íree-carrier concentration. Conse- 
quently, this contribution must be evaluated for each 
individual case in order to determine if its effect is 
significant in the over-all dispersion. It is clear, however, 
that in several instances the lattice polarization may be 
neglected when analyzing the dispersion in the far in- 
frared. For example, lattice polarization does not enter 
in connection with the elemental semiconductors, ger- 
manium and silicon. Moreover, a combination of several 
conditions, such as, a small value for e,, a large re- 
duced mass M, a slight polarizability (S1), and a 
large free-carrier concentration (NZ 10? carriers/cm?), 
can serve to make the effect of the lattice polarization 
completely negligible. 

Because various combinations of these conditions are 
quite often satisfied, the total dispersion in the far in- 
frared can oftentimes be considered to be described by 
the simpler Eqs. (2) and (3). In the subsequent analysis 
we will make this assumption, namely, that the lattice 
contribution to the total dispersion is negligible in the 
frequency range of the reflectivity minimum arising 
from free-carrier absorption. 


2nk- 


REFLECTIVITY MINIMUM 


. The reflectivity of a material for normally incident 
idiation upon the surface of an infinitely thick sample 


: R-[(—)--E /Lo-- 0*8], (6) 


where n and k are described by Eqs. (2) and (3). This 
tion has been differentiated with respect to the 
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form: 
n?—k?= e— B/ (14-9?) (7) 
and 
nk= B/20(14-Q?), (8) 
where 


B-Ne(ry/m.s and Q=w(r) 


and assuming that (7) is not a function of frequency, 
The condition of zero slope at the reflectivity mini- 
mum then yields an equation for the effective mass in 
terms of the frequency of the reflectivity minimum and 
other measurable parameters. The equation that re- 
sults is the following: 


3«—1 (1/02)4-54-80? e 
mee 3— VIS 
4e,(e,—1) (13%) 2€5(e,—1)? 
(14-29?) C? 
a OE =0, (9) 
(14-39?) 4€, (e, — 1)? (14-302) 


where m-*=m,/mo, mo is the free-electron mass, 
C-—Ne'/meoexog, Q=wo(r), and wo is the angular fre- 
quency of the reflectivity minimum. [ The derivation of 
this equation is outlined in Appendix B. It is based en- 
tirely on Eqs. (6)- (8), and therefore depends on their 
applicability to a particular material system for its 
own validity. ] 

This equation is quite general in that no approxima- 
tions are made with respect to the magnitude of Q nor 
with respect to the magnitude of e. [ However, it is 
true that Q must staisfy the conditions under which the 
substitution of (7) into Eqs. (2) and (3) was effected. ] 

We conclude that there are very broad conditions for 
which Eq. (9) represents an exact relationship, or very 
nearly so, between the conductivity effective mass and 
the frequency of minimum reflectivity and other meas- 
urable parameters. Equation (9) can therefore be used 
for the determination of me. This can be illustrated by 
considering the reflection curve for n-type germanium 
reported by Spitzer and Fan (S-F) and which is re- 
produced here as Fig. 1. The sample investigated by S-F 
had a carrier concentration of 3.9X( 1018 electrons/cm’, 
U&7520 cm?/V-sec," e,—16 and a reflectivity mini- 
mum at Ao=23 u. By making an initial estimate for the 
conductivity effective mass, say m,*=0.12 as deter- 
mined by cyclotron resonance, one can obtain an initial 
estimate for Q. Using this value of Q and the values for 
N, e,, and wo listed above, Eq. (9) can be solved for 
m,*. This enables one to calculate a revised estimate 
for € and the procedure is repeated until the solu- 
tion of Eq. (9) yields the same value for m as 


1 Tt is recognized that we are here employing a Hall mobility 
when actually the drift mobility is called for. However, according 
to the S-F report, the Hall coefficient was measured at 77°K 
where the Fermi level is 244.5 kT above the conduction band edge 
Íor this free-carrier concentration. It is reasonable to conclude 
that the mobility deduced from Ro is very nearly the drift 
mobility. 
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before. In actual fact, because 9? is large with re- 
spect to unity (0*—13) and therefore Eq. (9) is al- 
most independent of Q, this procedure converges very 
quickly. The value obtained for the conductivity effec- 
tive mass using this procedure for the above example is 
m.—0.145. This compares very well with the value 
m.* —0.15 calculated by S-F using their data from both 
reflection and transmission measurements, 

As an additional check on the theory, for example, to 
show that the free-carrier dispersion can completely 
account for the observed reflectivity, a. theoretical re- 
flection curve can be generated using the effective mass 
calculated from Eq. (9). When this is done, one finds 
that the reflectivity curve, reported by S-F for the 
n-type sample of germanium considered above, is re- 
produced as exactly as it is possible to determine the 
values from their report. (The values calculated to 
achieve this fit are shown in Fig. 1.) Moreover, such a 
precise fit indicates that we are apparently justified in 
the substitution of (7) into Eqs. (2) and (3) and that in 
this case at least such a substitution produces negligi- 
ble error. [This result should be compared with Fig. 
4a (Appendix A) for 0—3.6, n= —2, and s= —4.] 

Before concluding this discussion, it is of interest to 
consider the effect of various simplifying assumptions 
upon Eq. (9). For example, the plasma frequency is 
oftentimes obtained approximately by assuming that 
n=1 [see Eq. (6) ], that k is negligibly small compared 
with unity, and that Q is very much greater than unity. 
Under such conditions Eq. (7) can be solved directly to 
give the result 

me=Ne?/w eoleo — 1). (10) 


This expression is sometimes even further simplified by 
neglecting unity with respect to e, which yields 


me =N ewo eoes . (11) 


Now, if the conductivity effective mass is calculated, 
for the -type sample of germanium considered here, 
by means of Eqs. (10) and (11), the resulting values are 
m,*=0.123 and 0.115, respectively. These values differ 
significantly from the value m.*=0.15 obtained by 
Spitzer and Fan. This difference serves as justification 
for the derivation of the more exact Eq. (9). 

Finally, it should be noted that Eq. (9) can be very 
well approximated by a quadratic equation since the 
constant term is oftentimes negligibly small. 


3e—1 (1/0):5--89* 
Crn 


mč ?— tee 
4«(«—1) (14-30) 
—2 14-20? 
3e. Tit (12) 


^ 2e. (e,— 1)? (12-308) 


olved directly, with the result 
y two solutions. In actual fact, 
le solution. The root with the 
because it is the root which 


This equation can be s 
that there are apparentl 
there is only one allowab 
plus sign must be chosen; 
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Fic. 1. Reflectivity and refractive index as functions of wave- 
length for n-type germanium having 3.9 1015 cm- as reported 
in Ref. 4. In addition, theoretical values for the reflectivity, having 
been calculated using an effective mass me=0.145 mo, are shown. 


in the limit of Q7»1 reduces to the known result ex- 
pressed in Eq. (10). 


COMPARISON WITH OTHER TECHNIQUES 


Cyclotron resonance techniques for the determination 
of the components of the conductivity effective mass 
have yielded a very great insight into the band struc- 
ture of germanium and silicon (Ref. 12 gives a good re- 
view of this development). However, the condition for 
the observation of this resonance phenomenon is 79,7 1, 
where w, is the natural cyclotron frequency of rotation 
of a moving electron in a magnetic field about an axis 
parallel to the field, and classically this frequency is 
given by «e, —eB/m.. Consequently, experiments in the 
microwave region, using reasonable magnetic field in- 
tensities, require that r be 10-! sec or longer. This 
condition is only fulfilled for high-purity semiconductors 
at low temperatures. 

This limitation severely restricts the usefulness of 
cyclotron resonance for the determination of me. For 
example, some semiconductors have not as yet been pre- 
pared in a sufficiently pure form to enable cyclotron 
resonance measurements to be made. In this group, for 
example, is the very interesting family of semiconductors 
PbS, PbSe, and PbTe. Furthermore, there is reason to 
believe that the value of the conductivity effective mass 
in some semiconductors may be temperature-dependent. 
Evidence of this possibility has been reported in con- 
nection with investigations of both germanium? and 
lead sulfide.* However, because of the stringent condi- 


2T, H. Geballe, Semiconductors an American Chemical Society 
Monograph Series, edited by N. B. Hannay (Reinhold Publishing 
Corporation, New York, 1959), Chap. 8, pp. 321-6. 

13 G, MacFarlane, F. McLean, J. Quarrington, and V. Roberts, 
Phys. Rev. 108, 1377 (1957). 

Ó I. Smirnov, B. Moizhes, and E. Nensberg, Fiz. Tverd. Tela 2, 
n (1960) [English transl.: Soviet Phys.—Solid State 2, 1793 
(1961 
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Fic. 2. Absorption index 
as a function of wavelength 
for the z-type germanium 
sample of Fig. 1 as reported 
in Ref. 4. 
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tion 79,1, cyclotron resonance measurements are 
generally restricted to liquid-helium temperatures and 
so they cannot be utilized for the determination of the 
temperature dependence of me. 

More recently, techniques have been developed which 
have utilized measurements in the far infrared. As men- 
tioned earlier (S-F) obtained values of m, from a com- 
bination of reflection and transmission measurements,‘ 
and Lax and Wright (L-W) obtained their values from 
the reflectivity of polarized radiation for several values 
of magnetic field intensities? (magnetoplasma effect). 
In both of their analyses (S-F and L-W), the simplifica- 
tion of neglecting the contribution of the lattice to the 
total dispersion has been inferred. 

In connection with the S-F procedure, one notes that 
if the condition (w7)?>>1 is fulfilled, the dispersion rela- 
tion of Eq. (2) is independent of r. Under this con- 
straint, the conditions placed upon the energy de- 
pendence of 7, as noted earlier, do not apply. That is, 
under the constraint that (v7)?»1, the relaxation time 
may have an arbitrary energy dependence without 
having any effect on the resultant dispersion relation- 
ship, which is 


n —k—e-Ne/m.eg. 


(13) 
In addition to the simplified dispersion relation of 
Eq. (13) and the reflectivity relationship expressed in 
Eq. (6), the S-F procedure utilizes the fact that the 
transmission through a sample of material of thickness 
x in the absence of interference fringes and for <n? 
is given by!$ 
T= (1—R)*e-*7/ (1— R%e~27) , (14) 
where a is the linear absorption coefficient which is re- 
lated to the absorption index & by the relation 
a — Avk/A. 


15 Reference 6, pp. 13-14. 


(15) 
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The linear absorption coefficient æ equals the reciproca] 
of the distance within which the incident radiation js 
attenuated by a factor (exp)™. The procedure outlined 
by S-F for the determination of m, is readily deduced 
from Eqs. (6), (13)- (15). 

This method for obtaining the conductivity effective 
mass of free carriers is relatively simple compared with 
cyclotron resonance techniques. It does require that 
(wr)*>1. However, since the frequencies are of the order 
of 105-104 rad/sec, 7 may be of the order of 10-3 sec 
compared with 107! sec for cyclotron resonance. Con- 
sequently, measurements can be made on less pure 
materials and at higher temperatures. Furthermore, it 
may or may not require transmission measurements, 
Under certain conditions where & increases very rapidly 
with wavelength (see Fig. 2), it is possible to consider 
the reflection region where &*«&€(n— 1)? and thus obtain 
me directly from Eqs. (6) and (13). However, one must 
beware that going to wavelengths much smaller than 
^o means that 7? — es and small errors in the absolute 
reflectivity can cause very large errors in m™* calculated 
from Eqs. (6) and (13). Consequently, in general, the 
S-F procedure requires transmission measurements 
which in turn require the preparation of very thin 
samples. For their transmission measurements, S-F 
used samples whose thickness was of the order of 10 u. 
Such thicknesses represent a few thousand atomic layers 
and are very often difficult to achieve without producing 
extensive damage to the remaining material. For ex- 
ample, it has been possible to achieve such thicknesses in 
lead telluride only with very careful grinding and polish- 
ing techniques and then only very small surface areas 
(=600X50 u) were obtained so that special optical 
techniques must be utilized.!9 

The other procedure mentioned above, the measure- 
ment of the magnetoplasma reflection as proposed by 
L-W,* eliminates the necessity of making transmission 
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Tic. 3. Magnetoplasma effect in n-type InSb having 1.810" 
cm™ as reported in Ref. 5. 


16 W, W, Scanlon, Phys. Chem. Solids 8, 423 (1959). 
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measurements, This method utilizes the fact that, in the 
presence of properly polarized radiation, a magnetic 
field shifts the minimum in the reflectivity to both sides 
of the zero-field minimum, This results in the appearance 
of two minima. The displacement of these minima to 
either side of the zero-field minimum is directly propor- 
tional to the cyclotron resonance frequency w,=eB/me. 
In their investigation, L-W have determined the shift 
as a function of magnetic field strength and have deter- 
mined me from the slope of this dependence. Their data 
for the magnetoplasma effect in n-type indium anti- 
monide are shown in Fig. 3. 

This procedure, even though it offers the advantage 
of eliminating transmission measurements, does con- 
tain several inherent drawbacks. For example, magnetic 
field strengths of 25-kG or less would produce a shift in 
the minimum of approximately +4 y for the germanium 
sample of Fig. 1. Consequently, for zero-field reflection 
minima which are shallow and which have a width of a 
couple of microns, as for example the reflectivity mini- 
mum observed by S-F for germanium at room tempera- 
ture in Fig. 1, the actual shift of the minimum is very 
difficult to measure. In fact, in their original article Lax 
and Wright found that the positions of the shifted 
minima were sufficiently indistinct so that it was ex- 
pedient to determine the shift from that of the reflec- 
tion edge on the long-wavelength side of the reflection 
minimum at an isoreflection point. This shift is that 
indicated by the arrows in Fig. 3. However, this latter 
method for obtaining the shift of the reflection minimum 
is also subject to certain disadvantages. First of all, it 
is not clear that the edge should shift the same amount 
as the minimum since the latter shift was determined 
assuming that losses are negligible. This is certainly not 
the case in the region of the reflection edge on the long- 
wavelength side of the reflection minimum because in 
this region we have the condition k>n. Furthermore, 
because k increases very rapidly in this region of the 
reflectivity, the depth of penetration of the incident 
radiation becomes increasingly shallow. For example, 
from Eq. (15) we see that the penetration depth (1/o) 
for \=30 u and &=2 is only of the order of 1 u. Conse- 
quently, the surface preparation becomes increasingly 
important in this region of the reflectivity curve. Sur- 
face damage can therefore distort the reflectivity in 
this wavelength range. Such surface effects make it even 
more difficult to determine the relationship of the ob- 
served shift in the reflection edge to the cyclotron reso- 


nance frequency. 
CONCLUSIONS 


The relationship between 7 and wo presented in Eq. 
(9) offers some unique advantages 1n the determina- 
tion of me. It has rather widespread applicability since 
it is valid for totally degenerate statistics and for dunes 
all degrees of degeneracy when |s| X3, where s ire 
power of the energy dependence of r. Equation (9) n 
not limited by any assumption that er 1. It is muc 
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more applicable than the simplified relation of Eq. (11). 
It eliminates the necessity of determining the transmis- 
sion through very thin samples of material on the one 
hand and the necessity of applying large magnetic 
fields perpendicular to the direction of propagation of 
polarized infrared radiation on the other. What is re- 
quired is the measurement of the infrared reflectivity in 
order to determine the frequency of minimum reflection. 
It is not even necessary to obtain absolute reflectivity 
data. Relative values can serve equally well. That is, a 
percentage of the incident and/or of the reflected radia- 
tion can be lost through experimental error. Although 
this will result in a decrease in the resolution of the 
minimum, if the minimum is nevertheless discernible, it 
is still possible to determine m, from Eq. (9). More- 
over, this procedure is not overly sensitive to the effects 
of surface damage on the long-wavelength side of the 
reflectivity minimum. It has been noted above that the 
surface preparation becomes increasingly important in 
this region because of the decreasing depth of penetra 
tion into the sample. A damaged sample surface has the 
tendency of dispersing the radiation thereby reducing 
the reflectivity in this long wavelength region and of pre- 
senting a (7) which is different from that of the bulk 
material. The reflectivity in the region of the minimum 
is also subject to distortion from this mechanism only 
less so than in the region of the absorption edge because 
of the increased depth of penetration of the incident 
radiation. 

Consequently, m, can be obtained using Eq. (9) over 
the large temperature range in which the reflectivity 
minimum is measurable. The free-carrier concentration 
of the semiconductor material of interest is limited only 
by the requirement of single-carrier conduction, by the 
requirement that the lattice contribution to the dis- 
persion be relatively unimportant, and by the require- 
ment that the reflectivity apparatus be capable of 
analyzing the reflection near the free-carrier dispersion 
minimum. However, these limitations on the applica- 
bility of this procedure are also common to the other 
procedures which utilize infrared radiation. 
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APPENDIX A 


It is convenient to substitute terms expressed as func- 
tions of (7) into Eqs. (2) and (3). Such a substitution is 
only approximately equivalent to the exact terms. The 
nature of the approximation involved can be appreci- 
ated by examining the following ratios: 


(7/0 r) (z/ er) 
——————— — — — (16,17) 
(7}/ über) ()/ (Fer?) 
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The asymptotic limits for various scattering mecha. 
nisms are shown in Fig. 4 for reduced Fermi potentials 
of n<—4, =0, and >4. 
Appendix B 
The condition 0R/dQ=0 yields the equation 
(n2—1—?)n’+2nkk' — 0, (18) 


where the prime signifies 0/02. Solving for k and }’ 
from Eq. (8) and substituting into (18) yields 


(n1—12— 38?)n/4- 28n8' 20, (19) 


where B= B/[29(1--9?) ]. Combining Eqs. (7) and (8) 
yields 


cc poe x n— etnias E 


nt= en?-+8?, (20) 


where e=¢.—[B/ (12-9?) ], and upon substituting (20) 
into (19), we obtain 


[ (e—1)n?— 28? Jn 4-268! —0. (21) 


By solving (20) for n? (choosing the positive root to 
keep 7 real), by differentiating this result with respect 
to Q, and by substituting (u2)'/ (2n) for n’ in Eq. (21), 


E P oc o we obtain 
(c) s» 3/7? CORRESPONDING TO SCATTERING BY IONIZED IM- 
-— PURTESS [(e— 1)n?— 28*]E— 2n°0?-+ 8 (12-302)] 
G. 4. Asymptotic limits of the ratios (16,17) plotted as func- 2 Q2? pA 
f wr for various values of the scattering parameter s and Hne (14-30%) (2n?—«)=0. (22) 


ae ese deeenctacy 7 <— 4, —0, 24. Equation (22) can be rewritten in the form 
nder the conditions of wr®1 and wr1. Under these n‘=G (O) +H (0), (23) 
tions the above ratios approach the following 
ting values: 

for wr®1: 1and (r3)(7) 

foreor«1: (72)(r) and 1. 
these limiting conditions, it is apparent that for 
ete degeneracy these ratios are unity. In the limit 
nondegeneracy these limiting values become B= (e—G) (He—G)/ (H—1)?. (25) 


oT 1: fandT(S/2—s)T (5/2--5)[r (5/2)? After much simplification (25) reduces to Eq. (9) of the — | 
er«1: T(5/24-2s)r (5/2)LT (5/24-s) ]-? and 1 text. ^ 


where G(9) and HZ (Q9) are complicated functions of Q. 
Equating (23) and (20) yields 


»?—8*(H —1)/ (e—G). (24) 


Dividing Eq. (20) by n? and substituting for n? from | 
Eq. (24) yields the relationship 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


PHYSICAL REVIEW 


VOLUME 134, NUMBER 4A 


13 MAY 1964 


Pressure Effect on Resistivity of Ga(As,_,P,) 


G. E. FENNER 
General Electric Research Laboratory, Schenectady, New York 
(Received 2 January 1964) 


The change of resistance of n-type Ga(Asi_zPz) samples under hydrostatic pressure up to 15 000 atm 
was measured between 190 and 363°K. Large pressure-dependent resistance changes were observed for 
samples with 2220.35. These observations are consistent with the presence of another conduction band edge 
of much lower mobility presumably along £= (100) into which electrons are transferred when the relative 
energy separation between the two edges is lowered by pressure. The measured results are compared with 
calculations based on the model proposed by Ehrenreich. The agreement between theory and experiment 
is found to be good if certain parameters are varied in a systematic way. For lightly doped material the 
mobility ratio of electrons in the light mass band to those in the heavy mass band is found to be large and to 
decrease rapidly with increasing temperature. At higher carrier concentrations both the magnitude of the 
mobility ratio as well as its variation with temperature are substantially reduced. At 300°K the (000) and 
(100) minima are found to cross at x20.44 if we assume Vegard's law to be valid. The experimental data are 
best explained if it is assumed that the separation between the two sets of minima changes at a rate of 


>1X10~4 eV/°K. 


INTRODUCTION 


N the last few years considerable progress has been 
made in the understanding of compound semicon- 
ductors, in particular those consisting of elements from 
the III-V columns in the periodic table. A comprehen- 
sive review of the transport properties of a number of 
these has been given by Ehrenreich,! who also deduced 
the band structure of Ga(As; .,P;) as a function of the 
composition variable x. The details have been exhaus- 
tively covered by that author in an earlier publication? 
and therefore we shall only review those points of his 
model that are of direct concern to us here. Unlike the 
majority of semiconductors investigated to date, 
Ga (As, P4) displays two sets of conduction band edges 
which are close enough in energy over an appreciable 
portion of the composition range so that their combined 
influence on the electrical behavior of the semiconductor 
has to be considered. The separation in energy between 
the two sets of minima depends upon xv and it vanishes 
for x«0.5. For x«0.5 the (000) minimum is lowest in 
energy, while for 120.5 the (100) valleys are believed 
to be lowest. Because of the large differences in the 
mobilities and density of states between them, the bulk 
properties of the material can be expected to change 
appreciably as the region +=0.5 is approached. It is the 
purpose of this paper to report measurements made in 
that transition region under the application of hydro- 
static pressure and to derive values for parameters that 
so far have been speculative or not known at all. The 
influence of hydrostatic pressure on the band structure 
of semiconductors has been discussed by Paul? He also 
touches upon some of the discrepancies in the experi- 
mental results in GaAs, which have a direct bearing 
on the model of both that compound and that of 


Ga (As; P.) itself. 


Our approach in this paper is to assume the correct- 


E i . Phys. 32, 2155 (1961). 
1H. Ehrenreich, J. Appl. P ee Bet oath 


* H. ich, Phys. Rev. 
SW. Ea | Appl. Phys. 32, 2082 (1961). 


ness of the model proposed in Ref. 2. We hope to deter- 
mine several band-structure parameters by fitting our 
results to this model. As will become apparent shortly, 
the biggest difficulties in accornplishing our task arise 
from the lack of homogeneity in the crystals themselves. 


EXPERIMENTAL DETAILS 


The Ga(As;_,P,) samples used for the measurements 
were obtained from various sources. The most homo- 
geneous samples were cut from crystals grown by 
halogen-assisted vapor transport in a sealed capsule 
somewhat similar to the method reported by Piz- 
zarello.* Others were grown from excess gallium solution 
and one was an epitaxially grown sample purchased 
from the Merck Company. All crystals were doped with 
Te. X-ray determination of the lattice constant in 
conjunction with Vegard's law was used to establish the 
composition most accurately. All the crystals grown by 
vapor transport were prepared to yield 1220.35. When- 
ever possible the specimens were cut into rectangular 
bars about 10 mm long and 1 mm square. Ohmic con- 
tacts were made by alloying tin dots to the semicon- 
ductor. The pressure was generated in a cylindrical 
multiple wall pressure cell of conventional design.* The 
pressure generating piston was driven directly by the 
platen of a 50-ton press. At room temperature the 
measurements were carried out inside this cylinder, 
while at the other temperatures the samples were en- 
closed in a small pressure cell similar to the one reported 
by Fitchen.* The large pressure cylinder then was used 
as the pump and connected to the small cell by j-in. 
o.d., 0.024-in. i.d. hard-drawn stainless-steel tubing 
manufactured by the Harwood Engineering Company. 
The electrical connections were made by the use of 


5 F, A. Pizzarello, J. Electrochem. Soc. 109, 226 (1962). 

5 C. A. Swenson in Solid State Physics, edited by F. Seitz and 
D. Turnbill (Academic Press Inc., New York, 1960}, Vol. 11, p. 41; 
P. W. Bridgman, The Physics of High Pressure (G. Bell and Sons, 
London, 1949). 

5 D. B. Fitchen, Rev. Sci. Instr. 34, 673 (1963). 
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0.062-in. swaged metallic sheathed thermocouple wire.” 
At temperatures below 300°K a mixture of n pentane 
and isopentane served as the pressure fluid while at 
temperatures above that » pentane alone was used. The 
pressure was measured by a seasoned manganin wire 
gauge’ mounted inside the pressure cylinder. The gauge 
calibration was established using a recent determination 
of the freezing pressure of Hg at 0?C.5 The accuracy of 
the pressure measurements is estimated to be better 
than 1%. 


RESULTS 


The application of hydrostatic pressure on Ga- 
(As;L.P;) increases the energy gap between the con- 
duction band minimum at k= (000) and the top of the 
valence band at the rate of about 1.1X(10-5 eV/atm. 
The energy separation between the (100) minima and 
the valence band has a small negative pressure co- 
efficient and therefore the net result is a decrease of the 
relative energy separation AZ between the (000) and 
(100) conduction band valleys under pressure. As long 
as AE>>kT the number of electrons transferred to the 
energetically higher lying minima per unit dilatation of 
the crystal is small. Once AZ has been reduced to a few 
kT either by very high pressures in the case of *GaAs or 
by the proper choice of composition in Ga(AsP), the 
rate of transfer of electrons to the higher valleys in- 
creases rapidly. Because of the much lower electron 
mobility in the (100) valleys, the bulk resistance of the 
material increases many fold until all the current is 
carried by electrons in the (100) valleys. After that a 
further increase in pressure does not change the resist- 
ance appreciably. This general characteristic is demon- 
strated by the measurements shown in Figs. 1, 3, and 4. 

As mentioned already our plan is to compare experi- 
ment and theory by choosing certain parameters of 
Ehrenreich's model until a best fit with the measured 
values is obtained. With the assumptions set down in 
the Appendix we find for the relative resistance at 
pressure P 


p(P) b+ y 1+ ver? /éT 
p(0) 14r b+ veaP ler’ 


(Al) 


where b= u/n is the mobility ratio between valleys 1 
and 2, v—b,e-^70)/£7— 5,(0)/51(0) the ratio of the 
number of electrons in valley 2 to the number in valley 
1 at atmospheric pressure, and a the pressure coefficient 
of the energy separation between valleys 1 and 2. 
Valley 1 refers to the minimum at k= (000) while 2 
refers to the set of equivalent minima along k= (100). 
In trying to fit Eq. (A1) to the experimental data, the 
value for the pressure coefficient a was accepted from 
earlier work. It was deduced directly from pressure 


* R. H. Cornish and A. L. Ruoff, Rev. Sci. Instr. 32, 639 (1961). 

5D. H. Newhall, L. H. Abbot, and R. A. Dunn, American 
Society of Mechanical Engineers, New York Meeting, 1962 
(unpublished). 
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measurements on *GaAs and probably can be accepted 
with some confidence.’ Should a small correction for 
this value become necessary it would merely lead to a 
minor correction in b as seen by Eq. (A4) of the Appen. 
dix. The density of states ratio b, is numerically quite 
uncertain? and may be subject to correction. Fortu. 
nately for most of our considerations b, is absorbed into 
the factor v and will only become important when the 
band structure at a particular value of x is discussed, 

Figure 1 shows the results of measurements made on 
sample 21a-11, the crystal with the smallest carrier 
concentration of all our samples. This single-crystal 
specimen obtained from a crystal grown by vapor 
transport also had a small variance in composition and 
therefore we will base our conclusions largely on that 
particular sample. Other bars cut from adjacent parts 
of the same crystal gave results very close to 21a-11, so 
that our confidence seems justified. Except for the high- 
pressure region, i.e., the region after the bands have 
crossed, the fit is considered to be very good. The out- 
standing feature of the curves plotted in Fig. 1 is the 
behavior of b= ui/us as a function of temperature. It has 
a very large value at 200°K but decreases markedly as 
the temperature is increased. For the deviations be- 
tween the expected and the actual measured values at 


Ga (As, ,,P,] *2Io-11 


I | | i 
2 4 8 10 12 14 


6 
PRESSURE [10° aim] —> 


Fic. 1. Relative resistance versus pressure: Ga(As;_zPz), No. 21 
2-11, 2—0.359. Curve 1: y—7.2, b=55; curve 2: v=5.6, b=15; 
curve 3: v=4.8, b=320. 


? For GaP this coefficient has been found to be larger by about 
25%. See Ref. 3. p 
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Fic. 2. Hall constant versus 1/T at atmospheric pressure. 


high pressures, we offer the following explanation: The 
inspection of the Hall coefficient (Fig. 2) for this sample 
reveals that freeze out of carriers occurs below 400?K. 
This implies that an appreciable number of carriers is 
occupying one or more deep levels at the temperatures 
at which our experiments are performed. If we suppose 
that these levels have a pressure shift equal to that of 
the (000) minimum, then the number of electrons in 
that band will not change under pressure as long as the 
other minima are sufficiently far removed in energy. As 
the (000) minimum approaches the (100) minima, some 
of the carriers in the impurity levels can now be therm- 
ally excited into the (100) minima. Now our assumption 
that the total number of electrons in the conduction 
band remains constant is no longer valid. Because of the 
large mobility of electrons in the (000) minimum, this 
effect will not be noticeable until the latter valley is 
almost empty ; i.e., until the bands have crossed. Then, 
as carriers are transferred from the bound impurity 
states to the low but finite mobility (100) band, the 
resistance decreases in agreement with the observations. 

The effect should become more pronounced as the 
temperature is decreased, because the ratio of electrons 
in the (000) minimum to the number in bound states is 

decreased even before the pressure 1s applied. For the 

samples that do not show the freeze out, this decrease in 


resistance does not occur, giving further support to ihe 


correctness of our reasoning. 
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Figure 3 shows the results of measurements made on 
sample 17a. This specimen is also a single crystal and are 
similar in composition to 21a-11. The major difference 
between the two crystals is the much larger carrier 
concentration of 17a. This difference shows up in the 
reduction of the mobility ratio 5 which can be traced to 
the increased influence of ionized impurity scattering in 
the heavier doped crystal. Because this scattering proc- 
ess is most important at low temperatures, it reduces 
the temperature dependence of b. 

The results of measurements on additional samples 
are shown in Fig. 4. Specimen 48a is a polycrystalline 
sample which shows a pressure dependence similar to 
17a. However, the factor v which results in the best fit 
with the experimental data has a different temperature 
dependence for the two samples. It is probable that the 
polycrystalline sample 48a has a larger variation in 
composition which would explain the different behavior. 
A very pronounced effect of this variation can be seen 
in sample 267 A with x=0.24-£0.05. This epitaxially 
grown single crystal shows such large variations in 
composition that no agreement between our model and 
experiment can be accomplished. 

Finally, we have measured a sample with a composi- 
tion of x2:0.61 and find the change in resistance linear 
up to 10 000 atm at room temperature with Ap/(pP) 
=—14.541X10~- atm“. This agrees in sign with the 


100 

80 Ga (As, P.) "17a 
7 T= 363 'K 

E o T «300 °K 
a T«193*K 

4i 

"20 


Fic. 3. Relative resistance versus pressure: Ga(As; ,P), No. 
17a, x— 0.372. Curve 1: » 5.3, b=22; curve 2: v=3.3, b=22; 
curve 3: y —0.85, 6 —23. 
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Fic. 4. Relative resistance versus pressure: Ga(As;..P) No. 
48a, x=0.367, and No. 2674, x=0.24. Curve 1: v=6.1, 5-25; 
curve 2: y —4, b —26; curve 3: y 20.85, b—23. 


measurements made on” silicon and" GaP, but is larger 
in magnitude than either one of these. 

We shall now attempt to gain additional information 
from the measurements made on sample 21a-11. From 
the pressure data we find y=5.6-41. (See Table I.) If 
we accept the value of 5,— 70(2-40, —20) determined 
by Ehrenreich;? we find the energy separation between 
valleys 1 and 2 at zero pressure given by AE(0) — 0.065 
eV (4-0:017, —0.013). X-ray lattice-parameter evalu- 
ation resulted in a value of a=5.5806-0.0002 or, 
assuming the validity of Vegard's law, x— 0.359. Linear 
extrapolation of our energy separation AZ to the termi- 
nal compounds GaAs and GaP shows that we cannot 


TABLE I. Experimentally determined values of v and b as func- 
tion of temperature for several samples. œo=lattice parameter 
from x-ray measurements. Composition parameter x deduced from 
ao assuming validity of Vegard’s law. AE(0) is nominal value 
based on 5,— 70. 


Sample ao T AE(0) 

No. [i] = DK] » EV] 5-u/m 
21a-11 5.5806 0.359 193 44 +1 0.045 300-450 
2::0.0002- 300 5.6 +1 0.065 . 754-15 

363 7.2 21.2 0.075 55415 

17a 5.5780 0.372 193 0.85+0.2 0.073 2343 

210.0001 300 3.3 +1 0.079 2243 

363 5+1 0.087 2143 

48a 5.5789 0.367 193 0.85--0.2 0.073 23-3 

210.0001 300 4+1 0.074 26+5 

363 6.1 +1.1 0.08 2544 

NH1No.4 5.5738 0.39 300 10+2 0.05 3027 


+0.0004 


1 C. S. Smith, Phys. Rev. 94, 42 (1954). 
11 A, Sagar and R. C. Miller, J. Appl. Phys. 32, 2073 (1961). 
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match both values given in Ref. 2 simultaneously. The 
pressure data of Howard and Paul” rule against an 
energy separation between the direct and indirect banq 
edges in GaAs of less than 0.3 eV. Ehrenreich found 
0.36 eV as the most consistent value based on Hall and 
pressure measurements. In Fig. 5 we have plotted the 
energy separation AZ as deduced from our measure- 
ments. If we accept 0.36 eV as the energy separation 
between the (000) and (100) minima for GaAs, the 
separation for GaP turns out to be 0.46 eV if we assume 
a linear dependence of AE on x. This is 0.1 eV larger 
than reported earlier! From an analysis of optical 
absorption and reflectivity data Zallen and Paul" find 
2.75-2.8 eV for the direct gap energy in GaP. With a 
value of about 2.2 eV for the indirect gap energy Eg this 
leads to an energy separation AE of about —0.5 to 
— 0.6 eV. Photothreshold measurements by Spitzer and 
Mead" indicate that AE~—0.45 eV. Engeler,!? of this 
laboratory, concludes that a value of 0.5 eV for the 
separation of the (100) and (000) band is consistent 
with Spitzer and co-workers"? optical absorption data. 
Figure 5 indicates the two sets of minima to cross in 
energy at x,2:0.44, somewhat smaller than the earlier 
deduced value of 4,440.53. The assumption that AE 


depends linearly on x is undoubtedly only an approxi- ` 


mation, but the deviations are not expected to be large 
enough to affect our determination of x, materially. 

The main difficulties in the interpretation of our 
results show up when we try to determine the effect 
temperature has on the energy separation AE. Again 
accepting the value b,= 70 independent of temperature, 
we can find AZ from v. This is done in Table I for three 
samples discussed earlier. In order to arrive at the 
probable error of these determinations, curves calcu- 
lated by varying both b and v were compared with the 
actual measurements. The limit of agreement was 
defined as that point at which the deviations of the 
calculated curves from the measured have passed out- 
side the experimental tolerances. The discrepancies in 
the temperature coefficient of AE for the three samples 
are larger than the experimental error. In view of the 
difficulties arising from crystal inhomogeneities, as 
exemplied by results from sample 267 A, this is not too 
surprising. It must be concluded, though, that 9AE/0T 
= (8/8T)(Eiw— Eoo) is larger than 1X10-* eV/°K. 

While the changes of resistance under pressure can 
readily be understood in terms of the adopted model 
the interpretation of the Hall data derived from the 
same samples is not at all certain. The Hall coefficient 
for the two-band model is given by 


Ra [-E»)/en JE (*4-)/ (64-*], 


2 W. E. Howard and W. Paul, quoted in Ref. 3. 

13 R. Zallen and W. Paul (to be published). 

iW. G. Spitzer and C. A. Mead (to be published). 

15 W. E. Engeler (private communication). “bb 

16 W. G. Spitzer, M. Gershenzon, C. J. Frosh, and D. F. Gibbs; 
Phys. Chem. Solids 11, 339 (1959). 
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Fic. 5. Energy separation AZ(0) versus composition at atmos- 
pheric pressure and 300°K. The uncertainty in the experimental 
points 1s due to the experimental errors as well as the large un- 
certainty in the knowledge of bp. 


where scattering factors of the order of 1 have been set 
equal to 1. Under the condition z14-775— 11— constant, 
Ry has a maximum value of (1--5)?/4b when n2/1;— b. 
The temperature at which this maximum occurs in- 
creases with b and AE. To be consistent with the 
parameters deduced from the pressure experiments, the 
Hall coefficient should have increased appreciably at 
room temperature over its value at lower temperatures 
assuming AE to be temperature-independent. This, 
however, is contrary to the experimental evidence. 
Furthermore, the maximum of Ry occurs at much 
higher temperatures than expected from the energy 
separation of the (000) and (100) valleys at room 
temperature. 1 

For the lightly doped sample 21a-11, the rapid freeze 
out below 400°K invalidates our assumption that the 
total number of electrons in the conduction band re- 
mains constant. This fact and the rapid decrease of b 
with increasing temperature could be advanced as a 
reason for the poor agreement with the measured Hall 
coefficient. Both arguments are not supported by the 
measurements made on the rest of the samples and, 
therefore, we are forced to look for other possible 
explanations. As mentioned earlier, the pressure meas- 
urements at various temperatures are best explained if 
we assume AE to have a positive temperature coefficient 
in excess of 1X 10-4 eV/°K. An increase of p nan 
separation AE at a rate of about 2X107* eV/*E = ; 
decreasing b would suffice to keep Ry approximately 


RESISTIVITY OF 


Ga (Asrab 2) A1117 
constant to higher temperatures, except that this is 
contrary to the experience with the other group IV and 
III-V semiconductors as well as the terminal com- 
pounds themselves. It should be noted, however, that 
very little is known about transport properties of 
ternary compounds and some of the extrapolations 
might well be in error. 


CONCLUSIONS 


The close agreement between the measured values 
and the calculations based on Ehrenreich's model of the 
Ga(As;...P) conduction band must be considered strong 
evidence for the correctness of the essential points of 
that model. While we have not directly proved that the 
energetically higher lying minima are indeed along the 
(100) directions, the self-consistency of our results gives 
strong support to this identification. 

The large mobility ratio (6>50) deduced from our 
room temperature measurements on the lightly doped 
sample is significant, since this is necessary for the 
interpretation of the large pressure-dependent resistance 
changes in GaAs observed by Howard and Paul. The 
rapid decrease of the mobility ratio b with increasing 
temperature could explain the smallness of 6 derived 
from high-temperature Hall data. Extended to GaAs, it 
suggests that this is the reason for the apparent differ- 
ence in 6=10 reported by Aukerman and Willardson'? 
and the b based on room-temperature pressure measure- 
ments. But in view of the discrepancies between our 
pressure and Hall data, this interpretation must be 
considered with caution. The composition x, at which 
the (000) and (100) minima are equal in energy is 
smaller than earlier estimates, excepting the recent 
determination of Spitzer and Mead." The extrapolation 
of their results to GaAs leads to an energy separation 
which is too small. Their results can be brought into 
agreement with ours if we assume some variation in 
composition in their samples. This variation would have 
the effect of lowering the apparent energy of the (100) 
band edge for a given value of x explaining the difference 
in their results. 
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APPENDIX 


We are assuming that the conduction band of 
Ga(As,P) exhibits two sets of minima, one minimum at 


7 L. W. Aukerman and R. K. Willardson, J. Appl. Phys. 31, 
939 (1960). 
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k=[000] (valley 1) and a set of minima along k=[100] 
in the Brillouin zone (valley 2). The separation in 
energy between the minima at the different points in the 
Brillouin zone is supposed to be a linear function of the 
composition of the alloy as shown in Fig. 5. For the 
purpose of the analysis we make these further simplify- 
ing assumptions: 


1. Boltzman statistics apply. 

2. Intervalley scattering is neglected. 

3. The total number of carriers in the conduction 
band is constant. 

4. The mobility ratio b=y1/y2 is independent of 
pressure. (Subscript 1 refers to the low-mass band at 
k=[000] throughout this analysis. Furthermore, sym- 
bols used are those of Ehrenreich.?) 

5. b,— (mj/m1)?? — 70 is also pressure- and tempera- 
ture-independent; m,= density of state mass in the 
valleys at &— [100 ]. 

6. 'The energy separation between different minima 
is AE— AE(0) —aP where a= 1.1X 10-5 eV/atm. 


With these assumptions we find for the relative 
resistance at pressure P, 


plp) bv 1+ vexP/*? 


p(0) 14-» b+ vexP/47 i 


n(0) /m,3!? 
v= -( g-AEQUET 
ni(0) my 


The logarithmic derivative of the expression (A1) is 
given by 


(A1) 


where 


yea PIKT 


d Inp INg 
=(1 ) . (A2) 
aP b/ T [1+ (v/b)e?!*T (3 4-vesPIT) 


Since v can directly be converted into composition by 
the use of Fig. 5, this last equation gives essentially a 
universal curve in which composition and pressure can 
be interchanged. In principle, it should therefore be 
possible to plot the experimental values of the logarithm 
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of resistance versus pressure, differentiate this curve and 
fit the resulting curve to (A2). Attempts to accomplish 
that in general were not satisfactory, primarily because 
b was different in every crystal and the graphica] 
differentiation lacked accuracy. 

Specific values of (A1) and (A2), however, are useful 
in finding the unknown parameters v and b. 

For the limit of high pressures, we get from (A1) 


5 p(P) bcr 
im = 5 A3 
Po^p(0) tl s 
and for the maximum value of (A2) we find 
d lnp a 1—1/b 
z : (A4) 
dP max kT 14-2/4/b 


The influence of assumptions 4 and 5 on the result 
(A1) can be judged if we let 


(P) /m(0)\” 
ME. 
ui (0) mi(P) 
From Ref. 2, 
mi(P) Ec(P) 1-- (1/2) E(OLEG(0)J- A ? 
(A6) 


mi(0) Ec(0) 1-- (1/2) E((P)EEG(P) +A] 


where Ee(0) and Eg¢(P) are the band-gap energy at 
atmospheric pressure and at pressure P respectively and 
A the spin-orbit splitting energy. If polar scattering is 
the dominant scattering mechanism, z in equation (A5) 
equals $ and (A1) can be corrected by replacing the one 
in the numerator by the right hand side of (A6) raised 
to the $ power. Since A«&Eg (A6) is approximately 


1 /0Eg 
tz) 
Eg(0)N dP 


This factor has to be considered only for values of v and 


P where the electron transfer to the higher band edge is . 


insignificant. 


3) 
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Oxygen and nitrogen as interstitial solutes in niobium cause the metal in transverse field to exhibit 
anomalous resistance minima or critical current maxima (peak effect). The anomaly appears at Ha, the 
upper critical field of these Type II superconductors. The resistance in the mixed state is decreased by ex- 
ceeding the solubility limit, by cold working and by strain aging at 170°C, masking out the anomaly. Rapidly 
quenching from the solution temperature decreases the anomaly. The results suggest that the anomaly is 
strongest at some intermediate stage of the process of segregation and precipitation at dislocations. Resis- 
tance in the mixed state is increased by introducing more solute, by quenching, and by etching. Structural 
changes brought about by cold working, strain aging, and rapid quenching responsible for large changes in 
J«(H) or R(H) in the mixed state also cause changes in the region He <H <H a. In this latter region, field 
orientation effects and observed H3/H-2 ratios are roughly consistent with the concept of a superconducting 
suríace sheath as recently proposed by Saint-James and de Gennes. Pure niobium, however, tends to ex- 


hibit consistently higher ratios. 


I. INTRODUCTION 


UPERCONDUCTING transition metals in Group 
V of the periodic table have been noted for their 
anomalous behavior which is usually attributed to the 
presence of chemical and/or structural impurities. An 
important metal of this group is niobium, which serves 
as a base metal in the fabrication of many high-field, 
high-current-carrying superconductors. The influence 
of substitutional! and interstitial? solutes on some super- 
conducting properties of noibium has been recently 
summarized. Other investigations have also appeared, 
revealing the importance of small amounts of inter- 
stitial impurities on the substructures of the metal 
(e.g., see Ref. 3). (us 
Unusual superconducting behavior occurring in 
niobium has been reported in both the resistance-field* 
and critical current-field characteristics. Recently, 


* i were presented in part before the 
The contents of this paper eeu UE 


i ical Society meetings, s 
EEO al Philadelphia, Pennsylvania, 23-26 March 1964. 

1 W. DeSorbo, Phys. Rev. 130, 2177 (1963). 

2 W. DeSorbo, Phys. Rev. 132, 107 (1963). 

3 J. O. Stiegler, C. K. H. Dubose, R. E. Reed, Sr., and C. J. 
McHargue, Acta. Met. 11, 851 (1963). - 

4 T. G. Berlincourt, Phys. Rev. 114, 969 (1959). : 

5M. A. R. LeBlanc and W. A. Little, in Proceedings, of the 
Seventh International Conference on Low-Temperature Physics, 


some striking resistance decreases following an initial 
rise were observed? during the superconducting transi- 
tion of the metal: (a) when the current / through the 
wire, and temperature T' were kept constant and a 
transverse magnetic field was increased continuously; 
or (b) when the current and field were held constant and 
T varied. The field at which the resistance minimum 
takes place was found to be independent of J, occurring 
at a field comparable to the upper critical field, Hez 
of a Type II superconductor." It is pointed out in 
this report that this anomalous behavior is readily 
observed in niobium containing :small amounts of 
interstitial solutes such as oxygen or nitrogen but is not 
observed in outgassed niobium. The results are com- 
pared with the magnetic behavior of the solutions which 
have already been described? as Tvpe II superconduc- 
tors. The influence of solute concentration, field orienta- 
tion, cold working, strain aging, quenching, and etching 
on the resistance-field transition and on the current 


edited by G. M. Graham and A. C. Hollis-Hallet (University of 
"Toronto Press, Toronto, 1961), p. 362. 

5S. H. Autler, E. S. Rosenblum, and K. H. Gooen, Phys. Rev. 
Letters 9, 489 (1962). 

7 A. A. Abrikosov, Zh. Eksperim. i Teor. Fiz. 32, 1442 (1959). 
[English transl. Soviet Phys.—JETP 5, 1174 (1957) ]. 

3 B. B. Goodman, IBM J. Res. Develop. 6, 63 (1962). 
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density-field characteristics both in the mixed state 
below H.z and in the region H.<@H<H,3 are sum- 
marized. The dependence of H,3—the field at which the 
specimen is completely normal—on these factors is 
cited. 


II. EXPERIMENTAL 


The niobium wire specimens (0.030-in. diam) were 
made from an electron-beam melted stock obtained 
from the Stauffer-Temescal Company, Richmond, 
California, hereafter referred to as Nb(S-T) (see Ref. 
2). Rods, about 0.250-in. diam were machined from 
the ingot then swaged and drawn to wire form. The 
resistance ratio, Roos°x/Rio°x, of the wire “as received" 
was about 110. The quantitative addition of the inter- 
Stitial solute was carried out by a procedure developed 
earlier by Powers and Doyle?. The interstitial solute 
was introduced at temperatures between 1000-1100°C. 
The specimen was then homogenized by heating at 
about 1200-1300°C for approximately 1 h. Prior to 
the introduction of the gas, the specimen had been 
annealed and outgassed at about 1800-2000°C for 
eral hours at a residual pressure less than 1X 10-7 
mm Hg. After this treatment and prior to the introduc- 
n of the interstitial, the resistance ratio of the wire 
about 500. An analysis of a similarly prepared 


Powers and Margaret V. Doyle, J. Appl. Phys. 28, 255 


‘J. Metals 9, 1285 (1957). 
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specimen reported by Seybolt gave oxygen content 
6+3 ppm and nitrogen content 52-3 ppm. 

The resistance transition as a function of field at 
constant current density was measured at 4.2°K 
potentiometrically with a noise level less than 0.1 uV. 
Referred to in terms of R/R,, where R is the resistance 
in the transition and R,, the normal resistance, the 
noise level is less than 1 10~ except for data obtained 
with low current densities (J<125 A/cm?) for out- 
gassed niobium. The critical current of the wire speci- 
men in a given field was also measured potentiometric- 
ally and the value taken was that which produced 
approximately 0.1 „V across the potential leads. The 
fields were produced by a small niobium wire (0.004- 
in. diam) wound iron-core magnet (0.040-in. slot) 
designed and built by H. R. Hart, Jr., following an 
original design due to Autler.!! Field direction was 12 
all cases transverse to the wire and to current flow 
Field orientation for ribbon samples, either perpen- 
dicular (H.Lw.s.) or parallel (H||w.s.) to the wide 
surface were obtained by mounting the ribbon in 4 
form permanently fixed between the pole pieces. 1 

The magnetic induction measurements were carried 
out by a technique used earlier by Bean.? A typical 


1? A. U. Seybolt (private communication). J 
1 S. H. Autler, D. B. Montgomery, and G. Ajootian, Quartet y 
Progress Report on Solid State Research, Lincoln Laboratory, 
MIT, July 1960, p. 63 (unpublished). y 
12 C. P. Bean, Marget V. Doyle, and A. G. Pincus, Phys. Rev- 
Letters 9, 93 (1962). 
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sample. prepared for these measurements consisted of 
7-10 wire segments, approximately 1.2 cm long, bundled 
together into a cylindrical form in a paper “straw.” 
The ends of each wire were etched (50% HF, 50% 
HNO;) to remove any strains. The sample cylinder 
was immersed in liquid helium at zero field. Magnetiza- 
tion measurements were made at 4.2°K by flipping it 
in and out of a copper coil (about 2000 turns) coaxial 
with an external steady field. The coil was connected 
to a flux meter whose deflection was proportional to 
the magnetization of the specimen. The magnetic field 
measurements were accurate to +10 Oe; while —4r M 
values were accurate to +5 Oe. 


III. EXPERIMENTAL RESULTS AND DISCUSSION 


A. Influence of Oxygen and 
Nitrogen in Niobium 


1. Resistive Transition at Constant Current Density 

The resistive transition R/R, as a function of trans- 
verse field H and current density J at 4.2°K for four 
wire specimens of niobium containing (a) 0.70 at.% 
oxygen, (b) 1.52 at.% oxygen, (c) 0.23 at.% nitrogen, 
and (d) niobium outgassed and annealed is shown in 
Fig. 1. The resistance decrease in the transition, after 
an initial rise, is apparent for each of the interstitial 
solid solutions of niobium containing oxygen or nitrogen 
at current densities below a maximum value Jmax and 
above a minimum value Jmin. The drop in resistance is 
most pronounced at current densities near Jmin. In- 
creasing the interstitial concentration in the homogen- 
eous solid solution tends to decrease both Jmin and 
Tmax. For example, for specimen Nb-+0.70 at.% 
oxygen, Jmax~ 1000 A/cm? and Jmin~90 A/cm?; while 
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Fic. 3. The resistive transition R/R, as a function of applied 
field at 4.2°K for a niobium wire containing 200 ppm oxygen (a) 
before cold worked diam=0.030 in.), Hes evaluated at J=10 
A/cm? and (b) after cold worked by hammering (diam ~0.027 in.). 
Hes =8.0 kOe- 1.9 Hee at J —200 A/cm’. 


for specimen Nb+1.52 at.% oxygen, Jmax~800 A/cm? 
and Jmin~12 A/cm?. For a given specimen, the field 
corresponding to the maximum decrease in R/Rn, after 
an initial rise, appears to be almost independent of J 
and coincides with Hes the upper field required for 
—4xM —0 (compare Fig. 1 and Fig. 2). 

For niobium outgassed and annealed, no apparent 
discontinuities in the transition are seen even at low 
current densities [Fig. 1(d)]. At these small J values 
the noise level in R/R, is greater than 1X10~. How- 
ever, the anomaly can still be observed in niobium 
containing as little as 200 ppm oxygen [Fig. 3(a) and 
(b)]. When the oxygen is beyond the solubility limit. 
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Fic. 4. The resistive characteristics R/R, as a function of cur- 
rent density and applied tranverse field at 4.2°K for a niobium 
wire (0.030-in. diam) containing oxygen concentration [6.4 at.% 
oxygen plus some nitride contamination revealed by micrograph 
and x-ray studies (Ref. 2)] beyond the solubility limit at the 
temperature of introduction. 


at the temperature of introduction,*™ no resistance 
minima are apparent (Fig. 4). 

The magnetization behavior of these interstitial 
solid solutions? and Nb,!* is that of a Type II super- 
conductor. In such a superconductor, the total sur- 
face energy becomes negative at H,; and a “mixed 
state" ensues. As the field is increased beyond ZH, the 
flux penetration increases until at H.z the transforma- 
tion to the normal state is achieved. The model is based 
on the material being homogeneous and defect-free, 
exhibiting reversible magnetization characteristics. It 
is believed that the resistance R(H) in the ‘mixed 
State" is associated with flux motion or creep of the 
kind discussed by Kim, Hempstead, and Strnad,!° i.e., 
resistance is determined from the rate at which flux 
filaments are moving. Based on this model, increasing 
the interstitial concentration seems to increase the flux 
motion for a given J since R increases [compare 
Fig. 1(a) and 1(b)]. When the solubility limit of the 
interstitial is exceeded, no resistance appears below 


33 A, U. Seybolt, Trans. AIME 200, 774 (1954). 

M E, Gebhardt and R. Rothenbacher, Z. Metalk. 54, 623 (1963). 

16 T. F. Stromberg and C. A. Swenson, Phys. Rev. Letters 9, 
370 (1962). 

15 Y, B. Kim, C. F. Hempstead, and A. R. Strnad, Phys. Rev. 
131, 2486 (1963). 
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H. within the limits of the measurements (compare 
Fig. 4 and Fig. 9, Ref. 2). In the “mixed state" the 
interaction of defects, inhomogeneities, or precipitates 
with flux filaments controls the hysteresis of the type 
described by Bean!^5 and the equivalent critica] 
current.16.19 

Anderson and Kim? have suggested a possible ex. 
planation of the anomalous resistance drop after an 
initial rise based on the flux-creep model. As Hes is 
approached, the lattice flux lines are expected to be- 
come more rigid, slowing down the rate of flux motion, 
Oxygen and nitrogen present in small concentrations 
would tend then to favor this process. 

Above He the variations in resistivity appear at 
smaller current densities dependent on the measuring 
currents. R approaches R, at Hs, the field at which the 
material is completely normal. There are two possible 
explanations for the variation of the R(H) curve in this 
region. One is that recently proposed by Saint-James 
and de Gennes? envisioning a superconducting sheath 
along parts of the sample which disappears at Ha. 
Recently, experimental evidence” has appeared show- 
ing that this remanent superconductivity is basically 
a surface effect in agreement with this point of view. 
Some supporting evidence for this is also presented in 
this work (see below). An alternative explanation may 
be that above H. a real resistance occurs; and that the 
shape of this portion of the R(H) curve manifests the 
destruction of the final traces of superconductivity due 
to lattice irregularities such as dislocations, grain 
boundaries, and inhomogeneities having a higher critical 
field than the matrix. The connected regions would 
play an important role near and just above H.», while 
the discontinuous superconducting regions would be 
responsible for the shape of the curve at the top of the 
transition. 


2. Influence of the Interstitial on H' 


The field at which the resistivity first becomes 
detectable H’ decreases for these specimens as the 
current in the wire increases (Fig. 5). It appears to 
reach a limiting value corresponding to the value Ha, 
obtained from the magnetization curve (Fig. 2). In 
this curve, He is the field at which the flux first pene 
trates the specimen. In this comparison the applied 


11 C, P. Bean, Phys. Rev. Letters 8, 250 (1962). 
18 J. Silcox and R. W. Rollins, Appl. Phys. Letters 2, 231 (1963). 
19 J. Friedel, P. G. de Gennes, and J. Matricon, Appl. Phys: 
Letters 2, 119 (1963). 9 
? pP. W. Anderson and Y. B. Kim, Rev. Mod. Phys. 36, 3 
(1964). m 
(1968) Saint-James and P. G. de Gennes, Phys. Letters 7, 3 
2G. Bon Mardion, B. B. Goodman, and A. Lacaze, Phys 
Letters 8, 15 (1964). 2. 145 
(re 55 F. Hempstead and Y. B. Kim, Phys. Rev. Letters 12, 
ee J. Tomasch and A. S. Joseph, Phys. Rev. Letters 12, 148 
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field value does not contain the small correction due to 
the magnetic field generated by the current in the 
wire. H’ decreases as the interstitial solute concentra- 
tion increases. The resistive transition occurring be- 
tween H’ and the field at which the resistance minimum 
occurs, 7.2, for a given J becomes broader as the solute 
concentration increases [compare Figs. 1(a) and 1(b) ]. 
More quantitative comparisons should be done at the 
same reduced temperature since oxygen present in- 
terstitially lowers the transition temperature.? 


3. Critical Current Density 


The critical current density J, as a function of 
transverse field, of wire specimens of annealed niobium 
and annealed niobium containing various concentrations 
of oxygen is shown in Fig. 6. J. extends to higher fields 
as the oxygen concentration increases. This is no longer 
true when the solubility limit is exceeded. A comparison 
of the curves with their respective magnetization data 
(Fig. 2) clearly defines three portions: (1) below Ha, 
a sharp decrease in J.(H) takes place analogous to the 
behavior in a “soft”? superconductor; (2) the “mixed 
state" region between He and Hs, where the anomalous 
behavior in the threshold-field curve (peak effect) re- 
lated to the resistance minimum discussed above is 
evident in the niobium containing 0.70 at.% oxygen 
and (3) a region above ZH. 

Below H.1, one assumes that the transport current 
J is carried in a thin shell (London penetration depth). 
Between H. and Hes it is believed that the transport 
current is controlled by the interaction of defects and 
flux filaments and that the net flow of current is de- 
pendent on the concentration gradient of flux filaments 
resulting from this interaction. Je is determined as soon 
as the motion of the flux filaments is detected. 
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Above ZH», evidence cited below seems to indicate 
that current could be carried by a superconducting 
layer as suggested by Saint-James and de Gennes.” 
An alternative explanation could be that starting at 
H.» the continuous supercurrent flux filaments in the 
main body of the matrix no longer exist and the current 
carrying mechanism changes to that associated with 
extended defects consisting of higher critical field super- 
conducting structural defects or physical filaments. 
Similar suggestions have been made earlier (e.g., see 


Nb* 0.70 at.%0 
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Fic. 6. Threshold current density-field behavior of some 
annealed niobium wires (0.030-in. diam) containing various 
amounts of oxygen. Annealed and outgassed niobium is also 
shown for comparison (T =4.2°K). 
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F Fic. 7. The resistance characteristics R/R, as a function of 


plied transverse field at 4.2°K for an annealed ribbon of niobium 

:032X(0.007 in.) containing 0.70 at.% oxygen (a) with field 

perpendicular to the wide side (H 1.w.s.); (b) with field parallel 
the wide side (Z||w.s.). Hes values are evaluated at J~10 
‘cm. 
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discussion Ref. 25). J.(H) drops off rapidly in this 
region since the superconducting paths would, pre. 
sumably, not be continuous nor necessarily be char- 
acterized by a uniform critical field. One would expect 
the rate of decrease of J-(H) to depend on the con. 
centration and nature of the defects. 

For the specimens containing oxygen in concentra. 
tions exceeding the solubility limit at the temperature 
of introduction, the critical current has a relatively 
lower value at zero field, remaining almost constant as 
the field is increased, decreases monotonically and then 
sharply at a field approximately corresponding to that 
where the magnetization becomes zero (B—H) (see 
Fig. 9, Ref. 2). For this specimen Zi and Hez cannot 
be clearly identified from the magnetic behavior since 
it exhibits pronounced magnetic hysteresis. At inter- 
mediate field the current density is somewhat higher 
than that of Nb and dilute Nb-O solutions indicating 
perhaps some more effective interaction of the pre- 
cipitates with flux filaments in the Nb matrix. 


B. Influence of Field Orientation 


1. Resistive Transition at Constant Current Density 


a. Nb+0.70 at.% Oxygen-Annealed Ribbon. Figures 
7(a) and 7(b) show the resistance transition of an 
annealed niobium ribbon containing 0.70 at.% oxygen 
for H1w.s. and H||w.s., respectively. R(H) in the 
mixed state for a given J is lower than for the wire 
specimen [see Fig. 1(a) ]. Of the two field orientations, 
it is smaller when H||w.s. The anomalous resistance 
minimum is also smaller for this field direction. While 
H à is independent of field direction, Hes has a higher 
value when H||w.s. In either case, H.3 tends to decrease 
as J increases. If H4; is evaluated at the smallest J, 
namely J=10 A/cm?, the ratio H.3/H.2=1.68, when 
H is parallel to the wide side. This value is in good 
agreement with the Saint-James and de Gennes analy- 
sis?! which predicts H.3/H.2= 1.695, when H is parallel 
to the surface. For H perpendicular to the surface, the 
Saint-James and de Gennes analysis reduces to that 
of Abrikosov with no remanent superconductivity 
expected beyond Hs. In this study when the field is 
perpendicular to the wide side of the annealed ribbon 
[Fig. 7(a)] some remanent superconductivity does 
exist and H.3/H.2=1.37 (J=10 A/cm?). The ratio 15 
smaller than the theoretical value, although the edge 
of the ribbon specimen provide some surface paralle 
to the field. 

b. Nb+0.70 at.% Oxygen-Cold-Worked Ribbon: 
When a ribbon, the same geometry as that above, 5 
made by cold-working niobium wire containing 0: 
at.% oxygen (ie., wire uniformly compressed to i 
flat ribbon) there occurs a decrease in the resistance m 
the mixed state and an enhancement of the resistente 


25 W. DeSorbo, Rev. Mod. Phys. 36, 90 (1964). 
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minimum for either of the two field directions when 
compared with the results obtained on the anncaled 
SPECIMEN [compare Fig. 7(a) with 8(a) and 7(b) with 
8(b)]. The resistance decrease is more pronounced 
when Zi||w.s. of the ribbon. This decrease in R(H) in 
the mixed state upon cold working is consistent with 
the point of view!^ that R is determined from the rate 
at which flux filaments are moving. Increasing disloca- 
tion (or defects) and/or segregation and precipitate 
concentration impedes the motion decreasing R. The 
Nb-O solutions, as prepared, are all supersaturated at 
room temperature (equilibrium solubility at room tem- 
perature is vanishingly small.)?:* Upon cold working, 
the dislocations can help take oxygen out of solution 
since they can serve as sites for segregation and pre- 
cipitation and provide paths for enhanced diffusion. 

The resistive anomaly and its enhancement by cold 
work may be related to the relative orientation between 
the flux filaments in the mixed state and defects with 
that of the field.” On the basis of the Anderson and 
Kim? suggestion, flux line rigidity would not only in- 
crease near H.z with cold work, but it would also be 
anisotropic. With the type of deformation used in this 
work, this relative orientation would be expected to be a 
complex one. The demagnetization coefficient itself 
seems to be important in the low-field region. The small 
change, if any, of H.z upon cold work (see Fig. 9) 
implies, from the Abrikosov model, that the mean free 
path of the bulk specimen has not been altered appreci- 
ably and that possibly only a small fraction of the total 
solute atom segregating to defects are responsible for the 
large changes in the resistive characteristics observed 
in the mixed state. The small changes in Hes with cold 
work accompanied by large changes in magnetization 
peak and hysteresis loop are consistent with earlier 
observations made on some lead-base alloys.” 

For cold-worked ribbon, Hes is larger than that ob- 
served on the annealed ribbon, and the ratio H3/H<e 
equals 1.91 (J=111 A/cm?) for either field direction 
[see Figs. 8(a) and (b)] and is expected to be even 
larger for smaller J. The increase in Hes is anticipated 
if defects participate in the nucleation to the super- 
conducting sheath as suggested by Saint-James and 
de Gennes. The distinction in R(H) between parallel 
and perpendicular field orientations for the cold worked 
ribbon is less apparent in the region Hez to Hoa [com- 
pare Figs. 8(a) and (b)]. These observations are in 
agreement with similar results recently reported by 
Hempstead and Kim” on some substitutional solid 


solutions of Type II superconductors. 
c. Dependence of H'—Nb4-0.70 at.% Oxygen (An- 


i i i ity-field studies 
25 Anisotropic effects observed in current density-fie : 
of some einereacondl alloys [J. J. Hauser and Rod: TUE 
Phys. Chem. Solids 24, 371 (1963) have been related to 
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eae and eiee to changes in the demagnetization 


coefficient. 
2 J. D. Livingston, Phys. Rev. 129, 1943 (1963). 
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Fic. 8. The resistance characteristics R/R, as a function of 
applied transverse field at 4.2°K for a cold-worked ribbon 
(0.0320.007 in.) containing 0.70 at.% oxygen (a) with field 
perpendicular to wide side; (b) with field parallel to the wide side. 
H. values were evaluated at J — 111 A/cm?. 
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Fic. 9. The influence of cold work 
on the magnetization behavior at 
4.2*K of niobium +0.70 at.% oxygen 
(a) 0.030-in. diam wire, (b) same wire 
with diameter reduced to ~0.027 in, 
by hammering, and (c) wire after 
uniformly compressed to flat ribbons 


7000 


0.032 X 0.007 in. 


A1126 WARREN 
Nb+ 0.70 At. % 0 
1000 
Eod (a) 
0 f " i Ji H x ò 
1000 2000 3000 4000 5000 6000 
-500^ 
=a 
> 1000F (b) 
! — 500 SLIGHTLY GOLD WORKED 
_ moon Sis == —— = = 
Y 1000 2000 3000 4000 5000 6000 
-500 
1000 (c) 
HEAVILY GOLD WORKED 
500 
o£ Ce OOOO oot L À 
d 1000 2000 3000 4000 5000 6000 
-500 


APPLIED FIELD (0e) 


nealed and Cold-Worked Ribbons). Changes in H' as a 
function of sample current corresponding to cold 
worked and to annealed specimens for the two-field 
orientations mentioned above are presented in Fig. 10. 

d. Niobium Ribbon—Outgassed and Annealed. An- 
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Fic. 10. The transverse field H^ required for the initial appear- 
ance of resistance (R/R,<1X107%) at 4.2°K as a function of 
ecimen current, J, (amperes) for (a) annealed and (b) cold- 
rked ribbon of Figs. 7 and 8. 
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nealed niobium ribbon, whose interstitial concentration 
is less than 5 ppm, does not exhibit the resistance 
anomaly when the field orientation is either perpendi- 
cular or parallel to the wide side [Figs. 11(a) and (b)]. 
The resistance in the mixed state below H.» (see also 
Fig. 12 for H. evaluated from the magnetization data) 
is larger when the field is perpendicular to the wide side. 

Above H., the resistance is smaller when H||w.s. of 
the ribbon. This could be due to the existence of a 
superconducting sheath predicted for this field orienta- 


Taste I. The influence of oxygen concentration in niobium 
wires (0.030-in. diam) on the ratio Hes/Hez at 4.2°K (see note 
added in proof). Magnetic field perpendicular to wire and to 
flow of current. 


H, a/ H. c? 
T.(°K) Hz Gis 10 
(Ref. 2) (kOe) Ke A/cm’) 
Nb 
wire 9.4, 2.70 1.24 2.3: 
ribbon 
H|lw.s. 9.45 2.70 1.24 2A 
ribbon 
Hiws. 9.45 2.70 1.24 1.90 
Nb-+200 ppm O 
wire 2.00 
Nb+0.7 at.%O 
wire Qs 70 3.64 1.60 
ribbon 
Ellas: ES 70 1.68 
ribbon 
Hiwss. 8.75 7.0 15 
Nb--1.52 at.950 7 
wire 8.0, 9.6 6.08 121 
Nb+6.4 at.%O 2 
wire ~9.0 «t 
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= Determined magnetically. 
b For value of He used to evaluate K, see Ref. 2. 
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Fic. 12. The influence of cold work on the magnetization char- 
acteristics of niobium containing less than 5 ppm oxygen content 
for an annealed specimen (0.030-in. diam wire) and the same 
specimen cold worked by compression (ribbon 0.032X0.007 in.) 
T=4.2°K. 


tion?! However, the ratio Hs/He2 is equal to 2.44 
(J=10 A/cm?) which is larger than the theoretical 
value (see note added in proof). It is interesting to note 
that for pure niobium K(=H.2/V2H.) is small com- 
pared to the values observed in the Nb-O solutions 
(see Table I). For Hw, Ha/H;a-1.85 (J=10 
A/cm?) smaller than that observed for the parallel 
field orientation but still larger than the theoretical 
value (see note added in proof). 

e. Niobium Ribbon—Outgassed and Cold Worked. For 
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Fic. 13. The field H’ at which the resistivity first appears as a 
function of current is summarized for Nb ribbons shown in Fig. 11. 
The results for Nb+6.4 at.% O (wire 0.032-in. diam) are also 
shown (T =4.2°K). 


the cold-worked ribbon, whose interstitial concentration 
is less than 5 ppm, R(H) [see Fig. 11(c) and (d) ] could 
not be detected below H.z (Fig. 11) within the limits 
of the measurements. Again, such results are in agree- 
ment with the notion that in the mixed state, R(H) is 
dependent on the rate of motion of flux filaments.!^ In 
niobium with low interstitial concentration the dis- 


Fic. 14. Threshold current: 
density field behavior of severa 
specimens of niobium wire 
(0.030-in. diam) and ribbons 

RIBBON (01032X0.007 in.) containing 
(COLD WORKED) 0.70 at.% oxygen at 4.2 K. 
Ha and Hee determined from 
magnetization data. 
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locations are effective in reducing this rate. In these 
cold-worked ribbons H., and the ratio /1,,/H,s are 
larger than the corresponding values observed in an- 
nealed ribbons. Again, the distinction between the field 
orientation of the resistance transition is less significant, 
similar to the behavior in cold-worked Nb-O ribbons. 

f. H' Dependence on Field Orientation in Niobium 
(Ribbon). The field H’ at which the resistivity first 
appears as a function of sample current is summarized 
in Fig. 13 for annealed and cold-worked niobium. The 
limiting value of H’ at high current approaches H for 
the annealed wire. For the annealed ribbon, the limiting 
value may be smaller than H4 for either field direction, 
while for cold-worked ribbon it is larger than H4. For 
comparison, the results are also shown for niobium 
containing oxygen beyond the solubility limit at the 
temperature of introduction. H’ for this material is 
large as in cold-worked Nb. 


2. Critical Current Density—Annealed and 
Cold-W orked Nb and Nb--0.70 at.% Oxygen 


The threshold current-field data J.(H) for cold- 
worked ribbons of both Nb-++0.70 at.% oxygen and Nb 
are summarized in Figs. 14 and 15, respectively, where 
they are compared with the data obtained on similar 
but annealed ribbon specimens. The data on annealed 
wire specimens are also shown for comparison. The 
values of H «ıı and H «2 were obtained from the magnetiza- 
tion data. Cold working increases the critical current 
carrying capacity in the mixed state in both materials 
by introducing dislocations. These play a role in 
stabilizing the flux filaments in the mixed state re- 


10 
10 1 
Fic. 15. Threshold current density- e è 
field behavior of several wire (0.030-in. Em 
diam) and ribbon (compressed to £ 
0.030Xx0.007 in.) specimens of ni- = 10 
obium, each containing less than “2 
ppm interstitial concentration, 
showing the influence of cold work and 
field direction (T=4.2°K). 10! 
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ducing R(H). In Nb-O solutions, some segregation and 
precipitation at these dislocations may be participating 
in this process. 

The anomolous increase in J.(/7) curve (peak effect) 
related to the resistance minimum, discussed above, is 
evident in the interstitial solid solution even before cold 
work (Fig. 14). This is in contrast to the behavior in 
substitutional solid solutions where it is usually neces- 
sary to cold work the material to bring out the anomaly 
where it also occurs at H.. However, upon cold 
working these interstitial solutions (ribbons), the peak 
is enhanced when the field is perpendicular to the wide 
side, always coincident with smaller J, values in the 
mixed state below H.z. For this same material the peak 
virtually disappears when the field is parallel to the 
flat surface, but J.(H) is now larger in the mixed state. 
Mendelssohn (see discussion of Ref. 25) attributes the 
peak effect, occurring only in the transverse field, to a 
more favorable relative orientation assumed by the 
direction of the superconducting paths and the field 
as H. is approached (see also Hauser and Treuting, 
Ref. 26). An alternative explanation is that proposed by. 
Anderson and Kim? (see above). In either outgassed 
or cold-worked niobium no anomalous variation is 
evident in the J,(H) curve for either of the two field 
orientations used to examine the ribbons (Fig. 15). 

Above H. field orientation effects on J,(H) are 
quite evident for both Nb-4-0.70 at.% oxygen and Nb 
after each has been annealed (see Fig. 14 and 15, 
respectively). The ribbon specimens show a greater 
ability to carry superconducting current when the 
field is parallel to the surface which would be expected 
from the Saint-James and de Gennes model.” In cold- 
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Fic. 16. Resistance characteristics 
R/Rn of a wire specimen (0.030-in. 
diam) of Nb+0.70 at.% oxygen; (a) 
quenched in helium exchange gas 
from an homogenization tempera- 
ture 1100°C to room temperature; 
(b) quenched and cold worked 
by uniform compression roughly 
to a rectangular form approximately 
0.028 0.032 in.; (c) after (b) the 
same wire was given a low tem- 
perature anneal, 3 h at 170°C 
(72220 A/cm?) (T—42?K). At 
this current density 7/:5—10.9— 1,56 
Hes. 


2 3 4 5 6 if 8 9 
H (kOe, L) 


worked ribbons the field orientation effect on J.(H) is 
smaller and for Nb ribbon it is nonexistent just below 
H. with the deviations going over into the opposite 
direction for small /.(H) values. Similar results have 
been observed in cold-worked Nb+3 at.% Ti.” These 
results corroborate the dependence of resistance on 
field orientation above H.z described above. As the 
defect concentration increases, the distinction between 
R(H) for | and .L field orientation becomes less 
significant. 
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Fic. 17. Threshold current-density field behavior of the same 
K specimens shown in Fig. 16 (T=4.2°K). 
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C. Ratio H.3/H.2 and Oxygen Concentration 
in Niobium 

The ratio H;3/H,» observed on annealed niobium 
wires containing various concentrations of oxygen are 
summarized in Table I. The values are approximately 
equal to those observed on ribbon samples when H||w.s. 
(e.g, Nb and Nb--0.70 at.% oxygen). The ratio is 
largest for pure niobium which also has the lowest K. 
It decreases with increasing K and increasing solute 
concentration (see note added in proof). 


D. Strain Aging Nb+0.70 at.% Oxygen 


Some preliminary experiments have been carried 
out to vary the degree of interstitial segregation to 
dislocations. This was done by studying both R(Z) and 
J-(H) on a wire specimen, Nb-+0.70 at.% oxygen, 
(a) quenched, (b) then cold worked, and (c) annealed 
at 170°C for 3 h in a pressure less than 1X 1077 mm Hg 
R(H) data are summarized in Fig. 16 for J — 220 A/cm. 
The resistance in the mixed state is approximately the 
same before and after cold working. However, the re- 
sistance anomaly in the latter is much more pronouncec- 
Strain aging at 170°C decreases the resistance in the 
mixed state by a significant amount, masking out the 
resistance minimum. : 

Above H.;, the data show little, if any, change P 
resistance upon quenching and cold work for the current 
density shown. The strain aging process does, howevel 
tend to decrease R. H.3/H.2 for the J shown is appro: 
mately 1.42 for these wire specimens, both before an 
after quenching; increasing to approximately 1.6 after 
strain aging. 


The process of homogenizing, cold working and low 


———— 
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temperature anneal on J.(H) and on the magnetization 
loop are summarized in Figs. 17 and 18, respectively. 
It is interesting to note that strain aging brings J.(H) 
in the mixed state up to where it was for slow-cooled 
specimens (compare Fig. 17 with Fig. 14). This result 
cannot be deduced from the corresponding magnetiza- 
tion loops (compare Fig. 16 with Fig. 8). It seems that 
structural changes brought about by cold working and 
by strain aging responsible for changes in J,(H) in the 
mixed state also cause changes above H.z (see Fig. 17). 
The results indicate that in these specimens, if a 
superconducting sheath is carrying the current above 
H. this sheath appears to be dependent on structural 
changes. Below H.1, the current, carried in the penetra- 
tion depth, appears to be independent of these changes. 

The results, both R(H) and J.(H), suggest that the 
anomalous peak effect, or resistance minimum, is 
strongest at some intermediate stage of the process of 
segregation and precipitation at dislocations. This con- 
clusion is in agreement with the suggestion made 
earlier? that a solute or impurity atom is probably 
necessary for the appearance of the anomaly in both 
interstitial as well as substitutional solid solutions. The 
more wide-spread appearance of the anomaly in the 
former solutions has been attributed” to the far greater 
mobility (e.g., see Ref. 28) of the interstitial atom. This 
makes the interstitial solute atom segregate more easily 
to structural irregularities, such as dislocations. à 

As mentioned above, the anomaly is readily evident 
in niobium containing as little as 200 ppm oxygen; but 
not in Nb containing less then 5 ppm oxygen. Recent 


23 G, B. Gibbs, D. Graham, and D. H. Tomlin, Phil. Mag. 8, 


1269 (1963). 
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transmission electron microscopy studies? have shown 
that the presence of relatively small amounts of inter- 
stitial impurity (probably as little as 150 ppm) has a 
strong influence on the annealing structuresof the metal. 
Dislocations were reported to act as sinks for inter- 
stitials present as interstitial atmospheres or pre- 
cipitates. These would also tend to retard the rate of 
the recrystallization process and probably account for 
a wide variety of structures seen after high-temperature 
anneals. The appearance of the anomaly in niobium re- 
ported earlierf-* was probably due to the presence of 
small but significant amounts of interstitial impurities 
in the metal. 

At present, the experimental results outline the 
dependence of the peak effect on various structural 
features, but a satisfactory theory has not been de- 
veloped. Some proposed alternative explanations have 
been cited above. Hauser and Treuting* have suggested 
that anisotropy in defect structure is probably responsi- 
ble for the minimum (“valley effect") in the J.(H) 
curve but reported no evidence for the presence of a 
second phase. Preliminary optical microscopy studies 
on some of the specimens reported here revealed pre- 
cipitate particles only in cold-worked specimens 
(ribbons) occurring at a high density in certain areas 
presumably regions of greater dislocation density. 


E. Quenching and Aging Nb 4-0.70 at.% Oxygen 


Since strain aging at temperatures as low as 170°C 
has a marked effect on the mixed state, some additional 
experiments were carried out to study aging of quenched 
specimens. Some preliminary work on varying the 
homogenization or solution temperature, T, was also 
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attempted. In these experiments T, was decreased to 
liquid nitrogen temperature in less than 20 sec. 

The resistive transition of a sample Nb-4-0.70 at.% 
oxygen quenched from 7,=1000°C is shown in Fig. 
19(a). When compared to a similar specimen cooled 
from the same temperature to room temperature at a 
slower rate (approximately 1 h) [Fig. 1(a) ], itis evident 
that increasing the Supersaturation, as well as the 
degree of disperseness of the solute, increases R(H) in 
the mixed state. In the rapidly cooled specimen, a 
secondary dip is evident below the primary one occur- 
ring at Hes Upon aging at room temperature for 23 


Nb * 0,70 a1.% 
QUENCHED 


R Hey [1.O kOe L6 Hs 
days, R (II) remains essentially unaffected, although the "Rn 

secondary dip has disappeared [ Fig. 19(b)]. A low tem- 

perature anneal, 3 h at 170?C, reveals a small decrease 103 


in R(H), a reappearance of the secondary peak, and 
definite changes above H.z at low current density 
LFig. 19(c) ]. Upon cold working the specimen and then 
annealing (3 h at 170?C), pronounced decreases are 
evident in the mixed state [ Fig. 19(d)] similar to the 
strain aging results shown in Fig. 16. 

In the region H,2< H€ H. the process of quenching, 
aging, annealing, and strain aging decreases R(H). 
Structural changes responsible for resistance changes in 
the mixed state are also responsible for changes ob- 
served above H.» The results again emphasize the 
importance of segregation and precipitation at dis- 
locations on reducing resistance both in the mixed-state 
region as well as in the region above H.». If the super- 
conducting sheath proposed for Type II supercon- 
ductors” is responsible for the variations of R(H) above 
H. then the sheath is influenced by the segregation 
and precipitation at dislocations. However, for these 
wire specimens, H. (J=4.4 A/cm?) appears to be un- 
affected by the various structural changes introduced. 
The ratio H.3/H,2 remains constant with a value 
approximately equal to 1.6. 

Upon increasing the homogenization temperature 
from 1000 to 1200°C and quenching rapidly, R(H) in 
the mixed state tends to remain unchanged but the 
resistance minimum is now larger [compare Fig. 20(a) 
with Fig. 19(a)] Increasing the homogenization 
temperature does not appear to change the ratio, 
H./H,,—1.6. However, in this case the resistive 
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. is di truc- 
relatively small amount. This different a this (0.030-in, diam) containing 0.70 at.% oxygen (T=4.2°K); (s) 
ture may also have a higher Ho. Upon heating quenched in helium exchange gas from an homogenization tem 
specimen at 170°C for 3 h, the minima above H.s perature of approximately 1200°C; (b) then heated 3 h at 170°C. 


disappear [ Fig. 20(b)]. A relatively larger decrease in — Ha evaluated at J=10 A/cm’. 


Fic. 20. The resistance characteristics R/R, of niobium wire 
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R(H) in the mixed state, probably due to strain aging, 
takes place over that observed for an identical treatment 
given a similar specimen originally quenched from a 
lower T, [compare Fig. 20(b) with Fig. 19(c)]. Note 
also that the value of 1,3 or H-3/H-2 remains unchanged. 


F. Etched Specimens 


The anomalous dip in the resistance transition of 
niobium reported earlier® has been known to be in- 
fluenced by etching (see discussion Ref. 25). Nb-O or 
Nb=N specimens prepared in this work when etched 
with a solution consisting of 8 HNOs, 2 H:O, 1 HF 
showed only little changes in the dip characteristics 


a even when the diameter was reduced by one half 
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Fic. 21. The resistive characteristics R/R, as a function of 
applied transverse field at 4.2°K; (a) before etching specimen 
of niobium +0.70 at.% O, wire (0.030-in. diam); (b) after 
etching diameter reduced from 0.030 to 0.015 in. diam; (c) the 
field H' required for the initial appearance of resistance 
(R/RnS1X10%) at 4.2°K as a function of specimen current 
I, (amperes) before and after etching. 


[compare Figs. 21(a) and (b). For example for J=110 
A/cm?, the field at which the minimum occurs, changes 
Írom 6.95 kOe before etching to 6.75 kOe after etching. 
This small decrease indicates only a small change in the 
average solute concentration. Assuming that changes in 
He» reflect average changes in solute concentration, the 
experiment could be useful in investigating diffusion of 
nonmetallic solutes in superconducting transition 
metals. After etching a significant increase in R(H) in 
the mixed state, however, does occur, indicating less 
effective pinning of flux filaments. Above ZH», the 
influence of etching on R(H) is small or negligible. 
H.s/ H o remains unchanged and equal to approximately 
1.6. 
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H' as a function of specimen current exhibits a de- 
creasing trend upon reducing the wire diameter by 
etching [ Fig. 21(b)]. Its limiting value may be smaller 
than Ha. 


SUMMARY 


1. Oxygen and nitrogen as interstitial solutes in 
niobium cause the metal in transverse field to exhibit 
anomalous resistance minima or critical current maxima 
(peak effect). The anomaly is evident in niobium con- 
taining as little as 200 ppm oxygen but not in niobium 
whose interstitial concentration is approximately 5 ppm. 

2. The anomaly appears at H.z, the upper critical 
field of these Type II superconductors determined 
magnetically. 

3. The anomaly is increased by cold working but 
decreased by strain aging (3 h, 170?C) and by rapid 
quenching. These results suggest that the anomaly is 
strongest at some intermediate state of the process of 
segregation and precipitation at dislocations. 

4. The resistance (R) in the mixed state is decreased 
by exceeding the solute solubility limit, by cold working 
and by strain aging at 170?C; it is increased in the 
mixed state by introducing more interstitial solute, by 
quenching and by etching. 

5. Small changes, if any, of H.z upon cold work seem 
to indicate that only a small fraction of the total solute 
atom segregating to defects are responsible for large 
changes in both the resistive and critical current 
characteristics. 

6. For an annealed ribbon of niobium containing 0.70 
at.% oxygen (K —3.6) and for field orientation parallel 
to the wide side, H.3/H-2 gradually increases for de- 
creasing current density reaching a value 1.68 for 
J=10 A/cm?, in good agreement with the predicted 
value of Saint-James and de Gennes. For transverse 
field, H.3/He2 is equal to 1.37. 

7. For an annealed niobium ribbon containing less 
than 5 ppm oxygen (K=1.2s) for field orientation 
parallel to the wide side, Hes/H2=2.44 (J=10 A/cm’), 
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larger than the predicted value. The value is approxi- 
mately equal to the ratio observed for an annealed 
niobium wire in a transverse field. Niobium wires in this 
field orientation show a dependence of H.3/H.2 on 
oxygen concentration, varying from 2.44 for pure 
niobium (K — 1.24) to 1.21 for niobium containing 1.52 
at.% oxygen (K=6.03). It is possible that inhomo- 
gencities in the sample where the surface is characterized 
by one K and the interior by another could artifically 
force this ratio in either direction (see note added in 
proof). 

8. In cold-worked ribbons of niobium +0.70 at.% 
oxygen, or cold-worked niobium, the distinction, above 
Heo, in RT) or J.(H) for field orientations parallel or 
perpendicular to the wide side becomes less significant 
than the corresponding values observed for annealed 
ribbons. Cold working increases H.3/H.2 for either 
field orientation. 

9. Structural changes brought about by rapid quench- 
ing, aging, strain aging responsible for changes in J,(H) 
or R(H) in the mixed state also cause changes in the 
region H.a<H <H. but do not seem to affect the value 
of H.3=1.6 H. for wire specimens (Nb-F-0.70 at.% 
oxygen) in transverse fields. 

Note added in proof. Subsequent results carried out 
by the author on both pure niobium and niobium con- 
taining oxygen in concentrations below the solubility 
limit at the temperature of introducing oxygen show 
the value 4 5/H.; is independent of x. The value is in 
good agreement with the Saint-James and de Gennes 
theoretical value 1.695.” 
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The wave-packet nature of an experimental beam is discussed. The theory of wave-packet scattering is 
applied to wave packets with exponential time dependence scattered from resonances in both the isolated- 
resonance and overlapping-resonance regions. It is shown how a sequential description of scattering de- 
pends on some of the resonance parameters in the scattering amplitude. The meaning and usefulness of 
some experimentally possible wave-packet experiments is discussed. 


1. INTRODUCTION 


CATTERING of a particle from a many-body 

system can be divided into two types with respect 
to a time interval 7 and a corresponding energy interval 
6(~i/7).1 If the scattering amplitude as a function of 
energy varies rapidly over the energy interval ô, the 
Scattering is said to be compound scattering; if the 
amplitude varies slowly over this interval, we have 
direct scattering. 7 is several orders of magnitude 
greater than the time it takes the incident particle to 
traverse a distance the size of the scatterer in free 
space. 

In most actual scattering experiments time is not 
resolved. The incident beam has an energy spread. The 
cross section can be regarded as the weighted sum of 
cross sections for independent experiments, each with 
definite incident energy. This will be discussed in Sec. 2. 
It is still possible to define a time delay Aż according to 
the definition of Wigner,? 


At=—ihi(d/dE) InS(E) , (1) 


where S (E) is the S-matrix element for the scattering 
problem. This time delay may be thought of as the 
time it takes for the phase of one incident wave to 
catch up with that of another whose energy differs from 
it by an infinitesimal amount, and whose phase shift 
in the scatterer is therefore infinitesimally different. 
If the S-matrix element is divided according to some 
physical prescription into a rapidly and a slowly 
varying component, this definition is difficult to apply 
and it certainly does not tell us anything about each 
‘component separately. The rapidly varying component 
corresponds to a large At, that is, to particles that spend 


. a long time on the average in the scatterer, and the 


slowly varying component corresponds to a small AL, 


——— 


= —*Su ported in part by the U. S. Atomic Energy Commission 


and the Australian Commonwealth Scientific and Industrial 
Research Organization. 

1 For example H. Feshbach, C. E. Porter, and V. F. Weisskopf, 
Phys. Rev. 96, 448 (1954). 


3 F. P. Wigner, Phys. Rev. 98, 145 (1955). 
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that is to particles that pass rapidly over the scatterer 
without significant time delay. 

An experiment which measures the relative amount 
of direct and compound scattering has been suggested 
by Eisberg, Yennie, and Wilkinson? The experiment 
also defines an energy interval which is the reciprocal 
of the time delay. The information is obtained from 
the energy spectrum of bremsstrahlung from elastic 
scattering of charged particles. 

One would like to make a classical picture in which 
particles are described as being in the scatterer for 
varying times. To do this one must make a wave-packet 
argument. It will be shown in Sec. 2 that the physical 
situation corresponding to a meaningful wave-packet 
argument is necessarily a time-dependent scattering 
experiment. 

Wave-packet arguments are made for example by 
Friedman and Weisskopf for the case of shape-elastic 
and compound-elastic scattering. They show that for 
nonoverlapping levels (resonances) of the compound 
system the time delay is the average over the beam 
energy spread of the time delays for the individual 
levels. If time delay is defined according to Wigner's 
definition, this result is obtained at once. It has been 
done explicitly for example by Goldberger and Watson.’ 

If, however, time delay is defined as the average 
time delay in the emerging of a wave packet in a 
hypothetical time-dependent experiment complemen- 
tary to the usual energy-dependent experiment, then 
the result is not so clear. The delayed wave packet 
would be expected to interfere with the immediately 
scattered wave packet from the shape elastic scattering. 
It is commonly stated that if the two wave packets are 
sufficiently short in time they will not interfere.9 This 

* R. M. Eisberg, D. R. Yennie, and D. H. Wilkinson, Nud. 
Phys. 18, 338 (1960). 

iF. L. Friedman and V. F. Weisskopf, Niels Bohr and the 
Development of Physics (Pergamon Press, Ltd., London, 1959). 
d M. L. Goldberger and K. M. Watson, Phys. Rev. 127, 2284 

6 oe example, H. G. Preston, Physics of the Nucleus (Addison 
Wesley Publishing Co., Inc., Reading, Massachusetts, 1 


R. K. Adair, S. E. Darden, and R. E. Fields, Phys. Rev. 96, 50 
(1954). i 
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situation is now extreme hypothetical. It requires very 
good time definition in the experiment. In an actual 
experimental situation, time is undefined and the 
wave packets interfere completely. It may also be asked 
whether the scattering amplitude, and hence, for ex- 
ample, the angular distribution will not be dependent on 
the width of the incident wave packet, since, for short 
wave packets (in time), we might not expect such 
complete interference as for long wave packets." 

The question of what actually happens to a scattered 
wave packet has been investigated by Sasakawa.* He 
shows that for the scattering of a wave packet from an 
isolated resonance, the variation of the cross section 
with energy is characterized not by the width of the 
resonance but, in addition, by the width of the wave 
packet. In Sec. 3 we will re-derive this result using a 
different mathematical technique and Lorentzian wave 
packets. The Lorentzian shape is one shape for the 
energy spectrum that can be realized experimentally. 

In Sec. 4 we will consider the scattering of wave 
packets from a system in which the compound levels 
overlap. This gives new insight into the optical model, 
which is defined for this purpose as the model for which 
the time delay in the complementary time-dependent 
experiment would be small. Our treatment is more 
general than Sasakawa's treatment of the same situa- 
tion in a special case. 

The usual definition of the optical-model scattering 
amplitude? is that it is the average of the actual scatter- 
ing amplitude over an energy interval 6. In the actual 
situation, amplitudes for different energies are in- 
dependent and it is the cross sections that are experi- 
mentally averaged. The meaning of the energy averaging 
process will be discussed. 

In Sec. 5 we will discuss the possibility of performing 
scattering experiments with time definition which are 
complementary to the usual ones in which energy but 
not time is resolved. One such experiment has been 
suggested by the present authors.” 


2. DEPENDENCE OF THE SCATTERING CROSS 
SECTION ON THE INITIAL BEAM 
CHARACTERISTICS 


A beam consists of particles whose energy is defined 
within certain limits. The time of arrival of particles 
at a specific point may also be defined within limits by 
modulating the amplitude of the beam. In general the 
limits on the time and energy resolution of the beam 
are wider than those imposed by the uncertainty 
principle. For example some particles may be moving 


} i the International Symposium on 
7 I. E. McCarthy, Proceedings of vitse o pecu 


Direct Interactions and Nuclear Reaction I 
(Eos a Publishers, Inc., New York, 1963). 2i 
8 T. Sasakawa, Progr. Theoret. Phys. Suppl. 11, 69 (1959). 


] . 31, 893 (1959). 
outs le G. E. Brown, Rev. Mod. Phys. 31, 
19 L R Dodd and I. E. McCarthy, Phys. Rev. Letters 12, 136 


(1964). 
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faster than others because they were accelerated a little 
more in the accelerator. The fast and slow beams could 
in principle be separated by a magnetic spectrometer. 
This type of energy uncertainty will be called classical 
uncertainty. On the other hand, if the time of arrival of a 
particle at a point is defined within a time interval 7, 
we cannot in principle measure its energy with a 
definition better than 6=h/r. This type of uncertainty 
will be called quantal uncertainty. 

A beam with both classical and quantal uncertainty 
is described by a collection of wave packets with 
different mean wave numbers k; whose position at tirne 
Lis measured with respect to different initial position 
vectors r; The value of r; specifies the starting time 
of the wave packet at the source. One such wave 
packet may be written 


tutt) = Qe) Í A(ks5; k) expLik- (r—r))Pk. (2) 


Suppose that the number of packets of type j is 
given by z where 


n-;N. (3) 


N is the total number of wave packets. 

We wish to know the cross section for a scattering 
experiment with an incident beam which can be 
described in this way. For simplicity we will consider 
only one spin state. We must be careful to distinguish 
two cases. 

Case A. If the r; are such that the wave packets do 
not overlap both before and after scattering, the 
scattering of each wave packet may be considered as 
an individual event. The total cross section is the sum 
of the cross sections for the scattering of each wave 
packet. 

Case B. The r; are such that the wave packets 
overlap. In this case different wave packets will 
interfere and the cross sections for individual wave 
packets do not add incoherently. 

The mathematical description of these cases is as 
follows. For case A we will consider wave packets with 
the same value ro of r; and describe the beam by means 
of a density matrix. The density matrix at time / is 


p (rr^) =}; ojo; (r,t) £0j* (r^i) * (4) 


According to the standard theory of wave-packet 
scattering, £o; is given in the case where the space 
occupied by the wave packet is large compared to the 
size of the scatterer but small compared to the distance 
of the detector by 


£oj(r,)) = Qi fA (k;,ô; k) exp(—ik-ro—iEt/h) 


X[exp(ikr)/r]/(k,2)dk. (5) 

11 See, for example, E. Merzbacher, Quantum Mechanics (John 

Wiley & Sons, Inc., New York, 1961). A review of general wave- 

packet scattering with applications to nuclear reactions has been 

given by N. Austern in Selected Topics in Nuclear Theory (Inter- 
national Atomic Energy Agency, Vienna, 1963). 
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J(k,Q) is the scattering amplitude for scattering of 
particles into a detector whose angular position is Q. 
A(k;,5; k) is a weight function which confines the wave 
packet to a volume in momentum space centered at 
k; and of width ô. à is the quantal uncertainty. 

We will not be interested in the angular uncertainty 
in the wave packet, so we will write 


8(0,.—09)0 ($1.— po) 


A(k;,ô; k) 
(58; k) 27k? sinb, 


A(k;6;k), (6) 


where (6;,¢:), (00,00) are, respectively, the angular 
coordinates of k and k;, the beam direction. 

The probability P;(/) of detecting a particle at time 
lat the detector whose position is ra is, for the wave 
packet 7, 


P;(Q)=N | £oj(raff redo. (7) 


This defines the normalization JV; of toj. The differ- 
ential cross section is 


(de/dQ)(k,5) Nr Í [zos(ra) M, — (8) 


—o 


where vois the group velocity of the wave packet. 
We may write the differential cross section for the 
beam described by the density matrix (4) as 


da/dQ = urë | P (rara, Ddt . (9) 


Substituting from (4) and interchanging the order of 
summation and integration we have 


de/dQ— 2396 Suae | | £oj(ra,L) K 


=>}; $;do (k55)/dQ. (10) 


This is the result stated above for case A. Since we 
are only interested in the differences of the peak 
energies Æ; for different wave packets j, we may 
replace the sum in (10) by an energy integral. 


A d E! dE 
= | FALDA (11) 


dQ 


The cross sectionis the energy average of the cross 
sections for individual wave packets with different 
peak energies. In particular, for beams with negligible 
quantal uncertainty the cross section is the energy 
average of the cross sections for scattering events with 
plane wave and outgoing spherical wave boundary 
conditions. For this reason we will call $, the width of 
the weight function ®(Z), the classical uncertainty. 

We will now consider the more general case B and 

see under what circumstances the distinction between 


k- classical and quantal uncertainty can be maintained. 
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In terms of plane-wave states 7(k), the most genera] 
density matrix is 


so | [ak mee. — qa 


We wish to write (12) in diagonal form in terms of 
wave-packet states £;;. 


pri) - Diag jh". (13) 


We must know if this diagonalization is unique. If 
so, we can assign a definite classical weight factor 
with uncertainty @ and a definite quantal weight 
factor A with uncertainty 6. If not, the distinction js 
meaningless. 

In mathematical terms the question is as follows, 
Can the Hermitian matrix p(1,1’,t) in (12) be diagonal- 
ized by more than one transformation of the type (2) 
of the basis vectors? A necessary condition for the 
diagonalization to be unique” is that the states £;; must 
be orthogonal like the states 7. 

In fact we have 


f Lite dir (Qn) if A(k;8;)A(k;,3;k)* 


Xexp[— ik: (r:—rv)]dk. (14) 


The overlap integral in (14) is approximately zero 
if the wave packets have sufficiently different starting 
times given by r; and ry because of the rapid oscillations 
of the integrand. 

In case A the wave packets have a definite physical 
significance. The quantal uncertainty leads to altera- 
tions in the usual cross sections, as will be shown in the 
succeeding sections. In case B the wave packets may 
be regarded as mathematical aids to the visualization 
of the scattering process. For example we can de- 
compose a plane wave into an infinite number of 
overlapping wave packets and then follow the propa- 
gation of one of these packets. Nevertheless, this would 
be without physical significance because the scattering 
amplitudes for all the packets must be recombined to 
obtain the cross section. 

The vital point is that the wave packets both before 
and after the scattering must be quite distinct in space 
and time for the quantal uncertainty to have any 
physical significance. 


3. SCATTERING OF A WAVE PACKET 
EROM A RESONANCE 


In the usual theory of scattering it is assumed that 
(E) in Eq. (5) varies much more slowly with energy 
than the wave packet amplitude factor A(k;,ô; k); that 
is, that the quantal uncertainty ô is negligible in com- 


12 U. Fano, Rev. Mod. Phys. 29, 74 (1957). 
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parison with the width of the energy fluctuations of the 
scattering amplitude. 

We will consider the case where 6 is comparable with 
n. the width of the compound state |s) of the system 
comprising the incident particle and the scatterer 
whose energy e, is near the central energy Eo of the 
wave packet. In the present section we will suppose that 
only one state |s) contributes to the scattering ampli- 
tude at the energies covered by the wave packet."^ 
We will be interested in only one wave packet 7=0. 

Since we are interested in the energy rather than the 
momentum of the wave packet, it will be convenient 
to transform the integral in (5) to an energy integral. 
The transformation is trivial when & is proportional to 
E as it is for photons (E=ñck). 

For our discussion of hypothetical time-dependent 
experiments with finite-mass particles complementary 
to the usual energy-dependent experiments, we canmake 
the assumption, following Friedman and Weisskopf,‘ 
that the source and detector are sufficiently close to 
the scatterer for wave packet spreading to be negligible. 
We will neglect the final term in the following expansion 
of E: 


h2 
ED e+ [kk]. (15) 
m 


Photon wave packets do not spread for practical 
purposes. 

We must now consider a particular form for the 
energy amplitude factor A (Eo ô; E) corresponding to 
A(ko ô; k). Wave packets made by electronic means 
(see Sec. 5) would have a rise and decay function in 
time that is something like an exponential. We will 
therefore consider a wave packet that has an ex- 
ponential rise and decay, with time constant 1/6. We 
will assume that the beam is switched off as soon as it 
has attained full strength, so that there is no time for 
which the beam intensity is constant. The experimental 
difficulty is in fast switching on and off. Since we want 
as short a packet in time as possible, this is then the 
most realistic form. The energy amplitude factor for 
this form is 

6/27 
(E— Eo + 0/4 


One shape of wave packet is already available in 
nature. This is the photon wave packet from the decay 
of a level |s} of a many-body system. It has a halí- 
exponential time spectrum. If the moment of excitation 
of the level is taken as zero time, then the probability 
of emission of a photon decays exponentially with a 
time constant Z//T,. In this case we have 


i/2m 
E— Es4-iT,/2 : 


(16) 


A (Eo ; E) = 


(17) 


A (Eo Ts; E)= 


i i has been 
12a Vote added in proof. The decay of a single resonance 
EAA by R. E Newton, Ann. Phys. (N. Y.) 14, 333 (1961). 


We will use the form (16) for most of our discussion. 
For the scattering amplitude we will use the expan- 
sion of Siegert, Humblet, and Rosenfeld,” 


1 
SERCE), 9 |= 2. (21+ 1 )P cos) 
jl 


Rin 
x] cae. —l (18) 


E— ei MEL 


The nonresonant term C;(£) varies slowly with energy. 
The splitting off of this term is not unique and can be 
made according to some physical prescription. Equation 
(5) now becomes 


K z ô 
Etr) = E(2t4+-1)Plcos) | C ITEM 
2 o (E— 


T Fa) (E— E) 
m Rin 
xexyiEX.] cit ow (19) 
B= eiit 


where 
X 2 [(ro-7)/vy—t]/h, 
E; = Esg— 16/2 , 
KK-1/8xr*. (20) 


We have dropped the subscripts on £(r,t) because we 
are now interested only in one particular wave packet. 

The integral in (19) is now in a convenient form for 
contour integration if we make the approximation of 
extending the lower limit to — «, thus neglecting the 
contributions of bound states. 

We will first consider the scattering of the wave 
packet according to the nonresonant scattering ampli- 
tude C,(E) whose variation with energy can be neglected 
over the energies of the wave packet. Performing the 
integration in Eq. (19) we find 
£r) - K D1(2I+1)Pi(cos)Ci(E) expGE;X), X>0 

=K Yu (214-1) Pi(cos0)Ci( E) 
Xexp(E;X), X«0. (21) 

The wave packet is centered at X —0, i.e., at 

t= (ro-l-r)/vo. (22) 

Hence, the nonresonant packet is propagated without 
time delay. The time spectrum ef |£(r/)|* is an ex- 
ponential rise and fall, exp(—6t/i). 

The differential cross section is, according to Eq. (8), 

do/d2=}|5,(214+-1)P:(cos8)C,(E) |?. (23) 


Thus, the differential cross section for nonresonant 
(potential) scattering is independent of the quantal 
uncertainty 6 and identical with that for a normal beam 
in which 6 is negligible. 


13 A, Siegert, Phys. Rev. 56, 750 (1939). J. Humblet and L. 
Rosenfeld, Nucl. Phys. 26, 529 (1961). 
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We will now consider the scattering from an isolated 
resonance state |s). Taking one term of the sum in (18) 
we have for the integral in (18) 


go 6/24 R, 
I= = exp(;EX)dE, (24) 
_» (E—E;)(E—H;) E—E, 
where 
E,—€,—1T,/2. (25) 


Integration round an infinite semicircle in the upper 
half plane gives 


s 


I= exp(;E;X), X>0. 


(26) 


E,— E, 

This means that the leading edge of the wave packet, 
that is, the part for times greater than (ro+7)/v0, is 
propagated with the same shape as it originally had, 
but, of course, with a different magnitude. The propa- 
gation of the trailing edge (the tail) of the wave packet 
is given by integration round a contour in the lower 


half plane 


exp(;E;X) 
AaB 


R, 
16 = 
(E,— Es) (E,— E) 


expGE,X), X«0. (27) 


dor "1325 (E— €)-2-:(7,/2--6) i 
[GE— €) 2-:(T,--8)/4 P 


dQ 


Il 


—$R R.Co E) 
eL e [(E—e,)?+ (7,2-9)/4 


(E— e) (E— e)? -- (T:+8) (T. +38)/4 


I. E. McCARTHY 

The time spectrum given by |7|? is a decaying oscilla- 
tory function. In particular, if the time width of the 
wave packet is much less than the decay constant of 
the state |5), the time spectrum of the tail has the shape 
exp(—T,!/A). This corresponds to exciting the resonance 
suddenly and watching it decay with its natural time 
constant. 

The differential cross section is 


de | R,R,[(E— e T./24- P. 8) (0,/24-)/4] 
2T, [(E— e*4- (P.3-8)/4 P i 


28) 
dQ 


This reduces to the Breit-Wigner form for very 
small quantal uncertainty 6. The most significant thing 
is that the width of the energy spectrum is I’,+6. Note 
also that the differential cross section for scattering from 
a single resonance is reduced in magnitude for large 
quantal uncertainty. For large 6, do/dQ is of order 1/6. 

In view of the arguments often made about the 
noninterference of potential and resonance scattering 
for large 6, it is interesting to consider the interference 
term. Denote the additional cross section due to the 
interference of potential and resonant scattering by 
dør/dQ. It arises from the last term in the splitting 
of the scattering into potential and resonant parts 


[£() |*— | £e (1,0 2+ | Ex (n) |? 
+2 ReEp(r,!)Er(r,t). 


Taking only the s state for simplicity of notation, 


(29) 


(30) 


(E— eT ,/24- (T.+)? (T44-28)/8 


+4 mR — —————————— ————. (81) 


The coefficient of Im[R.Co(Z)] is of order 1/8 for 
large ô. Hence the interference term in the cross section 
does become smaller for larger ô, that is for better 
defined wave packets in time, but so also does the 
resonant scattering term, both being of order 1/8. The 
potential scattering term is not affected by 6. 


4. SCATTERING FROM MANY LEVELS 
AND THE OPTICAL MODEL 


We will now consider the energy region where, for a 
given channel, the resonances in the scattering ampli- 
tude are such that the average width is greater than the 

b. average spacing, so that many levels contribute at each 
iE energy over the energy spread of the wave packet. 
mM 


» 


(E— e.) t- (T.+6)?/4 


The levels in this region are defined by some plausible 
model such as has been discussed by Brown! for nuclei. 
Their widths are of the order of 1 eV, so that the 
corresponding lifetime, 10-15 sec, is too short for a 
practical wave-packet experiment. However, it is 
interesting to consider a hypothetical wave-packet 
experiment in order to give a proper quantum mechani- 
cal sequential description of the passage of a particle 
through the scatterer, when we know the S-matrix 
element for the scattering. 

This gives us another way of looking at the optical 
and direct interaction models. We will restrict ourselves 
to elastic scattering for simplicity. The optical model has 
been considered in two ways, each with a different 
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starting point. The first way starts with a Schródinger 
equation. The optical-model Hamiltonian is shown to be 
an approximation to the many-body Hamiltonian. The 
second way is independent of any Hamiltonian assump- 
tion and starts with the scattering amplitude. The 
average over some energy interval I of the scattering 
amplitude S is shown to be capable of being calculated 
from an optical-model Hamiltonian. The average cross 
section is split into two terms. 


(c) |(S)/?+(|S—(S)|?). (32) 


The first term is the optical-model or shape elastic cross 
section. The second term is the fluctuation or compound 
elastic cross section. 

A third, classical, definition of the optical model is 
sometimes given. The optical-model cross section is the 
cross section for propagation without time delay while 
the compound elastic cross section is the remainder. 

It is not clear that the last two definitions are 
equivalent. The splitting in Eq. (32) requires a definition 
of the averaging procedure. Some authors, for example 
Brown, have gone to much trouble to discuss the 
averaging procedure. Brown’s argument is essentially 
a wave-packet argument since it uses an average of the 
scattering amplitude over an energy interval I. We 
have seen in Sec. 2 that for a beam without significant 
time resolution it is the cross sections, not the ampli- 
tudes, that are averaged. 

Another condition for the validity of this definition 
of the optical model is required. The average over 
amplitudes must be nearly equivalent to the average 
over cross sections. This means that the compound 
elastic cross section must be very small compared 
with the optical model cross section. 

A quantal statement of the third definition of the 
optical model must be given by a detailed wave-packet 
description. It is quite conceivable that the scattering 
amplitudes for certain problems are such that a large 
proportion of the cross section is due to time-delayed 
wave packets. This question will be discussed in detail 
by one of us (LRD) ina subsequent publication. 

At present we will just consider the propagation of 
the wave packet in general and show how it leads to a 
large proportion of immediate propagation in the case 
DD where I is the average level width and D is the 
average spacing. E 

Considering only the S wave for simplicity and 
omitting the potential scattering term, Eq. (19) gives 
for the time dependence of the trailing edge of the wave 


packet R. 


E) - K expGEX) Lap -8)/2 


i8 exp([,X/2) expi(e,— E)X 
Mu ea e 
E— e i(T«4-9)/2 


[e x/2) 


| . (33) 


e bracket is the one containing 


The second term in th 
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the properties of the scattering amplitude rather than 
the wave packet. Each level contributes an exponential 
tail to the amplitude with a time-delay constant 
A/T,. However, this contribution is multiplied by a 
phase factor exp[;(e,— E)X ] which gives a partial 
cancellation of the tails when the packets frorn different 
levels are superposed. 

If the level widths T, are much greater than the 
average spacing D, the factor e,— E can be large so 
that the phase factor oscillates rapidly. In this case 
the phases of the contributions from different levels 
tend to be random so the tails cancel out giving a large 
proportion of propagation without time delay. 

Thus the sequential description of scattering gives 
the same result as the usual energy description. The 
optical model is valid when TD. 

The dependence of (33) on the magnitude of the 
quantal uncertainty 6 is also interesting. As 6 is in- 
creased the magnitude of the contribution from each 
level to the scattered packet decreases. However more 
levels contribute significantly to the sum. If the phases 
of the residues R, are random the magnitude of the 
scattered packet will decrease rapidly with increasing 6. 
If they are correlated, the magnitude will decrease 
less rapidly. The range of the correlations between 
levels can be determined in principle by varying 6. This 
will be discussed in detail in a subsequent publication. 


5. POSSIBLE WAVE-PACKET EXPERIMENTS 


The condition for a time-dependent scattering experi- 
ment is that the experimental definition of time must 
be accurate in comparison with the characteristic time 
of the scattering amplitude. That is, we must have 6 not 
much less than T. 

Experimental definition of time is at present possible 
for times as short as about 10-1? sec. Typical nuclear 
values for #/T are 10-!5 sec, so wave-packet experi- 
ments cannot be performed with nuclei except in special 
cases. These cases are metastable states which can have 
lifetimes as long as 10-* sec. 

One experiment has actually been done using the 
Méssbauer effect with the 14-keV y ray from Fe”, 
which has a decay constant of 10-7 sec, by Holland, 
Lynch, Perlow, and Hanna.“ The time spectrum of the 
incident wave packet was defined by using as zero time 
the time of formation of the 14-keV state, which was 
defined by the time of emission ef the 128-keV y ray 
(a fast decay) from the next highest state in the y-ray 
cascade from the decay of Co’. The wave packet has 
an exponential time spectrum with 6=h/10-7 sec. This 
wave packet was scattered resonantly from an Fe 
target. The time spectrum of the final state and the 
increased width of the absorption line were both 


“R. E. Holland, F. A Lynch, G. J. Perlow, and S. S. Han 
Phys. Rev. Letters 4, 181 (1960); F. J. Lynch, R. E. Holland, and 
M. Hamermesh, Phys. Rev. 120, 513 (1960). 
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observed. In this case 6=I',. The width of the wave 
packet in this experiment is of course fixed. By looking 
at the scattering at times less than 1077 sec, Holland 
et al. were able to observe greater widths, but for these 
cases the shape of the incident packet was not defined. 
The spectrum of total elapsed time gives only an upper 
limit to the time width of the incident wave packet 
because it is not known if the delay occurred in the 
source or the scatterer. 

The Méssbauer effect is, of course, a wave-packet 
scattering experiment in which the absorption cross 
section is measured. If we do not define time but 
merely observe the cross section we lose the wave-packet 
property, that is the quantal uncertainty. The cross 
section is obtained using Eq. (17) to define the energy 
amplitude factor of the wave packet. 

do R? 1 
WSS es Rm (34) 
dQ 20, (Eo—E,)?+T 2 


Thus, we have the well-known result that the line- 
width in the Méssbauer effect is twice the width of the 
level, assuming all the nuclei in the target are capable 
of absorbing resonantly. 

Defining the starting time of the excitation of the 
metastable state puts a quantal uncertainty into the 
beam equal to T.. The method of Holland e al. defines 
the starting time with a minimum tolerance equal to the 
lifetime of the next highest state in the y-ray cascade. 
This is shorter than the experimental time resolution, 
which is itself much shorter than A/T.. 

One way of varying the time width of the wave 
packet would be to vary the resolution with which the 
starting time is measured. It is possible by this method 
to decrease 6, but not to increase it. 

A better experimental way of varying the wave- 
packet width in the Móssbauer effect has been sug- 
gested by the present authors. A third resonant absorber 
is introduced between the source and the target. This 
absorber is accelerated in a very short time (10-? sec) 
to a speed sufficient to shift the resonance so that the 
incident beam is no longer absorbed and can hit the 
target. It is then slowed down again quickly so that the 
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time duration of the pulse is of the order of 10-? sec, 
This method would produce approximately the ex- 
ponential wave packet [ Eq. (16) ] that we have used in 
the calculations of Secs. 3 and 4. It is experimentally 
just possible to achieve the requisite acceleration by 
using a very thin foil of Fe?" as one plate of a parallel 
plate condenser in a vacuum which is charged first 
with one sign and then with the opposite sign by an rf 
pulse. However, the acceleration may be achieved more 
easily using a piezo-electric crystal. Another possibility 
is to use the Stark effect to shift the resonance. This 
requires a tightly bound dielectric crystal containing 
nuclei with a metastable state. 

Wave-packet experiments in the atomic energy 
region may be interesting. An absorber of laser material 
placed in a laser beam and moved for a short time as 
suggested above would produce wave packets of laser 
intensity. The quantal uncertainty would be much 
greater than that of a beam from a pulsed laser, whose 
quantal uncertainty can be no greater than the width 
of the laser state. 

All such experiments observe only the scattering 
of a wave packet from a single resonance with a trivial 
angular distribution. Overlapping resonances, even in 
atoms, would probably have widths of the order of a 
few tenths of an electron volt. Wave-packet experi- 
ments in this region with 6~I would require time 
lengths of 1015 sec which seems impossible at present. 
The possibility of doing time-dependent electron- 
scattering experiments from atoms is not experimentally 
remote and would be interesting. 

Using laser wave packets, it may be possible to 
observe the changes in angular distribution due to the 
interference of potential and resonant scattering as the 
quantal uncertainty is changed. The potential scattering 
could be obtained by diffraction of laser light round a 
small crystal of laser material. 
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This paper is concerned with the extension of some recently reported results, especially those of Bourret 
(1960), relating to coherence properties of blackbody radiation. An explicit expression for the complex elec- 
tric correlation tensor &ij(r,z) of blackbody radiation is derived, and on the basis of it spatial coherence 
(r=0) is discussed in detail. The behavior of diagonal as well as nondiagonal components is illustrated by 
contour diagrams. In particular, it is found that the nondiagonal components of the correlation tensor, even 
though being zero for zero-space separation (r=0), acquire, in general, nonvanishing values when r0. 
The magnetic and mixed correlation tensors are also discussed. 


1. INTRODUCTION 


T is a common notion that the most incoherent 
radiation is blackbody radiation in an equilibrium 
enclosure. Recent researches!^* have shown that even 
in this type of radiation there is coherence in a suffi- 
ciently small space-time region. Bourret! has derived 
expressions for the second-order electric correlation 
tensor of blackbody radiation, using techniques analo- 
gous to those employed in the theory of isotropic 
turbulence of an incompressible fluid. Such a correlation 
tensor describes the correlation between the Cartesian 
components (denoted by subscripts i, 7) of the electric 
field E” (r,t) at two space-time points rı and rz, at 
time instants /; and /5, 


6; (ry ro.) = (Es (xf) Ej? (19,5)), (1.1) 
where sharp brackets denote the time average: 


(EO (5) Ej? (ro,t2)) 


1 y 
= lim — E; (r, HHE; (1o,154- t)dt . (1.2) 
Toro TUR) Mm 


Sarfatt? rederived some of Bourret's results using 
quantum mechanical density matrix techniques. 

The components of the correlation tensors discussed 
by Bourret and Sarfatt are real functions of space and 
time. However, numerous recent researches on co- 
herence properties of light have shown that an appro- 
priate measure of coherence is provided by certain 
complex rather than real correlation functions (cf. Ref. 
5), and this is also supported by recent investigations 
of Glauber® based on quantum field theoretical calcula- 


* This research was supported by the U. S. Air Force Office of 


Scientific Research. 

1 R, C. Bourret, Nuovo Cimento 18, 347 (1960). 

2Y. Kano and E. Wolf, Proc. Phys. Soc. (London) 80, 1273 
(1962). 

3 T EET Nuovo Cimento 27, 1119 (1963). 

4C. L. Mehta, Nuovo Cimento 28, 401 (1963). 

5 M. Born and E. Wolf, Principles of Optics (Pergamon Press, 
Inc., New York, 1959), Chap. Nel 

* R. J. Glauber, Phys. Rev. 130, 2529 (1963). 
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tions. (See also Sudarshan,'*^ Wolf, Mandel, Sudar- 
shan and Wolf,’ Mandel.!°) The appropriate complex 
electric correlation tensor &;;(11,r2,l1,f2) may be derived 
from the real correlation tensor defined by (1.1), by 
using the concept of an analytic signal.!!^^ Assuming the 
field to be stationary in time, so that 4 depends on 
I and tz only through the time difference r— 4— lz, the 
complex correlation tensor is given by? 


Gi (ry ro,7) = 2[ 65 7 (ru ro,7) 2-66; 9 (112,7) ] , (1.3) 


where 


1 
6j 9 (ryrs7) =-P 


7T -5 Trai T 


^ 4 (tyt0,7') 
J ee 


is the Hilbert transform of 6;;(?(r;,r»,7) and P denotes 
the Cauchy principal value at 7^— r. 

Bourret and Sarfatt have restricted their discussion 
of the tensor to certain special cases only, namely, 
to those characterizing temporal correlation (ri— rz) 
and lateral and longitudinal spatial correlation (r=0). 
Their results do not provide information about the non- 
diagonal components of the correlation tensor, though, 
in principle such information could be obtained with the 
help of their formulas. 


? E. C. G. Sudarshan, (a) Phys. Rev. Letters 10, 277 (1963); 
(b) in Proceedings of the Symposium on Optical Masers (Polytech- 
nic Press, Brooklyn, New York and J. Wiley & Sons, Inc., New 
York, 1963), p. 45. 

3 E. Wolf, in Proceedings of the Syri on Optical Masers 
(Polytechnic Press, Brooklyn, New York, and John Wiley & 
Sons Inc., New York, 1963), p. 29. 

L. Mandel, E. C. G. Sudarshan, and E. Wolf (to be published). 

1 L, Mandel, Phys. Letters 7, 117 (1963). 

11 D, Gabor, J. Inst. Electr. Engrs. 93, Part III, 429 (1946). 

12 An alternative, but equivalent, definition of the complex cor- 
relation tensor, which will be needed later is as follows: With the 
real field EC? (r,t) we associate the complex analytic signal E(r,f), 


ie., 

É Ef) - Ec (4) ED, 
where E is the Hilbert transform of E(?. Thé 
relation tensor 8;; may be expressed in the fiit 

Sij (rura 7) = (E: (rttr) Ej" 
The equivalence of the definitions (1.3) dn 
Ref. 17, pp. 464-466, where also a certain T 
connected with questions of convergence is 
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In this paper an explicit expression for the complete 
second-order complex electric correlation tensor of 
blackbody radiation is derived and a number of 
diagrams are obtained, which show the behavior of the 
elements of this tensor. The magnetic and the mixed 
second-order complex correlation tensors are also 
discussed. 

In paper II (Ref. 13) the electromagnetic correlation 
tensors defined on the basis of quantum field theory will 
be evaluated for blackbody radiation and will be com- 
pared with the corresponding tensors defined in the 
present paper on the basis of semiclassical theory. 
Higher order correlation tensors will also be considered 
in paper IT. 


2. EXPLICIT EXPRESSION FOR THE SECOND-ORDER 
COMPLEX ELECTRIC CORRELATION TENSOR 
OF BLACKBODY RADIATION 


In this section a new expression for the normalized 
complex electric correlation tensor 7;;(r1,7) of blackbody 
radiation will be derived. For any stationary field, the 
normalized correlation tensor is defined by the formula 


6;;(11,12,7) 
[4r 1,0) JE 65 (rs, 1/0) ]* | 


where 6;; is the correlation function (1.3). The normali- 
zation ensures® that 0< |y;;| € 1. In the case of black- 
body radiation in a cavity whose linear dimensions are 
large compared with the mean wavelength of the radia- 
tion, the field is isotropic and 6;; and y:i; depend effec- 
tively on r, and rz through the difference r= r;— rz only. 

An integral expression for y;(r,r) for blackbody 
radiation bas been obtained by Kano and Wolf.? They 
showed that 


45o* 
wie | 


8x5 


(2.1) 


yi (ryrs7)— 


k30;;— kh; 
k(exp(ak) —1) 
Xexp{i(k-r—ker)}d3k, (2.2) 


where 
a=nc/KT, (2.3) 


c being the vacuum velocity of light, # the Planck’s 
constant divided by 27, K the Boltzmann constant and 
T' the absolute temperature. The integration in (2.2) is 
taken over the whole k space. The normalization con- 
stant in (2.1) has the value 


C 6a( Ii, 1,0) ] JE 65i( Th, 12,0) ] Ws &;:(0,0,0) 


(no summation) 
64 19 K4T4 
== (2.4) 
45 (hc)? 


13 Ç, L. Mehta and E. Wolf, following paper, Phys. Rev. 134, 
A1149 (1964). 
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Equation (2.2) may be written as 


45g 
ya(r,r) =—(0:0j— 85V?) 
id {i(k-1—Rer)} 
exp{i(k- r—ker 
ji ren Le d REDE) 
k(e**— 1) 


where 0;=0/dr;. Using spherical polar coordinates for 
k, with the polar axis along the direction of r, we obtain 


Sat T 
vii(t,7) = —-(0:9;— 84V?) 
875 
n g iter T 27r 
Í kdk l QUII singan | do 
0 eL Jo 0 
45g 1 f° sinkr ee 
———(8;0;— 04 V2)— | Shoe O) 
2r‘ rJg e2- 
The last integral can be written as 
o 1 co 
p3 pant feti er) — g-ik(rker)) g-nakgp 
n=1 24 0 
1 œ | 1 1 
2i »-ilna—i(r—cr) na+ilr+cr) ; 
so that y;; may be expressed in the form 
45a e 
VOU) E ETE 
275 n=1 (11a-1-ic7)?4- 7? 


Carrying out the differentiation on the right-hand side, 
we finally obtain the following expression for the normal- 
ized complex electric correlation tensor of blackbody 
radiation: 


90a* « m 
ae ETUR 
mi »-1l (nal icr)?--72)? 


Yii(t,7) = 


7i'j—120i 
44— —— — —L, QD 
((na-Eicr)?--72)3 
where z;, r; are the components of the vector r with 
respect to z and j axes, respectively. 
3. TEMPORAL COHERENCE 


Restricting ourselves first to the case r—0, Eq. (2.7) 
gives the following expression for the normalized tensor 
that characterizes temporal coherence: 


ysi(0,7) = (90/14)¢ (4, 1--icr/a) 6:;, (3.1) 


where {(s,a) is the generalized Riemann zeta function 


ME. T. Whittaker and G. N. Watson, A Course of Modern 
Analysis (Cambridge University Press, New York, 1958), p. 265; 
(also Doyer Publications, Inc., New York, 1962), p. 265. 
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defined by 
z 1 
(s,a) = x [SEU * 
n=0 (1-1- a)* 


It should be noted that the tensor +;;(0,r) is diagonal. 

The expression (3.1) for the temporal degree of co- 
herence of blackbody radiation is identical with the 
expression derived by Kano and Wolf? and discussed in 
detail by them [cf. Eq. (11) of their paper]. In particu- 
lar they gave diagrams showing the behavior of the 
modulus and the argument of y;;(0,7) as function of r. 


(3.2) 


4. SPATIAL COHERENCE 


Setting r=0 in Eq. (2.7) we obtain the following 
expression for the normalized tensor that characterizes 
spatial coherence: 


90 » 0i 2 Tifj— bir" 
yu(r,0) 7 — 2; + . (4.1) 
qi nit (n?2-7?/a2)? a? (n?--r?/a?)? 


It is seen from this expression that y;;(r,0) is real. 
The series on the right-hand side of (4.1) may easily be 
summed with the help of the result! 


a 1 T 
——L(ra), (4.2) 
n-17?--a? 2a 


where L(x) is the Langevin function 
L(x)- cothx— 1/x . (4.3) 
Successive term by term differentiation of (4.2) gives 


o 1 1d(m 
e [neo] b (4.4) 
n=l (n?-+-a?)? 2a da\2a 


o 1 m d(1dp[i 
E E Ga) E 
Z (n?--a3) 16a eA ( |} s) 


If we set 
r= (a/r) 3 (4.6) 


so that 7’ represents the separation of points in the 
cavity in units of a/r, (4.1) gives, with the help of 
(4.4), (4.5), and (4.3), 


/4 


45 i ri'r 
yir )o8vi(n0)— E (r)às-- B(r EE , (4.7) 
T 
where 
A(r)2 —r cothr'— r”? csch?z 
— 275 csch?r' cothr’-++4, 
(4.8) 


B(r’)=3r' cothr'4-37^* esch*r’ 
+-27'3 csch?r' cothr’—8. 


7 7 Publications, 
18 L, B. W. Jolley, Summation of Series (Dover i 
New York, 1961), p. 22, Series number 124. 
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Equation (4.7) is valid for all r’ but is not suitable for 
computing yau(r^) when 7’ is small. For small 7’, however, 
we directly obtain from (4.1), on expanding in powers 
of 7’, 


8 3 
Vill’) =7ii(1,0) = i| Io ii ‘| 


35 


rir 4 2 
E mee] (rm). (4.9) 
ri 738 


If we set i= j— x in (4.7) and (4.9) and take r’ in the 
direction of the x axis, we obtain the following expres- 
sions for the longitudinal coherence function ^y ,,I^"*: 


45 
"y 4, 9*1") "d cothz" 4-7"? csch?r — 2) 
7 


(<e), (4.102) 


4 1 
= {ae 


(<r), 
21 35 


(4.10b) 


where r'—x'—mx/a. 

On the other hand, if r’ is chosen perpendicular to the 
x axis, then we obtain from (4.7) the following expres- 
sion for lateral coherence function ~~z2'*: 


45 
yz íI*(r') S—[4—7 cothr’—r’”? csch?z 
Ar^ 
—2r'3 cothr’ csch?r ], (r«), (411a) 
8 3 
-1—-—/RLgEÁHMee. (<x), (4-116) 
21 35 


where y'= (y'2--2/2)3 — x (y? -22)1/2/0. 

Expressions for *,,^*, Yyy", Yz, and Ys" are, 
of course, strictly similar. The expressions (4.10a)- 
(4.11b) are in agreement with those derived by Bourret.! 

Next we will examine the three-dimensional distribu- 
tion of y;;(r). First, let us consider a particular non- 
diagonal component, say the xy component. From 
(4.7) and (4.9), we obtain 


ye EY ray cothr 3 csch? 
Yat) pu [3r' cothr’+-3r’? csc 
+27’? csch?r cothr’—8] (r«o), (4.12a) 
4 
cd T (<r). (412b) 


In Fig. 1, the xy section of the surface of constant 
yz,(r) is shown. ¥z,\) is positive in the I and IMI — - 
quadrants and negative in II and IV quadrants, att ain- —— 
ing peak values at four points, denoted by letters Ay Ag 
Bı, Bo in the figure, at distances r=ar'/x~2.3a/x from 
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Fic. 1. Contours of yzy(r’) in the xy plane. yzy(r’)=yzy(r,0) 
— ,0); N= &(0,0) = (64/45) (x2/he)®K*Ts; r=ar'/7; 
a=nc/K 


the origin, this distance being of the order of half the 
mean wavelength of the radiation.! The variation of 
Yzy(r') along the line x—y, z=0, is shown in Fig. 2. 
In Fig. 3, the contours of the surfaces z,(r’) in the plane 
x=y are shown. 

We see that the surfaces, yz,(r’)=constant, are 
closed surfaces, having the lines x— -£y, z=0 as axes 
of symmetry. (Thus, any plane section through one of 
these lines will be symmetrical about that line.) Near 
the origin these surfaces are more or less hyperbolic 
cylinders as seen from Eq. (4.12b). The surfaces for 
peak values |yz,(r') |2-0.143, shrink to the four points 
Ay, Az, Bi, By. The behavior of the other nondiagonal 
components, such as Yue(t') o Or ¥22(r’) is, of course, 
strictly similar. 

For diagonal components of the correlation tensor, it 
is again sufficient to consider one of them, e.g., that 


Vey (1!) 


x «o —r'— xX»0 


Fic. 2. Variation of *zy(r') with r', when r' is in the 
direction z—y in the plane 2=0. 
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corresponding to i= j=z. We have from (4.7) and (4.9) 
45 
vyz(r)— "md cothr' — 7^? csch?r' 


2/2 
— 2r? cothr’ cschr'44)--— Gr cothz" 
r 


--3r^? csch?r'4- 27? cothr’ csch?7 — 8)) , 


(r'cc), (4.13a) 


(r'cm).  (4.13b) 
Eq. (4.13b) shows that for small spatial separation 
r’ of the two points, the surfaces y;.(r')— constant are 
the ellipsoids 

2(x?4- y?) 4- z?— constant . 


The contours of y:.(r’) in the xy plane are circles, 
shown in Fig. 4. Figure 5 shows the contours in the 


Tic. 3. Contours of yz, (r^) in the plane x— y. 


yz plane. The actual surfaces y..(r’)=constant are just 
the surfaces of revolution generated by rotating these 
contours (Fig. 5) about the z axis. 

For the sake of completeness the variation of yz 
along the z axis (longitudinal coherence) and along the 
x axis (lateral coherence) are given in Figs. 6 and 7, 
respectively. These two curves are in agreement with 
Figs. 3 and 4 of Bourret.! 

In the present section we have discussed only the case 
of spatial coherence characterized by »y;(r')ey;(r,0). 
In the general case, when r is arbitrary, i.e., when the 
correlation is characterized by y;,(r,r) rather than by 
Yi;(1,0), the value of the correlation, for any particular 
separation 7 of the two points may, of course, become 
appreciably larger (or smaller), provided 7 is chosen 
appropriately. 


5. THE SECOND-ORDER COMPLEX MAGNETIC AND 
MIXED CORRELATION TENSORS OF 
BLACKBODY RADIATION 


Besides coherence effects associated with the electric 
field, coherence effects involving the magnetic field are 
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also of interest. Some of these effects are characterized 
by a magnetic correlation tensor 5C;; and mixed correla- 
tion tensors §;; and Gi, introduced in earlier publica- 
tions." These tensors may be defined in a similar 
manner as the electric correlation tensor 6;;. Assuming 


again that the field is stationary we have, in analogy 
with (1.3a), 


SC; (ry ro,7) = (Hi(r,t- 7) H* (150) , (5.1) 
Ga(ruto,7) = (Ei(riHH- 7) H j*(12,0)) , (5.2) 
Gri ror) = (Hr 7) Ej*(12)) , (5.3) 


where, as before E(r,) denotes the complex analytic 
signal associated with the real electric field E“(r,t) and 
H(r,) denotes the complex analytic signal associated 
with the real magnetic field H ?(r,/). 
We note that the tensors G and G are related as 
follows: 
Gi (ru ror) = gj (1s, n, —7) . (5.4) 


It is known!**)!7 that in vacuo each of these tensors 


Fic. 4. Contours of ys:(r’) in the xy plane. 


satisfies two homogeneous wave equations. Also, that 
the tensors are not independent but satisfy differential 


equations of the form!9^.1? 
ih : 
eid Gin —Gim=0, (5.8) 
c Or 
10 3 
ejud! Gin +- — nO, (5.6) 
c OT 


16 E. Wolf, (a) Nuovo Cimento 12, 884 (1954) ; (b) in Proceedings 


of the Symposium on A 
(North-Holland Publishing Compan 
17 P, Roman and E. Wolf, Nuovo 


ical Optics, edited by Z. Kopal 
Mie MENT 1956), p. 117 
Cimento 17, 462 (1960). 
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Fic. 5. Contours of +3:(r’) in the yz plane. (Surfaces of constant 
yz: are the surfaces generated by rotation of these curves about | 
the z axis). 


etc., where 0,! (k=x, y, z) are the components of the | 
gradient, taken with respect to the coordinates of r, and | 
eji is the completely antisymmetric unit tensor of | 
Levi-Civita. There is a similar set of differential equa- ; 
tions involving the components ð,’ of the gradient, 
taken with respect to the coordinates of rz. In the case | 
of blackbody radiation, one has, on account of isotropy | 
0,!=—0,2=0/dr, where ry are components of the | 
vector r— ri— ra. 
Now for blackbody radiation we have from (2.1)-(2.4) 


hc k20;— kiki 
57 | 
22? J k(exp(ak)—1) 
Xexp{i(k-r—ker)}d3k. (5.7) 


On substituting from (3.7) into (5.5) and solving the 
resulting equation for G, subject to the boundary condi- 
Xu 


1.0) 


0.5 


1 
o 5 ior 


Fic. 6. Longitudinal coherence. Variation of *,,'e(r) 2x; (r,0), 
with r^, when r’ is along the z axis. 
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tion Sjm(r,0©)=0, we obtain, if we also use (5.4), 


(nr) he i ky 
—@, (nr 7)=——ejne | -————— 
ab nt exp(ak)—1 


T 


Gin (r,7) = 


Xexp(i(k:r—kcr))d*k. (5.8) 


From (5.8) and (5.6) we find on solving for 3Cjm subject 
to the boundary condition 36; (r, © ) — 0, and on compar- 
ing the resulting expression with (5.7), 


S6, (r,7) 8 Sjm(r,7) . (5.9) 


In deriving (5.9) use has been made of the identity! 
EjklEmk'j= Om Oti — Okk’ Om. Relation (5.9) implies that the 
magnetic coherency tensor is identical with the electric 
coherency tensor, discussed in detail in the preceeding 
sections. ^ 

Let us now consider the mixed tensors G and Gg. 
From (5.8) we may readily derive series expansions for 


Yit) 


Lo 
0.5 


(0) r! 
-0.1 5 10 


Fic. 7. Lateral coherence. Variation of yz:"t(r')=vy=::(r,0), with 
r’, when r' is along the x axis. 


these tensors. For this purpose, we first rewrite (5.8) in 
the form 


Gim(r,7) = 


^. hc . 
— Gin (1,7) — — 6mi(— 101) 
273 


i(k-r—ker 
[ 380m (5.10) 


exp(ak)—1 


and apply to the integral on the right-hand side of 
(5.10) a similar procedure as used in connection with 
Eg. (2.5). We then obtain 


s 16he 
Gin(r,7) =— Gjn(r,T) =1—Ejmi lk 
zB 
2 nat-icr 
xE ——————. 
n=1 [(na+icr)? +r]? 


18 H. Jeffreys and B. S. Jeffreys, Methods of Mathematical Physics 
(Cambridge University Press, New York, 1950), 2nd ed., p. 73. 


(5.11) 


MEHTA AND E. 


WOLF 


Fic. 8. Contours of lez, (r^) |= |ezy(r,0)| in the yz plane. 
Szy (T) &ozy(r,0) = (1/N) Gry (r,0) ; N = (64/45) (a?/ hc) KT". 


We see that the mixed tensors Gim and Gim are anti- 
symmetric and that 


Gim(0,7) = Gim(0,7) =0 . (5.12) 


Equation (5.12) implies that at every point r, E;(r, !H- 7) 
and H;;*(r,t) [and also, of course H.(r, 14-7) and E;* (r,t) ] 
are uncorrelated irrespective of the time delay 7, i.e., 
there is no “temporal coherence" between the complex 
electric and the complex magnetic field. 

It will be convenient to normalize the tensors G and 
G in a similar manner as we normalized 6, ie., we 
define normalized mixed correlation tensors ø and ë by 
the formulas 


1 1 
vim(t7) Sim (tr) D je r,r) — Bir) D (5.13) 
where [cf. (2.1), (2.4), and (5.9) ] 


lexy h| 


0.5 


[ 
o 5 r 


Fic. 9. Variation of |cz,(r’)| with r', 
when r' is along the z axis. 
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I. CORRELATION TENSORS OF CLASSICAL FIELD 


N — [6,/00,0) ]^[565,(0,0) J"? 
64 a^ KAT* 


= 6;;(0,0) —365,(0,0) =— —,. 
45 (ho) 


Lj 
It follows from (5.13), (5.14), and (5.11) that ø and 
& may be represented in series form as follows: 


(5.14) 


180a 

Ojm(¥,T) = — 6 jn(t,T) =1— 7 Ejmk 

T 

z nar icr 
———————— . (5.15) 

n= [(11a-icr)?- 7? P 
Here the constant o is given by (2.3) as before. 
PHYSICAL REVIEW VOLUME 
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From (5.15) we obtain in the special case 7=0 
(spatial coherence between the electric and magnetic 
fields), for typical nondiagonal elements of o and 6, 


180 ^» 
ez, (r0) = — 84, (r,0) - i—2 pr E ach TERT S 
am^ nei (n?--r*/a2)? 


(5.16) 


The diagonal elements are, of course, identically zero, 
since c is antisymmetric. 

We see from (5.16) that in the plane s=0 (xy plane) 
a2,(1,0) is identically zero. In Fig. 8 the contours of 
|ozy(r,0)| in the yz plane are presented and in Fig. 9 
the variation of |oz,(r,0)| along the z axis is shown. 
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Coherence Properties of Blackbody Radiation.* II. Correlation 
Tensors of the Quantized Field 


C. L. Menta AND E. Worr 
Department of Physics and Astronomy, University of Rochester, Rochester, New York 
(Received 9 December 1963) 


Expressions are derived for the electromagnetic correlation tensors of blackbody radiation defined on the 
basis of the theory of the quantized field. Correlation functions of all order are considered, but second-order 
ones are discussed in detail; it is found that these are identical with those obtained on the basis of semi- 
classical theory in part I of this investigation. This result illustrates a recent theorem of E.C.G. Sudarshan 
relating to the equivalence between semiclassical and quantum mechanical description of statistical light 


beams. 


1. INTRODUCTION 


I! part I of this investigation; expressions were 
derived for the complex second-order electromag- 
netic correlation tensors of blackbody radiation and 
their behavior was discussed in detail and illustrated 
by a number of diagrams. The statistical methods used 
were based entirely on classical concepts, though 
quantum mechanical features of the radiation were 
implicit in that treatment, since the spectrum of the 
radiation was taken to be given by Planck’s law. 
In the present paper the second-order correlation 
tensors introduced recently by Glauber? on the basis of 
the theory of the quantized field, are evaluated for 
blackbody radiation and are shown to be identical with 
those defined and evaluated on the basis of the semi- 
classical theory. This result illustrates a recent theorem 
of Sudarshan,’ relating to the equivalence between 


_ * This research was supported by the U. S. Air Force Office of 


Scientifi h. 
TCL. ee and E. Wolf, Phys. Rev. 134A, 1143 (1964), 


recedi 1 : 
E REJS Glauber, (a) Electronique Quantique, Jeme Conference, 


edited by N. Bloembergen and P. Grivet (Dunod Cie, Paris, 1964), 


p. 111; (b) Phys. Rev. 130, 2529 (1963); (c) ibid. 131, 2166 


Letters 10, 277 (1963). 


1 ys. Rev. 
3E. C. G. Sudarshan, (a) Phys. Rev Optical Masers (Poly- 


~~ (b) in Proceedings of the Symposium on { 
E techie Press, Brooklyn, New York and John Wiley & Sons, 


Inc., New York, 1963), p- 49- 
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semiclassical^-? and quantum mechanical description of 
statistical light beams. 

In Sec. 4 higher-order correlation tensors of black- 
body radiation are briefly discussed. 


2. THE SECOND-ORDER CORRELATION TENSORS OF 
THE QUANTIZED FIELD 


It will be useful to begin with some results which will 
be needed later, relating to quantization of the electro- 
magnetic field and the definition of the correlation 
tensors of the quantized field. 

The electric-field operator, at the space-time point 
az (rci), when expanded in a Fourier series is given 
by? (with 2—2, y, 2) 


Ê; (x) =É; (x) +, (x) ? (2.1) 


‘The term “semiclassical” implies here that the distribution 
functions characterizing statistical properties of the beam are 
not nec ily non-negative may therefore not be true 
probabilities. They are essentially Wigner distribution functions 
(see Refs. 5 and 6), called also “quasiprobabilities.” However, in 
the present case (blackbody radiation) the distribution function 
turns out to be positive. [See Eq. (3.1).] 

5 E. P. Wigner, Phys. Rev. 40, 749 (1932). 

6 (a) J. E. Moyal, Proc. ; 

(b) G. A. Baker, E Phys. Rev. 109, 2198 (1958). (c) C. 
Mehta, J. Math. Phys. 5, 677 (1964). 
? All operators are denoted by 


Cambridge Phil. Soc. 45, 99 (1949). 
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where 


2qfhc ^? 2 
mowi) SEOs ENN Q2) 
V k =l 
2r 2 
Bow EVT) re" 
V k 


xtX eme). (2.3) 


A=1 


Here V represents the volume to which the field is 
confined and â (k) and &j!(k) are the annihilation and 
creation operators, respectively, for a photon of 
momentum Zik and polarization ^; they satisfy the 
commutation relations 


(4, (k) ânt (k^) ]= Sws , (2.4) 
[&(K),5(k)]- [à(k),5,1(k)]-0. (2.5) 


e% and e? are two unit vectors, such that ©, e? 
and k/k (with k=|k|) form a right-handed orthogonal 
triad and xx represents the usual four-vector product 


xv=k-r—ket. (2.6) 


In writing down Eqs. (2.2) and (2.3), the modified 
Lorentz condition was used in the form introduced by 
Gupta and Bleuler, employing an indefinite metric: 
Allowable states |) are only those for which 


x ð, 4) 


ly)=0, (2.7) 


= Ox, 


where A Œ is the positive frequency part of the potential 
of the field. This condition justifies our restriction to 
transverse photons (A= 1,2) as long as only expectation 
values of products of field operators are considered. 
The magnetic-field operator may be similarly written 
in the form 
Hx) - B, (z)--E;O (2), (2.8) 


where 


2a 1/2 
Aeg-( T) re" 


2 (kx); 
— á k dkr ; I 
ED : o doe (2.9) 


à=] 


2rheN 2 
nowe) 29 


2 (k xe); = 
x| 2 TUE âi(k)e | . (2.10) 


A= 


If is the density operator of the field, the second- 


3 See, for example, S. S. Schweber, An TRAY. p Quantum 
Field Theory (Harper & Row, New York, 1961), p. 


AND E. 


WOLF 


order correlation tensor of the electric field, introduced 
by Glauber?*? may defined as 


' Gg (xix) = Tr(8E; CO (a) Bj; (2) ,— 2-11) 


where the indices 7, 7 specify Cartesian components of 
the tensor (i, j—«, y, Z) and the prime on 6 is used to 
distinguish this tensor from the corresponding one 
defined in paper I in classical terms. In a similar way, 
one may define the second-order magnetic and mixed- 
correlation tensors 


"Se (1,2) = Tr (GEL? (5) 10? (9) (2.12) 


"Gu (x9) = Tr( GEO? (91) A; ()) (2.13) 
"Gig (1,22) ='G jf (o1) 
=Tr{pH,© (x); (x2). (2.14) 


To evaluate the correlation tensors '6, '3C, 8, and 'G 
for blackbody radiation, it will be useful first of all to 
express the density operator f in a representation in 
which the base vectors are the eigenvectors | (z)) of the 
annihilation operator d, (k): 


ay (k) | (2) 2 (K)| (2). (2.15) 

Here {z} denotes the set of z,(k) for all Ns and k’s, 
so that 

I7 ITI a. 09). (2.16) 


Since the operator &(k) is not Hermitian, its eigen- 
values z,(k) are in general complex, 


z,(k) — 2 (k)+ iy, (k), (2.17) 


(a(k), y (k) real) and the eigenvectors belonging to 
different eigenvalues are not orthogonal. However, they 
obey a closure relation (cf. Ref. 9). 


1 
= Í TII z, (K)) (m (k) | di, (K) 21, (2.18) 


where 1 is the identity operator if, as we shall assume, 
they are normalized so that 


(A (k)| n (K)) —1. (2.19) 


The eigenstate |z,(k)) has the following expansion 
in the number representation?: 


a, (k)) ^00 
Jabo OW |m(k)), 


|%(k))= EN ERG 
[o (k) 1]! (2:20) 


where the |z;(k)) form an orthonormal set of eigen- 
vectors of the number operator di (k), (k): 


ât (Kk) (Kk) | m (Kk) =m (K) | m. (K)). (2.21) 


The eigenvectors of the annihilation operator have 
been found very useful in the analysis of problems 


? J. R. Klauder, Ann. Phys. (N. Y.) 11, 123 (1960). 
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Il. CORRELATION TENSORS OF QUANTIZED FIELD 


relating to the statistical behavior of radiation.: 
An important property of these eigenvectors is expressed 
by a theorem established recently by Sudarshan? which 
we shall employ. According to this theorem, any 
operator and in particular the density operator p may be 
expressed in the "diagonal" form 


a= ADIE: (2.22) 


Using this representation of the density operator, it then 
follows from (2.2), (2.3), (2.11), (2.22) and (2.15) and 
its Hermitian adjoint that the second-order correlation 
tensor of the electric field may be expressed in the form 


'6i(xyx2) 
x if ea) | Bi (er) Bj (a) | (30243) 


= Dee (K)e; 9? (kB (Kk, k'; x12»), 


A,k A^ ,k^ 


(2.23) 


where 
Pyy (k,k’ ; X1,X2) E (2rhc/ V) (kk)? exp(— ikx14- ik x2) 


x | PE O 


The corresponding expressions for the second order 
magnetic and mixed correlation tensors defined by 
(2.12)- (2.14) can also readily be written down. We only 
need to use the expressions (2.9) and (2.10) in place of 
(2.2) and (2.3) where appropriate. We then obtain in 
place of (2.23) the following expressions for the other 
second-order correlation tensors of the quantized field: 


kxe9wN 7k’ x 2% 
‘ " i j 


By (k,k’; 1,22), (2.25) 
kx 20 
'Su(xyx2) = 22 «e / ) 
Ak M, k^ k E 
Xy (k,k'; 1,272), (2.20) 
k xe? 
Glee X X ( ) go^(y) 
ik Mk! k 5 
XP’ (k,k’; %1,%2) . (2.27) 


3. THE SECOND-ORDER CORRELATION TENSORS OF 
BLACKBODY RADIATION 


We will now evaluate the correlation tensors ’6, ‘SC, 
'G and 'G for the case of blackbody radiation. It would 
appear from a discussion of Glauber [Ref. 2c, Eq. 
(10.23) ], based essentially on the central limit theorem, 


: I. Mandel, Phys. Letters 7, 117 (1963). 
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that the “phase-space” distribution function $((z)) for 
blackbody radiation is given by 
1 |an(k) |? 
"nm 


Ak mik i fis 


where 4,27. (\=1,2) is the average number of 
photons with momentum k and polarization A, 


fiy 1/(e*—1), (3.2) 
with 

a—hc/KT. (3.3) 
That the “phase-space” distribution $((2)) given by 
(3.1) does indeed lead to the well-known expression for 
the density operator @ of a radiation field in thermal 
equilibrium at temperature T is verified directly in 
the Appendix to the present paper (see also Mandel"). 
If we substitute from (3.1) into (2.24), and also use 

the relation 


1 
| ax* (kJa (KTE — exp(— la (k) |2/aie) d'a (K) 


Ak TH 


1 
a | Ins 


TH 
xexp(— lav (k)|?/ñx)Pzy (K^) 
= hy dyn Oe" y (3.4) 
where 6 is the Kronecker symbol, we find that 
$, y (k,k’ 5 x1,x2) 


2rhc : 
cep HN exp{ —ik(x1— x2)) . (3.5) 


Next we assume that the linear dimensions of the 
enclosure are large compared with the mean wavelength 
of the radiation. Summation over k may then be 
replaced by integration over the whole k space, if use 
is made of the usual rules! 


1 
atlas, form, 69 


Ôrw c 6(k—k’) 3 (3.7) 


where f is an arbitrary function and 6(k—k’) is the 
three-dimensional Dirac delta function. One then 
obtains from (3.5) and (2.23), the following expression 
for the correlation tensor '6: 


he 
8 4;(e1,%2) =— [rm exp{ix(x2—x1)} 
dr? 


x » e 9 (k)g? (k)d*k. (3.8) 


=l 


uj. M. Jauch and F. Rohrlich, The Theory of Photons and 
Elecirons (Addison Wesley Publishing Co., Cambridge, 1955), 
p. 38. 


Runs tat aait 
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Next we substitute for ñ, from (3.2) and also use the 
relation 


2 : Rk; 
27 «9 (k)e;™ (k) Ue 


=1 52 


(3.9) 


which follows from the fact that £9, «©, and k/k form 
a triad of mutually orthogonal unit vectors. We thus 
finally obtain the following expression for the second- 
order electric correlation tensor ' 6 of the quantized field : 


hc Kk30;— kik; 
! Gs (1,9) = J 
4r? J k(et—1) 
Xexp(x(xo— 21)) d*k . 


(3.10) 


For the magnetic second-order correlation tensor, we 
obtain in a strictly similar manner from (2.25) 


c 
'5C 3 (471,02) 2 — [mm exp(ix(xo—21)) 
47? 
2 /kxe0WN /kxg0? 
<> ( X ) dk, (3.11) 
Mel k i k j 


Now the three vectors k x £0/k, k x £?/kE and k/k also 
form a triad of mutually orthogonal unit vectors, and 
hence a relation similar to (3.9) holds between them. In 
consequence, the right-hand side of (3.11) becomes 
identical with the right-hand side of (3.10) and hence 


we have 
"Feij (21,32) =" 655 (21,52) . (3.12) 


For the mixed second-order correlation tensor '9 we 
obtain from (2.26) in a similar manner 


hc 
‘Galore == | Bi exp(zx(xa— 21)) 
47? 


A=] 


2 kxe 
X» GN w( ) dk. (3.13) 
k /j 
It may readily be verified that 


2 k x £9 
$49 a ) = ek, 


NS k E 


(3.14) 


where e;; is the completely antisymmetric unit tensor 
of Levi Civita. Hence, if also (2.14) is used, (3.13) 
reduces to 

z hc ki 
"Gs (91,22) = — 921,32) pon EU 
xXexp(ix(xo—21))d*k. (3.15) 
Noting that x(x—23) is the four-vector product 


K(x2—931) — kcr—k-r, (3.16) 


MEHTA AND EF. 


WOLF 


where 7—1/;—1» and r— ri— rs, it follows on comparing 
Eqs. (3.10), (3.12), and (3.15) with Eqs. (5.7), (5.9), 
and (5.8) of paper I that 

' és (21,92) = 16i (3,22) , 

"SC; (1,2) = 396" (9,22) , 

'ga(xyx2) = 395" (1,2) , 


Galant) = 36 (02) . 


(3.17) 


The relations (3.17) show that the second-order 
electromagnetic correlation tensors of blackbody radia- 
tion defined for the quantized field by Eqs. (2.11), 
(2.12), (2.13), and (2.14) are proportional to the 
complex conjugates of the corresponding tensors defined 
for the classical field by Eqs. (1.3a), (5.1), (5.2), and 
(5.3) of paper I.?« 


4, HIGHER ORDER CORRELATION TENSORS 


Up to now we have considered correlation tensors of 
second order only. For the sake of completeness, we 
will now briefly consider electromagnetic correlation 
tensors of higher orders for the case of blackbody 
radiation. Again these correlation tensors may be 
defined either as appropriate averages of products 
involving the field vectors of the complex classical field 
at a number of space-time points,’>!6!7 or as quantum 
mechanical expectation values involving the corre- 
sponding field operators.? Since the equivalence of these 
different definitions has been demonstrated by Sudar- 
shan,? Mandel” and Mehta and Wolf,!® we may restrict 
our discussion to correlation tensors defined on the 
basis of the theory of the quantized field. The electric 
correlation tensor of order m-+1 is then defined by the 
equation 
COLOR eee (21,22, tYXmjYXmep»yttá ancy) 

T Tr{pÊ, O (23) 8, (£2) is Ej, CO (x 2) 

XÉ;, (Emy) ET Ej, CO (man) , (4.1) 


where É;C2 (x), É;C9 (x) are Cartesian components of 
the operators EC? (x), ÊH (x) at the space-time point 


12 The correlation tensors of the classical field could readily be 
redefined without any loss of generality, so as to lead to strict 
identities ^8;;— 8;;, etc. In particular, the factor } could be 
suppressed by employing a slightly different normalization in 
defining the analytic signal (Refs. 13 and 14) by means of which 
the complex fields are associated with the real fields. However, we 
preferred to retain the customary definitions throughout this 
investigation. 

13 M. Born and E. Wolf, Principles of Optics (Pergamon Press, 
London and New York, 1959), Chap. X. 

M D. Gabor, J. Inst. Elec. Engrs. 93, Part IIT, 429 (1946). 

15 L, Mandel, in Electronique Quantique 3eme Conference, edited 
bN Bloembergen and P. Grivet (Dunod Cie, Paris, 1964), 
p- 101. 

16 E. Wolf in Electronique Quantique 3 eme Conference, edited by 
N. Bloembergen and P. Grivet (Dunod Cie, Paris, 1964), p. 13. 

11 E, Wolf, in Proceedings of the Symposium on Optical Masers 
(Polytechnique Press, Brooklyn, New York and John Wiley & 
Sons, Inc., New York,'1963), p. 29. 

18 C. L. Mehta and E. Wolf (to be published). 
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II. CORRELATION TENSORS OF QUANTIZED FIELD 


x, defined by Eqs. (2.2) and (2.3). It follows from 
Sudarshan’s discussion’ that the expectation value on the 
right of (4.1) may be expressed as a phase-space average 
of the products of the components of the complex classi- 
cal field, in the form 


9mm) 
jij2 ejm (x1,52,* Emi Emil" “Empn) 


» [seme eoe cen SE roy) 
XEjs am): Liman (Xmes)d (2) , (4.2) 


where $((z]) is the “phase space" distribution function 
associated with the density operator 9 [cf. (2.22) ] and 


2ahc\ 2 
B,(2)=i(—*) D (Rye; (k) a (k)e**; (4.3) 


k,À 


(j=x,y,z) is the eigenvalue of the operator Ej (x) 
corresponding to the eigenstate | (z)). 

In the case of blackbody radiation, the phase-space 
distribution $((z)) is given by (3.1), which is a multi- 
variate Gaussian distribution. Now since according to 
(4.3) the field components £;,(x,), E;*(x,) are linear 
combinations of the z,(k) and z;*(k), they will, accord- 
ing to a well-known theorem on the Gaussian random 
process, be also distributed as Gaussian variates? and 
hence all moments involving them may completely be 
expressed in terms of the second-order moments? 
as follows: 


If m=n 


o(m,n) a rr on y 
On cy 112, mj mtl” Y m+n) 


-Y.8 ina; x5) 6C (gs; $4): 


X6 (rnm). (4) 
If mn 


WHEN CORE "Xm; mtl" "Xmpn) — 0. (4.5) 

The second-order coherence tensor 6fbP (x1; x»), etc. 
on the right of (4.4) is precisely the second order 
electric correlation tensors given by (3.10) and T 
denotes all permutations p, g, ---s of the non-negative 
integers m+1, m--2, ---2m. i 

Strictly similar expressions can readily be written 
down for the magnetic and mixed tensors of an arbitrary 
order. : 

Finally, it should be mentioned that the expression 
(4.4) for the electric correlation tensor of blackbody 
radiation has also recently been derived by Glauber.” 


19 Proof of this result for a real Gaussian random process is 
given, for example, in Wang and Uhlenbeck, Rev. Mod. Phys. 17, 
323 (1945). Generalization to a complex Gaussian random process 


: : md i 
i ees ane I. S. Reed, Inst. Radio Engrs. Trans. Inform. 


Theory ITS, 194 (1962). 


CC-0. In Publi 


A1153 


APPENDIX 


In this Appendix we will verify that the phase-space 
distribution (3.1), namely, 


= (A1) 


1 
&(E3)-TI— ep] = 


dk T 


Ny 


corresponds to the density operator f for radiation 
field in thermal equilibrium at temperature T. 

On substituting from (A1) and (2.20) into (2.22), 
we obtain 


* E 1 
= AL = 
Ak na(k)=0 m) (k)=0 Tür 
|z|? gni c zm (k) 
ess 
fix (ny (k) Im (k) pua 
Xexp{—|2|?} | (k))my(k)|d?z. (A2) 


Next we set z—r exp(i0), d'z— rdrd& and note that the 
integration over 0 gives 2205, (x), mx)» We then obtain 


a. Í 1 
i x | —ew|-r(1+—)| 
Ak m(x) Jo fp l ny 


1o 9X9. 09 d 
1 1N-»00 
z (142) am. a3 


Mk 


x (Ane (r) 


Next we substitute for ñ, from (3.2) and find that 


peal? eal 


àk exp(ak) 0 
Xm) 09], (A4) 
where a is given by (3.3). But 


2j exp{—m(k)ak} |m (k))( (k) | 
ny (ke) 
=exp{—4'(k)&(k)ak} Z Im Q9) (91 (AS) 
nA 
and we have also the completeness relation 
E |m(k) Xo ()| - 1. (A6) 
ny (ie) ¢ 
Using (A5) and (A6) in (A4), we- finally obtain the — 
following expression for the density operator p corre- 
sponding to the phase-space distribution function (A1): 
MN ia GLA 
p= Sn ne ae 
Xx exp (ak)/Lexp(ak)—1] 
This is precisely the density operator for a 
field in thermal equilibrium at temperature T.” 


21 A. Messiah, Quantum Mechanics (John Wiley & 
New York, 1961), Vol. I, p. 448. 
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We have measured the change in the thermal conductivity of superconducting tin, indium, and lead films 
upon application of a magnetic field in the plane of the film. These experiments were undertaken to explore 
the dependence oí the energy gap upon magnetic field and to determine the thermodynamic order of the field- 
induced transition in films. Within the range of temperatures available to us (0.35 T to 0.65 T.) the thermal 
conductivity of indium and tin films increases nearly as H?. In films of thickness 22:2800 A the thermal 
conductivity jumps at 7, to the normal state value, indicating a first-order phase transition in thin films, con- 
sistent with the Ginzburg-Landau (GL) theory. If the effect of a field upon the superconducting state can be 
adequately represented as a change of the e; of BCS, we may use the theory of Bardeen, Rickayzen, and 
Tewordt to compute ¢o(H) from our data. At T=0.65 T, we find e(I1)/e (1 —0) ^ [1— (I/H.)*]^ in 


agreement with the GL prediction. At T=0.35 T. 


satisfactory fit to our data. Orientation of the field 1 
effect on the thermal conductivity to 2-196. This is interesting in view of Bogoliubov's prediction of a 
pr* Varift term in the excitation spectrum of a current-carrying superconductor. When the field is not parallel 
to the surface of the film the thermodynamic transition is still sharply defined, but Meis reduced. Thin lead 
films gave results similar to those of indium and tin films except that at low fields the field-dependent con- 
ductivity increased more slowly than H? at low temperatures. 


I. INTRODUCTION 


T is now well accepted on both experimental"? and 
theoretical*4 grounds that near T. the superconduc- 
ing transition of a thin film in a magnetic field is of 
second order. The predictions of the Ginzburg-Landau- 
Gor'kov theory*-* (GLG), applicable here, are in good 
agreement with experimental results. At reduced tem- 
peratures /— T/T«&1, however, the GLG theory is 
inapplicable.” Bardeen? has made a calculation which 
leads to a first-order transition at very low temperatures, 
while Nambu and Tuan? find a second-order transition 
for a very thin film at all temperatures. 

The elegant experimental measurements of the field 
dependence of the energy gap employing the tunnel 
effect? have generally been limited to a few metals at 
moderately high reduced temperatures and to the case 
of fields applied parallel to the film surface. Only re- 
cently, while this report was in preparation, have meas- 
urements been extended to lead films at low reduced 
temperatures by Douglass and Meservy.?? The seex- 


* Supported in part by the A. P. Sloan Foundation, the National 
Science Foundation, and the Office of Naval Research. 

1 Present address: Morris Electronics (P) Ltd., Poona, India. 

1T. Giaever and K. Megerle, Phys. Rev. 122, 1101 (1961). 

? D. H. Douglass, Jr., Phys. Rev. Letters 7, 14 (1961). 

3 V. L. Ginzburg and L. D. Landau, Zh. Eksperim. i Teor. Fiz. 
20, 1064 (1950). : 

1 D, H. Douglass, Jr., Phys. Rev. Letters 6, 346 (1961). 

5L. P. Gorkov, Zh. Eksperim. i Teor. Fiz. 36, 1918 (1959) 
[English transl.: Soviet Phys.—JETP 9, 1364 (1959) ]. 

6V. L. Ginzburg, Zh. Eksperim. i Teor. Fiz. 34, 113 (1958) 
[English transl.: Soviet Phys.—JETP 7, 78 (1958) ]. 

7V. L. Ginzburg, Zh. Eksperim i Teor. Fiz. 30, 593 (1956) 
[English transl.: Soviet Phys.—JETP 3, 621 (1956) ]. 

3 J. Bardeen, Rev. Mod. Phys. 34, 667 (1962). 

9 Y. Nambu and S. F. Tuan, Phys. Rev. Letters 11, 119 (1963); 
Phys. Rev. 133, A1 (1964). 

3) D. H. Douglass, Jr., and L. M. Falicov, Progress in Low Tem- 

eralure Physics, edited by C. J. Gorter (North-Holland Pub- 

fishing Company, Amsterdam, to be published). 


however, €o(H)/eo(0)=1—(H/H.)* provides a more 
and || to the direction of heat flow produces the same 


periments support our findings, to be reported here, that 
the transition for a thin film remains second order even 
at very low temperatures, and they also find a similar 
field dependence of the energy gap at low temperatures. 
It should perhaps be noted that the tunnel-effect meas- 
urements selectively indicate the effective energy gap 
at the film surface for electrons moving normal to the 
surface. 

We have developed an independent experimental 
method to study the thermodynamic parameters of a 
superconducting film in a magnetic field. This method 
consists of measuring the effect of an externally applied 
magnetic field on the thermal conductivity of a super- 
conducting thin film. A preliminary report of our re- 
sults on tin and indium films has been published else- 
where." The experiments were then extended? to a wider 
range of experimental conditions and to another metal, 
lead. We report these more extensive experiments here. 
The experimental data reveal the thermodynamic order 
of the transition by inspection. By using the relation de- 
rived by Bardeen, Rickayzen, and Tewordt? (BRT) 
the data may be inverted point by point to give eo(H), 
the field-dependent energy gap averaged through the 
film. 

The external field has been applied at various angles 
with respect to the plane of the film with unexpected 
results, and the relative directions of the field and the 
heat flow in the plane of the film have been varied in 4 
search for vestiges of the anisotropy of the energy gap 


(1961) E. Morris and M. Tinkham, Phys. Rev. Letters 6, 600 
1? D. E. Morris, Ph.D. thesis, University of California, Berkeley; 
1962 (unpublished). 
13 J. Bardeen, G. Rickayzen, and L. Tewordt, Phys. Rev. 113, 
982 (1959). 
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Fic. 1. Sample and thermometer mounting arrangement. 


in a current-carrying superconductor predicted by 
Bogoliubov.4 


II. EXPERIMENTAL METHOD 
A. Low-Temperature Arrangement 


The use of carbon resistance thermometers,5 with 
their low thermal inertia and high sensitivity, makes it 
possible to measure the low-temperature thermal con- 
ductivity of thin metal films deposited on glass sub- 
strates. During our experiments, the temperature drop 
along the film is typically 0.1°K with 10-7 W total heat 
transport along the sample film and substrate. Under 
these conditions a change of 2 udeg in the temperature 
drop along the film is detectable. 

'The sample and thermometer arrangement is shown 
in Fig. 1. A 4-W carbon-composition resistor is pre- 
pared for use as a thermometer-heater (I) by grinding a 
flat surface until the resistance material just becomes 
visible. The lead wires are cut off short, and lengths of 
No. 42 Formvar insulated manganin wire are attached 
with nonsuperconducting solder. The thermal conduc- 
tivity of these lead wires is negligible in our experi- 
ments. The flattened surface of the resistor is glued with 
GE-7031 cement at one end of the glass sample-film 
substrate. Using the same cement, the opposite end of 
the substrate is attached to a copper heat sink, which is 
in turn connected to the lid of the experimental chamber. 
A similarly prepared second resistor, to be used as a 
reference thermometer (II), is similarly glued to the 
copper heat sink. 

The experimental chamber is placed in the lowest part 
of the interior of a conventional, liquid-nitrogen 
shielded, glass Dewar for liquid helium use. A thin- 
walled stainless-steel tube extends from the experi- 
mental chamber to the top of the Dewar. This tube is 
used to evacuate the chamber; it also carries No. 40 
manganin wires for connection to the thermometers. 
Since the usual power input to the heater-thermometer 
during the experiments is only ~10-7 W, thermal 


1 N. N. Bogoliubov, Zh. Eksperim. i Teor. Fiz. 34, 58 (1958) 
[English transi.: Soviet Phys—JETP 7, 41 (1958). 

18 J. R. Clement and E. H. Quinnell, Rev. Sci. Instr. 23, 213 
(1952). 
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anchoring of the manganin leads to the inside of the 
chamber lid and radiation baffling in the pumping tube 
above the chamber are both essential. The chamber is 
sealed to its copper lid with a Woods metal joint. When 
this alloy is cooled below ~8°K, it becomes super- 
conducting.!^ In order to prevent the superconducting 
diamagnetism and trapped flux at the joint from dis- 
turbing the magnetic field applied to the sample film, 
the sample is suspended near the bottom of the 4-in.- 
long chamber from a copper rod directed downward from 
the copper lid. 


B. Thermometry 


The thermometers used were Allen-Bradley 3-W 
nominal 100-0 resistors, connected in a wheatstone- 
bridge circuit. Satisfactory cancellation of temperature 
drifts was achieved without special matching of the 
reference and heater resistors. The resistance of the 
reference thermometer was measured as a function of 
the vapor pressure of the helium bath over the range 
1.2-4.2*K at a small enough current to prevent appreci- 
able heating above the bath temperature. The results 
could be approximated closely by a relation of the form 
InR=A+B/T, where A=2.89 and B= 3.26". 

The insensitivity of the resistance thermometers to 
magnetic fields was checked during several experimen- 
tal runs. If the magnetic field is increased to several 
times the critical field of the experimental film, any 
changes in the thermometer bridge balance above the 
critical field may be attributed to magnetoresistance in 
the thermometers. This is so because any effect of the 
magnetic field on the thermal resistance of the experi- 
mental films, which have electron mean free paths 
7-800 A, would in the normal state be negligible even at 
the maximum field applied. In the case of a film of rela- 
tively low critical field the resistance bridge balance was 
unaffected by an increase from H — 400 Oe to H=800 
Oe. In another experiment, increase of the field from 
7500 to 15 000 Oe produced only a small shift in the re- 
sistance thermometer bridge balance. These results are 
consistent with the behavior of similar resistors as de- 
termined by Clement and Quinnell.'^ They found that 
magnetoresistance effects were small and proportional 
to H?. Further, we expect the small magnetoresistance 
of the thermometer heater and the reference thermom- 
eter to nearly cancel in measurements of resistance 
bridge balance. ‘ 

The relation between power input and resistance 
change was measured separately for the heater ther- 
mometer and the reference thermometer in several ex- 
periments. The temperature rise of the reference ther- 
mometer at a given power input was only about 10% of 
that of the heater thermometer at the same power level. 
We may conclude that the thermal resistance of the re- 


16D, Shoenberg, Superconduclivity (Cambridge University 
Press, Cambridge, 1952). 
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sistors themselves and of the glue used to fasten them 
in place amounts to ~yp of the thermal resistance of the 
sample film and substrate. For example, at 1.2° anda 
power input of 0.090» 10-* W, the temperature rise of 
the heater thermometer was ~0.060° and of the refer- 
ence ~0,006°. This gives a value of dP/dT — 1.5X 10? 
W/deg for the reference thermometer. Berman" estab- 
lished that in the 100-0 3-W Allen-Bradley resistor (the 
same type and value as we have used) dP/dT=5.2X 10 
W/deg. The difference is accounted for by the fact that 
thermal contact was made to only a fraction of the sur- 
face of the resistor in our experiments. 


C. Temperature Stability 


The initial (unsuccessful) experiments were attempted 
at 1.3°K, without a reference thermometer. A mono- 
tonic, but not perfectly uniform, decrease of the bath 
temperature at the rate of ~1 mdeg/min during the 
measurements resulted from the decreasing helium film 
creep as the liquid-helium level declined. The resulting 
drift in the thermometer resistance overwhelmed the 
temperature changes expected from the desired effect 
of a magnetic field. Attempts were made to stabilize 
the bath"temperature with an electronic regulator con- 
sisting of a separate resistance thermometer in a 33-cps 
ac bridge, followed by a tuned amplifier, lock-in de- 
tector, and dc heater in the liquid helium. Although 
satisfactory long-term stability was achieved, amplifier 
noise in the heater produced objectionable variations 
over periods shorter than a minute. Since the total ex- 
perimental effect is ~5 mdeg, the noise level should be 
less than 50 udeg to ensure an adequate signal/noise 
ratio. 

With the introduction of the reference thermometer in 
a bridge circuit with the thermometer heater, the effec- 
tive drift rate was decreased by a factor of 10, even 
without special matching of the resistors. This per- 
mitted data to be taken without the use of any noise- 
producing feedback stabilization schemes. 

Temperatures of 1.2°K were reached by pumping at 
300 liters/min with a conventional rotary vacuum 
pump. Operation at 1.7°K was attained by adjustment 
of a valve in the pumping line until the temperature 
drift rate at the desired temperature was reasonably low. 
When operation near 2.1°K was desired, a Cartesian 
manostat was connected in the helium vapor pumping 
line. A 1-cu-ft ballast tank was connected in the pump- 
ing line between the Dewar and the manostat to reduce 
fluctuations in the pressure above the helium bath which 
were caused by the oscillation of the manostat throttle. 

Spatial nonuniformity of the bath temperature and 
bubbling of the helium usually results in intolerable 
temperature fluctuations of the sample if operation is 
attempted above the à point of the helium bath. How- 
ever, it was possible to conduct experiments at the nor- 


17 R. Berman, Rev. Sci. Instr. 25, 94 (1954). 
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mal boiling point (4.22°K) by leaving the bath com- 
pletely undisturbed and isolated from vibration for three 
hours after the liquid-helium transfer. By that time, the 
bubbling produced only a relatively small rapid fluctua- 
tion of the signal; however, the temperature drop along 
the film had to be increased to ~0.6°K (several times 
the low-temperature values) to obtain an adequate 
signal-to-noise ratio. 


D. Temperature Drop and Critical Field Spread 


'The approximate temperature drop along the film 
during the experiments at various temperatures is shown 
in Table I. The increase in AT at higher temperatures 


TABLE I. Temperature drop along experimental films and 
corresponding spread in critical fields. 


Mean AH</He 
temperature AT Indium Tin Lead 
1522 0.06° 1.5% 1.0% 0.2% 
lyfe 0.1° 4% 3.5% 0.6% 
Dat? 0.15° 10% 796 1.306 
4.5? 0.6? 15% 


was necessary to maintain the signal/noise ratio in the 
presence of greater uncontrolled temperature fluctua- 
tions and decreasing thermometer sensitivity. The per- 
centage difference in the critical fields at the hot and 
cold ends of the film which result from the difference of 
temperature is also shown in Table I. These values were 
determined from the appropriate bulk critical field 
curves. 


E. Magnetic Field 


A high-power water-cooled Helmholtz pair produced 
fields up to 1000 Oe. This relatively high-field, iron- 
free system was employed to permit precise control of 
the direction of the field by using auxiliary trimming 
coils, in order that the effect of applying the field 
slightly out of the plane of the sample film could be in- 
vestigated. The terminals of the trimming coils could 
be connected by a rotary switch to taps on a low- 
resistance nichrome strip wired in series with the main 
coil pair. In this way the angle of the resultant field, 
independent of amplitude, could be adjusted in 0.02° 
steps up to 4°. In experiments on films with critical 
fields greater than 1000 Oe, conventional iron-core elec- 
tromagnets were used. The resulting uncontrolled angle 
of the field with respect to the plane of the film is less 
than 1° and apparently it has no major effect upon the 
experimental results for these very thin films, as may 
be seen in Fig. 11. 


F. Sample Films 


Sample films were produced by rapid evaporation onto 
glass substrates held at 77°K, after thorough ion- 
bombardment cleaning of the substrate surface. 
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The substrates were No. 0 microscope-cover glasses 
about 0.11 mm thick which had been broken into pieces 
about 5 mmX 15 mm, suitable for our experiments. The 
substrates were washed in a saturated solution of deter- 
gent, rinsed in hot tap water, and air dried. The sub- 
strate was then clamped against the surface of an un- 
filled cold trap inside the vacuum chamber. The cham- 
ber was evacuated and an intense (20 mA) high-voltage 
(5000 V) glow discharge was operated for 5 min to heat 
and clean the substrate by ion bombardment. Then the 
pressure was reduced to 3X 10-?^ mm Hg and the sub- 
strate was cooled to 77°K by filling the attached cold 
trap. The vacuum pumping line contained a liquid nitro- 
gen cooled baffle to rule out contamination of the 
vacuum’ chamber by condensation of oil vapor. 

A tantalum evaporation boat containing the pre- 
weighed and pre-fused sample charge was rapidly 
heated to about 1400?C and the film evaporation took 
place in a period ranging from 1 sec for a 3-mg charge 
(giving a 1300-À film) to 6 sec for a 30-mg charge. The 
source-to-substrate distance was only 2 in. The deposi- 
tion rate of over 1000 Á/sec, along with the residual gas 
pressure of less than 107? mm Hg, ensured that the 
flux of residual gas molecules was a great deal smaller 
than that of the metal atoms during the evaporation. 
Films prepared under these conditions should therefore 
be quite pure. Cooling of the substrate during the 
evaporation is necessary to prevent migration of the 
metal atoms after they first strike the surface.'® After 
the films were warmed to room temperature and re- 
moved from the vacuum, they were annealed at room 
temperature for at least one week. (The lead films were 
annealed in vacuum to retard oxidation.) 

The sources and purities of the metals used in evapor- 
ation of the experimental films are given in Table II. 


TABLE II. Metal sources and purities. 


Metal Supplier Stated purity 

Indium Indium Corporation 99.999075 
of America 

Lead A. D. Mackay 99.999% 

Tin Johnson, Matthey 99.999% 


CONLE MM E EE E 


Films ranging in thickness from 250 to 12 000 A were 
prepared. Thickness was estimated from the measured 
critical field and published data relating the critical field 
to film thickness. As described above, all films were 
deposited on substrates cooled to 77°K. In all cases it 
was possible to prepare films which were mirror-like 
in appearance with very little diffuse reflectivity, in 
contrast to results of room-temperature evaporations.'* 
No structure was visible under magnification of 1500 
diameters except a fine granularity. — 4 

A. 250-À indium film showed no noticeable deteriora- 


Films (John Wiley & 


13 L, Holland, Vacuum Deposition of Thin 
Sons, Inc., New York, 1956), pp- 206-208. 
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tion after seven months exposure to the atmosphere. A 
700-À tin film similarly exposed for four months showed 
no visible deterioration. Oxidation of lead films, how- 
ever, was more rapid. The experimental film designated 
Pb I was exposed to the atmosphere for approximately 
5h while Pb II was exposed approximately } h. In order 
to determine the durability of lead films and the extent 
of deterioration produced by atmospheric exposure while 
preparing for an experiment, measurements have been 
made of the transmission of 0.9-u radiation through a 
lead film of about 180-À thickness, as a function of the 
duration of exposure to the atmosphere. The results may 
be roughly interpreted as a decrease in thickness of the 
metallic film due to oxidation, resulting in decreased 
reflection and increased transmission. We assume that 
Z, the impedance per square, is inversely proportional 
to d, the film thickness. Then, for transmission T««4, 
T~Z~1/d. On this basis, during the period between 
15-min and 2 h after the beginning of exposure to the 
atmosphere, Ad was ~7 A. After one day Ad was ~20 A, 
and the oxidation of the film proceeded at ~7 A per 
day for the next week. Now, PbO is a semiconductor, 
not an insulator as the above analysis has assumed; 
also, the estimation of ds from the weight of the evapor- 
ated Pb charge may be somewhat in error. Nevertheless 
we may safely conclude that the atmospheric oxidation 
of the Pb films used in our experiments was of negligible 
importance. 


II. EXPERIMENTAL RESULTS AND ANALYSIS 
A. Electron Mean Free Path 


An estimate of the electron mean free path A, in the 
experimental films will be useful in the interpretation of 
our data. According to BRT” the mean free path of the 
electronic excitations is unaffected by the transition to 
superconductivity. We may determine A, in the normal 
state from the relation for thermal conductivity 
Kn=4CentrAe where the electronic specific heat Cen 
~4X10-XT cal/mole-deg for both tin and indium,” 
and vpz109 cm/sec is the Fermi velocity. A direct 
measurement of K, is impractical because of the 
dominant thermal conductance of the glass substrate. 
It is possible, however, to determine experimentally the 
ratio of the thermal conductivity of the films to that of 
the substrate and combine this information with pub- 
lished values of the thermal conductivity of glass, Ke, 
in order to find Kn. We will denote by r the fractional 
change in the total conductance of the film plus sub- 
strate combination when the film is driven into the nor- 
mal state by a field HZ H.. The quantity r is measured 
experimentally as the change in the thermometer bridge 
balance position upon application of H>H., (ie., the 
total signal") divided by the shift in bridge balance 
position resulting from increase in the bridge power in- 


1 C. Kittel, Introduction to Solid State Physics (John Wiley & 
Sons, Inc., New York, 1953). 
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put from a small value up to the full operating current. 
When r is combined with the published values” of the 
temperature dependent ratio K,/K, of the thermal 
conductivities of the superconducting and normal 
states,!® and with d and D, the thickness of the film and 
of the substrate, we may determine K,/Kg=7rD/ 
(1—K,/Kn)d. Then Ae=3Kp/Cenvr. The thermal con- 
ductivity of the glass, A, needed in this calculation, is 
estimated from a value! of Kgu=5X10-! W/cm-deg 
at 2°K, and the assumption that Kj T!? at liquid- 
helium temperatures. The resulting values for Kg 


were used in the computation of the A, values quoted 
in Table IIT. 


TABLE III. Data leading to estimates of electronic mean free 
paths A, in the experimental films. Symbols are defined in text. 
In all cases thickness of glass substrate D=0.11 mm. 


Heater Ks Kg Kg 

d T power — (mW/cm- (mW/cm- A. 

Sample (A) (°K) (aW) r Kn deg) deg (A) 
SnII 2600 1.2? 0.00 0.2 0.1 0.26 0.11 600 
SnIII 700 1.2? 0.08 0.046 0.1 0.26 0.07 500 
InII 1600 1.7? 0.3 0.07 0.39 0.40 0.18 600 
1.8? 0.1 0.07 0.45 0.40 0.20 600 
2.19 0.22 0.11 0.61 0.55 0.73 1600 
2.149 0.85 0.07 0.64 0.58 0.5 800 
InllI 650 1.25? 0.14 0.05 0.16 0.26 0.2 600 
1.79 0.11 0.07 0.33 0.40 0.12 1000 
2.19 1.3 0.016 0.65 0.55 0.58 600 
InIV 4000 1.2? 0.00 0.41 0.14 0.26 0.26 800 
2:19 1.3 0.10 0.67 0.55 0.44 600 


a From Ref. 16, p. 83, Fig. 29. 


Note that A, has a value of 600-900 À, and seems to 
be constant with increasing film thickness between 
650 A (InIII) and 4000 A (InIV). This indicates that 
A, is limited by imperfection and impurity scattering 
rather than boundary scattering in these films. In either 
case, A. would be independent of temperature. Thus, 
the range of calculated values of A, at different tempera- 
tures in the same film gives an indication of the lack of 
precision in the determination of Ae. 

Phonon scattering of the electrons is negligible in our 
films. The phonon-limited K.g would be 2.56 10?/ T? 
W/cm-deg in tin, according to the compilation of 
Klemens.” From K,— (1/3)yTvrA, we may calculate 
at 1.2°K, A.,~0.4 cm, and at 2.1°K, A,,2:0.075 cm. 

An independent experimental estimate of the thermal 
conductivity of the glass substrate, Kg, also listed in 


. P? For tin and indium, K,/K, as a function of T/T., measured 
in bulk samples, is in good agreement with the theoretical cal- 
culation of BRT for the case of electronic thermal conductivity 
with lattice-defect scattering dominant. Boundary scattering 
would give essentially similar results. In the films we have studied, 
even if we neglect defect scattering, the electronic mean free 
path would be limited by boundary scattering to a few thousand 
angstroms at most, and thus we can have considerable confidence 
in the applicability of the experimental and theoretical bulk 
metal values of K,/K,. 

21 D. Bijl, Physica 14, 684 (1949). 

2p. G. Klemens, Encyclopedia of Physics (Springer-Verlag, 
Berlin, 1956), Vol. XIV, p. 245. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar zm 


AND M. 


TINKHAM 


Table III, was made from the power input to the heater, 
the resulting temperature change in the thermometer 
resistance, and the approximate dimensions of the sub- 
strate. We ascribe the difference between Kg: and Ky 
at the lower temperatures to poor calibration of the 
thermometer resistance as a function of temperature, 
This calibration does not affect the normalization or in- 
terpretation of the experimental data; it was used only 
in the computation of Kg and in the estimation of the 
temperature difference between the ends of the film 
during the experiments (Table I). 


B. Temperature Variation of H./H.» 


We note an increase in  ,/ H e with temperature in all 
of the experimental films, (see Figs. 2, 5-8). This may 
be anticipated as a result of the increase in À with tem- 
perature, since H./H.» is determined by the ratio 
A/d. The effect has been studied in detail by Appleyard 
ef al and by Alekseyevsky™; their results are dis- 
cussed by Shoenberg.!® Our measurements of H, are 
limited to temperature T<0.7 T. where the changes of 
^ are relatively small, and our results are only of 
sufficient accuracy to show an agreement in sign and 
order of magnitude with the expected effect. Our results 
fall between (H.—H 4)/H a (T)--[1— (T/T9* 25, 
which applies for thick films (@>>)), and H./H (T) 
~[1—(T/T.)*}2, predicted by Pippard* for very small 
specimens. 

Pippard? and Ginzburg have predicted substantial 
superheating and supercooling effects in films a few 
penetration depths thick. In several experiments we 
approached the transition from above and also from 
below and found no detectable hysteresis. This may be 
a result of the thermal gradient along our films, or of 
their crystalline imperfection. 


C. Behavior of Very Thin Films 


The results of our experiments are easiest to interpret 
in the case of very thin films (d «). The applied field 
penetrates almost uniformly, and eo is also expected 
to be essentially uniform. 

Figure 2 shows results for thin (7700 A) films of tin 
and indium. For both metals, the thermal conductiv- 
ity K increases nearly as H? up to the critical field, 
the thermal conductivity in the superconducting state 
joins smoothly onto the field-independent conductivity 
of the normal metal. The location of the shoulder at 
this point fixes H. for the pruposes of normalizing the 
data. The absence of any discontinuity at He indicates 
that the superconducting transition in a magnetic 
field is second order for films of this thickness, 7 
contrast to the first-order transition of bulk super- 
conductors in a field. In other words, since K is related 


* E. T. S. Appleyard, J. R. Bristow, H. London, and À. D. 
Misener, Proc. Roy. Soc. (London) A172, 540 (1939). 

^ N. E. Alekseyevsky, J. Phys. USSR 4, 401 (1941). 

35 A. B. Pippard, Phil. Mag. 43, 273 (1952). 
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Fic, 2. Change of thermal conductivity of thin superconducting 
films with field. Note that the phase transition is of second order 
for films of this thickness. 


directly to the gap width, we can infer that the gap closes 
continuously to zero when a field is applied to a thin 
film, whereas in a bulk sample it drops discontinuously 
from nearly the full value (~98% even at the surface) 
to zero. 

The measured dependence of K upon H may be used 
to compute a dependence of the energy gap upon field 
in a simple way if the electronic term K, is primarily 
limited by scattering of electrons by lattice imperfec- 
tions. Under these conditions the ratio of the thermal 
conductivity in the superconducting state Ke, to that 
in the normal state Ken is given by Bardeen, Rickayzen, 
and Tewordt? as their Eq. (3.6). This may be written 


es 


=G(é/kT) 


en 


= | i (9 f/E)dE if Í j E:(0f/0EE, 


0 


where f(E/RT) is the Fermi function. The function 
G(ey/ ET) is plotted in Fig. 3. If we assume that the effect 
of a field upon the superconducting state can be ade- 
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Fic. 3. The function G(eo/kT) relates th 
conductivity to the energy gap; according tot 
Rickayzen, and Tewordt. 


quately represented as a change of the e; of BCS, then 
the observed quantity Ax plotted in Fig. 2 should be 
given by 


s Ka (H)—K e(O) GLeo(H)/kT]—GLeo(0)/kT] 
Fee One 


1—G[es(0)/kT] j 


This relation was used to invert the experimental data 
on indium IIT, point-by-point, to yield e;(//), with the 
result shown in Fig. 4. Because G(«o/ ET) initially drops 
only cubically as eo/ET increases from zero (see Fig. 3), 
the gap must drop more steeply to zero near H, than the 
approach of K,,(/I) to Ken there. A rather steep final 
drop ~ (H ,— H) V? in the gap near H, is necessary to re- 
produce the almost linear approach of K,, to Ken ob- 
served at that point. 


GLG theory 
Sn I| d2:2600 A 
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0.6 |- ri D 
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Fic. 4. Magnetic field dependence of superconducting energy 
gap for several tin and indium films computed from the data in 
Figs. 2, 6, 7, and 8, assuming that the energy gap is independent 
of position in the film. The curve [1—(//H,)*}/? is that given 
by the GLG theory in the limit of very thin films. 


The difficulty of distinguishing between a broadened 
transition because of film inhomogeneity and a truly 
rounded approach to K.s. because of the form of 
G(eo/kT) results in an uncertainty in the choice of He. 
The bars on the plotted experimental points in Fig. 4 
indicate the size of this uncertainty. 

Our results for the dependence of the energy gap upon 
magnetic field in a thin film are qualitatively similar to 
those from the more direct tunnel experiments of 
Giaever and Megerle! and of Douglass,” although their 
data were taken at higher reduced temperature and in a 
different metal (aluminum). From our data shown in 
Fig. 4, we see that ¢o(H)/«o(0) seems to approach 1— J£", 
where k= H/H., at low temperatures, but it is moder- 
ately well fitted at T/T.=0.63 by (1—/^)! ^. The latter 
form is that given by Douglass‘ based on the Ginzburg- 
Landau-Gor’kov (GLG) theory, which is expected to 
hold near Te. We also note that our data at T/7',— 0.36 
deviate from 1— k? and approach (1— /2)!? when the gap 
has dropped so that e (H) « £T. This behavior indicates 
that the GLG approach becomes successful, as expected, 
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when there is a large amount of thermal excitation 
present. 

We may find the approximate value of d/X for 
In III from the GLG theory. Ginzburg? has shown that 
the solution of the Ginzburg-Landau equations, in 
the limit d «3, leads to H.= (24)? (/d)H a. For In TII, 
H./H»~7, so that d~0.7\. The sevenfold increase in 
critical field over the bulk value shows that the field 
penetration must be very nearly complete, so that we 
are in this case observing the effect of a field in the thin 
film limit. 


D. Behavior of Thicker Films 


The film indium II (Fig. 5) of intermediate thickness 
d~1800 A, is still thin enough to display a second-order 
transition, but the increase of K near H. is much faster 
than H?. This behavior may be explained in terms of an 
increase of the penetration depth as the gap decreases, 
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Fic. 5. Change of thermal conductivity of a superconducting film 
of intermediate thickness with magnetic field. 


as would be expected on the basis of sum-rule argu- 
ments? Since the film thickness is comparable to the 
electron mean free path (2600 A in this sample accord- 
ing to Table III) which determines the coherence length 
£ we expect the energy gap to be only weakly dependent 
on position in the film. We may then apply the approach 
of Pippard;?? considering the balance between condensa- 
tion energy and magnetic energy to determine the gap 
(or order parameter) as a function of field. T aking 
into account the dependence of film susceptibility on 
(d/d) and the dependence of ^ on eo, one is led to expect 
a change in the form of eo( H/H) of the observed sense 
for films with d>. A similar conclusion follows from 
the GLG theory, as discussed by Douglass.* 

In Fig. 6 we see the experimental results on a moder- 
ately thick film, tin II. The data taken at 2.12? shows a 
variation of K with H which is purely quadratic (within 
1%) up to the critical field. At this point A«— 2400. 
With a further increase of field, K increases rapidly 


26M. Tinkham and R. A. Ferrell, Phys. Rev. Letters 2, 331 
(1959). 
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Fic. 6. Change of thermal conductivity of a 2600-A tin film with 
magnetic field at various temperatures. Because of poor tempera- 
ture stabilization, we have little confidence in the 1.20 and 1.65°K 
curves except for the determination of critical fields. The right- 
hand curve is for H perpendicular to the film plane. 


up to the normal-state conductivity. The breadth of the 
transition can be accounted for partly, but not com- 
pletely, by the difference in temperature of the ends of 
the film. We do not have much confidence in the 1.2 and 
1.65? curves, except for the points which determine the 
transition, because the temperature stabilization was 
poor during those particular runs. 

When we apply our method for calculation of the 
energy gap as a function of field to the 2.12? data, 
assuming a uniform gap throughout the film, we find the 
energy gap decreasing exactly as H? to eo(H)/ eo(0) 
— 0.842 at the critical field (Fig. 4). If weapply the GLG 
theory we find that the observed critical field ratio 
HJH.,—1.50 implies that d/\=3.6 and e0(H-<)/€0(0) 
—0.78. The agreement with the above value of 0.842 is 
quite good considering that at H, the gap in the interior 
of the film should be depressed somewhat less than that 
at the surface, because in this film the coherence length 
is limited by A, to a value less than the film thickness. 

We next consider the data for In IV (Figs. 7, 8, 9, 10). 
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Fic. 7. Change of thermal conductivity of a thick supercoD- 
ducting film with magnetic field, at T~0.38 T.. The first points 
are shown on an expanded scale for greater clarity. At the right 
5 the result of applying the field perpendicular to the surface 0 

e film. 
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Fic. 8. Change of thermal conductivity of a thick superconducting 
film with magnetic field, at T~0.64 Te. 


Because of the large film thickness the change in thermal 
conductivity does not exceed about 10% up to the criti- 
cal field, and the calculated energy gap variation assum- 
ing e) is independent of position, i.e., £d (Fig. 4) is 
correspondingly small. The numbers at the right-hand 
margin of Fig. 4 are found by extrapolating the com- 
puted values of the energy gap to H, at the points 
shown. These points fall almost exactly on straight lines 
until we reach the transition region, so we may be con- 
fident of the extrapolation. There seems to be a very 
slight curvature to the 0.38 T. curve, probably reflecting 
a trace of the peculiar hysteresis effects which are so 
prominent in this sample with a field orientation H 
perpendicular to d, the direction of heat flow (Figs. 
9, 10). 


E. Initial Decrease of Ax 


In indium II (Fig. 5) at the lowest temperature we 
see an apparent initial decrease of thermal conductivity 
with field. This may be a spurious effect arising from a 
slow approach to the equilibrium distribution of any 
trapped flux. We note from Table III that the heater 
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Fic. 9. Change of thermal conductivity of a thick supercon- 
ducting film with magnetic field applied perpendicular to d, the 
direction of heat flow. At the right, the effect of prior exposure o! 


sample to a large field perpendicular to the film is shown. 


A1161 


power in our experiments is only about 1075-10-? W and 
that r, the total change in thermal conductance of the 
metal and glass-sample sandwich with magnetic field 
is between 5 and 40%. Thus, evolution of heat at a 
rate of only 10-1? W («4 erg/h!) when the field is 
applied could produce a spurious effect of 1% of the 
total. Such an evolution of heat could result frorn induc- 
tion heating of the film during a slow approach to the 
equilibrium distribution of trapped flux. In many cases 
during the experiments, a small effect with a time con- 
stant on the order of minutes was noticed in addition to 
the ordinary thermal relaxation time constant, when the 
field was increased. This effect did not appear upon de- 
creasing the field or upon increasing H from zero to a 
value previously exceeded. If indeed this is related to a 
slow approach to equilibrium in the flux distribution and 
associated induction heating, the sign of the bridge un- 
balance produced by the heating would give an apparent 
depression of Ax, consistent with the sign of the observed 
effect. 
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Fic. 10. Hysteresis effects observed in a 
thick superconducting film. 


Another possible explanation of the long time con- 
stant term is heat released during adsorption of helium 
gas on the thermometer or sample surface when the 
equilibrium temperature of the heater-thermometer 
changes. It would be difficult, however, to explain the 
absence of the effect with decreasing field on this basis. 

Yet another explanation might be depression of 
phonon conductivity by scattering from excitations as 
the gap decreases. It may be easily shown that the 
phonon conductivity of the experimental films, if we 
neglect the substrate, is very small in comparison with 
the electronic contribution." However, diffuse scatter- 
ing of the substrate phonons by the electronic excita- 
tions in the film when the substrate phonons cross the 
boundary into the film may lead to an effective “modu- 


E Keetectron = 3C FAS while K ptonea = AC yt op. C. and Gs are 
comparable in the range 1-4^K in lead, tin, and indium, while 
vr/0572103. If APA, "ed, then K,/K.~10%. 


um T 


A1162 DOE 


lation" of the phonon conductivity of the substrate. The 
resultant effect may be comparable to that of the elec- 
tronic thermal conductivity of the film. This possi- 
bility is examined in the following section. 


F. Modulation of Phonon Conductivity of 
Dielectric Substrate 


The wavelength of thermal phonons in glass at 1°K 
is approximately 2000 A. Thus we may expect almost 
perfect specular reflection at the optically polished sur- 
faces of our film substrates. Thermal phonons at 1°K 
have x= 1/400 of the Debye cutoff frequency, and there- 
fore behave essentially as ordinary ultrasonic phonons. 
Experiment and theory indicate that the acoustical 
match between quartz and indium is quite good and 
leads to less than 1% reflection. Therefore, thermal 
phonons from the glass substrate pass quite freely into 
an indium surface film. Neglecting electron-phonon 
scattering in the film, the phonons will be specularly 
reflected at the smooth film-vacuum surface, and pass 
back into the glass. The effect of electron-phonon scat- 
tering in the indium film is to diffusely reflect the 
phonons back into the glass. If the phonon mean free 
path is a significant fraction of the substrate thickness, 
then the effective change from specular to diffuse sur- 
face scattering will materially decrease the thermal 
conductance of the substrate-film system. As a field- 
induced transition from the superconducting to the 
normal state will increase the electron-phonon interac- 
tion in the film, it will depress the thermal conductance 
of the system. 

From the known thermal conductivity of glass at 
2°K, and the relation K — $C"A, we calculate A~ 5X10 
cm. This is only ~ 1/20 of our substrate thickness, and 
limits the change in the substrate thermal conductance 
which may be produced by surface scattering to less 
than ~5%. 

The degree of diffuse phonon scattering in the film 
depends, of course, upon the size of Ape, the phonon 
mean free path as limited by electron scattering, com- 
pared with the film thickness. We may estimate Ape by 
using the principle of detailed balance and the measured 
values of Kep, the phonon-scattering limited electronic 
thermal conductivity, and its temperature dependence. 
Kittel? has done this in the high-temperature limit. 
Carrying out a similar analysis in the low-temperature 
limit, one finds Ap-~10-* cm for tin and 10-9 cm for 
lead at 1.2°K. Since our experimental films range from 
5310-5 to 410-5 cm in thickness, substantial diffuse 
scattering of phonons may occur within the film in the 
normal state. When the film becomes superconducting, 
this diffuse scattering will nearly vanish, since most of 
the electrons in a superconductor at temperatures far 
below T. are in the superconducting ground state and 
thus unable to scatter phonons. Even though the total 
modulation of the conductance of the substrate is 
limited to 5% in our case, since the substrate conduc- 
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tance is typically 2 to 10 times that of the film, the par- 
tial modulation obtained at moderate fields might wel] 
be enough to account for the small decrease in conduc- 
tance at low fields found for In II (Fig. 5). By employ- 
ing a substrate which has a larger mean free path than 
glass, the effect discussed in this section might be made 
quite large. Use of a material such as crystalline quartz 
(with A of several cm at 2°K) would appear to permit 
complete dominance of the thermal conductance by 
surface effects. 


G. Hysteresis Effects in a Thick Film 


During the measurements made on In IV (d=8n) 
rather peculiar hysteresis effects were observed (see 
Figs. 9 and 10). These effects appeared only when H was 
perpendicular to q, the direction of heat flow. There 
was no detectable hysteresis when H was parallel to d, 
nor was any comparable hysteresis observed in any ex- 
periments with the other (thinner) experimental films. 
We consider the most reasonable explanation to be 
scattering of the heat-carrying electronic excitations at 
normal-superconducting boundaries. 

We may assume that field inhomogeneity and film 
curvature make necessary small normal-state strips to 
carry flux through the film. If the super and normal re- 
gions have the shape of narrow lamina lying parallel to 
the direction of H, then when H.1.d a large number of 
interfaces would obstruct the heat flow when a moder- 
ate field is applied.1* Hysteresis would be a result of 
trapped flux affecting the arrangement of domains. 
Saturation of the film with trapped flux by application 
ofa large field perpendicular to the film surface before a 
measurement will ensure that the maximum amount of 
scattering at S-N boundaries occurs before the measure- 
ment field is applied. This procedure does appear to 
suppress the initial decrease of K (H), as we see in 
Fig. 9. Following saturation of the film with trapped 
flux, application of a subcritical field parallel to the 
plane of the film would sweep out some of the previously 
trapped flux and partially restore the original behavior, 
consistent with the results shown in Fig. 10. 


H. Search for pr-Vari Term in the 
Excitation Spectrum 


In view of Bogoliubov’s prediction of a pr: Varift 
term in the excitation spectrum of a current-carrying 
superconductor, resulting in an anisotropic energy gap, 
we varied the orientation of the field in the plane of the 
film between parallel and perpendicular to the direction 
of heat flow in the film. The directional dependence of 
AK(H) was studied in several films. In tin 1, Fig. 6 
(d= 2600 A, T=2.12°K, A.~600 A), no effect was founi 
within the experimental accuracy of about 1%. In 
indium II, as may be seen in Fig. 5, the result js not 50 
clear-cut. Nevertheless, the apparent deviation of the 
Hå points from those for H||à is not of significant 
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magnitude, and the deviations may be explained by poor 
control of the alignment of H in the plane of the film 
during the experiments. An experiment was conducted 
at 2.1°K on film Pb I, which was similar to Pb II 
(Fig. 11) except for an unexplained initial negative por- 
tion of its A«(/I) curve. The curves with Hq and Hjá 
were identical within the experimental uncertainties, 
except for a shift in //, probably arising from a differ- 
ence in the accuracy of alignment of H in the plane of 
the film. Since the induced diamagnetic currents are 
orthogonal to the field, these measurements also show 
the absence of a dependence on the angle between heat 
current and diamagnetic current. Thus, if we interpret 
our results in terms of a decrease of the energy gap with 
increasing field, the modified gap appears to remain 
essentially isotropic. Of course, even if there were a 
p-v term, first-order effects would cancel in a transport 
property such as K, which involves an average of exci- 
tations over the entire Fermi surface. However, as the 
change of e; with H approaches 100%, noncancelling 
second-order effects would be expected to appear, and 
these would be expected to differ by a factor of several 
for the parallel and perpendicular cases, because of a 
different angular average. Thus, another factor must 
enter to explain the lack of anisotropy. It may well have 
to do with the fact that in these films, the two equal and 
opposite surface currents are separated by a distance 
comparable to £o. Alternately, the short mean free path 
(7800 A) in these samples would be expected to aver- 
age out anisotropies as in the dirty superconductor 
theory of Anderson. Both of these characteristics of 
the experimental situation contrast strongly with the 
Bogoliubov idealization, which corresponds to a spatially 
uniform current arising from a displaced sphere of mo- 
mentum eigenfunctions characteristic of an ideal sample. 

The apparent lack of anisotropy is important in 
evaluating the results of tunnel effect measurements on 
the magnetic field dependence of the energy gap since, 
for specular tunneling, this method determines the en- 
ergy gap normal to the surface of the film, this direc- 
tion being perpendicular to the directions of both the 
applied magnetic field and the induced current. This 
directional aspect of the tunneling experiment would 
also interfere with any attempt to observe the v-p 
effect in the presence of an externally introduced uni- 


form current flow. 


I. Magnetic Fields Not Parallel to 
the Surface of the Film 


the thermal conductivity of 
al films with field intentionally 
e surface. The results were 


We have measured 
several of our experiment 
applied at an angle to th 


quite striking and unexpected. : 
Tin II ual indium IV are films thick enough to show 


first-order transitions in fields parallel to the surface; 


23 P. W. Anderson, Phys. Chem. Solids 11, 26 (1959). 


that is, the energy gap is only slightly depressed frorn 
its zero field value even just below the transition. This 
may be seen by inspection of Fig. 4. Application of a 
field perpendicular to the film might be expected to pro- 
duce an intermediate state configuration with the frac- 
tion of the material in the normal state approximately 
given by H/H.” Thus, we might expect Ac(H) to in- 
crease linearly from zero at H=0 to 1.0 at H -— H,., 
although the curve might “sag” somewhat as a result 
of both additional scattering of the electrons at S-V 
boundaries and the partial flux penetration of the super- 
conducting regions. 

Our measurements contrast strikingly with the above 
prediction. As we see in the right-hand portions of Figs. 
6-8, Ax(H) apparently increases exactly as H? up to the 
normal-state value, which is reached in these samples 
at a field between 42% and 55% of the bulk critical 
field. Our results suggest that the transition is essen- 
tially complete when K (H) first reaches K n, as there is 
no change in K with further increase of H (see Fig. 7). 

A possible explanation of the above behavior lies in 
the fact that for these films d is comparable with A, £s, 
and A,, in contrast to the usual case of a thick plate in 
the intermediate state with a transverse field. In the 
latter case the superconducting and normal domains are 
arranged as thin lamina parallel to the field in order to 
minimize the free energy.!^ This is not possible in our 
thin film samples, since domains shaped as thin lamina 
parallel to the field would require adjacent S-N bound- 
aries to be closer together than a coherence length £o, 
and this would contribute an excessively large positive 
term to the free energy. We might suppose that, when 
our films are in a moderate magnetic field, the super- 
conducting domains are blunt lamina whose width is 
determined so as to minimize the sum of the free energy 
terms arising from the demagnetizing factor of a blunt 
superconducting domain and from close spacing of the 
S-N domain boundaries? On general grounds, one 
might expect an array of vortices? about normal lines 
to be more favorable energetically than lamina involv- 
ing normal surfaces. Such a structure would have 
qualitative properties very similar to the laminar 
structure. 

We have measured Ax(H) for Pb II with the field 
applied at angles 0 of 0, 1.3 (0.4), 5, 10, 20, and 90° 
with respect to the plane of the film at T=2.16°, as 


s 

2? This is expected because the field at a superconducting-normal 
interface in a bulk sample must be equal to He; then Z will be 
close to Z/, throughout the small normal domains, while in the 
interior of the superconducting domains H=0 (if the domains 
are much larger than the penetration depth X). Thus, the flux HA 
which passes through area A of the film penetrates normal regions 
of total area 4, — I4 /H.. 

3 Since our domains will have dimensions of the same order as 
the coherence length, the energy gap may be nonzero nearly 
everywhere. In this case we may imagine the energy gap to oscillate 
as a function of position, with the magnetic flux density being 
greater in the regions where the gap is small. The remainder of the 
argument is qualitatively unchanged. 

31 M, Tinkham, Phys. Rev. 129, 2413 (1963). 
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Fic. 11. Change of thermal conductivity of a superconducting 
lead film with the field applied at various angles with respect to 
the plane of the film. 


well as at 0, 30, and 90° for T=4.5°. The results are 
shown in Fig. 11. We see that as @ is increased from zero 
the apparent critical field decreases, but the transition 
appears to remain well defined. The general shape of 
Ax(H) does not change rapidly with 6, but for large 6, 
AK(H) becomes more nearly linear in H. At 6=90°, the 
apparent critical field has decreased to a value of the 
order of H e». 

An attempt has been made to fit the experimental 
values of H,(6) with various analytical interpolation 
functions. These are based on the assumption that the 
parallel and perpendicular components of the magnetic 
field produce additive contributions to the free energy, 
and that when the sum of these effects reaches a critical 
value the film is driven normal. Assuming the contribu- 
tions to the energy to increase as simple powers of the 
components of the field, this leads to a form 


H.cos0N"  /H. sinfN^ 
eas 
Ha I Ha 
Since magnetic field energies normally are proportional 
to H?, one might expect that m=n=2. However, this 
gives a very poor fit at small angles. In fact, a consider- 
ably better fit was found to result from the choice 
m=n=1. Subsequently, it was shown?! by detailed con- 
sideration of the vortex model for the perpendicular 
field case that one would expect »—1, but m=2. A 
comparison of the degree of fit of these three interpola- 
tion formulas to the.critical field data on Pb II has been 
given previously?! A similar comparison with data of 
White on a thin indium film has also been given.” Both 


of these comparisons favor the theoretically based func- 
tion, with m=2 and n—1. 


J. Behavior of Thin Lead Films 


As we have discussed previously, the predictions of 
the GLG theory fail at low temperatures. The low- 


2 M. Tinkham, Rev. Mod. Phys. 36, 268 (1964). 
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temperature limiting behavior of ¢(77)/¢o(0) would be 
of interest in guiding theoretical efforts. Since facilities 
for producing stabilized temperatures below 1.2°K were 
not available to us, experiments were undertaken on 
lead since 1.2°K equals only 0.18 T. for this metal. Our 
detailed results are not easy to interpret and may be a 
reflection of the peculiarities of superconducting lead, 
rather than characteristic of superconductors in general. 
However, we may assert with some confidence that the 
superconducting transition in a field is of second order 
for thin films (dX) over the range of temperatures 
from 0.63 T, at least down to 0.18 Te. This conclusion 
follows from the observed absence of any discontinuity 
in Ax at H,, as was discussed in Sec. ITIC. 

Experimental results for Pb II, a lead film about 500 Á 
thick, are shown in Fig. 12. Another film,” Pb I, gave 
similar results except for an unexplained initial decrease 
in Ax which appeared at 4.2, 2.1, and 1.2°. The initial 
decrease of Ax for Pb I at 2.1°K was the same for 
Hq and H||d. Because of this divergence of results be- 
tween Pb I and Pb II, we have less confidence in the 
data in the low-field region than we have near He. In 
Fig. 12, the rounded approach of the 4.5° data to the 
normal-state conductivity may be accounted for quanti- 
tatively by the 0.6° temperature drop along the film, 
and the consequent 15% critical field spread (Table I). 
In contrast, at 2.16 and 1.27° the critical field spreads 
are small (see Table I), and the rounding of the curves 
at the transition results from the form for small €o/kT 
of the G function (Fig. 3) connecting K and €o/kT, as 
discussed in the analysis of the In III data. 

If we naively invert the data of Fig. 12 point-by- 
point in the usual manner we find the apparent rela- 
tions between e; and Z which are given in Fig. 13 for 
the various temperatures. At each temperature, the 
upper curve follows from assuming the zero-field gap 


A Pb Il 
K o x 
d=500 À H.Là 
DT= 45 °K 
He = 3700 oe 
- 7.4 Hep 
oT = 216 °K 


0.6F 


He = 5500 oe 


=7.4H 
0.4 cB 


an e T=1.27 °K 
Hc = 5900 oe 
=7.6 Hep 


3000 4000 
Field, Oersteds 


0 1000 — 2000 5000 6000 


Fic. 12. Change of thermal conductivity of a superconducting 
film with magnetic field at several temperatures. 


. This film was prepared in a similar manner to Pb II, and has a 
similar critical field, but it was exposed to the atmosphere for 5h 
before mounting, compared to an exposure of à h for Pb II. 
However, we expect that this duration of exposure would have pro- 
duced negligible oxidation (Sec. IIF). 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri Y 


Mt MS ml 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


EFFECT OF MAGNETIC FIELD 
value based on the BCS result e95=1.75 AT, at T=0°K. 
If we take instead the value 2.1 kT, as suggested by 
several experiments^-** the lower curves result. 

If one were to take Fig. 13 at face value, one would be 
forced to conclude that the variation of ey with H is dif- 
ferent in lead and indium at the same reduced tempera- 
ture /(— T/T.). Although the 4.5? lead curve and 2.1? 
indium curve (Fig. 4), both with 22, seem to be consis- 
tent with the GLG result e9(H)/¢9(0) =[1— (H/H) I”, 
we see that lead at 2.16° seems to be incompatible with 
indium at 1.2? even though the reduced temperatures 
are nearly the same (¢~0.35). It is just possible that the 
experimental curves for Pb II reflect some spurious 
effect of the sort that produced the initial negative 
curve for Pb I, although the deviation would have to be 
of opposite sign from that observed in Pb I in order to 
make the Pb II and In III results compatible. Inversion 
of the Pb I data produces an eo(H)/eo(0) versus H? 
curve with the same qualitative deviation from the 
indium curve, once the field is greater than about 
(H/H,)?=0.3, [so that we are well above the region 
where K (H) decreases from the zero-field value J. 

An alternative, and probably more sound, explana- 
tion of the discrepancy is that the relation derived from 
BRT which we have used to connect the thermal con- 
ductivity variations with changes in the energy gap is 
not completely valid, at least in the case of lead. 

It is well known that the thermal and electrodynamic 
properties of lead and mercury deviate considerably 
from the law of corresponding states. It has not been 
possible to accurately test the applicability of Sec. 
3 of the BRT theory to these metals, because phonon 
scattering contributes to the thermal resistance as a 
result of the low values of Opebye. But, since K,—3 
C.orA, and the experimentally determined?" C.(T) is 
different from that predicted by BCS, on which the 
BRT theory is based, we may expect that the BRT cal- 
culation will not account satisfactorily for the thermal 
conductivity of lead and mercury. 

Upon comparison of Figs. 4 and 13, we note that if 
the effective value of the energy gap of lead were to 
decrease from the usual value at 4.5? to about 1.4 kT. 
at 2.16? and about 1.0 kT. at 1.27°, both of the low 
curves for lead could be made to coincide with the 1.3° 
indium III curve. This suggests that we might combine 
the Ax(H) curves of Pb IT with the €o(H)/eo(0) result 
found for In III at 1.3°, and find the function Gi() 
which would be necessary to connect them. If we now 
subtract G(x) from Gi(x), we find that the difference, 
AG (a), is a function which is zero at x— 0, increases to a 
maximum of about 0.35 for v between 4 and 6, and then 
decreases with further increase of v. Let us gener- 
alize G(x) to include an «effective density of states 


2 :chards and M. Tinkham, Phys. Rev. 119, 575 (1960). 
D. ML Glacberg and M. Tinkham, Ds 118, 990 (1960). 
8T. Gi Phys. Rev. Letters 9, 1 ; 

x p D. E. Mapother, and R. W. Shaw, Phys. Rev. 


112, 1888 (1958). 
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0.2} 


— — «9(T=0) = 175 kTc 
Cty, «o(T-0) = 21 kTc 


9 
0 0.2 0.4 0.6 0.8 1.0 


Fic. 13. Dependence of energy gap of a lead film on field as cal- 
culated from the data of Fig. 12 using the G(e/kT) function from 
Fig. 3. For each experimental point, two gaps are computed as 
shown: one assuming the gap (2e) approaches 3.5 kT, at T=0 
as found by BCS; and one assuming the gap goes to 4.2 kT, as 
found experimentally. 


pLE/ e(H) ]. Note that this brings us completely outside 
the BRT theory, and thus we can have no confidence in 
the result except as a phenomenological description. Let 


Gix)— | p(E/ e) (0 f/3E)dE d | EXOf/9E)dE 
0 0 


and take p(E/«)) 21 for E> ey (the usual BRT form). 
Then 


AG(x) = Jj p(E/ e) EO f/ EE vf | Fe f/AE)dE. 
0 9 


We have attempted to fit AG(x) with several simple 
forms of p(£/eo). Within our very limited accuracy of 
AG, we find a satisfactory fit for the delta function 
p(E/€o) = 36 (E/€o)—43], or for the parabolic function 
p(E/ e) — 3(E/ey?. We emphasize that we do not take 
the above to be conclusive evidence for the existence of 
“states in the gap” of lead, or for the location of any 
such hypothetical states. Rather, these considerations 
are intended to indicate the major scale of the departure 
of the data on lead from that on indium at low reduced 
temperatures. We conclude that lead behaves in some 
respects differently from other superconductors even at 
the same reduced temperatures, but the cause is not at 
present fully understood. b 


IV. CONCLUSIONS 


From the results of the preceeding sections we may 
draw several conclusions: First, the thermodynamic 
transition in a field may be quite sharp and well- 
defined even for films sufficiently thin (700 A) that 
H_->7 Ha. The thermodynamic transition appears to be 
reversible and without hysteresis for typical films up to 
2500 A thick. Our experiments give direct evidence that 
the superconducting transition is of second order in 
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films which are thin (compared with the penetration 
depth) for temperatures ranging from T. down to 0.18 
Ta the lowest available temperature, in disagreement 
with the theory of Bardeen but in agreement with that 
of Nambu and Tuan? 

We have applied the theory of BRT," which is based 
on BCS, to our data on thin films of Pb, Sn, and In, on 
the assumption that their theory is valid for 70 with 
a suitable field-dependent gap parameter e (H). At 
T=0.65 Ta, the function eI) which results from our 
analysis is essentially identical to that predicted by 
GLG and experimentally found in tunnel measurements. 
At lower temperatures (0.36 T), our eX (I1) computed 
with the aid of BRT is different, indicating a break- 
down of the GLG theory for TKT., or possibly a failure 
of the method of data analysis via BRT. 

By varying the relative orientation of the direction of 
heat flow and the applied magnetic field in the plane of 
the experimental films, we unsuccessfully sought evi- 
dence for vestiges of the pr-v term in the excitation 
spectrum of a current-carrying superconductor, pre- 
dicted by Bogoliubov.^ Since the experimental condi- 
tions differed radically from those envisioned by 
Bogoliubov, this negative result is not surprising. 
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Application of magnetic fields at various angles up to 
90° with respect to the film plane still produced sur- 
prisingly well defined thermodynamic transitions, al- 
though H, was reduced considerably below the parallel 
field value. The transition in a perpendicular field 
appeared to be second-order even for films of thickness 
up to 4000 A, whose parallel field transition was 
definitely first order. The perpendicular critical field 
values and the angular dependence of H. seem to be 
explained satisfactorily by the theory of Tinkham.* 

Lead films gave results qualitatively similar to indium 
and tin films, but quantitatively the form of the derived 
e9(H) at low temperatures was different from that of 
tin and indium. We suspect that this is connected with 
the well-known peculiarities in the thermal properties of 
superconducting lead. 
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With the recently available very intense coherent maser light, interesting higher order coherent Raman 
effects have been observed in solids, liquids, and gases. Garmire, Pandarese, and Townes have given a 
phenomenological theory of these higher order coherent Raman effects. In this paper a somewhat more de- 
tailed theory of these effects is given. T' he analysis is based on a perturbation solution of the Boltzmann 
equation for the density matrix of the Raman active medium. General expressions for all the relevant in- 
duced dipole moments and absorbed or emitted Stokes and anti-Stokes radiations in the ordinary or higher 
order coherent Raman processes are given. These cover a great variety of possible experimental situations. 
Several specific simple examples are studied in some detail and the results of the phenomenological theory 


are reproduced from the general results given here. 


I. INTRODUCTION 


apps first-order Raman effects have been investi- 
gated extensively in the past both experimentally 
and theoretically. With the recently available very 
intense maser light, interesting higher order coherent 
Raman effects in solids, liquids? and gases? have also 


* Work supported in part by Contract No. AF 19(628)-3862, 
AFCRL, Hanscom Field, Bedford, Massachusetts. 

1 G. Eckardt, D. P. Bortfeld, and M. Geller, Appl. Phys. Letters 
3, 137 (1963). 

2B. Stoicheff, Symposium on Molecular Structure and Spec- 
troscopy, Ohio State University, June 1963 (unpublished); 
Phys. Letters 7, 186 (1963); H. J. Zeiger, P. E. Tannenwald, 
S. Kern, and R. Herendeen, Phys. Rev. Letters 11, 419 (1963). 

3R. W. Mink, R. W. Terhune, and W. G. Rado, Appl. Phys. 
Letters 3, 181 (1963). 
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been observed. Garmire, Pandarese, and Townes‘ give a 
detailed macroscopic theory of these higher order 
coherent Raman effects, particularly those through 
excitation of intense coherent molecular oscillations at 
infrared frequencies. Many interesting conclusions were 
drawn from the theory and some of these have already 
been observed experimentally. In this paper an attempt 
is made to give a somewhat more detailed analysis of the 
same effects on the basis of the quantum-mechanical 
Boltzmann equation for the density matrix characteriz- 
ing the Raman-active medium and to include in the 
theory the effects of various relaxation processes. 


4E. Garmire, F. Pandarese, and C. H. Phys. Rev. 
Letters 11, 160 (1963). SS) EMI Townes, Phys 
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The effects of spatial dispersion? are not, however, 
explicitly taken into account in the calculations here. 
This may be a serious omission in some cases involving 
liquids and solids, but in the cases of gases it is most 
likely not. When spatial dispersion effects are neglected, 
a local description of the Raman-active dielectric 
medium is possible. The study of the various higher 
order coherent Raman effects can then be carried out in 
several steps. 

In the first step, we consider a volume of the di- 
electric with linear dimensions large compared with the 
intermolecular spacings but small compared with the 
wavelengths of the interacting photons and the coherent 
molecular vibrational waves (or other polarization 
waves). Within a volume of this size, the radiation fields 
can be assumed coherent and uniform, and the medium 
can be characterized by a spatially independent density 
matrix. The quantum-mechanical Boltzmann equation 
governs the time response of the density matrix due to 
the applied fields. The average induced dipole moment 
per molecule or molecular oscillations in the medium 
produced in various Raman processes can be obtained 
from the corresponding higher order terms in the per- 
turbation solution of the Boltzmann equation. From 
these one can determine the frequencies of the Raman- 
scattered Stokes and anti-Stokes radiations and the 
rate at which they are absorbed or emitted locally. In 
Sec. II an explicit formulation of the problem in terms 
of the density matrix is given along with a formal pro- 
cedure for obtaining the appropriate perturbation 
solution. 

In the second step, we consider a volume of the dielec- 
tric medium with linear dimensions large compared with 
the wavelengths of all the waves of interest but still 
small compared with the distances over which the ampli- 
tudes and phases of the waves change appreciably. The 
harmonic spatial dependences of all the waves can 
therefore be directly introduced into the various expres- 
sions obtained in the previous step. When macroscopic 
spatially coherent effects are considered, one assumes a 
fixed phase for each of the electromagnetic waves. Upon 
averaging the resulting expressions over such a volume, 
one obtains the conditions under which each Stokes or 
anti-Stokes radiation will be absorbed or amplified on 
the average. These include the so-called phase-matching 
conditions which give the propagation vectors of the 
Stokes and anti-Stokes radiations. 

In the final step, one may consider questions which 
are only significant on dimensional scales over which the 
amplitudes and phases of the varlous waves vary 
appreciably. These involve questions such as energy 
transport, boundary effects, geometrical effects, etc. 

In the present study we do not, however, go beyond 
the second step in any detail. The primary concern here 
is in the intrinsic properties of the higher order coherent 


5 See, for example, J. J. Hopfield and D. G. Thomas, Phys. Rev. 
132, 563 (1963). 
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Raman processes as discussed by Garmire, Pandarese, 
and Townes. 

The above assumptions and basic steps are implicit in 
the calculations and discussions given below. 


II. THE DENSITY-MATRIX EQUATION 


We ignore first the spatial dependence in all the 
variables. Under the usual assumptions,’ the quantum- 
mechanical Boltzmann equation for the density matrix is 


dp/dt= — (i/ F)LH,o]—3ET (o—p)+ (o—5)F], (2.1) 
where the total Hamiltonian 
H=Hot+V=Ho—p-E (2.1a) 


is the sum of the unperturbed Hamiltonian of the mole- 
cule, Mo, and the interaction energy of the molecules 
with the total electric field, E. In a representation in 
which Ho is diagonal, T, which characterizes phe- 
nomenologically the effects of various relaxation proc- 
esses, is a diagonal matrix I'44 7/5055: 1, where 1 is a 
unit matrix with the dimensionality of the corresponding 
eigenvalue of Hy. The matrix p is diagonal, and the 
diagonal elements are proportional to the thermal 
equilibrium populations of the corresponding energy 
levels at a given temperature T in the absence of V, or 


p=exp(—Ho/kT)/Trace[exp(—Ho/kT)]. (2.1b) 


p(t) determines the average induced electric dipole 
moment, p(/), per molecule: 


p()=TrLpe()], (2.2) 


where p is the electric dipole operator. 

In studying Raman effects we consider three energy 
levels, ei, €2, and e, of the molecules (Fig. 1) where level 
1 is the ground state and level 2 could be a vibrational 
level, a rotational level, or an electronic level if electronic 
Raman effects are considered. Level 3 is the inter- 
mediate state. All these levels will in general be de- 
generate. For the basic first-order Raman process, it is 
sufficient to assume that the total electric field consists 
of two components: 


E(t) » [E,ci«st4- E sees] 
HCE Eet]. (2.3a) 


The elements of interest in the perturbation term V 
of the total Hamiltonian are assumed to be: 


: 
Via 15 V 2a,28= V30,38= V 1a,28=0, 

Via.38= V 15,35?) e22t4- V35,35 (7 *e~ #2! 7 
V2a,387 V20.28 Pett Vsayant ere, 


for wa and w sufficiently close to the correspond- 


(2.3b) 


*R. Karplus and J. Schwinger, Phys. Rev. 73, 1020 (1948); 
W. E. Lamb, Jr., Lectures in Theoretical Physics (Interscience 
Publishers, Inc, New York, 1960), Vol. II, p. 435; Third Inter- 
national Symposium on Quantum Electronics, Paris, France, 1963 
Cruises 
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(o) Fic. 1. Schematic diagram of 
energy levels and fields involved in 
(a) ordinary and (a), (b) higher 


3 ——— — — —6-—— order Raman processes. 
^ wd 
2 
1 
(b) 


ing transition frequencies wsı=4(es—e:) and wsz 
=) (es—ea), and 


Vma, ng = — (mjo.| p- Ea,»| 2,8) . 


In Eqs. (2.3b) and (2.3c), œ and f designate the de- 
generate states of the energy levels em and en, respec- 
tively. We also assume that 


(2.3c) 


(2.3d) 


wa— Gb: li" (e3— e) = 21, 


but wa and cy are not so near the corresponding transi- 
tion frequencies that the direct processes become 
important. 

As in the usual theory’ of ordinary Raman effects for 
a single molecule, which is based on the perturbation 
solution of Schródinger's equation, we obtain the 
steady-state solution of the Boltzmann equation (2.1) 
by perturbation techniques.5 Specifically, we formally 
expand the elements of the density matrix: 


Pma,nB= Dma,nBO + pma,ng VF Qe -+ Qna, ng Y+ d (2.4) 


in successive orders of approximation corresponding to 
ascending powers of the amplitude of the electric field. 
The lowest order term 


fom) 
exp| wmn 

a! 1I s 

X(—1)Bna,nsd! , (2.5) 


which would simply be equal to fma.ne if the latter is 
purely diagonal and. time-independent. In Eq. (2.5) 


Io > 3 (Ymt Yn) D (2.6) 


which is independent of the degeneracy indices a and f. 


(0) — 


Dma,nB 


7See, for example, M. Góppert-Mayer, Ann. Phys. 9, 273 
(1931); G. Placzek, Handbuch der Radiologie (Akademische 
Verlagsgesellschaft Leipzig, 1934), Vol. VI, Part II, pp. 209-347. 
For more recent studies in connection with maser work see, for 
example, A. Javan, J. Phys. Radium 19, 806 (1958). 

8 C. L. Tang and H. Statz, Phys. Rev. 128, 1013 (1962); 
H. Statz and C. L. Tang, Third International Symposium on 
"Quantum Electronics, Paris, France, 1963 (unpublished). 
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The successive terms, i.e., j2 1: 


t 1 
exp ioni) G= i) 
(ort 


mn 


i 
Pma, ng? a TEN 2L 
h ly 


X [ Vina tyPiynp TRE Pma,ly G-Dy py ng JUV . (2.7) 


This series solution, (2.4)- (2.7), is formally identical to 
that used in the study of nonlinear interaction of 
oscillating modes in a gas maser, or in similar problems, 
where the nonlinearity is due to multiple successive 
single-photon processes." Raman processes are true 
multiple-photon processes. 

Mathematically, the distinction is quite simple, and 
it lies in how the intermediate steps in the iterations 
implied in (2.7) are carried out, apart from the reso- 
nance conditions. Very briefly, in the successive single- 
photon process, one calculates alternately the off- 
diagonal and the diagonal elements in the successive 
steps of (2.7) (see Ref. 8). In the ordinary Raman 
process, as in the situation depicted in Fig. 1(a), in 
which simultaneous absorption of a photon at wa and 
emission of a photon at w» occur, one calculates, for 
example, the second-order term pia,ss ? directly from 
piss, and ps,,55 7. This will become clearer in the 
following sections. 


II. ORDINARY COHERENT RAMAN EFFECTS 


In the ordinary Raman process, only two photons are 
involved as shown in Fig. 1(a). Following the usual 
procedure,® we obtain from Eqs. (2.7), (2.5), (2.3b), and 
(2.1b) the lowest order terms in the perturbation 
expansions of the off-diagonal elements of the density 
matrix 

ANG la gy Deitat 
Pia,3y D = ONU ae : (3.1a) 
AL1+i@a—os1)T 13] 


iA Bai Ts og as Veeti 
dc eue ee (3.1b) 
h[1--2(0v— 032) Tos ] 


where 


DnB.nB* Dmm Dnn (3.2) 


Apnn= Pma,ma 


for any « and f. Afmn is diagonal and is independent of 
the degeneracy indices by virtue of (2.1b). Substituting 
(3.1a) and (3.1b) into (2.7), we obtain for the second- 
order term in the off-diagonal matrix element 


V35,3, ® Va ag D *T 3563 05001 
D1,28 E p S 
y J[14(o.—o:—o)T12] 
[ Apa21 23 ApaT3s 


xeu Ameis —á— (3.3) 
1—i(@p—w92)P23  1+-7(@a—ws1) T'1s 


In the proper limits this term corresponds to the second- 


? B. Senitzky, G. Gould, and S. Cutler, Phys. Rev. 130, 1460 
(1963); H. Haken and H. Sauerman, Z. Physik 173, 261 (1963); 
N. Bloembergen and Y. R. Shen, Phys. Rev. 133, A37 (1964). 
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order term in the perturbed wave function of the 
Schródinger equation in the theory of ordinary Raman 
effects for a single molecule.” It contains all the neces- 
sary information for calculating the coherent molecular 
oscillations excited by the light waves. If electronic 
Raman processes are being considered and dipole transi- 
tions between levels 1 and 2 are allowed, it describes the 
induced dipole moment oscillating at «&:1—«,—« pro- 
duced by the light waves at wa and ws. 

This coherently driven molecular oscillation described 
by pia2a gives rise to a third-order term pi4,5, 9, 
which results in an oscillating dipole moment per 
molecule at wa: 


p, ea Ds [D2s 1a012,2y 9 + Pax, 1a P Pia,a7 | 


any 


- om 


Yay’, B 


; (b)* 17 ( 
1X aD3y,10V 12,337 © Vay" 28 * V 28,5, 09 


Keivatteic, ; (3.4a) 


where 
TT 13 
Xa 
T [1--i(o4—0,— 2) T 12 ]L1H-$ (064—031) T 15] 


Apa»T oa Apa T 13 y 
e ss 
1—i(e,—032)T»s 1+ilwa—wz)Ti3 


and an oscillating dipole moment at w: 


Po) =}, [pay.28028,2y ?) + p3y,28 Dog, ay] 
By 


= Y) iXepsy.20V 28,99 Vay 1a *V10,37 
YB y'a 
Xett e.c., (3.4c) 
where 
T's T os 
X= 
[1 —i(wa—wo—Wa1) T 12 Hilwa 632) 23] 
D321 Apa 
[ ApaoT 23 Pail 1a | (3.44) 
1+ilws— ws) T23 1—ilwa—wz)T 13 


with the help of (3.3), (2.7), (2.3b), and (2.2). Not all 
the terms in the sums over the degeneracy indices in 
(3.4a) and (3.4c), as well as (3.3), are nonzero. Many 
will vanish of course due to selection rules or other 
special circumstances. . 

These dipole moments at frequencies wa and Wb, 
Pa® and pè, induced in the Raman process will lead 
to energy exchange between the molecules and the 
oscillating electric fields at the corresponding fre- 
quencies. The average absorbed power is equal to the 
time average of the scalar product of the rate of decrease 
of the dipole moment and the corresponding oscillating 
component of the electric field. We obtain from (3.4a) 
for the average power absorbed per molecule at the 


frequency wa 
Pa= Ref M Dus Xs V ay 1a 9* Via ay © 


Yab 1 
XVay 28 Vag”). (8.5) 
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Similarly, from (3.4c) the average absorbed power per 
molecule at c; is 


Py=Ref 25, 2wWXwVy 25 9 * V 25,3, P 


TB! en 


XViy 12 (9* V5, Oe (8:0) 


We next introduce the spatial dependence of the 
perturbation terms V'*? and V% into (3.5) and (3.6) 
and obtain the absorbed power averaged over a volume 
large compared with all the wavelengths of interest but 
small compared with the distances over which the 
amplitudes and phases of the waves vary appreciably. 
We consider homogeneous plane waves: 


Via 35? = V 15,35 /?e7 tkt , 


(3.7) 


7 EST (b) g— ik ps 
V 2a,3'”) T Vso,s8 ZE iky 2 


where r is the position vector of the average molecules 
characterized by the density matrix and V,,,,,5/*-" is 
spatially independent but complex. 

Substituting (3.7) into (3.5) and (3.6) and averaging 
over r, we obtain the average power per unit volume 
absorbed at the frequency wa: 


(24); Ref z 2NwaXaV ay, 1a V4 V 14,5, (0 


vah 


XV 2T a83}, (38a) 
and that at ws: 


(Poje=Ref X) 2YoeyG V ar, HV ag, O 


YB y'a 


X Vay 12 9*V 155,1) D (3.8b) 


where Ñ is the density of the active molecules coupled 
to the fields. In both cases the final results are inde- 
pendent of the propagation vectors. Unlike the higher 
order Raman effects which will be considered in the 
following section, there are no phase matching condi- 
tions here. The average absorptions per unit volume, 
(3.82) and (3.8b), are always directly proportional to 
the local absorptions (3.62) and (3.6b). 

On the other hand, substituting (3.7) into (3.3), we 
have 


ee » V 1,2, 9 V 1,25 9 *T1 
Par OREL Qo RE 
A paa T 23 ApaTis 
1—i(ws—w32)T 2, 1+i(we—wa1)T 13 
Xexpi[(wa—ws)t— (k.—k:»)-r], (3.9) 


which means the molecular-oscillation wave (or other 
polarization waves) produced in the coherent Raman 
process has a propagation vector equal to k,— ka. 

If wa designates the primary light, say the maser light 
at w, then (3.8b) describes the first Stokes radiation at - 
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ws; =w): 
(Ps,)r= Ret x Nos Xs V sy aa 50 * V 29,5, SY 

ypy’ a a K 
; Visa 25V qas, 0), (3.102) 
where 

T2T 25 

Xs: 


WL m i(@»—ws,— a1) T 12 ]L14- i (ws, —W32) To3 | 
ApsoT 23 Apalis 
x a6 | . (3.10b) 
1+i(ws;—ws2)T23 1—i(i—en)Tis 


If w» designates the primary light, say the maser light 
at œn then (3.82) describes the anti-Stokes radiation at 
MAS=Wa? 


(Pas)r=Re{ 2j 


DNoASXASV sy 14 4 *V 14,2, (49? 


vay. 8 
XWVsyag(0*V59,, 00), (3-100) 
where 
TisTis 
Xas= 3 : 
AS [1--i(o4s—e1—0n)Ti 1-1 (645—035) Tis] 
ApseT 23 Apalis 
: |. 6409 
1—4(91— 0932) T»; 1-Fi(oas—o3)Tis 


Equations (3.102)- (3.10d) contain all the information 
about the Stokes and anti-Stokes radiations in the 
ordinary Raman process. These expressions are very 
general though somewhat complicated. In order to see 
the physical interpretations of these results better, we 
consider some simple limiting cases. 

We first simplify the x's. Let us assume that 


T's, T o3 > (3.11) 
and 
Apsa7 — po», Api — Pi. (3.12) 
Therefore, we have from (3.10b): 
" 1 Trot il 12? (01—08,— 21) 
lm Xs,=— | 
Wares WL 1+ (wm wsi wz) Ti? 
1 a 
x| rias asd —iP— — | 
Qs, — 32 
» A 1 
X| — serios mon) + ipa — 
Qs, W32 
= py (wim ws) ipu. | (3.13a) 
@1— W31 


where P in (3.13a) designates principal values. The 
6-functions’ contributions correspond to the effects of 
the single-photon processes between levels 1 or 2 and 3. 
These effects are usually not considered as a part of the 
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Raman process. One should note, however, that a clear- 
cut distinction can easily be made only in the case 
Tis, To5 — 09. Experimentally, in the cases where the 
Raman processes are important, the direct single-photon 
processes usually are not, so that one does not have to 
make a distinction and both the 6-function contributions 
and the principal value signs in (3.13a) can be omitted: 


1 [ Tiz 
lws — 32) L1H lomosin) T?" 


Xsı = — 


lwir wsi — 92) T 12” 
+ | 


i 
1+ lwi os, — 921)? T 1» 


pu P22 
x dip Pee Ey 


Qj—031 WS,— W32 


] . (3.13b) 


We also assume that the primary light is linearly 
polarized in, say the 2 direction 


E;-$Eg-kU, (3.142) 
and that the Stokes radiation is 
Es,— [sin6s,$--cosó s 2 JE s, 
Xexp(—iks,-r+ips,). (3.14b) 


With (3.13b), (3.142) and (3.14b), (3.102) and (3.10b) 
can be readily evaluated if, for example, levels 1 and 2 
are nondegenerate; one obtains for the Stokes radiation 
the familiar result‘: 


2N (pu-— po2)ws,T 12 
953 A; [14- (t1— 9 s,— 921)? Ti] 
X |2ul?l pa EE cos'ós,, (3.152) 


(Ps) 


where Ay=w31—w1~w32—ws, near the resonance peak 
at w)—wg,~wn and |pmn| is the magnitude of the 
reduced matrix element of the operator p. Since pu 25; 
(Ps,);<0, and Stokes radiation is emitted. It is emitted 
in all directions with the radiation pattern determined 
by the angle, 05, between the polarizations of the maser 
light and the Stokes light according to cos*6;. 

Similarly, one obtains for the absorbed anti-Stokes 
radiation per unit volume‘: 


2N (p11— pz2)94 ST 12 
973 A[1-- (wrm wsio) T1] 
X | p13|2| pos] SEE As? cosas, (3.15b) 


(Pa S)r 


where Ao=ws1—was~w32—@1. It is clear from (3.15b) 
that anti-Stokes radiation is always absorbed. 

These are only some special cases. A great variety of 
more complicated situations may arise in experiments 
and the general expressions (3.10a)—(3.10d) must then 
be used. 


j j 


MT We LL P 


* 
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IV. HIGHER ORDER COHERENT 
RAMAN PROCESSES 


A. General Theory 


In the higher order Raman processes more than two 
photons are involved simultaneously. Let there be 
photons at angular frequencies e, and wa as shown in 
Fig. 1(b) in addition to the photons at wa and w, as 
shown in Fig. 1(a). 

Physically, a light wave at we, propagating through 
a medium which is being modulated by coherent 
molecular oscillations, excited by two other light waves 
at wa and w, as described by the second order off- 
diagonal density matrix element pia,23, (3.3), will in- 
duce an oscillating dipole moment at w=w.+ (wa—ws). 
Similarly, the light wave at wa can induce an oscillating 
dipole moment at w-=wa—(wa—w»). Both are higher 
order Raman effects. This is not unlike the double- 
resonance effects? where the coherence between levels 
land 2 is induced directly by an applied field of angular 
frequency «s. In the present case, this coherence is 


ENT RAMAN EFFECTS A1171 
induced by two applied fields at w, and «wy, where 
Wa — Wb 031 — 02327 Wal. 

Mathematically, the presence of two additional 
oscillating fields adds new elements in the perturbation 
term, V, in the total Hamiltonian for the density- 
matrix equation: 

V 14,55 (D eioat-L- Via ag Pe iat, (4.1a) 
and 


Vraag Pett V, agg fout, (4.1b) 


In the perturbation solution of the density-matrix 
equation, a third-order term in the off-diagonal density- 
matrix element pi5,5, ? appears which comes from sub- 
stituting into (2.7) the second-order off-diagonal ele- 
ments pia,28 and p214 and the new perturbation 
terms given in (4.1b). Similarly, a third-order term 
p23,34? appears. These give rise to new induced dipole 
terms. 

From (2.2), (2.7), (3.3), and (4.1a), we then obtain 
the average induced dipole moment per molecule at 
wa— (wa—ws) : 


Pa-at P= $ iXd_apvP3y,28V 28,377 Vaya’ *V 14,3, Meted-watovitt cic, (4.22) 
TB Y'a 
where 
Ti2T 23 Aps2T 23 ApaiT 13 
Xda ee MEE mo (4.2b) 
WTA —1(wa—wy— 21) P19 JL 1-2 (e4—95-95— 032) Tos ]-1--1(05— 032) Tz 1—ilwa 921) T 1a 
and the induced dipole moment per molecule at wet (e — wa): 
Pepa—b?? = SE 1X c¢a—bP37,1aV 12,347 7? Vay og P * V ag, ay (gioca ot) tt c.c, , (4.3a) 
ve T. 
where 
TT 13 Afs2T 23 ApsiT 13 1 
Xepa-d= - - e |: (4.3b) 
T3[14-:(ea—05— 021) T'12 ]L 14-2 (Wet 94 — 095—631) T 13 ]- 1 ilw) — 12-2 (954—031) T'13 
Let the primary light be the maser light at wz; we can where E 
study all the higher order Stokes and anti-Stokes LaL BENE 
radiations with the help of (4.22), (4.2b), (4.3a), and ^% P1 — (ras rn Pa EE REED 
4.3b). 
6) a ps ApsaT 23 ApaT 13 
1. First Stokes Radiation NU ae (4.5b) 
1+7(@;—w22)T 23 1-i(was—en)T 13 


The following combination of frequencies can lead to 
the first Stokes radiation in the higher order Raman 
process involving four photons: 
Wp=O1, 


Wa=WAS, Wal, 


and (4.4) 


We=Wg— Wat Oo=O1—OASTOL=O5;- 
Substituting (4.4) into (4.2a) and (4.2b), we obtain the 
induced dipole moment per molecule at ws, produced in 
the process: 
ps (P= » 


yy'a 


: Dy (AS)* 
iX sy Pay,28V 28,37" V sy" 1a 


KV 10,37 D 69 5:H- c.c. , (4.52) 


X F, Bitter, Phys. Rev. 76, 833 (1949). 
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and use is made of the fact that w4s—w;=w1—w5,. 

Following exactly the same steps as used in the 
previous section, we find the avérage absorbed power 
per molecule: 


Ps/—Re( X 2esXs'Vay 25/5 * Vg o P 
yya 
XVay aa (49*V1,,4,00).— (4.6) 


By introducing the plane-wave spatial dependence of 
the fields, (3.7) and 


Via sy (49) = V3, (49 exp(—ik4 sro (4.7) 
into (4.6) and then averaging over r, we obtain the 
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average absorbed power per unit volume: 
(Pg) = Re( p» 2Nos Xs, V ay, sg 59 *V op, ay? 
1 Bry’ x 
XV ay? aa S)*V 15,54 0) Okt ikv ksi-kas- (4.8) 


In contrast to the ordinary Raman processes, in the 
present case there is a phase matching condition‘: 

ki tk) —Kks,—kas=0 (4.8a) 
in order for the higher order coherent Raman process to 
take place. Whether (4.8) corresponds to absorption or 
emission will depend upon the detailed nature of the 
molecular states and the relative phases of the fields 
involved, as expressed in the bracketed factor in (4.8). 
We shall examine some special cases later. 


2. First Anti-Stokes Radiation 

For the first anti-Stokes radiation we take the follow- 

ing combination of frequencies in (4.3a) and (4.3b): 
Qa—0], Wb=Ws,, QGc—0Ol, 
and (4.9) 
Ga—ori-91—095,— 048. 

The induced dipole moment per molecule at was 
produced in the higher order Raman process is 


pas ®= È 


ya,’ b 


iXAs' pay 1aV 12,317 P Vay, og SY 


X Vap 3y V 65948t4- c.c. , 
A TiTi 
* Pitie os on) Tic Titi was—on)Tu] 
| ApssT 23 Apail is 
1—i(ws,—w32)To3 1--2(9—031)T 13 


from (4.9) and (4.3a) and (4.3b). The average absorbed 
power per molecule is then 


Pas'=Re{ »] 2wasXas Vay 1a 49 * V 15,3, O 
qe, y’ b 


(4.10a) 
where 


] (4.10b) 


XVayag(59*V55,5,0). (4.11) 


Upon introducing the spatial dependence of the fields, 
(3.7) and (4.7), and averaging over r, we obtain the 
absorbed power per unit volume: 
(Pas)—Re( X, 2NwasXas Vs, 14 (49*V ay O 
a.v. B 
XV sy 26S) *V 08,27} Gepekr- ks xas. (4.12) 


Again, one has the phase matching condition, (4.8a).4 


Y= DL Fera P Vagar O V ay 104 9*V 10,37 
YB y'a 


TANG 


3. Higher Order Stokes and Anti-Stokes Radialions 


By taking suitable combinations of frequencies, 
properties of successive higher orders of Stokes and anti- 
Stokes radiations can be studied in much the same way 
using (4.2a), (4.2b), (4.3a), and (4.3b). 

For example, for the second Stokes radiation, one 
takes the following independent combinations of 
frequencies: 

Q,7 01, WEUS, O4—O$8, 


(4.13a) 


w= Vd — Va Hw = OS lwr ws,) — O85 
and 
Wa=WAS, WEW, Wd=W5), (4.13b) 
w=ws (wAs—01) 7 95; 
Physically, these correspond to the cases where a light 
wave at ws, propagates in a medium which is modulated 
by the coherent molecular oscillations excited by two 
other light waves at o; and ws, (4.13a), or was and ci, 
(4.13b). As a result, absorption or emission at ws, may 
take place. It is now obvious how the absorbed powers 
could be calculated. It may be of interest to look at the 
phase matching conditions. 
It can be shown readily that for the first case, (4.132), 
the phase matching condition is! 


ks ks/—k;—ks,— 0; (4.142) 
and for the second case, (4.13b), it is 
kas—ki—ks,+ks,=0. (4.14b) 


It is clear that the general results given can be readily 
applied to a great variety of interesting experimental 
situations. We now study some specific simple cases. 


B. Special Examples 


Consider the first Stokes radiation in the higher order 
process. As in Sec. III, let us assume again that all the 
light waves are linearly polarized: 


VO —$E;, (4.152) 
V0 — (pz sin&s,;-- p. cos0s;) E s,e'*5:, (4.15b) 

VAs) = (pz sinÜ4s cosdast+py sinÜ4s sins 
+p, cos0as)Base¥4s, (4.15c) 


in a suitably chosen coordinate system. With (4.15a)— 
(4.15c), we can evaluate the sums in (4.8), assuming 
that level 2 is a vibrational level; for example, 


= 5 EE s Ease tsitas, p] T(1,0) | 3,8Y (1a T (1,0) | 3,a) 
XL[3,e| T (1,0) |2,8) cosés,+ (1/2) ((3,0| T(1, —1)|2,8) — (3,| T(1,1)|2,8)) sin6s, ] 
X[X3,6| T (1,0) | 1,0) cos@as— (1/V2)(3,8| T (1,1) | 1,2) (sin&4 s cosóA s-I-i sinbas sin$As) 


a, B 


+ (1/v2)(3,8| T(1, = 1)| 1,0) (sin&4s COSÓAs—4 sinÜ4s sin$4s)] D 


(4.16) 
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where the tensor operators are 


1 i 1 i 
T(1,0)— 5;, T(, —1)2— EAT ES , T(1,1)2 — Sob 
) ae ig ( , ) d "a Pv S 


A1173 


(4.16a) 


If, furthermore, levels 1 and 2 are nondegenerate, Eq. (4.16) simplifies to 


Y -i pul l| pal BrE s Ease 7948 costs, costas. 


(4.16b) 


Assuming in addition that Tiz, T2; —5 and none of the single-photon frequencies are very near any of the tran- 
sition frequencies, substituting (4.16b) into (4.8) and (4.5b), and taking the proper limits, we obtain for the average 


absorbed first Stokes radiation per unit volume: 
2N (Bui— Pos), T 12] pial ?| p23! eE s E, S 
973 [14- (e4s—01— 921)? T3?? ]A142 


(Psy')r= 


cos6s, CoSÜA s 


X[cos (Vs, tas) - T12(94s—01— 921) sin (Ysi HYas) Jxickr—gi-kas, (4.17) 


where in the denominator, as before, 


4Ai—0635—08, and 4s—0933—017:031—0948- 


On resonance,‘ one has absorption of the first Stokes radiation, or (Ps,'),>0, if 7/2« (Ws,+Was)<3m/2, and 
emission of the first Stokes radiation, or (Ps,), €0, if —7/2« (Ws,tWas)<x/2. 
Under similar circumstances, one finds the absorbed anti-Stokes radiation per unit volume: 


2N (B11— Po) AST 19| fia] ?| £23] 2B PB s Bas 


(Pas): = 
OHTI (e45— wiw) T1? JArA2 


COSÜ s, coSÜAs 


X [cos(Ws,+Wa s) —T (wa s— wr wn) sin (Yst ya 8) ]6xp4-4y—ks1—kas 3 (4. 18) 


where, in the denominator, use is made of the fact that 
€)1— 031740 8,— 0032 and oco4sg—9031701— 032. 


Therefore, on resonance, when 7/2« (Ws,+Was) « 37/2, 
(P 4s), X0, and one has emission of anti-Stokes radia- 
tion; when —/2« (Vs;d-VAs) «7/2, (Pas) 2:0 and 
one has absorption of anti-Stokes radiation.* 

It follows also from (4.17) and (4.18) that on reso- 
nance and when the phase matching condition issatisfied 


(Pash r was = — (Psi h os, s (4.19) 


which is similar to the so-called Manley-Row relation, 
well-known in the theory of parametric amplifiers in 
microwave tube work and derived classically.” 

One should note also that when, for example, three 
photons are simultaneously involved in the higher 
order Raman process, the net absorption of the first 
Stokes radiation is not just (P's, }r. It should be (Ps) 


11 p. K. Tien and H. Suhl, Proc. IRE 46, 700-706 (1958), with 
references to the original work of Manley and Row. 
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plus (Ps,);, since both terms could be of the same order 
of magnitude. Similarly, the net absorption of anti- 
Stokes radiation is the sum of (Pas): and (Pas): An 
examination of these expressions shows that when 
Es, É 4s one could have coherent amplification of anti- 
Stokes radiation of the right phase. 

It is clear that specific results for other higher order 
Stokes and anti-Stokes radiations can be obtained in 
much the same way. 

Most of the results and conclusions obtained in these 
simple examples, as well as those given in the previous 
section, have been given and discussed in detail in Ref. 4 
on the basis of a phenomenological theory. In more 
complicated cases, the general results given in Sec. IITA, 
as well as those given in Sec. II, will have to be used. 
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Experiments by Hobart and Franken in 1961 demonstrated a shift in the free-electron spin-resonance 
frequency due to collisions with sodium atoms whose electron spins were polarized. The results have been 
interpreted on the assumptions that the S-wave phase shifts were proportional to the wave number $ and 


that higher partial waves were not important. The 


triplet and singlet electron-sodium scattering lengths 


of Salmona and Seaton were used, but no check on the calculation was made since the sodium polarization 
was not known. Recently Balling, Hanson, and Pipkin have measured the shift for collisions with highly 
polarized rubidium atoms. They have also derived complete formulas for the frequency shift and line broad- 
ening in terms of the phase shifts with all partial waves included. In the present paper, an alternate deriva- 
tion of the frequency shift and line broadening formulas is given and the experimental results are discussed 
on the basis of the phase-shift information that is available for hydrogen, sodium, and cesium. The small 
size of the observed ratio of frequency shift to line broadening for rubidium appears somewhat surprising in 


view of the larger ratios calculated for other alkalis. 


1. INTRODUCTION 


MEASUREMENT of the free-electron g factor 

has been reported by Dehmelt.1 His method 
involves polarization of the electrons by exchange 
collisions with sodium atoms which in turn have been 
polarized by optical pumping.?^? An exchange broaden- 
ing cross section of 2607a4 or larger at 400°K was also 
measured. Salmona and Seaton‘ discussed this result 
on the basis of their calculated S-wave scattering 
lengths and found that the zero-energy-exchange cross 
section is large because the triplet and singlet scattering 
lengths are of opposite sign and are both large. 

Hobart and Franken have repeated the electron 
g-factor experiment using a more homogeneous mag- 
netic field.55 During the experiment, they discovered 
a shift in the electron-spin-resonance frequency with the 
sodium polarization and density. The sample tempera- 
ture was about 410°K and the magnetic field about 
3.7 G. The pulsed discharge which produced the elec- 
trons was short and uncorrelated with the modulation 
frequency used to observe the electron resonance. An 
argon buffer gas at pressures of between a few Torr 
and about 50 Torr was present, so the electrons were 
presumably thermal. The frequency shift was in- 
terpreted as due to phase shifts in the off-diagonal 
elements of the electron-spin density matrix introduced 
by the electron-sodium exchange collisions. Maximum 
shifts of about 100 cps were observed under conditions 
where the electron resonance broadening by electron- 
sodium exchange collisions was about 2000 cps (half- 
width at half-maximuim).” 

"The expected ratio of frequency shift to line broaden- 

1H. G. Dehmelt, Phys. Rev. 109, 381 (1958). 

? A. Kastler, J. Phys. Radium 11, 255 (1950). 

3H. G. Dehmelt, Phys. Rev. 105, 1487 (1957). 

* A. Salmona and M. J. Seaton, Proc. Phys. Soc. (London) 77, 
617 (1961). 

5J. Hobart and P. Franken, International Conference on 
Optical Pumping, Heidelberg University, April, 1962 (unpublished 


abstract). ^ 
6J. L. Hobart, thesis, University of Michigan, 1962 (un- 


published). ee 
1 P, Franken (private communication). 


ing for the conditions of the Hobart and Franken 
experiment has been estimated? using Salmona and 
Seaton's triplet and singlet S-wave scattering lengths. 
'The formula obtained, including only the S-wave 
contribution to the shift and broadening, was 


(d'/s) = — Pya (sin2Ao))av/{(1—cos2Ao))av- (1) 


Here d’ and w’ are the frequency shift and half-width, 
Pyais the sodium electron polarization, Ao— 89(1) —8o(0) 
is the difference of the triplet and singlet S-wave 
phase shifts, and the average is over the thermal 
distribution of electron energies. If the scattering 
length approximation for the S-wave phase shifts is 
adequate and higher partial waves are not important, 
the observed ratio of shift to broadening requires a 
sodium polarization of about 10%. However, since the 
sodium polarization was not measured, the obser- 
vations do not provide a check on the correctness of 
Eq. (1) or of the assumptions which were made. 

Recently Balling, Hanson, and Pipkin? have measured 
shifts in the free-electron spin-resonance frequency due 
to collisions with polarized rubidium atoms. In this 
work the rubidium polarization was 90% or higher and 
the electrons were believed to be well thermalized. 
They also derived complete theoretical formulas for 
the shift and broadening of the electron spin resonance 
due to electron-atom exchange collisions in terms of 
the differences between the triplet and singlet scattering 
phase shifts for all partial waves. Their interpretation 
of the observed ratio of shift to half-width of —0.05 
-E0.01 was that the S-wave phase-shift difference was 
near some multiple of (7/2) for electrons of the average 
energy, with (m/2) itself most likely, and that higher 
partial waves were probably not important. 

In Sec. 2 of the present paper an alternate derivation 
of the frequency shift and line broadening formulas for 
electron-atom exchange collisions is given. This deri- 


3 P. L. Bender, International Conference on Optical Pumping, 
Heidelberg University, April, 1962 (unpublished abstract). 

? L. C. Balling, R. J. Hanson, and F. M. Pipkin, Phys. Rev. 
133, A607 (1964). 
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vation is based on the expressions one would obtain 
from Baranger’s general impact theory" if the triplet 
and singlet states were energy eigenstates during the 
time between collisions. The proof that the resulting 
formulas hold for arbitrary alkali nuclear spin is given 
in the Appendix. 

In Sec. 3 the available theoretical information on 
phase shifts for electron-alkali scattering is reviewed. 
The modified effective range theory expansions of the 
S-wave phase shifts in terms of the wave number k 
appear useful for hydrogen and sodium but break down 
for cesium in the temperature range of interest. Phase- 
shift calculations as a function of energy for hydrogen 
and cesium and the scattering lengths for sodium are 
used in discussing the frequency-shift ratio for rubidium. 
The experimentally observed result is not easy to 
understand in this way, and thus more extensive 
phase-shift calculations as well as further frequency- 
shift experiments with cesium and sodium appear to 
be desirable. 


2. ALTERNATE DERIVATION OF ELECTRON SPIN 
RESONANCE FREQUENCY SHIFT 


For the experimental conditions of Hobart and 
Franken> and of Balling, Hanson, and Pipkin,’ the 
alkali and buffer gas densities are low enough so that 
the impact approximation is very good. The number of 
electrons is much less than the number of alkali atoms, 
and we thus consider only electron-alkali collisions. 
Magnetic interaction energies are small enough to be 
neglected during collisions. We take the axis of quanti- 
zation as along the magnetic-field direction. The 
components of the free-electron spin and the valence- 
electron spin along the axis are labeled as m. and my, 
respectively, while the total electron spin and its 
component along the axis are S and ms. We assume 
that the alkali nuclear spin is zero for simplicity and 
show in the Appendix that the result is independent 
of the nuclear spin. However, we do require that the 
off-diagonal elements of the alkali atom density matrix 
in the energy (ie, F, Mr) representation between 
collisions be negligible. 

If the triplet and singlet states of the coupled free 
electron and alkali-valence electron were nondegenerate 
energy eigenstates between collisions, we could use the 
general impact theory results of Baranger'? to obtain 
the shift and broadening of the transition between these 
levels in terms of the scattering phase shifts &i(1) and 
8,(0) for the triplet and singlet states, respectively. 
The subscript refers to the /th partial wave. Taking 
the triplet as the higher level, the half-width and the 
shift are given, respectively, in angular frequency by 


W= Cos Í dal f (9) — fo(9) D (2) 


> 
av 


10 M. Baranger, Phys. Rev. 111, 481 (1958). 
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is e (2rnv/k) R[ f (0) — fo(0) ] 


Himi Í a0 fo") fl2) — fol) /i63) » (3) 


Here fi(Q) and fo(@) are the triplet and singlet elastic 
scattering amplitudes as a function of direction, R 
means the real part, is the number of scatterers per 
aj, v is the electron velocity in ao/sec, ao is the Bohr 
radius for hydrogen, and # is the electron energy in 
units of 13.6 eV. The average is over the electron- 
velocity distribution, which is assumed to be Max- 
wellian. Since 


fs(8) = (2ik) Z (21-1) (24) —1) Pi(cosé), (4) 
L 
where S=1 and 0 correspond to the triplet and singlet 
amplitudes, we find 


w= (Grnv/ E): (21+-1)[1—cos2Ar])av, (5) 
i 
d (— (xnv/ k) (214-1) [sin2A;]av , (6) 
1 


where A;=6,(1)—6:(0). Converting n and v to cgs units 
and labeling the most probable electron velocity as v5 
and the corresponding value of k as km, we have 


Tg HUN A eS. 
w=( =~ )E(aH-1)((1—~cos2a)/(/sn))or (7) 
L 


2 l 


TOG Heg, c M 
= (S paren eina / Ge. O 
This is equivalent to the following collisional rates of 
change of the density matrix elements “os; in the 
electron-spin-coupled representation: 


(d°o;;/dt)co.= — (w— id)'oi; (9) 


for i corresponding to a singlet state and j to a triplet 
state, and 


(d0;;/dt)co.=9 (10) 


for i and j both corresponding to singlet or both to 
triplet states. 

In reality, the states between which transitions are 
observed are the free-electron spin states. We will let p 
be the free-electron spin density matrix in the usual 
representation. Ignoring the off-diagonal elements for 
the alkali spin density matrix s and also ignoring 
correlations between the free-electron and alkali- 
electron density matrices, we can write the total spin 


om Ea 
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density matrix c— pX1 as 


(11) 
dp =e 0 0 Bou Box 
E 0 0 Box Bpss 
Transforming to the electron-spin-coupled representation by the transformation 
*g —T'oI", (12) 
where M, Mr 
mon —ü os P P ME MET 
2» 2 2) 2 2» 2 2) 2 
S, Ms 
i, fl 1 
1,0 1/v2 1/v2 
=: p (13) 
7 il, il 1 
$ 0,0 —1/v2 1/v2 
gives S, Ms 
x 1,1 1,0 ‘ly al 0, 0 
^ S, Ms 
1,1 


— (Api3)/V2 
(pu— 4)/2 


(Bpx1)/V2 
0, 0 — (401)/V2 (911— 4)/2 (Bp) /V2 


other elements are not necessarily zero but are not needed. From Eq. (9), setting p=4(w— id), 


S, Ms 
1,1 0, 0 
S, Ms 


1,1 
1,0 


V2P* Apis 

—$* (pu — A) 

—V2p* Bpa 
V2pApn —$(p1— A) —V2pBpi2 0 


resentation by the transformation 
(do/ dt)co1. =T" (deo/dt)coi. T 
y +4, X 
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Finally, by contracting we obtain 
M. 
M, : Ei 
X j —jdw(pu— A) —[jw+hid(A — B 
b (dp/df) coi. = f a Meee (18) 
—2 | —[iw—4id(A—B) Ion 2w(p1—A) 
^ The above collisional rates of change for the off- We have k4,(290?K) — 0.0429 and &, (410? K) — 0.0510. 
diagonal elements of p correspond to the following For the hydrogen S wave, we take the results of 
half-width and frequency shift for the free-electron spin — Schwartz. However, his lowest points are for #7=0.01 
resonance: and 0.02. We must therefore use an interpolation 
w'-iw, (19) formula. O'Malley, Spruch, and Rosenberg" have 
he RES , shown that the normal effective range theory expansion 
d'=1P 4d, (20) ; ie y c 
is not valid in the presence of a long-range potential 
where P4— A4 — B is the alkali electron spin polarization. such as the polarization potential V(r)- —a/r*. How- 
These formulas are the same as those obtained by ever, a modified effective range theory expansion which 
Balling, Hanson, and Pipkin? by essentially repeating is valid for atoms which have spherically symmetric 
Baranger's calculation for the case of electron exchange ground states has been given by these authors. From 
collisions. They have been rederived here mainly for Eq. (2.3) of O’Malley," we obtain 
pedagogic reasons. The ratio of shift to broadening DA VITIS 
for the case where only S-wave collisions are important Boats) AC 273 . 
agrees with our earlier estimated value except that the -F$LAQ) —A(0) J(a/ad)EInk-F0(7), (28) 
velocity averaging procedure is now specified. A,=0(&?). (29) 
3. LOW-ENERGY ELECTRON-ALKALI Here A(1) and A(0) are the triplet and singlet S-wave 
ELASTIC SCATTERING scattering lengths and « is the atomic polarizability. 
n 24 D |o nek : 

We are now ready to review the available information momi A eue ; E Lat) je ] Ne bere | 
on low-energy electron-alkali elastic scattering phase : ae ee D. (s PEE "s f F: tonge o ki | 
shifts. The only calculations available which extend coefficient of kink and including a &' term, we obtain 
over the necessary energy range are for hydrogen and Ao—4.1965--25.2//Ink-4-2AR^ , (30) | 
cesium. For sodium, only the S-wave scattering lengths | 
are available. where the coefficient of k’ has been fit to Schwartz’s | 

We define value at k=0.1. With this interpolation formula, 

: *[5—0.339 and *19— 0.066. 
T1=((sin2A1)/(0/2m) av » (21) For the P wave we use the results of Burke and 
eT; ((1—cos2A;)/(0/%m))av, (22) Schey.!® Including only a ° term and fitting to their 
I f j lowest point, we find A1— 3.85, *I1— 0.001, *7,— 0.000. 
| R=(—d'/Paw'), (23) Also from Burke and Schey, higher partial waves are 
Den». (24) negligible. Thus, Rz (290^ K) — 5.2. r 
L. Then s (0/tm)= (k/km) For cesium, the only calculated phase shifts which 


R-[X:QH-D*I4)/ P» QH- 1*1], (25) 


where 


sJ = (4/A/m) | "asin2A «(B eds, (26) 


ef = (4/4) | "E —cos2A,(E)]e-?d«. — (27) 


For hydrogen and cesium, We will use T= 290°K, 
corresponding to the rubidium experiments of Balling, 
Hanson, and Pipkin,’ while for sodium we will also use 
T=410°K, corresponding roughly to the sodium 
experiments of Hobart and Franken** and of Dehmelt. 
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appear to be useful for the present problem are the very 
recent ones of Stone and Reitz. Values of Ao, A, and 
As obtained from their results are given in Table I. 
It appears that the first three or four terms of the 
modified effective range theory expansion are not 
adequate to represent the results even at moderate 
values of k. This is not surprising in view of the high 


11 C. Schwartz, Phys. Rev. 124, 1468 (1961). 

nT. i B red L. Spruch, and L. Rosenberg, J. Math. Phys. 
2, 491 (1961). ; 

5T. F. O'Malley, L. Rosenberg, and L. Spruch, Phys. Rev. 
125, 1300 (1962). st 

“T. F. O'Malley, Phys. Rev. 130, 1020 (1963). dc 

15 N. F. Mott and Í. N. Sneddon, Wave Mechanics and its 
Applications (Clarendon Oxford, 1948). fet 

16 P, G. Burke and H. M. , Phys. Rev. 126, 1 St 207 3 

17 P, M. Stone and J. R. Reitz, Phys. Rev. 131, 2101 (1963). $ 
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TABLE I. Triplet-singlet phase-shift differences A; for the S, 
P, and D partial-wave scattering of electrons from cesium.^ 
The functions fo=(0.78)[1—exp(—36k)] and f;=—(0.66k) 
X[1—exp(— 30042) ] are chosen to approximate Ap and A; in the 
energy range of interest. 


k? Ao ^ As fo fi 
0.001 0.55 —0.05 0.00 0.53 — 0.05 
0.003 0.65 —0.21 0.01 0.67 —0.21 
0.005 0.70 —0.37 0.01 0.72 — 0.36 
0.007 0.74 —0.49 0.00 0.74 —0.48 
0.010 0.77 —0.64 0.03 0.76 — 0.63 
0.013 0.79 —0.75 0.04 0.77 . —0.74 
0.020 0.83 —0.93 0.15 0.77 —0.93 

* See Ref. 17. 


polarizability of cesium.!5 Instead, we represent Ao and 
A; by functions fo— (0.78)[1—exp(—364)] and fi 
= — (0.66)k[1—exp(—300k?)] which give adequate 
agreement with Ao and A; over the range of interest 
and which have leading terms proportional to k and k’, 
respectively, near zero energy. These leading terms 
agree with the modified effective range theory. If one 
interprets the coefficient of the leading term in fo as a 
rough approximation to the triplet-singlet scattering- 
length difference, this gives A(1)—A(0)=28. Using fo 
and fi, we find for cesium */9—0.94, *79—0.61, I 
— —0.03, and all other contributions are 0.01 or less. 
Thus, we obtain Rc,(290°K)—1.4. Since the contri- 
butions from P and higher partial waves are small for 
cesium as well as hydrogen, the S-wave approximation 
is presumably good for the other alkalis also. 

A possible difficulty with the interpretation given 
above appears when one looks at the calculated triplet 
S-wave phase shifts rather than just the phase-shift 
differences. The triplet phase shifts decrease fairly 
slowly, but the first one is 0.37 rad above 37. Since the 
zero-energy limit must be a multiple of r, this raises 
the possibility that the zero-energy value consistent 
with the finite energy values is 4r. If so, all the values 
of Ao in Table I should be increased by m. This would 
change only the way in which one extrapolates from 
the lowest calculated point to zero energy and would 
probably not change the value of R by more than 2097. 

Actually, a scattering length difference large enough 
to give such a steep initial slope for Ao as a function of 
k seems questionable. Some support for this view comes 
from the fact that the P-wave triplet phase shifts are 
also slowly decreasing and the lowest energy one iS 
0.05 rad above 2m. Sirice a zero-energy limit of anything 
but 2m seems inconceivable, it appears that the low- 
energy values for the S-wave phase shifts may also be 
somewhat offset by the computational procedure used 
by Stone and Reitz. This makes the uncertainty in Ao 
appear to be of the order of 0.4 rad or larger and raises 
the possibility that Ao could approach or go through 
(1/2) at thermal energies and give a very small value 
for Res. 


18G. E. Chamberlain and J. C. Zorn, Phys. Rev. 129, 677 
(1963). 
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In the case of sodium the main pieces of information 
available are the triplet and singlet S-wave scattering 
lengths A(1)=9 and A(0)=—12 calculated by Salmona 
and Seaton. If the scattering length approximation 
under which 8e(1)— &A(1), ôo(0)=kA(0) were valid, 
we would have Ao—21&k. Including only the S-wave 
contribution, this gives *79— 0.68, “To= 1.28, Rx, —0.53, 
at410?K while at 290°K *79—0.80, “To= 1.10, Rx4— 0.73. 
In the same approximation, defining 


Qu (90: ())./ ©) , 


Qex = ray (Vr 0/4km?) . (32) 


At 410°K this gives Q.x=220ra,?. Dehmelt’s experi- 
mental result, adjusted to correspond to ? instead of 
Urma, IS Qex 2 280mag.. Salmona and Seaton's zero energy 
cross section is Q«.— 440a". 

We will next calculate the modified effective range 
theory expansion for sodium. From Eq. (4.9) of 
O'Malley, Rosenberg, and Spruch? we find 


(31) 
we find, 


Ao— [A(1)—A(0) Jk+-ARUnk+ B--Ck*-----. (33) 
Here, 
A —$(e/as)LA(1) —A(0)], (34) 
B= (A-In[1.23(a/ad)!*] 
$n (a/a) PLA (0) — A? (1) ]—-3[A3(1) — A3(0) ] 
+3LA?(0)rpos—A?(1)7por]/@o} , (35) 
C= $m (o/a) (1) — (0) ], (36) 


where 7por and 7pos are the triplet and singlet modified 
effective ranges and æ is the polarizability. Using? 
a=21X10 cm, we calculate 4 —3970, C= —18 700, 
and B= 10 600-1-72(rpss/ao) — 40.5 (r5o7/a0). Transform- 
ing to the variable «= (k/km) 


Ao(4109K) = 1.074— 0.162? 
+[0.0095 (r,,s/ao) — 0.0054 (r5, 7/a0) J2 


-F0.52221nx—0.127a5-----. (37) 


We do not know the modified effective ranges, but at 
least one of them must be very large in order to make 
the coefficient of a? approach one. Since the other 
coefficients of à? and higher terms are fairly small, it 
appears that the modified range theory expansion may 
be useful for sodium at 410?K. The exchange cross 
section and the ratio of electron frequency shift to line 
broadening calculated in the scattering length approxi- 
mation thus may be fairly accurate. 

Another procedure is to approximate the sodium 
phase-shift differences by the same function of (k/kex) 
as was used to fit the cesium results of Stone and Reitz. 
Here kex is the excitation energy for the atom in 
question. The reason for trying this is that the values 
of Ao for cesium derived from Stone and Reitz are 
rather similar to those for hydrogen if both are plotted 
as a function of (k/kex). At 410°K we obtain *7o7-0.94, 
*197-0.60, Rx4771.6, and Q..—100rad. This procedure 
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gives poor agreement with Dehmelt’s quoted lower 
limit for the experimental exchange cross section. While 
Dehmelt's number is not actually a lower limit, since 
no uncertainties are included, it would indicate a definite 
error in the experiment if the cross section were as low 
as 100zag. Thus, the somewhat similar behavior of the 
presently available phase-shift differences for cesium 
and hydrogen does not appear to be characteristic of 
sodium. 

One other possible source of information concerning 
the phase-shift difference for sodium comes from a rough 
estimate of the sodium polarization in Hobart's experi- 
ment based on a trace he gives? of the sodium Zeeman 
resonances at 3.68 G. This trace (Fig. 3.10 of Ref. 6) 
indicates a ratio of roughly 2.8:1 for the strongest and 
weakest F=2 Zeeman components. Assuming that the 
sublevel populations are independent of F and that 
sublevels differing in M r by 1 have a constant ratio of 
populations leads to an approximate sodium-electron 
polarization P4-— —0.17. These assumptions correspond 
to spin-exchange equilibrium between the sodium atoms. 
Actually, the small amplitude of the F—1 resonances 
indicates that the magnitude of the polarization was 
probably higher. However, since we do not know 
precisely what correction to make, we will use the 
above number. Combining this with the value Rya 0.53 
calculated on the scattering length approximation using 
the scattering lengths of Salmona and Seaton gives 
a ratio of shift to half-width of 0.09. While the maxi- 
mum ratio of shift to half-width observed was only 
0.05, it is not clear that the trace given by Hobart 
applies to the same experimental conditions. Thus, 
little can be said about the agreement of theory and 
experiment for sodium. However, it would not be 
surprising if the sodium phase-shift difference curve 
actually rose somewhat more rapidly than the scattering 
length approximation curve which we have used. 


4. DISCUSSION 


We are now ready to discuss the results of Balling, 
Hanson, and Pipkin? Their experimental value of 
R=0.05+0.01 for rubidium at about 290°K is more 
than an order of magnitude less than the corresponding 
value for cesium calculated from the phase shifts of 
Stone and Reitz." If the phase-shift difference curve 
for rubidium had the same shape as that calculated for 
cesium, it would have to have more than twice as high 
a maximum in order to agree with experiment. How- 
ever, it seems uncertain whether a calculation for 
rubidium done by a method similar to that of Stone 
and Reitz for cesium would give this much different a 
result. Also, it appears doubtful whether the rather 
high value for the cesium polarizability used by Stone 
and Reitz can account for the posit pem a 

iv 7 ing, Hanson, and Pipk 
Teucer RE d and that therefore 


that the Rb- ion is just barely boun f 
&«(0) should be about 7/2 appears to be open to question. 
This argument, wh 
Na, is based on 


ich was also used by Dehmelt! for 
the result for a deep square-well 
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potential! and is only valid if the binding energy is 
small compared with the incident electron energy. 
The modified effective range theory expansion’: shows 
that the square well is not a good approximation at 
extremely low energies. Also, the binding energies of 
alkali negative ions are believed to be much higher than 
thermal energies.” In particular, a calculation by 
Szasz”! gives binding energies of 0.43, 0.53, 0.49, 0.43, 
and 0.42 eV for Li, Na, K, Rb, and Cs. These results 
are not expected to be very accurate, as indicated by 
the value for Li of 0.62 eV obtained by Weiss in a much 
more thorough calculation.” However, the relative 
values may be good and fairly strong binding for Rb- 
seems likely. 

In view of the possible discrepancy mentioned above, 
the assumptions made in the frequency-shift calculation 
were reviewed. The least certain ones appeared to be 
that the alkali matrix in the F, M p representation can 
be taken as diagonal and that direct interactions 
between the free electrons can be neglected. 'The first 
assumption is discussed briefly at the end of the 
Appendix and appears to hold for the rubidium experi- 
ments. The second assumption is more difficult to 
investigate because of the very long range of the 
electron-electron interaction. However, if such inter- 
actions were important one might expect that normal 
variations of the experimental conditions would have 
given erratic results. 

Assuming that the experimental results and their 
interpretation in terms of the scattering phase shifts 
are correct, we are left with two possible hypotheses. 
One is that the rubidium S-wave phase-shift difference 
Ao increases rapidly at very low energies and fortui- 
tously passes through or approaches (7/2) at the proper 
energy to make R small. This seems somewhat unlikely 
but cannot be ruled out. The other possibility is that 
Ao for the higher alkalis rises rapidly initially but for 
some reason flattens out near (7/2) at roughly thermal 
energies. This would require that the low-energy 
phase-shift differences for cesium taken from Stone and 
Reitz be off by approximately 1 rad. 


5. CONCLUSIONS 


In view of the large difference between the experi- 
mental ratio of shift to broadening for rubidium and 
the ratio calculated for cesium using the phase shifts 
of Stone and Reitz, the present situation appears to be 
somewhat unsatisfactory. The quickest approach to 
seeing if there is a real discrepancy between experiment 
and theory would be an experimental measurement 
on cesium.” However, in view of the difficulty of calcu- 


9 N. F. Mott and H. S. W. Massey, The Theory of Atomic 
Collisions (Clarendon Press, Oxford, 1949), 2nd ed. 

2 L, M. Branscomb, in Atomic and Molecular Processes, edited 
by D. R. Bates (Academic Press Inc., New York, 1962). 

21 L, Szasz, Acta Phys. Acad. Sci. Hung. 6, 307 (1956). 

2 Note added in proof. Recent results for the ratio of frequency 
shift to line broadening in cesium ap to be similar to those 
for rubidium: L. C. Balling and F. M. Pipkin, Bull. Am. Phys. 
Soc. 9, 91 (1964). There is thus a definite discrepancy with the 
cesium phase-shift calculation of Stone and Reitz. 
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lations on cesium, it also appears to be desirable to 
repeat the experiments for sodium or for hydrogen. 
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APPENDIX: DERIVATION OF ELECTRON SPIN 
RESONANCE FREQUENCY SHIFT FOR 
ARBITRARY NUCLEAR SPIN 


We will treat the case of electrons with an arbitrary 
2X2 spin-density matrix p colliding with alkali atoms 
having an electron spin of 3 and a nuclear spin J. p is 
taken to be in the M, representation, where M, is the 
free-electron spin component along the axis and the two 
states are ordered according to decreasing Me. The 
alkali spin density matrix zr, y, in the F, Mr represen- 
tation is assumed to be diagonal. The alkali sublevels 
are taken to be listed in order of decreasing M p and 
subordered according to decreasing F; i.e., the F, Mr 
order of listing is [(7-+3, 23-3), (+4, 7-3), U-3, 
1—3), (4-3, I—3), ---]. When ur is transformed to 
the Mr, My representation in which the states are 
ordered according to decreasing Mr and subordered 
according to decreasing M z, the resulting matrix # will 
have off-diagonal elements connecting only the second 
and third, the fourth and fifth, etc., states. This is 
because there can be no off-diagonal elements connecting 
states with different values of (M ;--M ). The diagonal 
elements can be labeled (A7,Br, A 1-1,Br-1,---), where A 
corresponds to M ; —3, B to M ; — —$, and the subscript 
to Mr. Note that the expectation value of the electron 
polarization for the atom is given by 


I 
Pa= X (Am Bm). (38) 
MI 


We now form the density matrix o for the electron- 
plus-atom system in the (M.M z,M r) representation by 
taking the direct product of p and of # in the (M 7, M r) 
representation. The states of c are ordered according 
to decreasing Mr, first subordered according to de- 
creasing My, and, second subordered according to 
decreasing Me. o can be thought of as made up of 
(212-1? 4X4 submatrices. The 4X4 submatrices ey, 
along the diagonal of @ which correspond to different 
Mr are of the form given in Eq. (11) except that A and 
B are replaced by A m; and By. 

The transformation of ø to the representation with 
the electron spins coupled and the states ordered by 
decreasing Mr, first subordered by decreasing values 
of the total electron spin S, and second subordered by 
decreasing values of M; is given by the following 


BENDER 


transformation: 
cg —D'roT r. (39) 
T; is made up of 27--1 identical 4X4 submatrices T 
along its diagonal, with T given by Eq. (13). We find 
then that the 27-++1 4X 4 submatrices along the diagonal 
of o° are given by 
OM, Em Toul” ` (40) 


We can now make use of the results for the case of 
zero nuclear spin. When we find (de/dt)coi. we will 
contract it to obtain (dp/dt)co1. as was done in Sec. 2, 
The only parts of (do/dt)co1. which we will need are 
the 4X4 submatrices (doar,/dt)co1.. However, since 


(dom ,/dt)cor.=1" (do yr;/ di)coi.T (41) 


and since (d%osr,/dé)co. is obtained from ^e, in the 
same way that (d'*c/di)cao. was obtained from *e in 
Sec. 2, the contribution to (dp/dt)co1, from each cr, is 
identical to that given in Eq. (18) except with A and B 
replaced by Ayr, and Bu, Thus, (dp/dt)coi. is the. 
same as for the case of zero nuclear spin except that 
A Doi A My and Bo Day Bm,- In particular, 
w =4w and d' — idP,, which are identical to Eqs. (19) 
and (20) for the case of zero nuclear spin. 

In this calculation it was assumed that «r,w, was 
diagonal. However, w#r,ar, does have two off-diagonal 
elements which have approximately the same time 
dependence as the off-diagonal elements of p. While the 
rf field at the electron spin resonance frequency cannot 
directly produce finite values for these elements 
because the corresponding transition would require 
AM,=2I-+1, these elements can still be produced by 
the electron-alkali collisions. Thus, in order to be sure 
that off-diagonal elements of 4r, do not change the 
electron frequency shift and line broadening, we must 
show that these elements are much smaller than the 
off-diagonal elements of p. 

A direct calculation for arbitrary nuclear spin 
appears to be difficult. However, the calculation can 
be carried through for zero nuclear spin with the special 
condition that the rf field does not affect the alkali 
atom directly. In this case, it is found that the condi- 
tion for the off-diagonal elements of « to be much 
smaller than those of p is as follows: The ratio of alkali 
density to electron density must be large compared with 
the ratio of the width of the electron spin resonance 
due to alkali collisions compared with the reciprocal 
of the decay time for the alkali off-diagonal elements. 
If the decay times for the off-diagonal elements of 
both p and u were determined by electron-alkali 
exchange collisions, this condition would not hold. 
However, as long as the decay time for the alkali 
off-diagonal elements is determined mainly by some 
other cause such as light absorption rather than by 
electron-alkali exchange collisions, then the assumption 
that u is diagonal should be quite good. This condition 
was apparently satisfied in the experiments of Balling, 
Hanson, and Pipkin, and of Hobart and Franken. 
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Under the expectation that experiments will soon 


give values for the fine structure intervals of the 2P 


state of helium to an accuracy of 1/105, we have undertaken a program of calculations which, it is hoped 
will lead to a new determination of the fine-structure constant a. This paper gives a brief survey of the 
over-all program, and a detailed report of the successful completion of the! first task: The construction of 
approximate solutions to the Schrédinger equation which lead to average values of the leading fine structure 


operators accurate to about one part per million. 


~I. INTRODUCTION 


HE early studies of the fine structure of the 

helium atom by Breit! provided very important 
confirmation of the newly developing theory of the 
quantum interactions of electrons and the electro- 
magnetic field. Over the intervening years there have 
been several efforts to increase the accuracy of this 
analysis, chiefly by the construction of successively 
more complicated approximations to the nonrelativistic 
2-electron wave function. Breit,! with a 2-term trial 
function got values of the fine-structure intervals » 
(see Fig. 1) accurate to several percent. Araki? and 
co-workers with 8 terms came within about 1%, and 
also considered two small corrections to the first-order 
theory. Traub and Foley? with an 18-parameter 
function reduced the error to about one part in 10°; 
and most recently Pekeris, Schiff, and Lifson,* with up 
to 220 terms in the trial function, came within one part 
in 10! of the final results we have attained. (These 
numbers refer to the larger interval vo. In the calcula- 
tion of the small interval vi? there is a cancellation 
between separate terms amounting to a full order of 
magnitude, and a corresponding loss of accuracy 
results.) 

While these studies have been chiefly aimed at check- 
ing the theory, we here take another position: Assuming 
the current theory of quantum electrodynamics, one 
can combine the calculated and measured fine-structure 
intervals of helium in order to determine the value of 
the fine-structure constant a. At present, the best value 
of a comes from the work of Dayhoff, Triebwasser, and 
Lamb? on the fine structure (2P1j2—2P3/2) of hydrogen. 


* Thi k was supported in part by the U. S. Atomic Energy 
ET the U. S. Air Force under Grant AF-AFOSR 
130-63, and in part by the Advanced Research Projects Admin- 
istration through the U. S. Office of Naval Research. 

+ A preliminary report of this work was presented at the 1963 
Annual Meeting of the American Physical Society, Bull. Am. 
Phys. Soc. 8, 20 (1963). 

?G Dus É e d 36, 383 (1930). References to other early 

ie be found in this reference. * à 
BI DR and K. Mano, p Rev. 116, 651 (1959), 
: earli rs referred to therein. A 
a i Foley, Phys. Rev. 116, 914 (1959). : 
4 C. L. Pekeris, B. Schiff, and H. Lifson, Phys. Rev. 126, 105 


sE S. Dayhoff, S. Triebwasser, and W. E. Lamb, Phys. Rev. 
89, 106 (1953). 
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That value is uncertain to about one part in 10*, due 
essentially to the short lifetime of the 25 state, and it 
has not appeared feasible to improve on those measure- 
ments. However, the lifetime of the?P state of helium is 
about 2 orders of magnitude longer, and it is expected^ 
that these fine structure intervals can be measured to 
an accuracy of one part in 105 or perhaps better." It 
then becomes necessary to calculate theoretical values 
for the helium fine structure to 1/10^, and this is the 
task we here commence. It may be remarked that in 
the case of the hydrogen fine structure the theoretical 
formula? is very simple; this is not the case for helium. 
However, we believe that those circumstances which 
made the hydrogen problem simple, will have the effect 
of reducing the helium calculation from “impossibly 
difficult” to merely “difficult.” 

We should add that there is at present a specific 
need for a better value of a. Analysis? of the hyperfine 
structure of hydrogen reveals effects due to the electro- 


Yo, = 0.988 cm—! = 29 619 Me 
Fic. 1. Energy levels ! 
of the 15253P state of 
helium. 


1 Vz = 0.076 cm™! = 2291 Mc 


* V, W. Hughes (private communication). 

7 For the best measurements to date (accurate to about 0,5/109), 
see J. Lifsitz and R. H. Sands, Bull. Am. Phys. Soc. 6, 424 (1961), 
and earlier references given therein. 

8 See, for example, H. A. Bethe and E. E. Salpeter, Quantum 
Mechanics of One- and Two-Electron Atoms (Academic Press Inc., 
New York, 1957), p. 105. 

?C. K. Iddings and P. M. Platzman, Phys. Rev. 115, 91 
(1959); also see D. E. Zwanziger, Bull. Am. Phys. Soc. 6, 514 
1961). 
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magnetic structure of the proton, but a clear interpreta- 
tion of those results is spoiled by the present uncertainty 
in the value of a. 


II. PLAN OF THE CALCULATIONS 


We will now outline the several parts of the task of 
calculating the fine structure intervals in helium to an 
accuracy of about one part in 10*. The zeroth-order 
problem is the nonrelativistic two-electron Hamiltonian 
in a fixed Coulomb field: 


pr pe Ze Ze? e 
Hy=—+—-—-—+—. (1) 


2m 2m Tri T2 T» 


The next terms in a nonrelativistic expansion are the 
well-known” fine-structure terms which we shall denote 
as o2H . [Our reference for energy is /ao— o?mc?, and 
we shall not note factors of Z (—2) as being pertinent 
to the classification of the expansion terms.] We thus 
expect an energy level formula something like 


1 
Es—Eo= e(Ha)sta| (r+ {He 15) 
(=I 


+ (terms of order o? and smaller 
which we shallignore), (2) 


where H4 is some higher order operator which has not 
yet been worked out. 

There are still many terms not represented in this 
formula (2). The most apparent is the correction for the 
anomalous magnetic moment of the electron, of order 
o3. This is quite simple and has already been included 
by some authors.?* The reduced mass must be put in 
properly, and there is also the operator pi:po/M 
correcting for the motion of the nucleus; this is itself 
spin-independent but will contribute in a second-order 
calculation mixed with H». There are then very many 
diagrams one could write down describing radiative 
corrections of many higher orders. 

The most extensive work on the fine structure and 
Lamb shift in hydrogen has produced terms up to, but 
not including, the order a’mc?, which is just to the 
same order that we now need. But the two-electron 
atom seems so much more complicated than the one- 
electron atom that our project might appear too difficult 
to attempt. A thorough relativistic theory of the two- 
electron atom has thus far been analyzed? only as far 
as the leading Lamb-shift terms, o5mc^X (Ina-+-const), 
and our goal is well beyond this. What encourages us 
to proceed is the expectation that most of the difficult 


10 See Ref. 8, p. 181. 

u A. J. Layzer, J. Math. Phys. 2, 308 (1961), has calculated 
terms of order mea’ In’a and mca” Ina. Only the latter contributes 
to the fine structure, and we hope it will not be difficult to include 
the term of this order in our problem. 

12 J. Sucher, Phys. Rev. 109, 1010 (1958). 
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terms at these higher orders are spin-independent, and 
thus, while affecting the absolute energy level, do not 
contribute to the fine structure. 

As an example, consider the effect on the fine structure 
of some deviation from pure Coulomb potentials 
occurring at some small distance R. The integrals which 
occur are of the general form of the familiar ((1/7) 
X(0V/8r))—-(1/r), and are taken over radial wave 
functions for 77» 0 orbitals. Thus the relative correction 
would appear to be of the order 


R a 
| Pars | r?drr =~ (R/a)! € (R/a)?. 
0 0 


Thus the effect of finite nuclear size would seem to be 
completely negligible, and the effect of vacuum polar- 
ization would appear to be of order a(a)*—o? on the 
fine structure intervals, again negligible. 

The entire project may be divided into three rather 
well-separated parts. First, is the task of constructing a 
sufficiently accurate eigenfunction of Ho so that the 
leading term in Eq. (2) can be determined to about 
1/105; this is mostly a computer problem. Second, is the 
task of carrying out a proper relativistic analysis and 
determining the operator H, and whatever else may 
appear to the required order; this will be a matter need- 
ing both formal technique and enlightened short- 
cutting. The third job will be the evaluation of these 
smaller corrections to the fine structure, and this will 
require a combination of modest computer effort and 
much algebraic detail. 

TThe rest of this paper is concerned with the first of 
these tasks. 


Ill. THE SCHRODINGER WAVE FUNCTION: 
FIRST ATTEMPT 


We wish to construct a good approximation to the 
lowest *P eigenfunction of the equation 


Hop= Eq. (3) 


This will be done by setting up a sequence of trial 
functions y; using the variational principle 


&(V | Ho— Eo| V) - 0 (4) 


and seeing how results of interest converge as we make 
the trial functions larger and larger. 
We started with the functions 


1— Piz l+m+n Sw 
y= ( ) ps C isa Tiri? ro"ri)! 
Am V2 1mn=0 


den toldre, (5) 


where P,» exchanges coordinated 7; and 72, and the P- 
state character is represented by the vector sign. The 
unit of length is ao—72/m6'; and the scale parameters * 
and o were taken, by extrapolation from the results of 


| "BA 


| 
1 
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TABLE I. Result of variational calculations of 23? with the 
standard Hylleraas basis (5). 


Number Energy y 
w of terms — E (e/a) Chet Ry) D(ke Ry) 


10 2.132678402 


2 — 0.07065132 —0.05404532 
3 20 2.133085039 — 0.06829756 —0,05379667 
4 35 2.133140222 —0.06771029 —0.05394716 
5 56 2.133157595 — 0.06745614 — 0.05400336 
6 84 2.133162268 — 0.06733560 — 0.05402833 
7 120 2.133163594 —0.06728399 — 0.05403823 
8 165 2.1331639812 — 0.06726722 — 0.05404284 
9 220 2.1331641067 —0.06726320 —0.05404503 

10 286 2.1331641530 —0.06726453 — 0.05404627 

Final result 

from Table II 2.1331641908 —0.06727529 —0.05404839 


Traub and Foley; to be 4.62 and 0.29,? respectively, 
and were not varied. The variation of the linear 
parameters Cimn leaves us then with the numerical 
problem of finding the eigenvalue and eigenvector of a 
symmetric matrix of degree 


N (9) =§(@+1) (e-2) (+3). (6) 


Some of the details of the construction of the matrix 
elements and the numerical methods used will be found 
in the Appendix. We give in Table I the resulting 
eigenvalues for the series of trial functions of degree 
w=2, 3, ---, 10. This was done with 16-decimal 
arithmetic on an IBM-704 and consumed a total of 
some 20 h of machine time. 

This work very closely parallels that of Pekeris, 
Schiff, and Lifson,‘ differing only in the specification of 
the exponential parameters, x and ø. This makes a 
non-negligible difference: At 220 terms our eigenvalue 
is an order of magnitude better converged than theirs. 

Our chief interest anyway is not the energy eigen- 
value, but the eigenvector, or more properly the average 
values of the two spin-dependent parts of Hz. These 
are the well-known” spin-orbit terms 


eh N? /oi +02 (TX Pr Xp 
H(s0)=(—) ( lx + 
2mc 2 rj ro? 


3(11— 12) X (pi— =| (Ta) 


719° 


and the spin-spin term 


7i 2j 301° 11202* T1? 
H(s.s.)— (=) (cre: z = ) . (Tb) 


2mc/ ri»? Tur 


i Y 07? smaller 

13 The precise values of x and g used were about 1 : l 
than the PA given here, due to the decimal-to-binary conver- 
sion operation of the machine. MES 
14 For a discussion of relative convergenc “sf ] 
by C. Schwartz in Methods in Computational Physics (Academic 


Press Inc., New York, 1963), Vol. 2, p- 241. 
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[We have dropped here a term in H(s.0.) which has 
only off-diagonal matrix elements. | The first-order fine- 
structure levels are customarily given in terms of two 
constants C (from s.o.) and D (from s.s.) : 


(H3) o const— 2C — (10/3)D , 
(H3) 1-17 const—C-+ (5/3)D , (8) 
(H5) 7-27 const--C— (1/3)D. 


[These values will subsequently be given in units of 
(eh/2mcy'ag?— (a?/4)(£/as)- 3a? Ry]. These results 
are also shown in Table I. The accuracy of our C and D 
values here is hardly better than that of Pekeris e£ al.‘ 
(about 1/105) and far short of the desired 1/105. At the 
time when these results were obtained we attempted to 
extrapolate the apparent rate of convergence of the 
output numbers, and thus estimated that something 
approaching 2000 terms of the series (5) would be 
needed to obtain the required accuracy in C and D. It 
seemed that, at best, this would be an extremely 
expensive venture; and we thus decided that this 
attack had failed. 

Two possible paths were then considered. We might 
start with the wave functions already constructed, and, 
by use of auxiliary variational calculations of modest 
size, attempt to increase the accuracy of the C and D 
integrals to something approaching that which is 
obtained directly for the energy eigenvalue.’ However, 
it appeared that, due to the rather singular nature of 
the operators H», the auxiliary functions needed for 
this method would be of a complicated analytical 
nature; and so we did not make a serious attempt in 
this direction. 

Alternatively, we could seek a better set of basis 
functions for the calculation of y. The most likely cause 
for the slow convergence noted above appeared to be 
the weak logarithmic singularity in the two-electron 
wave function studied by Fock.!9!7 It appeared, how- 
ever, that putting into y the explicit logarithm term 
given by Fock would be a very messy job. Furthermore, 
it was felt that the reward would be rather slight if only 
the first term were accounted for; and the higher terms 
of Fock's expansion appear to be complicated beyond 
our ability to manage. We then sought, in a much more 
ad hoc manner to introduce just any covenient terms 
which would add flexibility to y; especially in the region 
(ry — 0 and r2— 0). We thus chose to try the addition 
of the simple factor (rı+72)"? to the series (5). The 

efficacy of this extension of the basis functions was first. 
tested on the relatively simple calculation of the 


15 The method referred to is that described by C. Schwartz, 
Ann. Phys. (N.Y.) 6, 170 (1959). 

15 V, A. Fock, Izv. Akad. Nauk SSSR Ser. Fiz. 18, 161 (1954). 

17 See also the discussion and analysis in Ref. 14, p. 256. 

18 Half powers were earlier introduced into the Hyller 
expansion by H. M. Schwartz, Phys. Rev. 120, 483 (1960). 
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TABLE IT. Results of variational calculations of 2 °P 
with the extended basis (9). 


CHARLES SCHWARTZ 


TABLE IIT. Extrapolation of the calculated interval v;; 
(in units of 3a? Ry). 


In units of ja? Ry 


Number 
t of terms = E(e*/ao) Cz Ce D 
1 7 2.132580318470  0.127974449  —0.198448086 — —0.053285232 
2 19 2.133100285189  0.136827714 — —0.203765520  —0.053888850 
3 39 2.133155369966  0.138223897  —0.205167872  —0.053987664 
4 69 2.133162942721  0.138619978 —0.205707289  —0.054026808 
5 111 2.133164012406  0.138640520  —0.205858266 —0.054041649 
6 167 2.133164164417  0,.138638567  —0.205898542  —0.0541046487 
7 239 2.133164186301  0.138635834  —0.205908639  —0.054047869 
329 2.133164189955  0.138636386  —0.205911292  —0.054048248 
9 439 2.133164190626  0.138636755  —0.205911984  —0,054048351 
Extrap- C =Cz+Ce— —0.067275287 — —0.054048390 


2.13316419080 
+5 


olation £19 £13 


helium ground state (14S), and those very successful 
results have already been presented.” 

The calculations reported in this section are not a 
total loss, since we can expect to use these medium- 
accuracy wave functions for the later calculation of the 
small correction terms schematically represented in 
Eq. (2). 


IV. THE WAVE FUNCTION: SECOND ATTEMPT 


The expanded basis now looks like the expression (5) 
with the replacement 


Cian Cina Dis (rid 72); (9) 


and the single term Dono is omitted. The scale parameter 
c was left fixed as before at 0.29," but after a short 
search we changed the value of x to 4.0 (exactly). The 
matrix elements of Ho and H> are more complicated 
with the half-power terms, but once we had learned 
to evaluate the integrals C and D even with the old 
basis, the new techniques were not very difficult (see 
Appendix). The results converged much more rapidly, 
than before, but it was found, as might be expected 
that as one approached more closely to the exact fitting 
of the function numerical accuracy became more and 
more critical at an alarming rate. We found it necessary 
to construct special programs to do all our arithmetic 
to an accuracy of 30, and then 52 decimals; the resulting 
cost in machine time was very great. The total labor 
took some 30 h on an IBM-7090. 

In Table II are presented our results with the half- 
power functions up to the ninth order. We show here 
separately the contributions from the two parts of (7a) : 


V. RESULTS 


From the numbers in Table II we now must give our 
best values for the fine-structure intervals along with an 
estimate of their probable uncertainty. This we do by 
attempting to extrapolate the results shown to w — co ; 
‘the smoothness of the sequence of computed numbers 


— 3 C, Schwartz, Phys. Rev. 128, 1146 (1962). 
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Ratios of 


Number Calculated Successive successive 
of terms 2D—2C differences differences 
7 0.0343768105 
—0.008279 
19 0.0260979112 0.022 
—0.00018529 
39 0.0259126200 — 0.800 
0.0001483866 
69 0.0260610066 1.962 
0.0002911883 
111 0.0263521949 0.257 
0.0000747812 
167 0.0264269761 0.306 
0.0000228957 
239 0.0264498718 0.151 
0.0000034446 
329 0.0264533164 0.127 
0.0000004379 


439 0.0264537543 


Projected increment — 0.000000064 
2:21 


Extrapolation = 0.026453818 
+21 


will be used to indicate the reliability we may place 
on this extrapolation. 

Table III shows the details of such an extrapolation 
for the small interval 


v=} Ry: (2D— 2C). (11) 


(It does make a helpful difference on the final error 
estimates that we extrapolate the combined integrals, 
rather than combine the extrapolated integrals.) The 
table shows values, differences, and ratios of successive 
differences; attempts to extrapolate must be based on 
the behavior of these ratios, especially at the bottom 
of the sequence. It is clear from Table III that these 
ratios cannot be very well described in any simple 
analytical terms; but it is to our advantage that they 
are very small.” After some playing with these numbers, 
attempting various analytical and graphical fits, it was 
decided that a reasonable procedure would be simply 
to extrapolate from the last step as if we had a geometric 
series, and assign as our uncertainty 4 of the added 
increment. That is, if A’ is the last difference recorded 
and R’ is the last ratio, our final answer is gotten by 
adding to the last computed value the quantity 


area Jaen. EEY) 


2 By way of contrast one may note that the corresponding 
ratios in the work of Pekeris ef al. (Ref. 4) are much smoother, 
but also quite a bit larger. One may also notice that while their 
value for vo, calculated with 220 terms is 0.011% larger than our 
final result, their extrapolated value is 0.009% too small. Their 
extrapolation here was thus in error by about 45%. (For viz their 
extrapolation erred by only about 16%) We point these numbers 
out intending not to chasten others, but to emphasize the need 
for a critical attitude toward any attempts at numerical extrapola- 
tion, and in particular our own. 


4 
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We thus arrive at the following results for the intervals 


of the ZP state of helium, bare of any higher order 
corrections whatsoever. 


7473? Ry[0.33751721(7)] 


R'—0.210 M 
v12= 4a? Ry[0.02645382(2)] ca 
R'=0.127 "v 
7» 3o? Ry[0.36397101(9)] E 

c) 


R'—0.72. 


The number in the inner parenthesis gives the un- 
certainty in the last figure quoted, arrived at according 
to (12). If our extrapolations are accepted, then we 
have achieved the desired goal of 1/10" accuracy. If it 
becomes necessary, we could, without too much expense, 
carry the numerical work one order further and thus 
increase the certainty of these numbers by about a 
factor of 5; but at present we feel this part of the 
program can rest, and wait for the experimental work 
and the analysis of higher-order corrections to catch up. 


VI. CONCLUSION 


The first major goal of the plan outlined in Sec. II 
has now apparently been achieved, and this encourages 
us to proceed with the other parts of the over-all 
program for the redetermination of o. There are two 
rather apparent cautions which we wish to make. First, 
regarding the extrapolation and error estimation, we 
have tried to be both clever and objective in our 
analysis, but each reader will have to find his own 
measure of skepticism regarding the precise accuracy 
of our results. Secondly, while we have endeavored to 
check and recheck all our algebra and computer 
programs, the considerable complexity of the work 
reported here must leave open the possibility of some 
undiscovered error of importance. To this point we 
report the following experience. 

Six months after we had completed the last of our 
computations, there was discovered an error in our 
computer program for multiple precision division. It 
turned out that the last ten decimals were sometimes 
treated incorrectly; but some conditions under which 
this error would not arise could also be precisely 
identified. A detailed scrutiny of all our programs then 
revealed the astonishing fact that this mistake did not 
have any effect at all on any of the matrix element 
computations. The error could work only in the process 
of solving for the eigenvectors of the matrices; but our 
method was, at the end, an iterative process, so that 
we suffered only a slower convergence (paid more 
dollars) as a result of the mistake, and all our final 
answers were unaffected. net T 

We hope that someone, working RU Sud 
will repeat and check all this work; this is the bes 
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insurance we can imagine for the reliability 
important results. E 

One can ask for a comparison of the presen 
experimental and theoretical values for the fine 
ture intervals. Adding to our results (13) the simpli 
correction for the anomalous magnetic moment of th 
electron? (the only correction of relative order a) and 
using the value of the Rydberg for helium, A 


ho? Ry - 2.921374 cme, 
we get 
54 0.987837 cmt, AS 
v1277 0.0765302 cm^7! , (45) e. 
Vi = 1.064367 cm. ES. 


These may be compared with the experimental values — iB 
991 0.98791 (4) cm~, NL. 
V127- 0.076423(3) cm, d 
Via77 1.06434 (3) cm. denm 


[Note added in proof. The numbers quoted here ha: 
been corrected, relative to those published (Ref. 7 
according to private communication from Dr. R. 
Sands. See also F. M. J. Pichanick, R. D. Swift, 
V. W. Hughes, Bull. Am. Phys. Soc. 9, 90 (1965).] - 
The differences are about 0.0001 cm~, which is roughly 
what one would expect due to the neglected terms 
relative order a°. Araki? has considered one such higher- 
order term, the mixing with the nearby 2!P stat P 
Perkeris el al.‘ give for this effect a resulting downwa 
shift of 0.000158 cm to the J=1 level. The addition 
of this correction appears to do little more than change 
the signs of the discrepancies for the 0-1 and 1-2 ii 
tervals; and this indicated that all the higher ord 
terms of Eq. (2) will have to be treated togeth 
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APPENDIX: NUMERICAL METHODS 


The many matrix elements of the operators . 
Hz in the basis used were evaluated in terms 
tabulated integrals of the following general type: —— 


Ice SEMEN) edu pde —— 
Aa(tx)=(—*) f e firme 
2 Am J da 
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/, M, N are integers greater than zero; ¢(0<o<1)isa and [ 
ee parameter which is put=0. 29 for the direct 1-EgN MEN 
integrals and=1 for the exchange eee and fa car )-( ) f dre tier Ml 
is a homogeneous function of degree }A in the two 2c 0 
lengths rı and rə. x 
By use of the following identity referring to in- «| dse-*s" fa (s,r) 
tegration over the angle 012 (signified by the angular A E 
brackets) , 
1--c M-- N-F (CA/2) A 
=?) ryt? = ( ) rare) 
(sin?0isris) = (cost ) , (A3) 2c 2 T 
(L+2) rire 1 M—1 
fa (1x) 
one can easily establish the following recursion formulas: x : | a nm Joy prive oum : 
A(L,M,N)=A(L—2, M+2, N)+A(L—2, M, N F2) We shall again tabulate these two dimensional arrays, # 


—2B(L—2, M--1, N-+1), 


F and G, by recursion; but all the obvious formulas we 
have looked at (proceeding from small to large values 


B(L,M,N)=[(L—2)/(L+-2) ]LB (.—2, M+2, N) of M and N) are badly behaved regarding loss of 
+B(L—2, M, N+2) accuracy from subtractions. We thus use the backward 
—2A(L—2, M+1, N--1)]. (A4) iteration formulas 
ae : : OC) 1 

Our general method for building tables of integrals is — F(M,) vor ree RU) (M+1, N) 3-F(M, N--1)] 
by the use of such recursion formulas; but one must (M+N+3A) 1 
be careful that the particular recursive scheme chosen 2/ (12-o) (A7) 
does not build by taking differences of nearly equal G(M,N)= (G(M+1, N)+oG(M, N--1)]. 


numbers (most obvious recursion schemes do suffer 
from this serious drawback.) For the above integrals— 
for large values of L, M, N in particular—the main 
contribution comes from large values of 712; thus cos612 
is mostly negative, the integrals 4 and B have opposite 
signs, and the recursion schemes (A4) are safe. 

In order to start the recursion sequence we need to 
evaluate 4 and B for L—1 and 2. The angular integrals 


(M+N+4A) 


These recursions start from the values of F and G along 
the line M+N=constant (about 30 in our work). 
We have found the following devious, but numerically 
safe, procedure for these evaluations. After Eq. (A6) 
make the change of variables 


are simple and we can express the results in terms of two c= ine (A8) 
families of two-dimensional integrals. (The subscript A tu’ 

is understood in what follows.) implying 

B(2, M ,V)-0, fa(x1)— (14-u)- 4? f, (1— u, 1+1) D (A8) d 


A(2,M,N)=F(M+1, N+1)+G(V-+1, M+1), 
B(1,M,N)=3F (M4-2, N —1)2H1G(N -2, M—1), (A5) 
A(1,M,N)=F(M-+1, N)--G(N4-1, M), 


where 


F(M,N) - [(1--0)/20 ttv m | drer 


0 


xX i dse-*l«sN— fa (r,s) 


fa (1,2) = (12-w)- 9/2 fy (14-0, 1—1); 


then expand the resulting denominator of F in an 
infinite power series in 


[3(1—2) (1—1)], 


and the denominator of G in terms of {[(1—2)/(1+¢)] 
Xu). Also make the binomial expansion of the resulting 
numerator factor (14-u)^-! according to the separation 


(12-u) =2u-++ (1—1). 


The resulting expressions are 


Sm MA) F(M,N) | * N (N— 1) 273(M4-N—J— 1) ! 
1 2c ( 2 G(M,N) TER (N—J) aree ami | 
1 qM fa (x1) FOL--N) | 
dx — — ————, (A6 , (A9 ah 
xJ a/a tam eS) x [eae ( . 
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where 


F (KC) [3 (+o) ]* (4/2) x (=) 


imo \ 2 


(K-iA—1--E)! . 
— FEK), 
OE 


2 2 /l—oN* 
GJ (K)= 2 m ) 
k=0 c 


(A10) 
(K+3A—1+2)! 


ees — G7 (AY, 
kIU —1) (K—1—J)! 


and 


I 
PreK = | dufi(1—u, 14-12)u7- (1— u) I+ 
0 
; (A11) 
6/040 | dufa (1+u, 1—20)u7-**(1— 4) &-1-7 , 
0 


For most of the integrals of interest the function 
fa(riuro) is (rı+r:)*? (A=0, +1, or 2); and so in the 
final expressions (A11) the function fa is merely a con- 
stant and these integrals are trivial. However in treating 
H we also run into the spectral case of Eq. (A3) for 
L= —2, and this leaves us with 


dese) 


| (rid r2) 


Jaruri) =n (A12) 


rire 


in Eq. (A6); but this reduces to 24? Inv in Eq. (A11), 
and these final integrals are again easy. 

The infinite sums in Eq. (A10) are rapidly converging 
for our values of c; and the entire procedure worked 
very well. 

Matrices of dimension 440 in sextuple precision 
require over half a million words of machine storage 
space (an order of magnitude more than what is 
available in the core) and so we made extensive use of 
magnetic tapes. The most time-consuming operation 
was the matrix inversion carried out at the start of the 
eigenvector calculation. For the largest, dimension 
438, this took 10 h of 7090 time—equivalent to about 
one billion simple 8-figure multiplications on this 
machine. 

We will now describe the method used for finding the 
eigenvector. Given two symmetric matrices H and V 
(V>0) of order n, we separate the problem 


X (Hij—XN55)x;-0, i—1, 2, tta d (A13) 
j=1 


as follows: Set x; — 1, and solve for x: from 


25 UI—AN)xj- — (Ha—ANa), 72,3, +--+, n (A1) 


j=2 
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once À is known; but we can get a stationary estimation 
for ^, once an approximate vector x is known, from the 
Rayleigh quotient, 


h= (x,Ix)/ (x, Vx). (A15) 
In this work we always have a very good guess for À 
to start with, and the alternating iteration (A14), 
(A15) is very rapidly convergent. Most of the time 
goes into solving (A14); the process of solving 
simultaneous linear equations 


(A16) 


by the direct method of elimination takes 41 operations 
(for a symmetric matrix A). On the other hand, iterative 
attacks on the solution of (A16) require only of the 
order of n? operations per cycle, but many iteration 
schemes do not converge at all well. The general 
criterion is that one have some “good” approximation 
to the inverse of the matrix A; then 


rD z gy T. (b— Ax (9), (A17) 

where 4 approximates A~, can be expected to con- 
verge "rapidly." For our problem the matrix to be 
inverted (A14) changes only very slightly, as our value 
of À is improved, so that it is necessary to calculate 
only once an inverse matrix (by the direct method?*), 
and then we use this as the kernel of the iterations 
(A17), until both eigenvalue and eigenvector are 
converged. 

This process works well, but as one goes to larger and 
larger matrices, loss of numerical accuracy becomes a 
serious problem at a rate much beyond any that could 
be attributed to statistical phenomena. Obviously, the 
full matrix (/J—XV) in Eq. (A13) is singular—for the 
correct A—but one would think that the remnant 
matrix in Eq. (A14) is far from singular since the most 
important element of the basis has been removed. In 
actuality, however, the removal of a single tríal function 
does not prevent the convergence of the variational 
sequence, it may merely slow it. Thus, as we go to larger 
and larger bases, the matrix in Eq. (A14) does come 
closer and closer to being singular, and at a painfully 
fast rate. We handled this problem by the brute force 
method of using higher precision arithmetic, but the 
cost was quite great. (It may be that the problem would 
not have been quite as bad as it did appear to us, had 
we caught earlier the error mentioned in Sec. VI.) 


?! After completion of this work the following, possibly more 
efficient, method occurred to us. The new terms one is adding to 
the basis at each step in the sequence of variational calculations 
must be numerically of decreasing importance (if the entire 
process is converging at all well). Then it may be sufficient to get 
(4) by carrying out the direct calculation on only a small 
submatrix representing the dominant terms, and simply using the 
diagonal elements for the higher parts of the matrix. 
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For static potentials which are proportional asymptotically to qP»(cos0)/r*, the low-energy expansion 


of the scattering amplitude is found through terms of O(&), 


using a modification of the method developed by 


Levy and Keller for central potentials. The resulting expansion to lowest order in £ is found to be f(0,6)~—A 
-+ (q/3)P» (cos0x) --O (k), where A is the scattering length and 6x is the coordinate of the momentum transfer 
vector. Applications are attempted first to electron-atom elastic scattering where results are somewhat more 
complicated than for the potentials above, secondly to transitions between magnetic quantum states of 


atoms caused by slow electrons. 


1. INTRODUCTION 


FFECTIVE range expansions developed originally 

for short-ranged potentials! have been extended 
in recent years first to spherical potentials with 7~* 
tails? and finally to potentials with arbitrary r^^ radial 
dependence asymptotically3 This has enabled the 
usefulness of the low-energy expansion to be extended 
to electron-atom collisions and to any scattering 
problem where long-range Van der Waals forces are 
operative. However there are physical problems, ex- 
emplified by the scattering of electrons from atomic 
oxygen or other nonsymmetric atoms, for which the 
asymptotic form of the interaction is not the 7~ polari- 
zation potential but rather the quadrupole interaction 
proportional to 7-3P»(cos6) ; in particular: 


(2m/12)V (r)~ QP»(cos6)/r —7-1[ p+ p’ P»(cos?) | 


-cO(r5). (1.1) 


Since such a potential is not spherically symmetric, 
the usual phase-shift analysis is not applicable for one 
has a coupling of different angular momentum states. 
Hence the expansions of Levy and Keller? for the phase 
shifts are not applicable. But while phase shifts for 
this problem are not well defined, the scattering ampli- 
tude, f(0,), is. In this paper we will look for an energy 
expansion for /(6,¢),> adapting to this purpose the 


* A preliminary report of this work was given at the Winter 
Meeting in the West, Bull. Am. Phys. Soc. 8, 608 (1963). 

t Present address: Defense Research Corporation, Santa Bar- 
bara, California. § 

1J. M. Blatt and J. D. Jackson, Phys. Rev. 76, 18 (1949) ; 
H. A. Bethe, ibid. 76, 38 (1949) ; G. F. Chew and M. L. Goldberger, 
ibid. 15, 1637 (1949). 

2T. F. O'Malley, L. Spruch, and L. Rosenberg, J. Math. Phys. 
2, 491 (1961). 

3B. R. Levy and J. B. Keller, J. Math. Phys. 4, 54 (1963). 

«T. F. O'Malley, L. Rosenberg, and L. Spruch, Phys. Rev. 
125, 1300 (1962) ; T. F. O'Malley, tbid. 130, 1020 (1963). 

5 An alternate approach might be to look for an expansion of 
the elements of the reactance matrix connecting states of different 

r momentum. If one is interested in the scattering amplitude 

only, this approach is more complicated. But as other information 
in addition to the scattering amplitude can be found this way, 
work along this line is being done. 


approach which was used by Levy and Keller? (LK) 
to expand tany for symmetric potentials. 

In Sec. 2, we shall briefly review the LK method, 
modifying it to a more heuristic form which we find 
easier to apply to the present purpose. In Sec. 3, the 
method is then applied to expanding the amplitude, 
{(0,), for potentials with the asymptotic form (1.1). 
The dipole potential is also briefly considered. Section 
4 is devoted to considering the relation of the results 
of the previous section to the electron-atom scattering 
problem. In particular, the derivation of the form (1.1) 
of the potential is indicated and the relation of g to the 
atom’s quadrupole moment is discussed. It is seen that 
that electron-atom problem is somewhat more com- 
plicated, due to the possibility of atomic transitions 
from one magnetic quantum state to another. The 
amplitude for these transitions is obtained to lowest 
order in the energy. 


2. MODIFIED LEVY-KELLER METHOD 


To introduce the method which will be applied in 
Sec. 3, consider the L=0 scattering of a particle by a 
long-range central potential, U(r), proportional to r^ 
asymptotically. Levy and Keller’s starting point in 
attacking this problem was the first-order nonlinear 
equation? for the phase shift 


d/dr tann(r)= — kU (r) 
X[sin(kr)+tann(r) cos(kr) (2-1) 


with boundary condition 7(0)=0. n(%) is the desired 
phase shift. They then represented tany as a series in 
odd powers of k 


tann(r)=k 2; (r). (2.2) 


Equation (2.2) is substituted into (2.1) and equating 
like powers of k gives the equations for each of the 


_ $ For a discussion of this equation, see Larry Spruch, in Lectures 
in Theoretical Physics, edited by W. E. Brittin, B. W. Downs, and 
J- Domas (Interscience Publishers, Inc., New York, 1962), Vol. 4, 
p. 1 
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(r). l(%) is the negative of the scattering length, A. 
Its equation is found to be > 


dA/dr- U(r)[r—A(r) f, (2.3) 


where we have changed the symbol /g to —A. The 
equations for l, etc., are similarly found. If the 
potential were of short range, the problem would be 
finished, with the /;(« ) furnishing the desired expansion 
of tang. However for long-range potentials, which are 
proportional asymptotically to r~”, it is found that at 
most the first few of the /; tend to a finite limit as 
f — ©. For example, if n=4, only to(%) exists. [It 
should be pointed out for later reference that all the 
quantities /;(r) are still well defined for any finite r, 
and the expansion, Eq. (2.2) is still meaningful. One 
is merely prevented from going to the limit. | 

Levy and Keller's procedure at this point was to 
consider the difference, 


m 
tang(r) —k X El;(r) - kBr,R), 
7=0 

where /, is the last one whose limit is finite. Upon 
substituting this into (2.1), an equation for the function 
B was derived. This equation was then solved by an 
iteration or perturbation approach. It was found that, 
to lowest order in k, 8 was determined by the asymptotic 
region r— ©. In this region, the usefulness of the 
iteration method arises from the fact that successive 
iterations produce successively higher powers of &, so 
that a series in powers of £ tends to fall out immediately. 

For the purposes of this paper, a somewhat heuristic 
version of this method will be employed. We shall use 
the perturbation approach from the very beginning, 
and work directly with tany, rather than introduce the 
difference function, 8(7). The result of this is a greater 
directness and brevity, and a more intuitive approach. 
This will be done at the expense of introducing certain 
anomalous terms into the results, which however we 
shall see to have a clear interpretation. 

To begin, let us rewrite Eq. (2.1) in integral form 


tann(r)=tann(R)—k* 
x | í U (r)[sin(&r)--tang(r) cos(&r) P- (2.4) 


R will be taken to be any very large but fixed distance 
(independent of k) and we vary k so that kR— 0. The 
region r< R is described by Eq. (2.2), as already pointed 
out. Now since R may be taken to be arbitrarily large, 
the integral may be considered to be a very small 
quantity for any allowable potential. (It is shown below 
that this imposes the mild restriction that r°U approach 
zero.) Thus the aptness of a perturbation approach to 
Eq. (2.4) for r>R is clear. y 

e is arbitrarily large, we substitute for U(r) 
its asymptotic expansion, 

U(r) 3; Ur. 
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We further introduce the new variable kr= x. Equation 
(2.4) then becomes 


tanz(r) = tang(R) — 5 £^? U , 


kr 
xj dxx-"[sin(x)--tang cos(x) f. (2.5) 
k 


ER 


If we temporarily ignore the k dependence of the lower 
limit of integration, the way in which the various terms 
in the asymptotic expansion of U contribute to the 
energy expansion of tanq follows by inspection from 
Eq. (2.5). In applying perturbation theory, one first 
takes tan(R) as the zeroth-order quantity and sub- 
stitutes this for tanņ on the right-hand side of Eq. 
(2.5). From Eq. (2.2), we see that this quantity is 
O(k). In first-order perturbation theory then, each r~” 
term in U contributes a &"-? term to tang plus terms 
of higher order. Now when these first-order terms are 
substituted back into the integral on the right-hand 
side, they produce in second order an additional factor 
k"; and so provided the lowest 7, mo, is greater than 
2, the leading term in second-order perturbation theory 
is O[ £2? ], and so on, just as found in LK. It is in 
this property that the power of the method lies. How- 
ever, because of the & dependence of the lower limit of 
integration, there will be additional lower order terms 
in the total result. 

We now proceed to study these terms and illustrate 
the method at the same time by solving for tann(~) 
in first order, for a potential with 75— 4, the case which 
LK treated in detail. We assume for convenience that 
there are no other terms in the asymptotic expansion, 
and look for terms of order less than #*. It will be seen 
that all the terms found by LK are reproduced very 
simply. In the zeroth order, we set for rz R 


tang? (7) —tang(R)- —A(R)k4-O(), — (2.6) 


where the second equality follows from Eq. (2.2) with 
the notation 4-— —/,. (This equation, as mentioned 
above, is valid up to any R which is fixed and finite.) 
Substituting this in the approriate places on the right- 
hand side of Eq. (2.5) gives for the first-order result 


tang (&) 2 —A(R)&— U k? 


x f z [sinx— A (R)£ cosx F 
kR 


Xx~dx+0(k°) 


Us UA(R) 
[4 ayes 
peur 


U,A*(R 
S eeu 


R? 


+4U,A(R)E Ink-+0(#). (2.7) 
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The integrals, which are elementary, are simplified if 
one notices that with the exception of the k? term all 
other terms come from the singularities at the lower 
limit of integration. [In all this analysis, since R is 
fixed, the quantity $R is considered an infinitesimal. ] 
If one compares the result, Eq. (2.7), with the corre- 
sponding result Eq. (35) of LK? which is likewise first 
order, one sees that they are essentially the same. The 
only difference is that their zo (the negative of the 
scattering length) is replaced by the bracket term in 
the present result. A moment’s reflection shows in what 
sense they are the same, namely the bracket represents 
the beginning of a perturbation expansion of A, starting 
with 4 9 — A(R) in Eq. (2.3). This was to be expected 
since we are using a perturbation method throughout. 
Tt can be verified explicitly if one looks for the solution 
of Eq. (2.3) in first-order perturbation theory with the 
present potential. This is given by 


AO (2) — A(R)4- of [r— A(R) Pdr 


R 


which when evaluated is exactly equal to the bracketed 
quantity in Eq. (2.7). These perturbation corrections 
to A(R) are easily recognized in a calculation like Eq. 
(2.7) first by their k dependence and secondly by their 
explicit dependence on R. What they amount to is a 
renormalization of our zeroth-order scattering length. 
[If we had kept higher powers of k, there would have 
been renormalizations of the effective range as well.] 
The other terms in Eq. (2.7), the k? and £? Ink terms, 
are quite independent of the choice of R. Our practice 
in applying the method hereafter will be to replace, 
in all final results, both terms like the bracket term in 
Eq. (2.7) and quantities like A(R) in the log term by 
the fully renormalized quantity A=A(%), to which 
they tend as either R— oo or the order of perturbation 
theory is increased. 

Since the method just described is to be used in the 
following section to find the expansion of the scattering 
amplitude, a brief review of the procedure may be in 
order. One first finds the exact first-order nonlinear 
equation for the quantity f of interest corresponding 
to Eq. (2.1) above and writes it in integral form as in 
Eq. (2.4). One then considers the region 7 less than some 
large but fixed R, and makes the expansion corresponding 
to Eq. (2.2) for f(R). Since this is the same whether 
the potential is snort or long ranged, the form of the 
expansion will usually be known already. However, it 
is very simply derived by substituting an assumed 
expansion such as Eq. (2.2) into the exact equation 
and equating powers of k. Having done this, one is 
ready to study the contribution from 7>R by the 
perturbation or iteration method. R is assumed very 
large, the asumptotic expansion of U(r) is substituted 
into the equation, and the new variable x— kr is intro- 
duced. With f(R) taken as the zero-order quantity, 
the iteration is begun. One then notices that the leading 
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power of k introduced by each succeeding order of 
perturbation theory is of higher order in & than the 
last (apart from the renormalizations already dis. 
cussed), and uses this to decide at which order to stop, 
Finally, one replaces partially renormalized quantities 
like the bracket and A(R) in Eq. (2.7) by the fully 
renormalized quantity, such as A. ý 


3. EXPANSION OF THE SCATTERING 
AMPLITUDE, f(0,¢) 


In this section we are interested in the solution of 
the Schrédinger equation 


[v--P- U()y()-0 
with the asymptotic form 
v~exp(ik;-r)+ / (6,9) exp (ikr)/r , 


where the vector k; is in the direction of the incident 
plane wave. The noncentral potential U(r) may be 
represented by a Legendre expansion relative to some 
z axis 


U(r)= 2; Ux(r)Pi(cosó), 


L even 


(3.1) 


where the Uz are assumed to have the following asymp- 
totic dependence as r— œ% : 


Usc-g/n— p! [4-0 (77), 
Usc- — p/r^4-O (r9), (3.2) 


and higher Uz, if they exist, are O(r-5). The reason 
for the particular asymptotic dependence assumed for 
the potentials by Eq. (3.2) is that this is the asymptotic 
form which one finds for the electrostatic interaction of 
a charged particle with a neutral quantum mechanical 
system, for example an electron and an atom. In this 
case the asymptotic form of Us will be proportional to 
the electric quadrupole moment of the target system, 
and this will be the dominant long-range interaction if 
it exists. The 7-4 term in Up will be proportional to the 
target's electrical polarizability. The derivation of the 
known asymptotic forms Eq. (3.2) for an electron-atom 
system is indicated in Sec. 4. The restriction of the sum 
to even terms only is also dictated by the application. 
Physically this follows from the target system's having 
a well-defined parity." In addition to being noncentral, 
we may allow the potential U(r) to be nonlocal and 
energy dependent, provided that the nonlocality is of 
short range and the energy dependence can be repre- 
sented by an expansion in k? near k=0. (These points 
will not be explicitly discussed further.) 

In order to apply the method discussed in the pre- 
ceding section, the Schródinger equation must first be 


1 The only exception to this is the excited states of the H atom, 
where there is degeneracy between states of different angular 
momentum. 
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replaced by the appropriate first-order nonlinear 
equation for the quantity of interest, in this case the 
scattering amplitude, f(0,5). This equation was found 
by Spruch? to be 


fj (k; — ky) = f, (k; —> kj)— ai) f dr’ 


x [anger + fos k’)e*r’/r ]U (1^) 


XLE f, (—k,— ker], (3.3) 


where the argument r is now written as a subscript. 
The vector k; is again in the direction of the incident 
plane wave and ky, is a vector in the final direction, or 
the direction of observation. The vector k’ has as its 
angles the variables 0 and ¢ of integration. The lower 
limit of integration, ro is arbitrary. 

Before looking for the contribution to f from the 
asymptotic region of the potential, let us first, as in the 
last section, consider the expansion of f, for r less than 
some finite and fixed, though possibly large, R. It is 
easily verified that the appropriate expansion which 
takes the place of Eq. (2.2) of the last section is one in 
powers of ik, namely 


f= GE^f, 


n=0 


(3.4) 


for r<R fixed. Substituting this in Eq. (3.3) and 
equating like powers of ik one finds the equations for 
the individual functions, fn. In particular the equation 
for jo is 


jo = (40) [rir Lat flr [evo 63 


where for the present purpose we have set 79— 0. Since 
the angular integral over the potential simply gives 
4r U^», we see that Eq. (3.5) is identical with Eq. (2.3) 
for the scattering length (rather for —A), with the 
potential given by the spherically symmetric com- 
ponent Uo. Therefore we shall refer to the quantity 
— fo as the scattering length and use the notation 
—fo=A. An important property of A is that it is 
entirely independent of the L>0 (the nonspherically 
symmetric) terms in the potential. f 

The equation for the next function, fı, in Eq. (3.4) 
may similarly be found. It is more complicated but 
simplifies when we use the fact that the potential U (r) 
contains only even angular terms. We shall not write 
the equation for fı but it is exactly satisfied by the 


function 


fh fé. (3.6) 


If we temporarily imagine that the potentials are of 


* Reference 6, p. 209. 
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short range so that we may go to the limit r= ~, Eq. 
(3.6) gives the optical theorem in the zero-energy 
limit. Again we note that Eq. (3.6) explicitly assumes 
that the potential has no component proportional to 
P(cos6). 

We may now sum up the result of Eqs. (3.4) through 
(3.6) to write the expansion of the amplitude, fg for 
any finite though possibly very large R, as kR — 0. It is 


fa=—A(R)+ikA?(R)+0(K). (3.7) 


Equation (3.7) might have been simply written down 
immediately as the known contribution to the scattering 
amplitude due to a potential in the region r less than 
some finite R. However, it seems more consistent to 
rederive it in the context of the present method. 

We now proceed to our main purpose, namely to seek 
the contribution to the amplitude Í from the region 
r2 R, using Eq. (3.3) and writing ra= R. As in the last 
section, the arbitrary distance R will be considered to 
be very large so that the asymptotic expansion (3.2) 
of the potential may be used. Accordingly we substitute 
the potential from (3.1) and (3.2) into Eq. (3.3) and 
multiply out the two brackets. We again introduce new 
variables 


x-kr, 


y- Kr =(K/k)x, (3.8) 


where K is the magnitude of the momentum transfer 
vector, K— k;—k. It is related to k by K= 2k sin(0;,/2) 
(see Fig. 1). The equation becomes 


kr 
dx 


full —> ky) = f(s kg) (1/42) | 


kR 


x | dotar costs? e Cp p Palcos JE) 


X (eiv cos, 9x) $ (h) x) fy (ky — k)eiz 0 -208(47.9)) 
+ (k/x) fe (— kj — k'e‘ Utos, 


T (k) fe (ki k) fe (ki — kA}, (3.9) 
where the notation cos(61,92) is used here and through- 
out the paper to denote the cosine of the angle, 615, 
between the vectors kı and ks. Cos(6;,92) is given by the 
relation 


COS (01,02) — cos£; cos6»-l-sinf; sinfs co$($y—42). (3.10) 


There remains nothing now but to go ahead and 
evaluate the integrals in Eq. (3.9) by the iteration or 
perturbation method, beginning with 


f= fg —A(R)+ikA(R)+0(H), — (3.11) 


where the superscript henceforth denotes the order of 
perturbation theory. We proceed to sufficiently high 
order in perturbation theory to obtain all the powers 
of k which are desired. As implied in Eq. (3.11) only 
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terms through O(&) will be sought here. (These will be 
seen to provide more than sufficient difficulty in evalu- 
ating while at the same time furnishing a great deal of 
detail in the results.) 

By inspection of Eq. (3.9) it may be seen directly 
to what order in perturbation theory we must go in 
order to find all terms through O(k), by taking the 
heuristic point of view explained in the last section of 
neglecting temporarily the k dependence of the lower 
limit of integration. Of the four exponential terms in 
the last bracket, the first term is O(1). This will be 
called the Born term since it appears also in the Born 
approximation. Since it is not multiplied by any factor 
fas are the subsequent terms, it makes its contribution 
once and for all in first order. It contributes to f a q 
term of O(1) and p and p’ terms of O(k). Higher terms 
in the potential would give contributions of O(f) 
which is why they have not been written explicitly. 
Skipping ahead to the last exponential term in the 
bracket, it is seen that this already contains a factor k? 
which makes it higher order from the very beginning, 
and so it makes no contribution. Finally the two linear 
f terms in the bracket behave alike. They have a factor 
of & to start with. Since the lowest order terms in f (0) 
and f? are O(1), the p and ?' terms produced by these 
will always be O (£2). In first order, these linear f terms 
will produce g terms of O (k) proportional to f©. When 
we set these f's equal to f? to obtain the second-order 
result, we see that the Born term proportional to q will 
result in a g?k term in this order. If the iteration process 
is continued further, it is seen that only terms of O(?) 
are produced and so second order suffices. 

We now proceed to evaluate the first-order contri- 
bution, {® to the amplitude. First, the zero-order 
quantity, f, given by Eq. (3.11) is substituted for f 
on the right-hand side of Eq. (3.9). The angular 
integrations are easily performed using the relation 


[anrstcosne' 008(8,94) — Ami j 1(x)Pr(cosbx). (3.12) 


This reduces Eq. (3.9) to 
fr (ks k)—E—4 (R) +ikA?(R) ] 


kr 


kr 
dxx3js(y)H- bk | dax? jo(y) 


= P2(cosdx)q i " 
— p'EPs(cosüx) i dao? ja(y) 
—qkA (R)[P2(cos6)+ P (cos6;) | 

vale dxx-?ei*j,(x)H-O(E). (3.13) 


The lower limit of integration has been extended to 
zero whenever the error introduced into f thereby was 
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O(k?). The first three terms, resulting from the Born 
term in Eq. (3.9) are complicated by the presence of 
two different variables, x and y. It is easiest to eliminate 
the variable x in favor of y by use of Eq. (3.8). In the 
first integral, this merely changes the limits (kr — Kr). 
Tn the second and third it also replaces the initial factor 
k by K. 

Before proceeding to second order, let us look at the 
physical amplitude, f,(? in this order. Letting r— c, 
we have 


fo? (k; 5 kj)-7 —LA (R)— ?/ R H-3aP:(cosóx) 
—1rpK— (1/16) p’K P»(cosüx)-- ik (A? (R) 
—154 (R)gLPs(cos0;)d-.Ps(cos0;) ]] --O(&?). (3.14) 


The first term on the right is of course the scattering 
length, partially renormalized in first order. The second 
term represents the principle new result of the paper, 
a term generated by the quadrupole potential which is 
O(1) but with angular dependence. As in previous work, 
this leading term is given by the Born integral. The 
angular factor Ps»(cosóx) requires some explanation. 
The angle 6x is not the scattering angle. It is the angle 
made with the z axis by the momentum transfer vector 
K-k;—k; (the magnitude, K, of this vector appears 
in the following term). The vector K is represented in 
Fig. 1. The magnitude K is given (since |k;| = |k;| =) 
by the familiar relation 


J 2k sin (85/2) = kv2[1— cos (0:,0) ]?. 
The quantity cos#x can be found from the relation 
KE cosĝ;— cos0; 


cos(6x)=—= : 
K v2[1—cos(6:,6s) |!” 


(3.15) 


The denominator might in turn be eliminated in favor 
of the angles 6;, 0; and $;—49; by the relation 


cos (0,,0;) — cos6; cos8,-++ sind; sind, cos($;— $;)- 


So if the initial angle 0; is arbitrary relative to the 2 
axis from which the quadrupole potential is measured, 
the amplitude will depend on ¢ as well as 6. Now for 
the special case 6;=0, Eq. (3.15) reduces to 


cosfx=sin(@/2) (0;—0), 
where we have written 0;=0. P2(cos@x) is then given by 


P2(cos6x)= P2[sin (6/2) ]=4(1—3 cos#) 
@:=0). (3.16) 


The third term in Eq. (3.14) is the known term‘ 
giving the lowest order effect of the r~ potential. As 4 
consistency check, if one considers the case 0;— 0, the 
imaginary terms for 0,—0 may be deduced from the 
two leading (real) terms by means of the optical 
theorem? and they agree with those in Eq. (3.14). 


9L, I. Schiff, Quantum. Mechanics (McGraw-Hill Book Com- 
pany, Inc., New York, 1955), p. 105. 
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Now, to proceed to second order, we substitute f. 
from Eq. (3.13) with the required changes of angles 
back into the right-hand side of Eq. (3.9). The leading 
term in f, — f? is the first q term in (3.13) which is 
O(1). This term multiplied with the q term in the 
potential will given a gk term as mentioned above. 
All other terms are O (k?) and hence higher order (except 
for second-order corrections to the scattering length 
arising from the p terms). Since further iterations may 
be seen to produce only higher order terms, f® is as 
high in perturbation theory as we shall go, and so, 
letting r — ©, we find 


fo (k; kd f= - af | adx 
kR 


x IL (q/x?) P» (cos0) (k/x)qP»(cosb x) 


XL} ji (K'r')/ Kir" icti on 
tse —k)H0(P), (317) 


where, according to the requirements of Eq. (3.9), 0i 
is defined as the angle made with the z axis by the 
vector K’=k;—k’. An analysis like that leading to 
Eq. (3.15) for 0x gives 


cosé;— cosh 


V2[1— cos(0,0) ]? 


COSÜ&; = (3.18) 


while the magnitude K’ is equal to &V2[ 1— cos(0;,0) ]?. 
The corresponding quantities for (k; +> —k;) should be 
clear. The r’ should be eliminated by the relation 
K'r = (K'/E)x. At this point it does not seem that the 
integrations in Eq. (3.17) can be carried out analyti- 
cally, so we will simply write 


f, — fa = eI (0,6) +0 (E), (3.19) 


where 


I(8,9) =— (1/47) | dx *e'* 


x J dO Ps(cos0) P2(cos8x:) 


x[5— j(K7)/ Kr ]e* cos (87,6) 
(Eie —ky) 


x i : dx! | dQ'x-3 Ps(cos6") 
0 


xen cos(97,8K"") 1 (k; e —ky) D (3.20) 
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Fic. 1. Shows schematically the 
meaning of the angle 0x. 


N 
2c 
* 


I(0,9) has been written in the alternate form of a 
double integral from which it arose, since it might prove 
more amenable to solution in this form. At any event 
it should be possible to evaluate the integrals numeri- 
cally for various values of 0, and $; if it is ever needed. 
The angle 0x^ represents the vector K” which is the 
difference between the vectors k; and k', the latter 
having the angles 6’ and $' of integration. (0,0) should 
of course have both a real and an imaginary part. 

As mentioned earlier, an evaluation of f® leads only 
to terms of O(#?) aside from further renormalizations 
of the scattering length, 4. The final result then may 
be written, combining Eqs. (3.19) and (3.14), 


f(k: =) kj)- —A+ 4qP2(cosOx) 
—łrpK+ gk Rel (0,9)— (7/16) p'K P2(cosOx) 
+ik[ A?— 35qA (P2(cos6;)-+ P2(cosd,)) 


+e Im/(6,9)]+O(#), (3.21) 


where the renormalized quantity, A4=A(#) has been 
substituted throughout, and 9x may be found from 
Eq. (3.15), ff. Eq. (3.21), the final result of this section 
represents the expansion of the scattering amplitude 
through terms linear in & for the potential of Eqs. 
(3.1) and (3.2). An attempt will be made in Sec. 4 to 
apply this result to low-energy electron-atom scattering. 


Dipole Potential 


As a digression, let us attempt to apply the above 
method to a potential whose asymptotic form contains 
a leading dipole term 

U (r)—- D cos(0)/7r*: (3.22) 
(This kind of potential arises in connection with the 
scattering of electrons from the excited states of the 
H atom, because of the accidental degeneracy between 
states of different angular momentum.) We substitute 
this asymptotic form into Eq. (3.3), neglecting any 
higher order terms in U since we shall only look for 
the leading k term in (6). In addition, we again sub- 
stitute the new variables kr’=x and Kr -—y, and 
multiply through by a factor k. We find in place"of 
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Eq. (3.9) 
kfk: my, kj) E kfr— a/m) f 


kR 


kr 
xx 


x [ax cos(0)x-? [civ co (62x) 


pata (ks Keine mtn 
bath fa( ky Kee vota 


xk f (k; k)kf.(—k;— ke]. (3.23) 
Now by Eq. (3.11) fr is O(1). (As mentioned before 
the ik term would be different for a dipole potential, 
but this is a higher order term anyway.) So k fr is O(K). 
Let us neglect terms of O(k) as higher order and look 
only for terms of lower order than this. We then have 
kf — O(k). Using this on the right-hand side we find 
for f 


D kr 
kf (k; x-- (3 | dx 
4r. kR 


x| dQ cosĝe?” eos (6,0) --O (k) 


= —iD(k/K) cosék[1— jo(y)] 
+0(k). 


Thus, in first order, kj is O(1) as we could see by in- 
spection of (3.23). If we now proceed to substitute 
kf® on the right-hand side in order to find kf, we 
see immediately that the new terms are also O(1) and 
so to all higher orders of perturbation theory. So that 
the situation here is very different from that for an 7? 
or 7-4 potential, where the new terms in higher orders 
of perturbation theory represented successively higher 
powers of &. With the dipole potential, all orders of 
perturbation theory give the same power of k, namely 
f « k3. Thus for the dipole potential, Eq. (3.24) (for 
y=) gives only the first Born approximation to the 
k term in f(0), and so will only be of use if D happens 
to be very small. For moderate D higher order per- 
turbation terms would be needed, while for still higher 
D the expansion will very likely not converge at all. 
So we see that unfortunately the present method 
breaks down for a potential which falls off as slowly 
as 7-2. From Eq. (3.24) we may at least salvage the 
qualitative result that the amplitude diverges as kal 
in agreement with the classical result,” and also with 
that of Seaton!! who considered the electron-hydrogen 
problem for a fixed value of the total angular momentum 
L, and with Gailitis and Damburg? who carried this 


(3.24) 


10H. Goldstein, Classical Mechanics (Addison-Wesley Pub- 
lishing Company, Inc., Cambridge, Massachusetts, 1953), p. 91. 

11 M. J. Seaton, Proc. Phys. Soc. (London) 77, 174 (1961). 

12 M. Gailitis and R. Damburg, Proc. Phys. Soc. (London) 82, 
192 (1963). 
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result somewhat farther. If one wishes to go beyond 
the Born approximation or the contribution to the 
amplitude from single angular momentum states, what 
seems to be needed is an exact solution of the wave 
equation with the dipole potential (3.22). 


4. ELECTRON-ATOM SCATTERING 


The particular asymptotic form of the potential U(r) 
given by Eq. (1.1) was chosen because it corresponds to 
the long-range interaction between a neutral atom (or 
molecule) in a particular state and a charged particle 
(electron). 

The potential between an atom and an electron is 
taken (neglecting spin-dependent forces) to be the sum 
of all Coulomb forces between particles (electrons and 
nucleus). When the separation r is very large, the inter- 
action assumes the asymptotic form 


[ 


Z 
V(rr)e X. &/r**? X r,"Pi[cos(66))], (4.1) 


L=1 j=l 


where Z is the atomic number of the atom, r and 0 are 
the coordinates of the separated electron, and r; repre- 
sents the position of the jth atomic electron. The 
absence of an L=O term results from the assumed 
neutrality of the atom. From this, the limiting form 
Eq. (1.1) of the effective potential seen by the scattered 
electron has been derived by a number of authors.? 
The simplest way to derive an effective potential is 
to treat the asymptotic potential (4.1) asa perturbation 
of the atomic ground state and to find its expectation 
value in first- and second-order perturbation theory," 
treating 7 as a (large) parameter. If we assume the 
atom to be and to remain in a particular state of mag- 
netic quantum number M, the first-order result is 


e » Qri(M)Pz(cos0) 


(OS , 
2 L2 rih 


(4.2) 


where the Qz(M) are the permanent electric multipole 
moments of the atom in its ground state with quantum 
numbers J and M. Only those moments will exist for 
which L<2J, so that for example a spherically sym- 
metric atom will have no permanent moments. Further, 
since all atoms in their ground states have 2 well- 
defined parity, all the odd moments vanish so that 
the leading term in (4.2) is that for L=2, the quad- 
rupole term. This immediately gives us the first term 
of Eq. (1.1). The M dependence of the quadrupole 


13 See for example J. Holtzmark, Z. Physik 55, 437 (1929); E. 
Gerjuoy and S. Stein, Phys. Rev. 97, 1671 (1955). 

4 Reference 9, p. 152, ff. 

15 This is generally true of excited states as well with hydrogen 
being an important exception. 
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moment is given by!^ 
Q(M)z — — — Q (4.3) 


where the number Q is defined! as the expectation 
value of the quadrupole operator for the state M — J, 


Z 
Ome [ar > r£ P2(cosd;) | Pol? sms (4.4) 
j=l 


with the integral extending over all the coordinates of 
the atom (a sum over spins is implied). Equation (4.3) 
has as a consequence that the average of Q(M) over 
M vanishes 


Qav=0. (4.5) 


Unfortunately the physical situation is somewhat 
more complicated. Due to the degeneracy of states 
with the same magnetic quantum number M, transitions 
between these states may be caused by electrons with 
vanishingly small energy, so that the first-order po- 
tential should really be written as a matrix of dimension 
2J--1. What has been done in (4.2) is to neglect all 
but “elastic” collisions, i.e., those in which M remains 
constant. This neglect is not justified, and an attempt 
is made below to remedy it. 

Returning to Eq. (4.1) and evaluating it now in 
second-order perturbation theory gives 


yo —e/(2r)[a-4-o' (M)Pz(cos0)]H-O(r-*), (4.6) 


where a is the (average) electric polarizability of the 
atom and a’ (which exists only for nonsymmetric atoms) 
has the property that, like the quadrupole moment, its 
average over M vanishes. In evaluating this result, the 
variable r is treated as a constant parameter. This is 
known as the adiabatic approximation. However, it 
has been shown!® that the result gives the correct 
asymptotic behavior of the effective potential, in other 
words that the adiabatic approximation is correct 
asymptotically. This gives rise to the second term in 
Eq. (1.1). 

A more consistent and satisfying way of arriving at 
the long-range electron-atom potential is to start from 
the exact one-body optical potential equation” for an 


16 J, M. Blatt and V. F. Weisskopf, Theoretical Nuclear Physics 
(John Wiley & Sons, New York, 1952), p. 28. A 

17 We have used the definition of Blatt and Weisskopf. The 
molecular definition of the quadrupole moment used by Gerjuoy 
and Stein (Ref. 13) can be made to agree with this by the relation, 

= —(2J/2J+3)Qes. 
: ds a aie it, (Cz Pe and M. J. Seaton, Proc. Roy. 

: A54, 259 (1000. — 

ns a BUT Ann. Phys. (N. Y) 5; 357 (1958). See M. H, 
Mittleman and K. M. Watson, Phys. Rev. 113, 198 (1959); B. 3 
Lippmann, M. H. Mittleman, and K. M. Watson, ibid. 116, 
(1959). 
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electron scattered from an atom in its ground state 


[V-k — (2m/ 8) V op ]u(r) 0, (4.7) 
where 


V. (r) = Vot V(E— H) V+. (4.8) 


Vos is the expectation value of the full electron-atom 
potential averaged over the target ground state. It 
corresponds exactly with the first-order perturbation 
theory result whose asymptotic form is given in Eq. 
(4.2) and the same comments made there again apply. 
In the second term, the vector V has as its components 
the matrix elements Vs; connecting the target ground 
state with each of its excited states. Similarly the 
matrix H has components Hi; connecting any 2 excited 
states of the atom (but with no ground-state com- 
ponents). The Pauli principle may be considered to be 
implicit in (4.8) or else exchange*terms may be added 
explicitly. In either case it may easily be shown?’ that 
its effects fall off exponentially as r— » and so will 
not affect the asymptotic behavior of V... 

Now if one lets r— « in Eq. (4.8) using (4.1) for 
the potential one obtains in a straight forward manner 
Eq. (4.6) as the limiting form for the second term but 
with error this time” of O(r-*d/dr). And so the asymp- 
totic form of the optical potential (with the restriction 
mentioned below to “elastic,” i.e M conserving, 
collisions) is 


2m (M) P»(cos) 1 
pate Lata! OL) P (cosb)] 


D eto ti aor ^ 
+0(=) , (4.9) 
7 


where d9=72/mé is the electron Bohr radius. Com- 
paring Eq. (4.9) with Eq. (1.1) we see that in applying 
the formulas of Sec. 3 to this case we should set 


q— Q(M)/eao, ?/-a' (M)/as. (4.10) 


The foregoing, however is not really adequate to 
describe the scattering of electrons by atoms, since 
transitions of the atom from one magnetic state to 
another have not been taken into account. 


p= a/do D 


Allowance for Transitions Between Magnetic States 


If the target atom is initially ina magnetic state M 
relative to some z axis, and an electron beam is incident 
with momentum k; the asymptotic form of the wave 
function may be written 


J 
y~ya (reit = , roe OAW ehm, (4.11) 
M'e— 


where Ya (rj) are the ground-state wave functions for 


2 M, H. Mittleman, Ann. Phys. (N. Y.) 14, 94 (1961). 
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the atom in the magnetic state M". The amplitude 
{(0,) is now a matrix of dimension 2/ 4-1, connecting 
these states. Correspondingly the potential Voo of Eq. 
(4.8) is likewise a matrix, as mentioned earlier. If the 
matrix elements of the potential Voo connecting the 
states M and M’ are labeled Voo(M,M’), where V is 
still given by Eq. (4.1) the asymptotic form of the 
potential matrix elements is given by 


(omi I8) V (MM) r7 (4x / S) SY o, e (0,0) 


X (Qar,arr/eao)+O(r-*), (4.12) 


where 


20sra0=eCdn/5)* f aep 


Z 
XE rY u-u” (050). (4.13) 


j=l 


e 
The Qa, are related to the ordinary quadrupole 
moment, Q— Qz,; by the relation 


— us 
0-1) | 
xcV, 2; J; M’, M—M?Q, 


(4.14) 


where Q is the quadrupole moment defined by Eq. (4.4), 
and C is the Clebsch Gordon coefficient?! The Q(M) 
defined before are simply the diagonal elements, Qarar. 

The second term of the optical potential, Eq. (4.8), 
is still O (7), so that Eq. (4.12) represents the leading 
term in the optical potential matrix as r— co. If the 
amplitudes faa are now written as functions of r as 
was done in Sec. 3 for f, they satisfy coupled equations. 
These equations may be derived from the coupled 
optical potential equations for the coefficients of the 
various Ym, whose asymptotic forms appear in Eq. 
(4.11), exactly as was done for the amplitude f in Ref. 
8. The equations are 


far aei (li k)- - G4 | rear | as 
0 


XE X Vu; ki—> K)U me, wC’) 
M 


Tier 
h Xyu (r; —ky > k’), 
where 


yu (r; k; Kk") mac aee 
Ffu mne (k;— k’)e*r’/r’ y 


The vector k’ again has the angles €" of integration. 
The sums are over all magnetic substates, and the 
asymptotic form of the Ua", are given by Eq. 
(4.12) plus terms of O(r~‘). If we are satisfied with 
terms in the amplitude of O(1), we see that these were 


21 M. E. Rose, Elementary Theory of Angular Momentum (John 
Wiley & Sons, Inc., New York, 1957), p. 33 ff.; E. U. Condon and 
G. H. Shortley, The Theory of Atomic Spectra (The University 


— Press, Cambridge, 1951), p. 76 ff. 
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given in Sec. 3 by the Born term in Eq. (3.3), together 
with the 7 term in the potential, apart from con- 
tributions from the short-range part of the potential. 
The same will be true in the present case. The Born 
term for the fa,w, with the potential given by Eq. 
(4.12), is 


1 i) 
fu w (k > km fun | ^ir | ane 
4n Jn 


1/4041? 
xx) Y». u-u (0,9) 


PNS 
Qu, ar 
x +0(k) 
eao 
1 Quar dm us 
=— Ay, a- (=) 
3 eto 5 


XY -m (0x, pK)+O(k). (4.15) 


K is again the momentum transfer vector k;—k;, A has 
the same meaning as in Sec. 3, and the substitution of 
A for A(R) has again been made as in Secs. 2 and 3. 
The relation of 6x to the angles of k; and k; was given 
by Eq. (3.15). Similarly tanéx=K,/K, and this is 
equal to 

sin; sin$;— sin; sings 


tanóx — (4.16) 


sinĝ; cos$;— sins Coso s 


The delta function arises from the fact that the (scat- 
tering length) equation which determines the leading 
term of fr contains only the angular average of the 
potential and so does not connect states of different M. 

Equation (4.15) represents the final result of this 
section, the scattering amplitude to lowest order in k 
for collisions of very slow electrons with a nonsymmetric 
atom initially in magnetic state M. We shall now 
mention two obvious applications of this formula. The 
first is the scattering of slow electrons from unoriented 
atoms. For this case one should construct the cross 
sections and then average over initial states M and 
sum over final states M". Using Eq. (4.14) for Qm, 
we find 


1 1/0\2 
¢(6)av=—— 2, 2. Mame (5) 


2J-F1 M m’ 9Neao. 
J 4-1) 2J--3 
(24-1) QJ 4- LOO. (4.17) 
5J (2J —1) 


where J is the total angular momentum of the atom.” 


22 Tt is interesting to compare the above result with those of 
Gerjuoy and Stein, Ref. 13, although they were not concerne 
with elastic scattering. If one nevertheless sets Je=Ja in their 
results and k»=ka and uses the relation between the quadrupole 
moments mentioned in Ref. 17, one reproduces the second term 
of Eq. (4.17). (There is of course no A? term in these results since 
they did not consider elastic scattering.) 
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rl y 
We have used the sum rules for spherical harmonics (Ost a. 
and for Clebsch Gordan coefficients? together with Eq. 7f."a2= G/ is (A 
(4.5). Tt should be noted that the differential cross e: 
section given by Eq. (4.17) is spherically symmetric. — (1/15). i 
If the target atoms have been EL so that they les 
are initially in magnetic state M, then Eq. (4.15) may These total cross sections still depend on t 
T be used directly to determine the differential cross between the direction of the incident electr 
section for transitions to any other state M’ caused by z axis along which the atoms have been ori 
collisions with very slow electrons. In this case, changes Finally with regard to the energy range o 
^ of M by 0, 1, and 2 units are allowed. To find the total Eq. (4.15) and the results of Sec. 3, as in pre 
cross sections for these transitions, the integrals over necessary conditions can be found for the € 
$; and 6; may be done, after substituting Eq. (3.15) the iteration process begun in Sec. 3 and impli 
for 0x, giving work of the present section, the iterated quai 
Qua consisted of the terms gk and pk each multiplied I 
ou, i — 47A?— (2/3)0A P2(cos6;) factor of order unity. Necessary conditions then 
[d edo applicability of the preceding results is that 
Quai sionless quantities &Q/eas and K'a/as be small com 
+(1 /9n( ) [1— (3/8) (sin0;)*] to unity. The latter condition is that already foui 
eao the polarization potential. i 
) +0(k), 
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Lifetime of the First !P, State of Zinc, Calcium, and Strontium* 


ALLEN LURIO 
IBM Watson Laboratory, Columbia University, New York, New York 


AND 


R. L. peZarrat AND ROBERT J. GOSHEN 
Columbia Radialion Laboratory, Columbia University, New York, New York 
(Received 9 January 1964) 


The lifetime of the first excited 1P; state of zinc, calcium, and strontium has been measured by the zero- 
field level-crossing technique. The results are: zinc 7(1P;)=1.4140.04X10™ sec, calcium 7 (1P1) 24.48 
-E0.15X 10? sec, and strontium 7 (P1) 24.97 £0.15 X 10^? sec. The results are compared to earlier experi- 
mental results and also to theoretical predictions. It is found that our results are in excellent agreement with 
the Bates and Damgaard Coulomb approximation method for estimating the oscillator strengths. 


I. INTRODUCTION 


HE problem of determining atomic transition 
probabilities and excited-state lifetimes is an old 
one in physics, although it has received relatively little 
experimental attention in recent years. At the same 
time, advances in photoelectric recording and optical 
techniques make many early methods well worth 
reviewing and revising. Surveys of existing literature 
show a few advances in techniques since the early 1930's, 
while revealing upon occasion a rather considerable state 
of disagreement—not uncommonly by an order of 
magnitude or more—in many "absolute" measure- 
ments. Several excellent compilations of previous litera- 
ture, tables of numerical values, and review articles 
exist. The best reference for all of these as well as 
individual articles, up to 1962, is the National Bureau 
of Standards monograph by Glennon and Wiese.' 
Most absolute experimental determinations of oscil- 
lator strengths or transition probabilities of spectral 
lines have used the “hook” method developed by 
Rozhdestvenski and co-workers at Leningrad over the 
years from 1912 to the present? This method and the 
various line absorption methods of measuring absolute 
oscillator strengths all require a knowledge of the 
density of the scattering atoms. It is this requirement 
that presents a limitation to the precision of these 
methods. (For the measurement of relative oscillator 
strengths, however, the “hook” method seems to be 
excellent.) The zero-field level crossing or Hanle effect,‘ 


* Work supported in part by the U. S. Navy Office of Naval 
Research under Contract Nonr 266(45), in part by the National 
Aeronautics and Spacé Administration grant NsG-360, and in 
part by the Joint Services (U. S. Army, U. S. Navy Office of 
Naval Research, and U. S. Air Force Office of Scientific Research). 

1 Permanent address: State University of New York at Stony 
Brook, Stony Brook, Long Island, New York. 

1B. M. Glennon and W. L. Wiese, Natl. Bur. Std. (U. S), 
Monograph No. 50 (1962). 

2See various articles in Optical Transition Probabilities, a 
collection of translated Russian articles published by the National 
Science Foundation, Washington, D. C., 1962. 

3 A. C. G. Mitchell and M. W. Zemansky, Resonance Radiation 
and Excited Aloms (Cambridge University Press, New York, 1961). 

4 W. Hanle, Z. Physik 30, 93 (1924); Ergeb. Exakt. Naturw. 4, 
214 (1925). 


on the other hand, does not suffer from this requirement 
in the limit of low vapor densities, and should therefore 
be much more reliable. The level-crossing technique does 
require the production of strong unself-reversed reso- 
nance lines and consequently is limited to measurements 
of the oscillator strengths of the first few resonance lines 
of an element. In this respect the “hook” method would 
seem superior since it does not require resonance lines. 

In a previous paper? the measurement of the lifetime 
of the first excited 1P; state of Cd by the zero-field level- 
crossing technique is reported. In the present paper we 
report on the lifetime of the first excited 1P; state of Ca, 
Sr, and Zn. In these experiments the resonance radiation 
is scattered from an atomic beam so that the difficulty 
of finding a transparent cell which is not attacked by 
Ca or Sr vapor is avoided. Our results are compared 
to previous measurements and to theoretical calcula- 
tions for the oscillator strengths and are found to 
agree surprisingly well with the Bates and Damgaard 
approximation.* 


IIl. THEORY 


We first present a simple and somewhat restricted 
theory derived on strictly classical grounds, to give as 
directly as possible a physical picture of the Hanle or 
zero-field level-crossing effect. Later, a more generalized 
quantum-mechanical treatment is given leading to the 
same conclusions. 

More than one geometrical arrangement is possible, 
and polarized light need not be used, but the simplest 
arrangement conceptually is probably the following. 
Consider a vapor of the atoms to be investigated—either 
in the form of a free atomic beam, or in a closed evacu- 
ated container—with the scattering region at the 
coordinate origin (see Fig. 1). A beam of resonance 
radiation with the E field polarized in the Y direction is 
sent in along the X axis. Absorption of the resonant 
light by the vapor raises the atoms to the excited state 
whose lifetime is to be measured. Classically one may 
consider the resonance light to excite electric dipoles 


5 A. Lurio and R. Novick, Phys. Rev. 134, A608 (1964). 
6 D. R. Bates and A. Damgaard, Phil. Trans. 242, 101 (1949). 
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oscillating along the direction of the Y axis. These 
dipoles then re-radiate energy with the usual dipole 
radiation pattern, producing maximum intensity per- 
pendicular to, and zero intensity along, the Y axis. 
However, if a magnetic field is present, aligned with the 
Z axis, the dipoles will precess about this field at their 
Larmor frequency w until they re-radiate. w depends on 
the field strength H via the Larmor relation w=yH 
where y is the gyromagnetic ratio: in this case 


y=o/H= g yuo/Tt , (1) 


with gy=the electronic g factor and yo=the Bohr 
magneton. The angle of rotation through which a given 
dipole precesses before re-radiating is thus dependent on 
H. Multiplying the sin? dipole-radiation pattern by an 
exponential damping factor to account for the random 
finite excited-state lifetimes over a large continuously 
excited collection of atoms, and integrating over all 
times, yields the following results for the observed 
intensity of scattered resonant radiation along the Y 
axis (at 90? to the incident beam) as a function of the 
magnetic field strength H: 


^ G r? 
r-c| Cant sitae] 732) 
: 2L aye 


where T is the reciprocal of the mean life of the excited 
state and C is an arbitrary constant relating to the 
incident light intensity, the experimental geometry, 
and the number of scatterers. The result is just an 
inverted Lorentzian line shape as a function of H, with 
a full width at half-maximum equal to T. The half- 
maximum is seen to occur at a field value Zj/» for which 
Ay? H?,,5— I?, or using Eq. 1, when 


r= (2gzuoH 2/4) sec™!. (3) 


Thus, a simple determination of the value of the 
magnetic field required to produce the half-maximum 
level in the Lorentzian line shape serves in principle to 
determine the mean life 7— 1/T', since gy is generally 
well known. In practice fairly detailed profiles of the 
scattered intensity were taken as a function of H, as 
described in the next section, in order to carefully 
analyze and verify the Lorentzian shape. If the incident 
light along the X axis is unpolarized, one may carry out 
the theoretical analysis by resolving this light into two 
orthogonal polarizations. For the component polarized 
parallel to the Y axis the treatment 1s the same as given 
above. For the component polarized parallel to the Z 
axis, the scattered light is independent of the applied 
magnetic field since a dipole oscillating parallel to H 
does not precess. In practice one polarizes the incident 
light parallel to the Y axis in order to eliminate this 
constant background. 


al treatment gives the correct results for the simple 


itation f = te toa J=1 excited 
he excitation from a J=0 ground sta! ! 
np have not examined the classical treatment in more 


complicated cases. 


7 The classic 
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Fic. 1. Geometry 
used in Hanle effect 
experiment. 
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In odd A isotopes the presence of hyperfine structure 
may alter the Lorentzian profile in particular cases. 
This occurs since the gp value of the odd isotopes will 
differ from the gy value for the even isotopes. Moreover, 
the Hanle effect is only a special case at zero field of the 
more general *evel-crossing" phenomenon discovered 
by Franken and co-workers? and the observational 
technique used here will also detect these nonzero-field 
crossings. This may then introduce distortions in the 
wings of the Lorentzian profile if fields large enough to 
reach nonzero-field level crossings are used, which may 
be the case with comparatively small hyperfine intervals 
and short lifetimes (e.g., r € 10-? sec). This was not of 
importance for the elements studied here since the 
naturally occuring group II elements contain at most 
small amounts of odd A isotopes and those odd A 
isotopes which do occur all have J>4. Thus for these 
isotopes the level-crossing signals are a very small frac- 
tion of the scattered resonance radiation because of the 
large number of Zeeman levels. 

We shall now give a quantum-mechanical derivation 
for the intensity of the scattered resonance radiation 
from a collection of identical atoms with J=0, excited 
from the !So ground state to the !P; state and decaying 
back to the So state. We will take the externally applied 
static magnetic field to be along the Z axis but will allow 
for arbitrary incident light and observing directions 
(see Fig. 2). The general expression for the rate of 
scattering of resonance radiation has been given by 
Breit? and Franken? and is 


f mie "E es 
ReNXET———————-. 4) 
M Pil (By Ey] 
where fam= (u| ê- r|m), gum=(u!é’-1’|m), p and u^ refer 
to the excited state magnetic sublevels, and m’ refer 
to the ground-state sublevels, and Æ, is the energy of 


the pth excited state. Figure 2 shows the spherical 


3 P, Franken, Phys. Rev. 121, 508 (1961). 
9G. Breit, Rev. Mod. Phys. 5, 91 (1933). 
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Fic. 2. Coordinate and polarization system used in quantum 
mechanical calculation of zero-field level-crossing signal. 


coordinate system used to evaluate R. The incident light 
is directed toward the origin at coordinate angles 0 and 
e and the light polarization vector ê makes an angle a 
with respect to 0 and lies in a plane normal to r. & andr’ 
refer to the scattered light. With this coordinate system 
we have 


ê= f(cos0 cose cosa—sing sina) 
+7(cos@ siny cose-+cos¢ sino) — k(sin cosa). (5) 


We may define a new set of complex vectors which will 


Puy Gy 1 
R(Au=0)= £ i np'^Tp'n E 
AE T— (G/W) (E,—Ew) 2T 


Puy Gury Zerez 
Rüw-i- E, 
mH 
Iu —a'| = 


1 1 
20202 


dL LÀ 
T?+ (E,9— £1,1)?/h 


FyuywGy 1 
RO= SS up'STp'p 


Iu —2at| 


For our experimental arrangement 0—0'—3m, 9—0, 
¢’=4r. Also Ej1— —E1,1— gzuoH. and E1,5—0 so that 
we may simplify the above equations to 


R(Au=0)= (1/T) ( sin’a sin?o/4-cos'a cos’a’), (13) 
1 gzuoH sin2a sin2a’ 


ROn 
e 0 UE (14) 


CP-G/AYE,-E,) T+ Qa Ei/h 


x È (e 2e,” + ege?) + 
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prove useful by the relations 
£47 est iey= (cosh cosa+i sina)eti? , 
€;— — sinl cosa, (6) 
TX zd, 


so that é-r=3(e,r_te_r+)+e.r:. In order to evaluate 
fmu and gmp we need the matrix elements” 


(7,0,0| ETF | ae 1, ZE 1)= FV2e (r) D 
(,0,0|er;| y^, 1, 0) er); 


where (r) is the reduced matrix element of r. Let us 
now define the excitation matrix Fyw =} m funfmu' Ot 
more explicitly 


(7) 


Wee) cede NA. T yer 
Faw = |—e462/V2 ez e; /V2||()]*.. (8) 
—te,? e2€,/V2 lee 


For our simple case the re-radiation matrix Gwu 
= Pom! £um'£» ls the same matrix as F,,; except that 
all quantities are primed since they refer to the scattered 
light. 

It is convenient for understanding the coherent 
scattering process to break R up into different Au com- 
ponents where Au— |u— g|. Then 


R(f- R(Au-O--R(Au-1)FR(Au-2), (9) 


and 


(1—e2—e,?+3e2e.") , (10) 


i(E1,1— E1,0) 
h 


È (ee +ee—) + (e-e,! — exe-") ] 


VE, — E11) 


Ir (ee4/+e,e") + 
h 


(ce! ee]. (11) 


EX. TGE- EV) 40+ (Era Er)/A] 


2(E131— Ei 
S ue het] . (12) 


L 


T/2 sin’a sina’ 
T2+ QgzuoH/hy 


If the incident light is polarized with a—27 (i.e; 
parallel to the Y axis) and the outgoing intensity !5 


R(Ap=2)=— (15) 


X E. U. Condon and G. H. Shortley, The Theory of Atomic 
Spectra (Cambridge University Press, Cambridge, 1935); 
Chap. MI. 
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measured without polarizers then 


ip f r 
Rha) = — — — — |. (16) 
2LT P+ (2¢ jyoll/hy? 


To obtain Eq. (16), the result for the intensity of the 
scattered light, we have summed Eq. (9) for o/ and for 
o/--7/2 since this takes account of both orthogonal 
components of the outgoing beam. 


Experimental Details and Results 


In Fig. 3 is shown a schematic diagram of the 
apparatus. The beam source is a stainless-steel oven 
positioned about 4 in. below the scattering region. The 
beam effuses through a }-in.-diam crinkle foil aperture 
at the top of the oven and is further collimated into a 
j-in.diam region as it passes vertically through the 
scattering chamber. The beam is then collected on a 
liquid I? trap. A beam flag is provided to determine the 
degree of instrumental scattering in the absence of the 
beam and to aid in aligning the optics. Under actual 
conditions of operation the instrumental scattering can 
be reduced to several times the dark current of the 
photomultiplier while the resonance radiation scattered 
from the beam can easily be made 30 times the dark 
current, In this arrangement the dc scattered resonance 
radiation signal is collected by an aluminized light pipe, 
passed through a narrow-band interference filter for the 
19,5-1P, resonance line in order to reduce the scattered- 
light background, and detected by a Dumont 7664 
photomultiplier. The photomultiplier output is further 
amplified by a Hewlett Packard 412A vacuum-tube 
voltmeter and displayed on a servo recorder. 

The externally applied vertical magnetic field at the 
scattering region was calibrated by proton resonance 
and found to be 7.51+0.02 G/A. The principal compo- 
nent of the stray magnetic field at the scattering region 
was 0.35 G in the vertical direction. This field was not 
compensated but was taken into account in evaluating 
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Fic. 3. Schematic diagram of apparatus. 
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Fic. 4, Normalized plot of level-crossing signal versus magnetic 
field for zinc. 1, 2 and 3 are theoretical curves. Only the portion 
of the curve near the half-width is shown. 


the data. Most of the measurements were made with the 
incident light polarized with the electric vector in the 
direction of observation. Measurements were also made 
with unpolarized incident light and gave the same 
results. 

The data was taken in the following manner. By 
means of a reversing switch, the intensity J of the 
scattered light was recorded successively with the mag- 
netic field in the “up” and then in the “down” direction. 
A series of about ten such determinations for values of 
H between 0 and 200 G comprised a run. A plot of I 
versus H with the zero-field scattered-light intensity 
subtracted should be an inverted Lorentzian curve. 
From an approximate value of 7 and the value of J at 
the larger magnetic-field values we may obtain the 
constant C in Eq. 2. For each run, after C has been 
determined, the experimental curve is normalized to 
unity amplitude and plotted together with a family of 
theoretical Lorentzian curves with different values of 7. 
In this way 7 and an estimate of the uncertainty in 7 
can be obtained. Between 10 and 15 runs were taken 
for each element and the values for 7 quoted below are 
the average values derived from these independent 
runs. Figures 4 and 5 show typical data for zinc and 
calcium. The slight asymmetry between the field up and 
the field down points for calcium is due to the incident 
and observing directions not being exactly perpen- 
dicular. Careful arrangement of the optics can eliminate 
this asymmetry. We find for the lifetime of the !P, 
states the following values: r(Znj— 1.412:0.04X 10-? 
sec, 7(Ca)=4.48+0.15X10-® sec, and z(Sr)— 4.97 
+0.15X 10-? sec. The quoted errors are twice the stand- 
ard deviations for the different runs and represent a 
conservative estimate of the accumulated uncertainty 
arising from possible instrumental inaccuracies. We 
have taken g;— 1.00 in obtaining 7 from the measured 
linewidths. 

The beam density was varied over a factor of about 
20 during these measurements in order to detect and. 
correct for coherence narrowing. At the highest beam 
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Fic. 5. Normalized plot of level-crossing signal versus magnetic 
field for calcium. 1, 2, and 3 are theoretical curves. 


densities a narrowing of a few percent was obtained. For 
this reason the results quoted are from those runs taken 
at low-beam densities. 

The possibility exists that self-reversal in the lamp 
might affect the results. One would expect this effect to 
distort the Lorentzian line shape. This would occur 
because as the magnetic field is increased at the scatter- 
ing atoms, the Am=-t1 Zeeman levels of the atom 
would scan a higher intensity part of the lamp profile, 
and consequently the scattered light would increase 
faster than it would for a flat profile. No such distortion 
of the measured Lorentzian shaped lines occurred to 
within the accuracy of the experiment. 

Finally, we wish to mention that originally data for all 


TABLE I. Zinc. 


DEZAFRA, 


AND GOSHEN 
TABLE II. Calcium. 
Quantity 
Author measured Result 

Prokofev* 7(3P1)/7(4P1) 1.25X10-5  — 
Steinhäuser? 7T(1P1) 3.5X 10^? sec 
Ostrovskii ef al.* T(1P1) _ _6.241X10™ sec 
Ostrovskiid T(LP3) 5.40 -E0.14X 107 sec 
Odintsov® T(1P1) 5.0+0.3X 107? sec 
Present T(1P1) 4.482c0.15X 107? sec 


a V. K. Prokofev, Z. Physik 50, 701 (1928). 
b A. Steinháuser, Z. Physik 95, 669 (1935); 99, 300 (1936). 
e I. Ostrovskii, N. P. Penkin, L. N. Shabanova, Dokl. Akad. Nauk. SSSR 
120, 66 (1958) [English transl.: Soviet Phys.—Doklady 3, 538 (1958). 
d'I. Ostrovskii, N. P. Penkin, Opt. i Spektroskopiya 11, 565 (1961) 
[English transl.: Opt. Spectry. (USSR) 11, 307 (1961). 
e A. I. Odintsov, Opt. i. Spektroskopiya 14, 322 (1963) [English transl.: 
Opt. Spectry (USSR) 14, 172 (1963) ]. 


three elements, with standard deviations of 3-666, were 
taken with a somewhat cruder version of the present 
apparatus. The data presented here were obtained in a 
physically different location, with entirely different 
hardware, including different recording instruments and 
calibrating standards. The results presented here fall 
within the standard deviations of the preliminary 
results. This degree of care was taken in part because of 
disagreement between the preliminary data and pre- 
vious values in the literature. 


Discussion of Results 


'Tables I, II, and III list the present and previous 
results for the lifetime of the first excited 1P; and ?P1 
states of zinc, calcium, and strontium. These lifetimes 
have been calculated from the quoted f values by use 
of the relation? 


mC g2 £2 
fr= N= 1.4993. 
Bre gı £1 


Tn the case of zinc, one has excellent agreement between 
the present results and those of Landman although it 
should be observed that both of these experiments 
employ the zero-field level-crossing technique. More- 
over, the ratio of our 7(1P1) to the 7(#P;) of either Byron 


Athor Qusatity Result or Billiter-Soleillet is in excellent agreement with the 
: ratio measured by Filipov. For the cases of calcium 
Filippov GP )/rQP.) 6.76X 1075 E Potes decade p h 
‘Ausiander® TP) 23*10- s and strontium our z(/P;) is lower than those of the 
Present work 7 (LP) 1.412:0.04X 10^? sec 
Landman* T(LP1) 1.382:0.05X 10^? sec 
Billiter-Soleilletd — ^ T(3P1) 2.1X 10-5 sec "TABLE III. Strontium. 
Byron? À T7(3P1) 2.0+0.2X nr sec 
t 3P 2.9-3.5X 105 
Dumon TËR) x a Author Quantity Result 
i si r " Prokofev* T($P))/r(1P1) 2.69105 
a A. N. Filippov, Physik Z. So: 1, 289 (1932). See also Optical i 1 
Transition Probabilities, à feaciatiomtotieriected| ASTEA published Ostrovskii eż al.> T(LP1) 6.3810 sec 
for the National Science Foundation, Washington, D. C. by the Israel — Ostrovskii and Penkin* T (LP) 6.2 10~ sec 
Program for Scientific Translations, Jerusalem, 1962. In this paper the ratio Present work = ( Pi) 4.972-0.1 5x10? sec 


of the oscillator strengths is given and we have converted to the lifetime 
ratio. 

b J, Auslander, Helv. Phys. Acta 11, 562 (1938). 

e A, Landman and R, Novick, Phys. Rev. 134, A56 (1964). 

dW. Billeter, Helv. Phys. Acta 7, 841 (1934); P. Soleillet, Compt. Rend. 
204, 253 (1937). E 

eF. W. Byron, M. N. McDermott, R. Novick, B. W. Perry, and E. 
Saloman, Phys. Rev. 134, A47 (1964). 5 

t M. Dumont, Thèse de Troisième Cycle, Paris, 1962 (unpublished). 


a V. K. Prokofev, Z. Physik 50, 701 (1928). o 
b I. Ostrovskii, N. P. Penkin and L. N. Shabanova, Dokl. Akad. Nauk 
EA 66 (1958) [English transl.: Soviet Physics—Doklady 3, 53 
1 E 
o Yu. I. Ostrovskiiand N. P. Penkin, Opt. i Spectroskopiya 11, 565 (1961) 
[English transl.: Opt. Spectry. (USSR) 11, 307 (1961) ]. 
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TABLE IV. Intermediate coupling coefficients. 


Lifetime ratio 


Wolfe method method 
Zn c 0.5656 0.5657 
; €; 0.8247 0.8246 
Sre cı 0.5519 0.5526 
í ca 0.8339 0.8334 
Gas cı 0.5709 0.5717 
ca 0.8210 0.8204 
.^ We use the Prokofev lifetime ratio in the case of Ca and Sr since no 
direct measurement of 7\3P1) is available for these elements. 


Russian measurements, however, we note that their 
most recent results are in better agreement with ours. 
We do not know the source of these discrepancies but 
we should point out the following fact: The recent 
ingenious improvement in the hook method" utilizes 
total absorption to measure JV fT. The use of this method 
requires that the absorption follow a Lorentzian law at 
about three Doppler widths from the line center or 
typically about 200 times the natural linewidth from 
the line center. It is important to note that this has 
never been checked experimentally and any deviation 
from a Lorentzian line shape well out in the wings of the 
line could lead to erroneous results. 

The excellent agreement in the case of Cd and Zn 
between the hook measurement of r(P1)/7(@P1) and the 
recent measurements of 7(1P;) and 7r(*P;) for these 
states indicates that one may get good estimates of 
7(3P;) by using our 7(!P;) and the ratio obtained from 
the hook measurements. 

It is also of interest to compare the intermediate 
coupling coefficients!” c; and cs for the !P; and °P; states 
as derived from lifetime measurements with those 
derived from Wolfe's?:3 theory which makes use of the 
deviation of the fine structure from an interval rule. 
This theory takes account of spin-orbit and spin-other- 
orbit interactions but does not allow for spin-spin inter- 
actions or the fact that the 1P, and *P; states can have 
slightly different radial wave functions. The values of 
cı and cs calculated by the two different methods are 
given in Table IV and are in very good agreement. 


n yu, I. Ostrovskii, N. P. Penkin and L. N. Shabanova, 
Dokl. Akad. Nauk. SSSR 120, 66 (1958) [English transl.: 
Soviet Phys.—Doklady 3, 538 (1958)]. M 

1 A. Lurio, M. Mandel, R. Novick, Phys. Rev. 126, 1758 

1962). y 
E a Kopferman, Nuclear Moments (Academic Press Inc., 
New York, 1958). 
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TABLE V. Comparison of theoretical and experimental values 
of the oscillator strengths. 


Ca Sr Zn 


ACSo —1P1) 4227 À 4608 A 2139 À 
J (exp)? 1.80 1.92 1.46 

J B&D)» 1.83 1.90 1.49 

f (Vainshtein)* 2.32 

J (Trefítz)? 


1.737-1.223 


a Present work only. 

^ D. R, Bates and 4. Damgaard, Phil. Trans, 242, 101 (1949). 

* L. A. Vainshtein, Transactions of the P. N, Lebedev Institute, Vol. XV. 
(English translation available from Consultants Bureau Enterprises, Inc, 
New York, 1962.) 

4 E, Trefftz, Z. Astrophys. 29, 287 (1951). 


As pointed out in the introduction, the lifetime of the 
1P, state provides a test of approximate methods used 
to calculate the oscillator strength of the 'P; to Ss 
transition. The prediction of the Coulomb approxima- 
tion method of Bates and Damgaard’ can be obtained 
by making use of the formula of Garstang.” 


(Sy PAP) = —V2 at V2e2)a£ — 6a, 


and 
SQ) = | —V2 (6: +-V2c2)0 | "607, 


where o° is obtained for the p electron from the tables 
given in the Bates and Damgaard paper. The estimate 
of the oscillator strengths f from this calculation is 
shown in Table V. Also included in Table V are other 
estimates for f using explicit forms for the wave func- 
tions of the two states. The surprising conclusion to be 
drawn from Table V is that the Coulomb approximation 
is in very good agreement with the experimental results. 

Note added in proof. After this article was submitted 
for publication a paper with the following reference ap- 
peared: E. Hulpke, E. Paul, and W. Paul, Z. Physik 
177, 257 (1964). In this paper the following results are 
given: 7(Ca)=4.67+0.11X10~ sec and r(Sr) - 4.56 
2-0.21X 107? sec, which are in excellent agreement with 
ours. As in their paper we have also considered the 
transitions from the !P; to the 1D; state to be negligible 
compared to the decay to the ground state. 


ACKNOWLEDGMENTS 


Wewould like to thank Robert Holohan for his help in 
assembling the apparatus and in taking much of the 
final data. We also appreciate several helpful discussions 
with Professor R. Novick and A. Gallagher. 


“u R, H. Garstang, J. Opt. Soc. Am. 52, 845 (1962). 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


PHYSICAL REVIEW VOLUME 


134, NUMBER 5A 1 JUNE 1964 


g, of the (5s5p)*P, Level of Cd and the (6s65)*P; Level of 
Hg by High-Field Double Resonance* 


R. KOHLER 
Columbia Radiation Laboratory, Columbia University, New York, New York and 
Physics Department, New York University, University Heights, New York, New York, 


AND 


P. THADDEUS 
Goddard Institute for Space Studies, National Aeronautics and Space Administration, New York, New York 
(Received 20 January 1964) 


An optical double resonance experiment at high magnetic fields has determined gy for the (5555) 3P; level 
of the even cadmium isotopes to be 1.499846 (13), and gy for the (6565) ?P; level of the even mercury isotopes 
to be 1.486094(8). Resonance was observed in all cases at a frequency of exactly 24 Gc/sec, and magnetic 
fields near 11430 G. At these high fields the Zeeman energy is of the order of 1073 of the fine structure 
separations of the triplet terms of the (#s#p) configuration, and the Am= 1 transitions are split by 9.51(7) 
G for Cd, and by 2.99 (7) G for Hg. This splitting represents several linewidths in the case of Hg, and many 
linewidths in that of Cd. The average field of the two Am=+1 transitions, however, determines gy to high 


precision independent of second-order corrections. 


I. INTRODUCTION 


HE double resonance}? and level crossing tech- 
niques** have been used in extensive studies of 
hyperfine structure (hfs) in the lowest lying (sup) *P1 
level of mercury, cadmium,?— and zinc.5-? Suf- 
ficient sensitivity has been obtained to allow the deter- 
mination of the hfs, and consequently the nuclear 
moments, of radioactive isotopes with half-lives as 
short as a few hours.!*:6 
While the level crossing method has the great ad- 
vantage in practice of doing away with all radio- 
frequency and microwave apparatus, aside from nuclear 


* Work supported by the Joint Services (the U. S. Army, the 
U. S. Navy Office of Naval Research, and the U. S. Air Force 


: Office of Scientific Research) and by the U. S. Office of Naval 
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magnetic-resonance (NMR) magnetometers, it allows 
a determination of only the ratio of the hyperfine inter- 
action constants to gy, and an independent measure- 
ment of g; must therefore be made to determine these 
constants to high precision.!® While gy can of course be 
measured by double resonance in the odd isotopes at 
various fields, or by observing both double resonance 
and level crossings, it is most directly found from 
double resonance in the even isotopes of the element. 

In principle, the higher the frequency and the greater 
the magnetic field, the more accurate the determination 
of gy. In a series of experiments designed to measure gy 
of the lowest lying ?P, level of zinc, cadmium, and 


3 
P 


Energy 


ultraviolet 
intercombination 
line 


Fic. 1. Zeeman effect of the 3P; state of the even isotopes of 
zinc, cadmium and mercury, showing the effect of second-order 
interaction with the other fine structure levels of the configuration. 
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greater than the isotope shifts and the hyperfine split- 
tings of the natural cadmium isotopes (and the optical 
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Scattered 
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Fic. 2. Geometry of the experiment. 
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TABLE I. Atome constants of Zn, Cd, and Hg. A(P;) and A(P1), the wavelengths of the resonance lines of the lowest lying (nsnp) 
configuration, and à» and âs, the fine structure separations of the triplet terms, are taken from C. E. Moore." 
T (3P;) r UP) AGP) AP) âz 9» 
is. uS MARE 2) (sec) (A) (A) (cm) (erm) 
Zn 2.00(20) X 10-5 » 1.38(5) X 10°: 3077 2139 388.927 - 190,082 
4 Cd 2.39 (4)X 10-6 4 1.66(5) X10? © 3261 2288 1170.866 542.113 
Hg 1.18X107 f 0.3-1.6K 1079 « 2537 1849 4630.677 1767.220 
po Mere. Atomic Energy Levels (U. S. Ru Printing Office, 1919-58). E li 1 
^9 * See Ref. 20. 
d See Ref. 23. 
* A. Lurio and R. Novick, Phys, Rev. 134, A608 (1964). 
f See the last paper of Ref, 22. 
g A. C. G. Mitchell and M. W. Zemansky, Resonance Radiation and Excited Atoms (Cambridge University Press, Cambridge, 1961), 2nd ed., p. 147. 
mercury to high precision (it is $ on the basis of LS Doppler width),* and only the m=0 state is therefore 
e coupling), we therefore chose to work at a frequency of illuminated. It is then readily calculated in terms of the 
24 Gc/sec, and a field near 11 400 G—approaching the — transition probabilities that at double resonance the 
limit attainable with the available 12-in. electromagnet resonance fluorescence at 90° is reduced by a factor of 4. 
when operating with a gap of 2 in. For mercury, on the other hand, the isotope shifts 
The Am=-+1 transitions for these states coincide at and hyperfine splittings are comparable to the Zeeman 
low magnetic fields, but at high fields interaction with shift of the Am=-+1 states," and we must rely mainly 
the neighboring fine structure terms of the configuration on the nonisotropic nature of the resonance radiation. 
(and to a negligible degree with levels of other configura- In the least favorable case, on the assumption that all 
tions) splits the resonances into two well-resolved com- three states are equally illuminated, it is calculated that 
ponents, as shown in Fig. 1. Precise measurement of the the resonance fluorescence at 90° is reduced by a factor 
splitting of the two resonances therefore serves also as of yz at double resonance. 
a check on the theory of the second-order fine structure 
interactions. Linewidths 
II. THEORY Several effects determine the linewidth and shape 
Signal St th of the double resonance effect, and have all been con- 
LSE EIEN sidered in detail elsewhere. Most fundamentally, the 
It is well known that double resonance can be ob- natural lifetime 7 of the state gives a Lorentzian line 
served between hyperfine or Zeeman states excited by shape with full width at half-maximum of AH 
resonance radiation in a variety of ways. Population =Ava=1/x7.'-6 The natural linewidths for zinc, cad- 
differences can be produced by resonance radiation that mium, and mercury calculated from the lifetimes in 
is polarized, nonisotropic, or confined to a narrow wave- Table I are listed in Table II. z 
length interval, and all of these methods have been put If the rf magnetic field is strong enough to induce 
to practical Ter many transitions during the lifetime 7, the resonance 
. . . g - 1,6 : . 
% To dispense with ultraviolet polarizers, we relied on will be power poe In rie © n rf pedi 5 
the latter two effects and used the experimental geom- always attenuated to the point where this effect 15 
j in Fi Only 1 taining the natural small, and it will therefore not be considered further. 
etry shown in Fig. 2. y lamps containing Puppe 3 
ff cada the Z Double resonance owes its high resolution to the fact 
metal were used. In the case of cadmium, the Zeeman hat Donoler broaden te deti DRIN CE 
À shift of the m=1 states at 11400 G is somewhat that Doppler broadening is determined by the radio 


rather than by the optical frequency. At microwave 
frequencies, however, the Doppler linewidth Ava= (v/c) 
X(8N&T In2/ M)", where N is Avogadro’s number and 


TABLE IL. Theoretical and observed linewidths in milligauss. 
AHeate is taken as the sum of the natural, Doppler, magnetic field, 
and wall collision widths. 


Atom AH, AH AHm Alle Ahi — Abg. 
Zn 8 29 30 37 124 m 
Cd 6 18 50 26 162 175 
Hg . 1295. 10. 50 15 1270 136 


1 F. Bitter, S. P. Davis, B. Richter, and J. E. R. Y. 
Phys. Rev. 96, 1531 (1954). pr 
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M is the molecular weight, may not be altogether 
negligible, as is shown in Table IT. 

Inhomogeneity of the magnetic field over the scat- 
tering sample contributes a linewidth AZ, that can 
often be reduced by careful alignment of the pole pieces. 
We found, however, that as our magnetic field ap- 
proached 11 400 G, its homogeneity fell off rapidly due 
presumably to uneven saturation. We estimate that 
AH ,,— 50 mG over a 5-mm sample. 

Collisions with the walls of the scattering cell ef- 
fectively shorten the lifetime of the excited state, and 
contribute a wall-quenching linewidth Av, which de- 
pends mainly on the size of the cell. We have made 
estimates of A», in Table II on the assumption that 
Avy=1/atw= (3NkRT/M)?/za, where Tw is the time 
for an atom at the temperature of the optimum vapor 
pressure to travel the radius a of the scattering cell. 

Collisions between the scattering atoms will broaden 
the resonance at high vapor densities, while multiple 
scattering of the optical photons may actually decrease 
the linewidth (coherence narrowing). Both of these 
effects may be avoided by working at low enough 
temperatures. 

Quenching by atomic or molecular contaminants in 
the scattering cell, on the other hand, may contribute 
a linewidth Av, which cannot be easily estimated in 

advance, and cannot be reduced once the cell is made. 
We shall see that possibly except for zinc we found no 
evidence of quenching of this kind. 


Zeeman Effect 


Interaction with the two neighboring triplet levels 
of the (nsn) configuration contributes a term which is 
of the order of the (Zeeman energy)?/ (fine structure 
energy) ~10-*X (Zeeman energy) to the energy of the 
m=-1, 0 states. More precisely, from perturbation 
theory,” 

a? (g5— gr) uo H* 
Waegusdt- 5 ET (1a) 


2 1 
Woo seiner —>) , (1b) 


360 30» 
o? (gs— £1) u’ H? 
ls gne (1c) 
46» 


where po is the Bohr magneton, a is the mixing coef- 
ficient which indicates the departure from Russell- 
Saunders coupling, and à and à» are the magnitude of 
the fine structure separations. We adopt the convention 
that gy is positive. Interaction with the singlet term of 


72 T. P. Barrat, J. Phys. Radium 20, 541, 633, 657 (1959). 

2 Y. W. Byron, M. N. McDermott, and R. Novick, Phys. Rev. 
134, A615 (1964). 

3 A. Lurio, M. Mandel, and R. Novick, Phys. Rev. 126, 1758 
(1962). 
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the configuration, and with terms of other configura- 
tions, contributes less than one part in 105 to the energy, 
and third-order terms vanish. i 
For double resonance at a fixed frequency var, the 
field H+, for which m=0— +1, is greater than H-, 
for which m=0— —1. To within one part in 10° we 


find from Eq. (1) that in terms of the average field - 


H= (H++H-)/2 - 
gy — hvaz/uoH . (2a) 
'This becomes 


£17 go(vas/ 95) , (2b) 


if the field is measured in terms of the NMR frequency 
vp of protons. gp is defined such that the proton mag- 
netic dipole-moment operator is up= poly. For protons 
in mineral oil we will take gp=3.041978(3) X 10-4, from 
the average of the measurements of g,/gp for protons in 
mineral oil of Koenig, Prodell, and Kusch,” and Ber- 
inger and Heald,” and the recent measurement of g, 
of Wilkinson and Crane." 

For the separation of the two resonances we find that 


a? (gs— gL) uH? 4 1 
ee 
EJT 380 68» 


AH—H*— H- 


where ôo and ôs are in ergs. When the field is measured in 
terms of the proton frequency, 


o? (gs— gL)? 27 4 1 
Avp= 3 ( -—) ; (3b) 
CE»EJ 38g 665 


where ôo and 8» are in cm. 


IIl. EXPERIMENT 


A schematic illustration of the experiment is shown 
in Fig. 3. Resonance radiation from an rf lamp was 
focused with a quartz condensing lens onto a small cell 
containing cadmium or mercury vapor. The cell was 
located within a microwave cavity between the poles 
of a 12-in. electromagnet. This cavity in the case of 
cadmium also served as an oven to control the vapor 
density in the scattering cell. 

Resonance fluorescence of the atomic vapor was ob- 
served with a photomultiplier at an angle of 90°. A 


light pipe allowed the photomultiplier to be removed 


from the magnetic field without loss of solid angle. A 
glass color filter effectively removed all scattered light 
except the resonance radiation. 

The microwave cavity was tuned to resonance with a 
klystron kept at a frequency of exactly 24 Gc/sec, the 
field being modulated at a low audio frequency, typi- 
cally 37 cps, by small rectangular coils attached to the 
pole faces. At resonance, the light scattered at 90° 

23S. H. Koenig, A. G. Prodell, and P. Kusch, Phys. Rev. 88, 
191 (1952). 


26 R. Beringer and M. A. Heald, Phys. Rev. 95, 1474 (1954). 


272. T. Wilkinson and H. R. Crane, Phys. Rev. 130, 852 
(1963). 
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Fic. 3. Schematic diagram of 
the experiment, 


Light 
Pipe 
Photomultiplier 


Scattering Cell 
in Cavity- Oven 


changed in intensity, and the signal from the photo- 
multiplier was displayed on an oscilloscope. 

Field measurements were made with an NMR mag- 
netometer, whose small mineral-oil probe was located 
as close to the scattering cell as possible. The offset 
correction was measured with a second probe and 
oscillator. 


Optics 


The cadmium lamp used consisted of a small quartz 
cylinder, 1.5 cm in diameter and 2.5 cm long, carefully 
baked out, purged with an rf discharge in argon and then 
filled with a small amount of distilled natural cadmium 
and about 1 torr of spectroscopically pure argon. It 
was identical to the lamps previously used in this 
laboratory for several level crossing and double reso- 
nance experiments.^!* Tt was excited by being placed, 
without cooling, in the tank coil of a Hartley oscillator 
operating in the vicinity of 30 Mc/sec. Experience has 
shown that these lamps furnish about 1 W (total out- 
put over 47 solid angle) in the 3261-À intercombination 
line. 

For mercury an Osram lamp was used that had the 
outer jacket removed, and was cooled with a gentle 
air stream to eliminate self-reversal. It too was excited 
with a radiofrequency oscillator operating near 30 
Mc/sec and was run at the same current specified by 
the manufacturer for operation at 60 cps. 

The scattering cells were designed to have as small an 
effect as possible on the electrical properties of the 
microwave cavity. They were very thin walled quartz 
bubbles, 5 mm in diameter, blown at the end of a length 
of 2 mm outer diameter quartz tubing, which, when cut 
off at a length of 5 cm, served as the tail of the cell. 
These cells were baked out at 1000°C for a day or longer, 
purged with a Tessla coil for several minutes, and filled 
with a small quantity of distilled metal, all under very 
high vacuum (less than 10-8 Torr for the last stage). 
The light pipe used to remove the RCA-1P28 photo- 
multiplier from the vicinity of the magnet was a glass 
tube aluminized on the inside, 1 in. in diameter, and 
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about 2} ft long. To pass the 3261-A line of cadmium, 
or the 2537-À line of mercury, and exclude other scat- 
tered light, a Schott UG-11 or UG-5 colored glass 
filter was cemented to the end of the light pipe. 


Microwave Cavity and Oven 


The TEs cylindrical aluminum cavity oven used 
for cadmium is shown in Fig. 4. It had an internal diam- 
eter of 1.4 cm, a Q of about 4000 when loaded with the 
cell, and was coupled about 50% to the microwave 
line. A hole 6 mm in diameter at the top of the cavity, 
below cutoff at 24 Gc/sec, allowed illumination of the 
scattering cell. Small slits in the side of the cavity, 
also below cutoff, allowed the light scattered at 90° 
to be observed. 

The waveguide connecting the cavity to the micro- 
wave line was thinned to provide thermal insulation. 
The cavity was heated with a small gas-oxygen flame, 
the temperature being monitored by a small copper-con- 
stantin thermocouple. Thermal regulation was provided 
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Fic. 4. The cavity oven used for cadmium. 
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by a Honeywell R7086A thermocouple-activated regu- 
lator that played a stream of cool air against the cavity 
when the thermocouple voltage exceeded a preset 
figure. 

The long tail of the scattering cell passed through an 
aluminum tuning plunger, and an insulating refractory 
section, and terminated in the aluminum cylinder shown 
in the bottom of Fig. 4. This cylinder was heated with 
another gas-oxygen flame, and its temperature regulated 
with a separate regulator unit. It was always kept at 
least 40°C cooler than the oven, and consequently 
determined the cadmium vapor pressure. 

For mercury, in an attempt to gain a higher Q, a 
thin-walled copper cavity was used that was kept at 
room temperature. With a brass tuning plunger, a 
loaded cavity Q of about 8000 was obtained. The tail 
of the cell was usually kept at room temperature, but 
could also be cooled in an ice bath. 


Microwave Measurements 


'The Varian VA-98E klystron used in this experiment 
is an extremely stable tube that oscillates only over the 
region from 23.6 to 24.4 Gc/sec, with an average output 
of about 30 mW. To achieve maximum stability, we 
used batteries for both the heater and repeller voltages, 
and a well-regulated power supply for the beam voltage. 
No electronic stabilization of the tube was employed. 
During a run it was kept manually to within an audio- 
beat note of the 24th harmonic of the 1000 Mc/sec 
output of the laboratory's frequency standard. This 
represents a stability of about one part in 10°, and is 
therefore quite adequate for the purpose of this 
experiment. 


Field Measurements 


The proton probe of the NMR magnetometer used 
for field measurements consisted of a small spherical 
cavity filled with mineral oil drilled in the center of a 
teflon spool. The inductance for a transistorized margi- 
nal oscillator was wound on the spool which was then 
placed in a brass or copper shield. It was located as 
close as possible to the scattering cell (about 4 cm away), 
and the output signal of the marginal oscillator was 
displayed with the double resonance signal on a dual 
beam oscilloscope. The NMR frequency, near 48 Mc/sec, 
was measured with a Hewlett-Packard direct reading 
frequency counter whose reference frequency was taken 
from the laboratory standard. The offset correction was 
measured by removing the oven, placing a second probe 
driven by another identical oscillator at the scattering 
cell location, and measuring the beat frequency (~4 
ke/sec) of the two oscillators with the frequency 
counter. A pentode mixer gave the difference frequency 
without excessive cross coupling of the two oscillators. 

The offset correction was found to depend on the 
field history, and a correction was therefore made after 
each field measurement. 
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IV. RESULTS 


For cadmium the onset of power broadening was ob- 
served when about 5 mW of microwave power were in- 
cident on the cavity, in rough accord with our prelimi- 
nary estimates based on the lifetime of the (5555) 3P; 
level, and the electrical properties of the cavity. With 
oscilloscope display giving a detector bandwidth of 
about 1 kc/sec we succeeded in obtaining a signal-to- 
noise ratio of about 30 for both the Am--E1 double 
resonance transitions [Fig. 5(a)]. The observed line- 
width (Table II) shows no evidence of contamination 
broadening. 

The shorter lifetime of the (6565) *P1 level of mercury 
was found to require for maximum signal strength 
somewhat higher rf fields than the VA-98E klystron 
and aluminum cavity could provide. A higher Q copper 
cavity subsequently used for mercury just allowed the 
level of power broadening to be reached. The observed 
resonances [Fig. 5(b)] had a signal-to-noise ratio of 
about 5. It should be recalled that because of the il- 
lumination of the m==1 states the mercury signals 
are expected to be about 3 times weaker than those of 
cadmium. The additional discrepancy in signal strength 
can be attributed to the somewhat weaker output of the 
mercury lamp. The linewidth was in agreement with 
expectations (Table II). 

We had originally hoped to extend our measurements 
to the (4549) ?P; level of the even zinc isotopes, although 
we were aware that the long lifetime of this level would 
produce serious quenching of the excited state atoms 
against the wall of the scattering cell. We, in fact, made 
several runs using narrow-band phase-sensitive detec- 
tion with time constants as long as 5 sec, but failed to 
detect either double resonance or fluorescence. Beyond 
the quenching and broadening mechanisms listed in 
Table II, we are inclined to attribute this to contami- 
nants produced by the high cell temperature required 
for zinc. 

The magnetic fields at which double resonance was 
observed for cadmium and mercury are listed in Table 
III. From Eq. (2) we then find for the (5s5p) *P1 level 
of cadmium that 


£7 1.499846(13) — 3—154(13) X 1075, 


in good agreement with the values g;1—$—171(9) 
X10-5 and g,15—3—157(9)X 10-5 found for the odd 
isotopes from level crossings? and zero-field double 
resonance? (we have here recalculated gy"! and gj" 
taking g;—3.041978X 10-3). 

Likewise, we find for the (6565) 3P; level of mercury 
that 

£77 1.486004(8) — $—13 906 (8)X 10-5. 


This is in reasonably good agreement with a recent 
preliminary value of g;/?—1.486118(16) found by 
Smith, who considered both his own level-crossing 


28 W. Smith (private communication). 
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study of Hg™, and the results of low-field double reso- 
nance." It is also in agreement with the less precise 
value 1.486350(300) found by Dodd? from double reso- 
nance in the even isotopes. It is in disagreement, how- 
ever, with the self-consistent values gj? — 1.486147 (10) 
and g,7!—1.486156(18) calculated by Kaul in his 
thesis? from his study of level crossings, and the low- 
field double resonance results.9:! 

For cadmium the known lifetimes of the *P, and ‘P; 
levels allow o to be calculated directly from the relation 


BP ^ TUPQN(P:) q 
== RUNE NNI (4) 
a? 7GPi)MCP1) 


and the normalization condition a?+6?=1. Using the 
lifetimes and wavelengths of Table I we find that 
a=0.99900(5) and B= —0.0448(11). The phase con- 
vention of Condon and Shortley is used to determine 
the sign of 8! From Eq. (3) the splitting of the reso- 
nances is then calculated to be Av5— 40.245 (4) kc/sec, 
or AH —9.452(1) G. This is seen from Table III to be 
0.255 kc/sec lower than the experimental value, but 
just within the experimental uncertainty. 

Knowing « and 8 independently from the lifetimes, 
we are in a position to determine the sum of the rela- 
tivistic and diamagnetic corrections to gy. Theoretically 


gr=tor(grt+gs) +B grt Aget Agaiam: (S) 
Taking gz—1,? and gs= 2.002319,” we find that?! 
Agre t Agdiam= —318(40)X 1075. 


Since for mercury the lifetime of the !Pi level is 
poorly known, the sum of the relativistic and diamag- 
netic corrections cannot be found. Instead, however, 
we may use Eq. (5) neglecting these corrections to ob- 


+-Agaiam= —135(81) X10 * given in Ref. 13 are in error. These 
values were calculated from Eq. ( 
outdated lifetimes—not t 
best of our knowledge, t 
self-consistent. 
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TALLE III. Measured fields for double resonance in cadmium 
and mercury, in terms of the NMR frequency of protons in mineral 
oil, and in G, taking for 1/27X the gyromagnetic ratio 4.25760(4) 
kc/sec-G.^ A series of runs were made for each atom, care being 
taken to randomize systematic errors from run to run by slightly 
relocating the cell and probe within the field, reversing the polarity 
of the magnet, varying the modulation frequency, changing the 
vapor pressure, reassembling the optics, etc. The uncertainties 
quoted are 3 times the standard deviation of the means of the 
individual runs. 


Atom H* H AH 


Cd  48(9651(66) kc/sec — 48 65640(35) kc/sec 40.50128) ke/sec — 
11 437.64(19) G 11 428.13(15) G 9,5107) G 

Hg 49 133.45(35) ke/sec — 49 120.73(38) ke/sec — 12.72(26) kc/sec 
11 540.17(14) G 11 537.19(14) G 2.9917) G 


* R, L. Driscoll and P. L. Bender, Phys. Rev. Letters 1, 413 (1954). 


tain a=0.9849(4) and B= —0.1733 (24). We then calcu- 
late from Eq. (3) the splitting of the mercury resonances 
to be Av, — 12.468(8) kc/sec or AH —2.928(2) G. It is 
interesting to note that this is lower than the experi- 
mental value by 0.252 kc/sec, nearly the same as in the 
case of cadmium. Since the measurement of the splitting 
AH is not susceptible to the same systematic errors as 
the absolute field measurements, the uncertainties in 
AH given in Table IJI must be considered quite con- 
servative, and this discrepancy may be significant. 


V. DISCUSSION 


While an accuracy of about one part in 10° has been 
attained in the field measurements reported in this 
experiment, for cadmium this represents an uncertainty 
of one linewidth, and can certainly be improved upon. 
In retrospect we believe that higher precision could 
have been realized at somewhat lower frequencies and 
fields. Most of our uncertainty can be ascribed to the 
inhomogeneities which crept in as the field approached 
11400 G. Had we worked at half the present field and 
frequency, for example, the field would have been con- 
siderably more uniform, the scattering cell would have 
been twice as large (and easier to construct and more 
durable), the Doppler and wall collision linewidths 
halved, and the exit and entrance apertures increased 
in area by four. It is probable that under these circum- 
stances we would have had sufficient signal strength to 
observe double resonance in zinc, and that the cadmium 
resonance could have been measured to a small fraction 
of its width. 
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Determination of the Polarizability Tensors of the Magnetic 
Substates of *P, Metastable Argon*} 
EDWARD PoLrack,] EDWARD J. Rosinson,§ AND BENJAMIN BEDERSON 
Department of Physics, New York University, U niversity Heights, New York, New York 
(Received 8 January 1964) 
H 
The atomic beam 77-7 gradient-balance method has been employed to measure the z-component as (m7) b 
of the electric polarizability tensors of the my = +2 and zi; — 4-1 magnetic substates of Pa metastable argon. 
The measurements were normalized to the polarizability of $51 metastable helium, which is calculable to 
within an accuracy of 1%. In this method, a metastable beam is passed through a region containing trans- D. 
verse, congruent, inhomogeneous electric and magnetic fields. The condition that atoms ina particular 
magnetic substate experience no deflection in such a field region is es (mj) E(8E/65) = pest (61/83), where 
heft is the magnetic moment in the field H, and 6//és, òH /ôz are the z components of the gradients of the 
magnitudes of the electric and magnetic fields, respectively. Because of the congruency of the £ and H fields 
(8E/8:)/ E — (8H /0z)/ H , and therefore a: (ms) —uerH / E?. Because of the normalization, it was unnecessary 
to find either E or H. The results, in units of 10! cm’, are a:(+1)=50.4+3.5 and a:(+2)=44.5+3.1. 3 
The measured values of (4-1) and «:(+2) can be used to determine the polarizability tensors of each of 
the magnetic substates. The other components of the diagonalized tensors, with œz (mys) 2a, (n7), in units 
of 10-24cm? are a (0) —46.42-2.4, az (21) —47.24-2.3, a, (2-2) —50.42-3.5, and a; (0) =52.4+4.8. 
INTRODUCTION We will here briefly review our basic technique, 
ECENTLY an atomic beam method has been applied to an atom possessing a tensor polarizability. 
developed to measure the electric polarizabilities Consider an atom traversing a region of space containing 
of atoms.-? Using this technique, called the “E-H both an inhomogeneous transverse electric field and an 
- o ? E E s j 
gradient balance method,” polarizability determina- inhomogeneous transverse magnetic field, with both 
J . B i 7 
tions are independent of the atomic-beam velocity fields possessing large transverse gradients. We assume 
distribution and specific values of the field gradients that these fields are congruent, that is, the electric and 
and apparatus geometry except where employed in the magnetic fields as well as their gradients are propor 
calculation of the electric field at the beam position tional and in the same direction throughout the inter- 
from knowledge of the applied voltage and pole-face action region. The atom experiences a transverse force 
contour. Salop, Pollack, and Bederson®?? used this due to its magnetic moment given by 
method to measure the polarizabilities of the alkalis, 1) 
and the results of this work, as well as those of Chamber- Fm= pest (ms) (8H/62) , ( 
lain and Zorn,‘ are in agreement with recent theoretical A A ) E ; 
determinations, to within experimental error.56 For where pet (1707) is the effective atomic magnetic moment 
all these systems, the ground states are S states and of the magnetic substate my and 8H /àz is the transverse 
A ? = 2 
the polarizability is therefore scalar. component of the gradient of the magnitude of the 
f magnetic field. * 
For an atomic system possessing a tensor polariza- 
Projects A eU S mes RU ah bility the electric dipole moment y+ induced in the atom ‘ 
Contract N-ONR-285(15) and by the Defense Atomic Support by the electric field E is generally not along the field 1 


Agency, through the U. S. Army Research Office, Grant DA-ARO- 
(D)-31-124-G269, Durham, North Carolina. 

From part of a thesis submitted by E. Pollack to the Graduate 
Faculty of New York University in partial fulfillment of the 
requirements for the Ph.D. degree, October 1963. 

į Present address: Physics Department, University of Connecti- 
cut, Storrs, Connecticut, 

§ National Science Foundation, Science Faculty Fellow 1963- 
1964. 

1B. Bederson, J. Eisinger, K. Rubin, and A. Salop, Rev. Sci. 
Instr. 31, 852 (1960). 


direction. For the case where the polarizability tensor 
is in diagonal form, ue is given by 


u= fo E jay E y4- ko E; P 
where a, is the zx component of the polarizability 


tensor o, etc. For S-state atoms, o,—o,—0;-—0, 1.€., 
the polarizability is a scalar. For the magnetic substates 


*A. Salop, Ph.D. thesis, New York University, 1961 of P-state atoms, this is in general not the case and the 
(oop ep ished): polarizability of each of the substates is a tensor. In 
ee) Salop, B, Palais, emd i ISSR ony, Miya Rev Ms KOL iia present work, the electric and magnetic fields, which | 


4 G. Chamberlain and J. Zorn, Phys. Rev. 129, 677 (1963). 
5 The earlier Scheffers and Stark (Ref. 6) values for lithium and 
cesium are in error, being about 40% too low for lithium and 20% 


too low for cesium. 
6 H. Scheffers and J. Stark, Phyzik Z. 35, 625 (1934). 


are in the z direction, establish the z axis as a symmetry 
axis in the atom. Thus, o is in diagonal form, with the * 
and y components being equal. In terms of matrix 
elements, the polarizability components may be 
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written (in atomic units) as follows? : 


A1211 


N 
IOLE r:ļk)l? 


az(my) =ay(ms) = 3; [U?--J — mj2)8 (PE mP+3I 4-2)8.4 4, o (J+ ms = J)64 a ]-————— , (2a) 
D $ E,— Eo Y 
N 
OLE ril kyl? 
"n i=l 
acm) =2 2 Dm õss HIH m? sas Hm ss HIHI, (2b) 
z Er— Eo 


where J and my are the total angular momentum and 
magnetic quantum numbers of the unperturbed state 
[0), J^ is the total angular momentum quantum number 
of the state |k), (E,— Eo) the difference in energy 
between |k) and |0), r; the radial coordinate of the ith 
electron, and NN the number of electrons. Since the 
electric field is parallel to the z axis, the induced dipole 
is 


u= kaL. 


The resulting force is also in the z direction, and is 
given by 
F,—o;(mj)E(8E/8z), (3) 


where a:(my) is the “effective” polarizability of the 
atom in the substate my and 0£/óz is the transverse 
component of the gradient of the magnitude of the 
electric field. (It is assumed throughout that the 
nuclear spin is zero.) For substates possessing negative 
atomic moments, the fields can be adjusted so that the 
electric and magnetic forces given by Eqs. (1) and (3) 
are equal and opposite. (In the present experiment these 
are the positive my states.) When this is the case, it 
follows that 


Hert (rz) (6/62) a (m) E (64/62) . (4) 


Should Eq. (4) apply for several magnetic substates, 
o (m) for each substate can be determined separately. 

It can readily be shown that the polarizability 
tensors of all the substates can be uniquely expressed in 
terms of the effective polarizabilities of the my=+1 
and my=-+2 substates. Thus, 


a,(0)=$[4a.(+1)—a:(+2)], 
az(0)=4Le:(+1)+2e:(+2)], 
a(+1)=3[e:(+1)+e:(+2)], (5) 
a,(+2)=a.(+1), 
o (my) =a, (mz)- 


Note that @:=a@,=@-, where @ is the y component of 
the polarizability tensor averaged over the five mag- 
netic substates, etc. 


1E. U. Condon and G. H. Shortley, The Theory of Atomic 
Spectra. (Cambridge University Press, Cambridge, England, 
1951). The tensor polarizability is defined on p. 106. Eqs. (a) 
and (2b) are obtained by using the T-operator properties tabu- 


lated in Eq. (11), p. 63. 
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Because of the assumed congruency of the electric 
and magnetic fields, we may write 6///6s=kH and 
0E/02— kE, where k is a geometry factor. Substituting 
for the gradients in terms of the fields, the balance 
condition, Eq. (4), can be rewritten as 


a (my) = per (ms)H/ Ee. (6) 


The measurement of a@.(m,) is therefore independent 
of velocity, and specific values of the field gradients and 
geometry, except in the determination of E from the 
applied voltage. H can be accurately determined by 
means of a "zero-moment" experiment,* and pett) 
is an accurately known function of H. 

Relative measurements, using a normalizing element 
whose polarizability is known, can also be performed. 
In the present work, «.(1) and a.(2) of metastable *P» 
argon were measured in terms of the polarizability of 
metastable *$; helium, which has been calculated? 
(46.6 107?* cm?) to an accuracy of about 1%. Substi- 
tuting for the magnetic field in terms of a(He) in Eq. (6) 
we obtain 


a (mz) 9 pets (ms) / 2us( V te/ V Y'a (He) , (1) 


where yo is the Bohr magneton, Vre is the voltage at 
balance for metastable helium and V is the voltage 
balance for the argon substate. 

Although the balance condition is valid for beams of 
infinitesimal width, Salop et al? have shown that it is a 
good approximation for the narrow beams used in the 
present experiment. 


APPARATUS 


Figure 1 is a schematic diagram of the apparatus. 
The metastable beam is formed by effusion of argon 
into a region where it is cross fired by an electron beam. 
It then passes through a narrow collimator into the 
E—H field region, and is finally incident upon a 
tungsten surface which serves as the cathode of a 
Bendix electron multiplier. The metastable *P2 and 
3P, atoms are capable of ejecting electrons from the 
tungsten cathode, and these electrons are multiplied 


3 V. W. Cohen, Phys. Rev. 46, 713 (1934). 
? Charles Schwartz (private communication). Also, A. Dalgarno, 
Proc. Phys. Soc. (London) 72, 1053 (1958). 
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Fic. 1. Schematic diagram of the experiment. 


and counted using a preamplifier and scalar. Metastable 
helium is produced and detected in a similar fashion. 

The main chamber contains the magnet pole pieces, 
which also act as the electrodes for the inhomogeneous 
electric field. They are insulated from the magnet yoke 
by a rectangular glass tube which also serves as the 
vacuum envelope. The electric and magnetic fields 
between the pole pieces are congruent, since the 
boundary conditions which they obey are almost 
identical if the pole pieces are of high permeability 
and the magnetic fields are sufficiently low to avoid 
saturation effects. 

The pole pieces are 14 in. long, and the ratio of 
gradient to field is 2.5 cm! at the beam position, 
and the gap width is 0.056 in. The beam height as well 
as its location relative to the pole pieces was fixed by 
a 0.002-in. X0.050-in. collimating slit mounted directly 
in front of the pole pieces, at a distance of 0.025 in. from 
the convex edge. A beam-height limiter 0.060 in. high 
was mounted at the detector end of the pole piece 
assembly. 
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Fic. 2. The electron gun used for metastable production. 


A Sensitive Research electrostatic voltmeter was used 
to measure the applied voltage. Prior to being put 
into service, its calibration was checked at the Nationa] 
Bureau of Standards. Subsequent calibrations were 
performed by the manufacturer. The instrument 
maintained its calibration to within 0.1% throughout 
the course of the present work. 

Figure 2 shows two views of the electron gun.” 

The gun makes use of a Philips type A cathode, which 
stands up comparatively well to ion bombardment 
which results from its operation. The entire gun is 
mounted between the pole faces of a magnet which 
produces a field of approximately 1000 G parallel to the 
electron beam. Recoil broadening was essentially 
eliminated by the use of a 0.002 in. wide slit mounted 
at the beam exit of the gun. 

The usual operating voltages were 23 V for argon 
and 26 V for helium, with a current of about 30 mA. 
At these potentials the photon background was negli- 
gible, although it rose rapidly with increasing voltage. 
For example, at 30 V using argon, about 6% of the 
detected beam was due to photons. The full beam 
intensities were typically about 1000 and 200 cpm for 
argon and helium, respectively. Dark current noise 
from the multiplier corresponded to only several cpm 
and so reasonable signal-to-noise was achieved by 
counting for several minutes, at each pair of E and H 
field values. 


MEASUREMENTS 


Measurements were made of the beam intensity as a 
function of voltage across the electrodes, for a fixed 
value of magnetic field. Figure 3 presents such a run 
taken with argon at a field of approximately 260 G. 
The beam intensity when V=0 is due primarily to 
those atoms with perr=0, namely, those in the *Po 
metastable state, and those with my=0 in the *P» 
metastable state. As voltage is applied across the pole 
pieces, the beam intensity decreases at first as the atoms 
without magnetic moment are deflected out by the 
electric field. The first peak is in balance at about 11 
kV and the second peak at 17 kV, and any further 
increase in voltage results in a monotonically decreasing 
beam intensity. The curve shape is similar to that of a 
“Zero-moment” curve.’ Figure 4 shows the region of 
the single peak for a similar experiment performed with 
metastable helium at a magnetic field of about 300 G. 

Values of the polarizability ratio for the my=-++1 and 
mz=+2 substates of argon are obtained using 8 


10 Tn the early stages of the experiment an rf discharge was used 
for metastable production instead of the electron beam. The 
discharge was pulsed, and a suitably timed gate circuit was use 
to discriminate the detector against prompt-arrival photons, 1n 
favor of the more slowly traveling metastables. Further develop- 
ment of this method was forestalled by satisfactory operation of 
the electron gun. Nevertheless, the pulse discharge could certainly. 
be improved, and may ultimately prove superior to the gun 
method in some cases. 
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modified form of Eq. (7). Thus, 
a2(-+1)/a,(+2) = 4 (V2/V1)?, (8) 


where the subscripts specify the substates. Absolute 
values, normalized to helium are 


a:(+1)=$(1.5) (Vite/V1)?X46.6K10- cm?, (9) 
a:(+2)=$(Vite/V2)?K 46.6 1074 cm?. (10) 


The data are sufficient to determine @.(m,) only if 
the magnetic substates associated with the several 
peaks can be definitely identified. For example, from 
the positions of the peak in Fig. 3, a ratio ayz/a2: = 1 can 
be inferred if a1: represents the my=-++1 substate or 
12/022 =} if o; represent the m — 4-2 substate. (Here, 
the subscripts 1 and 2 refer to the peaks occurring at 
the lower and higher voltages, respectively.) It is very 
unlikely that the ratio should differ greatly from unity, 
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Fic. 3. Beam intensity as a function of applied voltage for 
metastable argon, at a magnetic field of about 260 G. 


since the valence electron, which contributes the 
principal amount to the polarizability, is in an s state. 
Nevertheless, a subsidiary experiment was performed 
in order to definitely identify the substates. 

The first peak was brought into balance at a mag- 
netic field of about 260 G and a voltage of 11.1 kV. A 
beam profile was taken for the balanced substate in 
steps of 0.001 in. about the beam maximum. The 
voltage was then dropped to 9.9 kV and a similar 
profile taken. These two profiles are shown in Fig. 5 
The apparatus as used here functioned as a substate 
selector, and in addition as a relatively crude de- 
flection polarizability ae i Wee 
i i vas performed on the secon 
eeu e Saree balance and at 1.2 kV below 


eak. Beam profiles at 
oe were CA Figure 6 shows the results of the 


second deflection experiment. 
An atom experiencing a constant force F transverse 


to its direction of motion in an atomic beam sodes 
a deflection s—cF/9?, where c' is a constant and v is the 
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Fic. 4. Beam intensity as a function of applied voltage for 
metastable helium in the vicinity of the single-balance peak, at a 
magnetic field of about 300 G. 


speed of the atom. In the deflection experiment just 
described, F is proportional to e;(Es4-0E)— peHo, 
where Eo and Ho are the balance electric and magnetic 
fields, and 8E, the change in electric field. Dropping 
the second-order term, F is proportional simply to 
a:E@E, or s=ca;E/v, where c is a constant which 
includes the o£. 

Since ôE is the same for the first and second deflection 
experiments, the ratio of the deflections is given by 


51/537 (e12/02:) ( Vy/ V2) . 


Now it can be shown" that for deflections small com- 
pared to the detector width, a triangular beam is dis- 


480 ~,003" 


460 


Fic. 5. Beam pro- 
files for metastable 440 
argon of the first 
balance peak, at a 
magnetic field of 
about 260 G. The 
left-hand curve is 
taken exactly at bal- 
ane V=11.1 kV 
The right-hand curve 
is taken slightly off 
balance with V=9.9 
kV. The displace- 
ment of the peak of 
approximately 0.003 360 
in.representsa rough 
measurement of the 


380 


COUNTS PER MINUTE 


polarizability, and 340 
therefore serves to 
identify the balanced 
substate as my — 4-1. 320 


o 24 6 8 1 12 
DETECTOR POSITION IN .OO! INCHES 


11 George E. Chamberlain, Ph.D. thesis, Yale University, 1961, — 
p. 83 fi. (unpublished). 
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'TABLE I. Summary of experimental results. d 
Approximate Balance Magnetic 
d magnetic voltage substate and 
lic. 6. Beam pro- field (gauss) (kV) element az (my) a:(-+1)/a:(+2) 
seo 004" files for metastable 
XD argon of the second 200 9.9 +1(Ar) 1.13 
F balance peak at a 200 14.9 +2(Ar) 
U yao magnetic field of 260 11.2 +1(Ar) 51.5 1.14 Lo 
É [ about 260 G. The 260 16.9 --2(Ar) 45.3 
z left-hand curve is 260 13.6 +1(He) 
= L taken exactly at bal- 300 123 +1(Ar) 49.3 1.13 
co ance with V—16.9 300 18.5 +-2(Ar) 43.8 
ai kV. The right-hand 300 14.6 -F1(He) (46.6)^ id 
D an curve is taken : 
z SUY os STER a Calculated by Schwartz and Dalgarno (see Ref. 8). 
o The displacement of 
280 the peak of ap- 
proximately Mum in. combined average of the results taken at magnetic fields ^ 4 
represents a rough 5 
260 re eeement of the Of 200, 260, and 300 G. We find , 
polarizability and N Ee z 
therefore serves to a; (4-1) = (50.4+3.5)X 107 cm, 
identify the balance e 
E substate as my= +2. a-(+2)= (44.543.1)X 107* cm?, 
Á 2)]2 l 
Eu Li [o (2- 1)/a. (2-2) ]=1.13+0.08. 
DETECTOR POSITION IN .00! INCHES Inserting the above into Eq. (5), and adding errors by 
the rms method, 
` 3 x =S / —24 3 
placed with an unaltered profile. That is, velocity a; (0) — (52.42:4.8) X10™ cm®, 
broadening of the beam shape is negligible. Our de- o (2:1) 5 (47.254-2.3) X 10?! cm’, 
flections are actually comparable to the beam width, so x z T 
SU : 2(2) = (50.42-3.5) X 10-4 cm? 
that some velocity broadening should occur. Neverthe- ede ) e 
less, the beam shape will be only slightly altered, and it a2(0)=a,(0)= (46.4 2.4) X 1077 cm. 
mie Rid le to C jore ae Pa E or Because of the generally low counting rate, statistical 
$ EN Ds Do d ER EU See at tae x fluctuations contributed significantly to the over-all 
Dm E È E REM 6 dee mM io. error. The range of standard deviations of each point 
x EE Em de dh Pe s 3 nE ; " E s € was between 1% and 3%, depending upon the duration 
BOR a D QDOME VEU E MD ese of the run and the magnitude of the beam intensity, 
values are consistent with the assumption that the ith the helium data acil having the larger devia- 
my=+1 substate comes into balance at the lower um SDN SEND oed Ses were somewhat 
voltage. 3 ; paisa ^ 
Dur led , ce broader than the helium peaks, owing to the greater 
etailed measurements were made in the vicinity overlapping of the three substates my=+2, +1, 0 of s 
of the balance peaks. A normal run consisted of counting 3p, argon, combined with the presence of the single 
for several minutes at 100-V intervals, usually so as to E of 3P, argon, as compared to the presence of i 
register several thousand counts. The results are e o abse m, d +1, 0 of the 3S; state of helium, = 
presented in Table I, where the balance voltages for ominai] unin ule pene Gi whe sual Sos gi > 
each of the measurable substates of argon at 200, 260, , S 
and 300 G, and for the single measurable substate of p: : 
q i Among the other causes of experimental error are the 
T Dr at a Eo e D G ay Epist Also taloulaiees! following : an apparent shift in the balance peak due to 
UO the polanzabiity ratios calculated by, fast atoms from unbalanced substates; fluctuations of 
T IN g "t 9 yat i GE ELE Hes tecih electron gun current, source pressure, and magnetic and 
aodo tne absoute values zormas] go Hilfen esi ue electric fields during the course of a run; measurement j 
o eo o do yeu ated using Pus 0) and (10). of the voltage using the NBS calibrated voltmeter ; Jack | 
i i t to the 
SUMMARY AND D of perfect parallelism of the beam with respect to © | 
IOWIEION OF RESULTS pole face; and deviations from the balance condition | 
The results of our measurements of the z component over the finite beam width. A detailed discussion of | 
of the electric polarizability tensor of the »;— 4-1 and most of these errors is contained in Ref. 3. i 
mys=+2 substates of *P» metastable argon are pre- We make an over-all assignment of +7% to the | 
sented below. The absolute values representa combined error in measuring œ:(+1) and a-(+2), based upon — — 
average of the results taken at magnetic fields of 260 conservative estimates of the above error sources. j 


and 300 G. The ratio measurement represents a 


It was to be expected that the polarizabilities of 
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*P2 argon would be quite large, since its structure is 
grossly similar to that of ground-state potassium, which 
has a polarizability of about 36x 10-^* cri. Further 
since the valence electron contributes the major 
amount to the polarizability, the polarizability ratio 
a:(+1)/a:(+2) should be quite close to unity (The 
valence electron is in an s state.) 

The P configuration of the atom is due to the five b 
electrons of the incomplete M shell, whose contribution 
to the polarizability should therefore contain a strong 
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anisotropy. However, since the contribution of the core 
to the total polarizability is small, the net anisotropy 
should also be small. 
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Microwave, optical spectrometric, and interferometric techniques have been used to investigate the 
nature of the electron-ion recombination process in low-pressure, pure helium afterglows at 300 and 77^K. 
To determine whether dissociative capture of electrons by helium molecular ions, i.e., Hes*-+e — He* 
+He+ (KE), is the recombination reaction taking place, we have attempted to detect the kinetic energy of 
dissociation of the excited atoms produced in the recombination process by a study of the widths of the 
emitted afterglow lines. A model of the low-pressure helium afterglow which reasonably accounts for our 
electron density, line intensity, and linewidth observations involves principally the creation of He* ions and 
electrons by the collisions of pairs of helium metastable atoms, the three-body conversion of He* ions to 
Hez* ions, the ambipolar diffusion of Het, Hes*, and electrons, and the diffusion of helium metastable atoms 
to the walls. The 45876 radiation (33D— 23P transition) from the later part of the afterglow, when recombina- 
tion of Hes* and electrons is expected to be the source of the excited atoms, exhibits a substantial broaden- 
ing over the approximately thermal width observed during the discharge and in the early afterglow, when 
the radiation is expected to originate from electron impact excitation of helium atoms. It is concluded that 


the dissociative recombination process is the likely source of the observed afterglow line broadening. 


I. INTRODUCTION 


UMEROUS experimental studies of the after- 
glows of pulsed microwave discharges'~* have 
shown that volume electron-ion recombination appears 
to be an important or dominating process in the re- 
moval of free electrons from ionized gases for several 
different gases. The priacipal technique in these experi- 
ments has involved the determination of spatially aver- 
aged electron densities during the afterglow by means of 
microwave probing techniques (see part I of this paper,® 
hereinafter referred to as I) and an analysis of the time 
dependence in terms of solutions of the equation of con- 
tinuity for electrons subject to losses by recombination 


* Present address: Physics Department, Thiel College, Green- 
ville, Pennsylvania. 
T Physics Department, U 


Pennsylvania. 
1M A. Biondi and S. C. Brown, Phys. Rev. 76, 1697 (1949). 


2 R. B. Holt, J. M. Richardson, B. Howland, and B. T. McClure, 
Phys. Rev. 77, 239 (1950). z 
SM. A. Biondi AS T Holstein, Phys. Rev. 82, 962 (1951); 
M. A. Biondi, ibid. 83, 1078 (1951). te 
1A. C. Faire and K. S. W. n Phys. Rev. 113, 1 (1959). 
* M. A. Biondi, Phys. Rev. 129, 1181 (1963). 
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and diffusion. This analysis, as commonly employed, is 
outlined in Eqs. (3)-(6) of I. 

In the rare gases neon and argon, it has been possible 
to obtain experimental data consistent with the recom- 
bination solution, 1/7.=[1/n-(0)]+at, and thus to 
evaluate the recombination coefficient œ with reason- 
able precision. In helium, however, due to the larger 
ratio of diffusion loss to recombination loss it has not 
been possible to obtain such a clear distinction. In ad- 
dition, the helium measurements have been complicated 
by the fact that at pressures of only a few mm Hg the 
atomic spectra, which are presumably related to the 
recombination process, give way to molecular spectra of 
excited Hes. Thus, serious doubts have been cast on the 
validity of measurements of the recombination coeí- 
ficient in helium by means of the afterglow time depend- 
ence studies.® 

By means of a set of controlled mixture experiments 
in helium and argon, it has been shown (see I) that the 
large recombination coefficients observed in the pure 
gases do not appear under circumstances that inhibit 


e C. S. Leffel, M. N. Hirsch, and D. E. Kerr, Ann. Phys. (N. Y.) 
(to be published). 
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the formation of diatomic molecular ions, thus support- 
ing the hypothesis of Bates? that the process under ob- 
servation is the dissociative recombination of molecular 
ions with electrons in the reaction, 


Xj-be— X*+-X-+kinetic energy. — (0) 


This naturally leads to the supposition that the same 
process is responsible for any large recombination loss 
occurring in moderate-pressure helium afterglows. 

The purpose of the experiments reported in this paper 
has been (a) to establish definitely the presence or 
absence of the dissociative recombination process in 
pure helium afterglows at low pressures (2 mm Hg or 
less) by a technique which avoids the complexity and 
uncertainty of interpretation inherent in the microwave 
electron-density determinations under mixed diffusion 
and recombination conditions, and (b) to provide an 
estimate of the magnitude of the recombination coef- 
ficient, if possible. The principal new technique intro- 
duced is the interferometric measurement of the inten- 
sity profile of an optical emission line of the helium 
atomic spectrum. 

The significance of this measurement can be under- 
stood by reference to Fig. 1 and to Eq. (1) and from the 
following considerations. The dissociative process can 
be efficient only if it is exothermic, i.e., the transition 
from the bound state A to the dissociated state B must 
be accompanied by the conversion of potential energy 
into kinetic energy in a significant amount. This con- 
verted energy is shown as the vertical scaled length 
Ep in the figure. The kinetic energy so produced is 
shared by the fragments which fly out from the dissoci- 
ating system. In the case of a homonuclear diatomic 


Potential Energy 


X+X 


Fic. 1. Schematic representation of the interatomic potential 
curves of interest in the dissociative recombination of X2* ions 
with electrons. The upper potential curve is that for a molecular 
ion and an electron at rest at infinity. 


7D. R. Bates, Phys. Rev. 77, 718 (1950); 78, 492 (1950); 82, 
103 (1951). 
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molecule, this energy of dissociation p is shared equally 
by the two identical atoms. Also, in the case of helium, 
the relative energies of the molecular-ion ground state 
and the molecular and atomic states are such that one 
of the atoms will almost certainly be produced in an 
electronically excited state. 

If this excited atom is produced in a radiating state 
of sufficiently short lifetime, it can be expected to radi- 
ate some of its excitation energy before it has time either 
to lose its kinetic energy through collisions or to lose 
its excitation energy by resonant transfer to a slowly 
moving atom. If the atom does radiate before experienc- 
ing energy loss of either kind, the emitted photon will 
be shifted from its normal wavelength by the Doppler 
effect due to the atomic velocity. Since there is no pre- 
ferred direction for dissociation, the velocities of the 
dissociating atoms will be randomly directed, and conse- 
quently, the observed effect will be a Doppler broaden- 
ing of the atomic spectral line in excess of the broadening 
due to the temperature of the molecular ions by an 
amount corresponding to the kinetic energy Ep/2. The 
direct observation of such anomalous line broadening 
may be taken as positive evidence of the predominance 
of the dissociative recombination process in the produc- 
tion of the afterglow emission spectrum. 

For this type of measurement helium has been chosen, 
rather than neon or argon, primarily because of the 
simplicity of its optical spectrum and the short lifetimes 
of some of the radiating states. Since the analysis in- 
volves considerable computation, a single line (A5876, 
corresponding to the transition 3*D—2 3P) has been 
chosen for study. 

The linewidth studies have been correlated with elec- 
tron-density measurements and with time-resolved 
emission spectra. In addition to the recombination study 
itself, attention has been given to the early part of the 
afterglow preceding the “recombination” period which 
has been the subject of most previous investigations. 
An attempt has been made to formulate a complete 
model of the afterglow processes for a simple case. 


Il. EXPERIMENTAL SYSTEM 


The complete experimental arrangement is shown 
diagrammatically in Fig. 2, except for the ultrahigh- 
vaccum gas handling system. The apparatus shown may 
be regarded as comprising three main units: microwave, 
spectrometer, and interferometer systems. 


A. Microwave System 


At the center of the figure is the microwave resonant 
cavity which encloses the discharge vessel and has hollow 
walls which may be filled with liquid nitrogen. A re- 
peated, pulsed discharge is produced in the quartz 
discharge vessel by power from the magnetron. Follow- 
ing this pulse a very low power probing signal is applied 
from the klystron. Power reflected from the cavity 
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Fic. 2. Block diagram of the combined microwave, optical spectrometric, and optical interferometric 
systems used to study afterglows. 


resonator is detected by the crystal detector and passed 
through an amplifier, which suppresses the strong 
magnetron pulse. The resulting signal is displayed on a 
synchronized oscilloscope. 


B. Spectrometer System 


A spectrometer port is provided by a narrow slit 
which tunnels through the double wall of the cavity 
resonator and an evacuated double glass window which 
tunnels through the foam insulation. The spectrometer 
has a synchronous prism table drive motor and an elec- 
tronically gated multiplier phototube detector, whose 


output goes to an integrating filter, dc electrometer and 


synchronously driven strip chart recorder. 


C. Interferometer System 


A second small opening in the cavity wall, similar to 
the spectrometer port but circular in cross section, pro- 


Fic. 3. Detailed cross section view 
of the photoelectric recording, Fabry- 
Perot interferometer. (See text for 
explanation.) 


8 M. A. Biondi, Rev. Sci. Instr. 27, 36 (1956). 


vides a port for the Fabry-Perot etalon. The etalon and 
detection system used here? is similar in principle to that 
used by Jacquinot and Dufour.’ It is illustrated in more 
detail in Fig. 3. 

The mechanical support and adjustment of the etalon 
plates (F) is provided by means of a conventional ar- 
rangement of invar spacers (G) and spring mounted 
compression pins (H) similar to those described in 
textbooks," with modifications as required to permit 
operation through the walls of the evacuated housing 
(C). The photoelectric detection, integrating, and re- 
cording system is identical with that of the spectrometer 
system. 

In use the etalon, Fig. 3, is placed so that the focal 
plane of the collimating lens (A) is near the center of 
the microwave cavity, i.e.,in the middle of the discharge. 
The collimated beam passes from the lens through a 
glass color filter and multilayer interference filter (B), 


? P. Jacquinot and C. Dufour, J. Rech. Centre Natl. Rech. Sci. Lab., Bellevue (Paris) 6, 91 (1949). 


10 See, for example, S. Tolansky, 


High Resolution Spectroscopy (Methuen and Co. Ltd., London, 1947). 
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through the optically clear vacuum window (D), the 
etalon plates (F), and the vacuum window and focusing 
lens (L). Here it is focused on an opaque disk (N) hav- 
ing in it a circular aperture of 0.051 cm diam which is 
centered on the interference pattern by the micrometer 
carriages (M) to within 0.001 cm. This aperture permits 
a very limited angular range of incident light, including 
a portion of the central spot of the interference pattern, 
to pass through to the photocathode. 

From simple considerations of geometrical optics, 
one can calculate the luminous flux reaching the cathode 
of the multiplier phototube, assuming lossless optical 
elements. This calculation, together with rough measures 
of the absorption losses and the photocathode quantum 
efficiency, forms the basis for estimating absolute in- 
tensities in the afterglow. 

'The observed resolution of the etalon in practice is 
two parts in 10°, in approximate agreement with the 
calculated value. The experimental value of resolution 
is based on measurements made (a) with a microwave 
discharge in argon containing a trace of isotopic Hg? 
using the narrow Hg line at 5461 À as a standard, and 
(b) on the assumption that the smallest attainable 
values of linewidth in helium are due solely to thermal 
Doppler broadening corresponding to the ambient 
temperature. The values deduced by the two different 
methods are in good agreement. 

A wavelength scanning system with linear time base 
is provided for the etalon by continuously increasing 
the refractive index p in the space between the etalon 
plates (see Fig. 2). This is achieved by first evacuating 
the interior of the sealed etalon housing with a vacuum 
pump to provide a starting point and then admitting 
gas through a preset needle valve connected to a source 
of argon at 200 atm pressure. The gas density in the 
etalon rises slowly from zero to slightly over one atmos- 
phere, at which point further increase is interrupted 
by the opening of a safety valve. This system offers a 
uniform scanning rate over approximately 10 fringes. 
Since the synchronous motor drive on the recorder pro- 
vides a linear time base for the chart paper, one obtains 
a linear presentation of intensity as a function of fre- 
quency (wavelength) within each interference order, 
thus giving us the desired line shapes directly. 


D. Vacuum System 


The gas is admitted to and removed from the quartz 
discharge vessel through a small bore tube which passes 
directly out of the cavity resonator through a close 
fitting hole at the upper end of the cavity axis. Through 
this tube the discharge vessel is connected to an ultra- 
high-vacuum gas handling system’! with a bakeable 
oil manometer” and a copper isolation trap.? The dis- 


1 D. Alpert, J. Appl. Phys. 24, 860 (1953); Handbuch der 
Physik (Springer-Verlag, Berlin, 1957), Vol. 12. 

12M. A. Biondi, Rev. Sci. Instr. 24, 989 (1953). 

13 D, Alpert, Rev. Sci. Instr. 24, 1004 (1953). 
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charge vessel itself is isolated from the rest of the vac. 
uum system by a liquid-nitrogen trap. 

The helium gas for these experiments is prepared by 
distillation from the liquid into 1-/ gas flasks. The flasks 
connected to a special glass manifold which can he 
opened under liquid helium, are first prepared by being 
outgassed under ultrahigh-vacuum conditions; then 
the manifold is sealed off from the outgassing system 
under vacuum and transferred to the liquid-helium 
storage Dewar. After being filled by distillation from 
the liquid helium, the flasks are individually sealed off 
from the special manifold and sealed on to the operating 
vacuum system. After a bake-out of the whole system 
at 400°C overnight and the attainment of a vacuum level 
below 10-? mm Hg, the sealed flask is opened. By com- 
parison of visible spectra of our prepared pure helium 
with those of mass analyzed samples of reagent grade 
helium, it appears that in all gas samples used the im- 
purity level has been less than one part in 10? and too 
small to detect. 


II. EXPERIMENTAL AND ANALYTICAL PROCEDURES 
A. Microwave Procedures 


The average electron density as a function of time is 
determined by the technique of microwave reflection 
from the cavity resonator. This technique has been 
discussed in detail elsewhere and will not be reviewed 
here." 

In view of the high Q of the cavity resonator, it is 
not surprising that the growth of the discharge in re- 
sponse to the applied power should be quite complicated. 
The initial conditions of the afterglow seem to be most 
sensitive to the length of the power pulse and less sensi- 
tive to the repetition rate and power delivered from the 
magnetron. With the magnetron frequency set slightly 
above the resonant frequency of the cavity without 
discharge, the buildup of plasma density during the 
power pulse shifts the cavity resonance through the 
magnetron frequency to a higher frequency, at which 
point further increase in plasma density is prevented by 
the self-detuning of the cavity toward still higher fre- 
quencies. Observations have shown that in this system 
the discharge has passed through a maximum of light 
output and settled down to approximately the steady- 
state emission rate after about 40 usec. Therefore, this 
value of the pulse duration has been set as an arbitrary 
standard for most of the experiments. Experience also 
showed that an afterglow period of about 2 msec 16 
the least time in which the essential features of the time- 
dependent afterglow processes can be exhibited and, 
therefore, this has been used as the pulse repetition 
interval in most of the experiments. 


B. Spectrometric Procedures 


Using the high-speed optical spectrometer with gated 
multiplier phototube and recorder, comparisons have 


4 M. A. Biondi, Rev. Sci. Instr. 22, 500 (1951). 
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been made of the intensities of several atomic lines at 
various times in the afterglow. The synchronous motor 
drives on the chart recorder and the spectrometer prism 
table provide a direct conversion of wavelength to chart 
length scale. By setting the electronic gate to any 
desired duration and to any desired onset delay relative 
to the end of the discharge, the afterglow spectrum has 
been recorded during each of several short periods 
(durations of 50-500 usec) in the afterglow. Also, by 
stopping the prism table drive on the peak of any desired 
line and periodically changing the onset delay time of 
the gate, the time dependences of several individual 
lines have been traced. Attention was centered mainly 
on the diffuse series lines 44472 and 4922 (4D-2P) 
and A5876 and 46678 (3D-2P), although other lines of 
the sharp, principal, and diffuse series were also 
monitored. 

Even at the low pressures of these experiments the 
spectrometer indicates the presence of some helium 
molecular-band intensity, which the etalon cannot re- 
solve but shows as a “white” background to the desired 
atomic line emission. Therefore, the spectrometer has 
been used to determine the relative strengths of the 
bands that fall within the transmission range of the 
multilayer interference filter used with the etalon. The 
transmission of the interference filter together with its 
associated color filter was determined by comparing 
the spectrum of an incandescent lamp with and without 
interposition of the filter. Although the spectrometer 
does not resolve detailed structure of the bands and 
records atomic lines only as impulses whose shapes are 
fixed by the time constant of the electronic integrating 
circuit, the relative total strength (intensity integrated 
over wavelength range) of a band to a line is given by the 
ratio of the areas under the respective recorded contours. 
This information has been used to correct for the ap- 
parent line broadening which would result from the 
superposition of the band radiation on that of the line 
recorded by the etalon. 


C. Linewidth Measurements with the Etalon 


Several factors enter into the production of the ob- 
served atomic linewidth. Some of these factors are 
analytically separable; some must be treated as they 
appear. We now outline the analysis in terms of which 
the line broadening measurements are interpreted. 


1. Single Component Line Shape due to Temperature 
and Dissociative Recombination 


We assume that line broadening is entirely due to 
Doppler effects arising from temperature motion and 
from dissociation of the excited molecule. The kinetic- 
energy distribution of the diatomic ions is assumed to 
be Maxwellian at the gas temperature, so that it may 
be written 


Fo(0)= (8/2)? exp(—8#*), (2) 
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where 8— M/ET, M being the mass of a single helium 
atom. The problem is further simplified by assuming 
that all dissociations leading to a given excited atomic 
state originate in a single specific molecular configura- 
tion, and, therefore, release a definite total kinetic 
energy Ep which is shared equally by the two dissoci- 
ating atoms. Thus, the speed of the excited atom relative 
to the center of mass of the diatomic ions is 
7;— (E p/ M)'?. The direction taken by the atom is as- 
sumed to be randomly oriented with respect to the ob- 
server, so the distribution F; in velocity relative to the 
center of mass is simply a unit impulse function (ô 
function) of speed at vj. 

For the computation of Doppler shift we are inter- 
ested only in the distribution in speed along the ob- 
server's line of sight (the z axis). For the thermal dis- 
tribution we get by integration over the x-y plane 


Fo(v.)= (8/7)? exp( —8v2), (3) 
while the integration of F, over the x-y plane yields 
F,(0,)= 1/2, (4) 


for —vı € v, v; and zero elsewhere. 

The two distributions Fo and F, are combined in the 
following way. For every v, at which atoms are repelled 
from the center of mass there is a complete Maxwellian 
distribution of velocities of the center of mass, giving 
rise to a group of atoms having a Maxwellian distribu- 
tion centered on the value v/. That is, the number of 
atoms in the speed range dz,’ at v,—;' is 


dG= Fo(v,—/)F1(v7)do7 . (S) 


The total resultant distribution in v. is then obtained 
by integrating Eq. (5) over the entire range of vy. If 
we first substitute for v everywhere the corresponding 
Doppler wave number shift, = řov/c, and simplify nota- 
tion by putting a= (Mc*/kT)?/ys and b= (Ep/ET)^^, 
we obtain as the resultant intensity contour (normalized 
to unit area) 


G(?7)— (a/Ab)Lerf (as-1- b) — erf(av— D) J. (6) 


Analysis of G(7) shows that at the center of the line 
(7 much smaller than %;=i0,/c) the slope of G(7) 
equals the slope of Fo(#) multiplied by exp(—Ep/kT). 
If Ep is reasonably large compared to £T, the line shape 
G(?7) will be tend to become very flat topped. 


2. Insirumental Broadening of the Single Component Line 


The fringe shape for monochromatic light of wave 
number # passing through an etalon at normal incidence 
is given by Airy's formula'? 


I=T(1—R [14K sin'(2zut2)]7, (7) 


where T is the transmission and R the reflectivity of 
the coatings on the etalon plates, » is the index of 
refraction of the medium between the plates, ¢ the plate 
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spacing, and K=4R/ (1—R)*. For present purposes we 
will omit the intensity reduction factor 7?(1— R)? and 
write for the normalized etalon transmission function, 


H(f)- (12-K sint f), (8) 


in which f=2ui7. The quantity f is the interference 
order number; it is used here as a dimensionless fre- 
quency variable measured in units of the free spectral 
range of the etalon. This is a simple scale change, which 
does not alter the form of G, Eq. (6), and provides 
a convenient unit for discussion of the etalon 
measurements. 

The line shape resulting from the transmission of the 
line G(f) through the etalon is obtained by considering 
the line as the superposition of fringes of all the mono- 
chromatic elements whose integral over frequency con- 
stitutes G(f). Thus, if the etalon is set or “tuned” for 
a frequency f (i.e., the interference order is integral for 
f), then light at frequency f’ produces a fringe intensity 
contribution of G(f)H(f— f')df'. The total resultant 
intensity transmitted at a given etalon setting is then 


o(f)= ] aug par. (9) 


This integral requires numerical computation. It has 
been computed at intervals of f— 0.01 for several values 
of T and Ep, initially by manual tabulation and then 
more extensively by the computer facility of the 
Mathematics Department of the Westinghouse Re- 
search Laboratories. From the results so obtained, it 
was evident that no simple procedure would satisfactor- 
ily separate the instrumental contribution from the 
actual line broadening. For this reason the computed 
over-all profiles are compared directly with the ob- 
served total linewidths. 


3. Choice of the X5876 Line and the Shape of the 
Triplet Line in the Etalon 


| Although the singlet lines offer the obvious advantage 
of simplicity in their lack of fine structure, the A5876 
(83D—2*P) line was selected because it has the short- 
est lifetime among all the visible transitions (Trad 
—1.34X10-3 sec), is well isolated from neighboring 
| lines, is known to be a bright line in low-pressure 
afterglows, and is in a wavelength region suited to 
| available photoelectric tubes. Moreover, its terminal 
state, 23P, is itself a radiating state and, therefore, 
not sufficiently populated to cause any spurious broad- 
ening by self-absorption. The high radiative transition 
rate of nearly 109 sec was initially considered necessary 
. to assure a high probability of radiation prior to excita- 
tion transfer or kinetic energy loss by collisions, which 
have been estimated to occur at a rate of the order of 
108 per second at a pressure of one mm Hg. Thus, meas- 
urements of line shapes have been carried out at pres- 
sures of a few mm Hg or less. 
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CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


A. BIONDI 


An analysis of the structure of the 15876 line had 
been made by Brochard et al. using an etalon and a 
helium source cooled in liquid helium. Their results 
have been adopted as a basis for the construction of the 
expected line shapes. In our experiments it is not pos- 
sible to separate each of the six components of the line 
because of temperature broadening. The etalon-free 
spectral range has been chosen to m ake the one separable 
component appear as à satellite one and one-half orders 
removed from the main peak composed of the other five, 
The fringe contributions of all six have been added 
graphically over the range of interest in the main peak, 
and resulting widths at half-intensity have been meas- 
ured from the composite graph for a variety of param- 
eter values. The half-widths of these lines have been 
compared with actually observed half-widths to deter- 
mine Ep. 


4. Spurious Line Broadening Effects 


In addition to the temperature and dissociative re- 
combination line broadening effects, there are two princi- 
pal spurious effects which may lead to an apparent broad- 
ening of the observed lines. These are (a) the presence 
of molecular band radiation in the light reaching the 
photocathode, and (b) the effect of electrical integrating 
networks between the phototube and the electrometer. 

Any molecular band radiation which passes through 
the interference filter and the etalon may be assumed to 
consist of numerous rotational transition lines, a fact 
which is corroborated by the shapes of the molecular 
bands as recorded through the spectrometer. These rota- 
tion lines are spaced nonuniformly in frequency and 
irrationally with the etalon-free spectral range. Conse- 
quently, they overlap each other in a random fashion 
and form what may be regarded as “white” light. To 
compare the effect of band radiation with that of an 
isolated atomic line we make the simplifying assumption 
that the etalon transmission function E ( f) is periodic in 
f with unit period, is unity over a range fe equal to the 
etalon half-width, and zero in the remainder of each 
unit range; that the atomic line intensity G(f) is a con- 
stant A over a range fa equal to the half-width of the 
line and zero elsewhere, fa being larger than f, but less 
than unity; and that the molecular band intensity is 4 
constant B over a frequency range f» which is much 
greater than unity and zero elsewhere. With these very 
simple line shapes it can be readily shown that the 
etalon reduces the ratio of total band radiation to line 
radiation by the factor fa. Checks on this procedure by 
graphical area measurements on the carefully computed 
etalon-Doppler lines show that the simple theory 15 
always conservative in that it overestimates the 
molecular background by about 10-30%. The effect 
of a white background on atomic line half-width meas- 
urements is to displace the apparent zero intensity 
reference of the line downward, causing the half-width 


15 J. Brochard et al., J. Phys. Rad. 13, 433 (1952). 
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to be measured too low on the line profile, thereby 
giving too large a value for the half-width. A correction 
for this effect has been made which is accurate for nar- 
row lines and/or small “white” backgrounds and which 
overestimates the spurious broadening for wider lines 
and/or larger "white" backgrounds. Thus, the deduced 
linewidths in the presence of molecular light background 
are smaller than the actual widths. 

The electrical integrating circuits necessary to sup- 
press noise and to convert the time-gated sample signals 
into dc signals for amplification and recording place a 
limit on the speed with which the line profile may be 
scanned. In practice, if the circuits are characterized 
by a response time constant 7 and the line profile is 
traced between half-intensity points in a time At, care 
is taken to assure that A/Z;5r for the narrowest line in 
a set of measurements, leading to a distortion of the 
recorded profile which increases the apparent line 
breadth by less than 2%. The error in measured width 
for broader lines, which take even longer to trace, is 
even smaller. 


IV. MODEL OF THE AFTERGLOW 


The variations of the measured electron densities, 
spectral intensities, and line shapes during the after- 
glow are sufficiently complicated that it is of consider- 
able help to have in mind the relevant atomic-collision 
processes which contribute to their behavior. The fol- 
lowing model of the afterglow is consistent with previous 
researches on helium!*!8 and with the observations 
obtained in the present studies. 

At the low pressures (~1 mm Hg) and moderate 
container size (fundamental diffusion length, A1= 0.735 
cm) used in these afterglow studies, the principal loss 
of the electrons, ions, and metastable atoms is by dif- 
fusion to the container walls. In addition, there is sub- 
stantial production of electrons and atomic ions by 
collisions of pairs of metastable atoms,” 


He¥++He™ — Het+e+ He. (10) 


Since the excitation energy of the metastable state of 
the helium atom is ~20 eV and the ionization potential 
~25 eV, the electrons produced by this reaction have 
~15 eV kinetic energy. As will be discussed more fully 
later, it is to be expected that before this excess kinetic 
energy is completely dissipated by elastic collisions, there 
is a substantial probability that these electrons will 
collide with metastable atoms, raising them to higher, 
radiating states, i.e., 

He-4-e(fast) > He*+e(slow). (11) 


«M. A.B iondi and S. C. Brown, Phys. Rev. 75, 1700 (1949). 
17 AY eed S. C. Brown, Phys. Rev. 86, 102 (1952). 
15 A. V. Phelps and J. Molnar, Phys. Rev. 89, 1202. (1953); 

A. V. Phelps, ibid. 99, 1307 (1955). i ; 
duris e Rev, BS 660 (1952. Contrary to the 

conclusi H. Myers, Phys. Rev. 190, : e 

ROGA dure (Refs. 17-19) strongly supp the ene 

clusion that metastable-metastable opis em sions de 

the production of atomic, rather than molecular, he 3 
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The atomic ions produced in the ionizing reaction, 
Eq. (10), are converted to molecular ions in three-body 
collisions," 


Het-+ 2He — Hez*-- He. (12) 


While the major loss of molecular ions is by diffusion 
to the walls, dissociative recombination between molecu- 
lar ions and electrons is expected to produce radiating 
atomic excited states according to the reaction 


Hez^4-e —5 He*--He. (13) 


If we neglect small terms in the equations governing 
the production and loss of the atomic ions and the 
molecular ions, we obtain the following approximate 
relationships: 


àni/O0tc Da Vni — K amni n, (14) 
and 
àn3/ 0c» D aV? ns4- K n^n; , (15) 


where D,; and Daz are the ambipolar diffusion coef- 
ficients of the atomic and molecular ions of densities 
nı and nz, respectively, is the neutral atom density, 
Nm the metastable concentration, Ke the three-body 
conversion coefficient for reaction (12), and f; is the 
two-body ionizing coefficient for metastable-metastable 
collisions, Eq. (10). It is assumed that the above equa- 
tions apply to thermal energy particles, since the elec- 
tron- and ion-energy decay times are short (3 10 usec) 
compared to the time scale of the measurements. 

Since the molecular-ion concentration remains small 
compared to the atomic-ion density throughout these 
low-pressure afterglows, we may take the electron den- 
sity, nn. Also, at low pressures (<1 mm Hg) the 
metastable atoms decay principally by diffusion to the 
walls; thus, the third term on the right of Eq. (14) may 
be approximated by Bm (0) exp(—2vmt), where vm is 
the diffusion frequency of the metastables. From the 
published values!^-!5 of the atomic ion, molecular ion, 
and metastable atom diffusion coefficients, of K. and 
B; (see Table I), and assuming a metastable atom con- 
centration sufficient to produce the observed maximum 
value for the electron density in the afterglow, i.e., 
114, (0)7-10?? cm7?, one can calculate the afterglow be- 
havior of the atomic ions, electrons, and molecular 
ions. The results of a model afterglow calculation at a 
pressure of 0.54 mm Hg and T=300°K are shown in 
Fig. 4. Starting from a very small number of electrons 


TABLE I. Coefficients used in the afterglow analysis 
(for references, see text). 


He* He.* He (2 3S) 
Diffusion, D 300°K 540 1015 520 
E IT. Hg) 77°K 173 230 130 
Recombination, a (cm/sec) <10 110-8 
Ionization, B; (cm*/sec) wis ett 1x107? 
Conversion, Ke (cm*/sec) 6X10-2 Ue aise 
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Fic. 4. Calculated behavior of various quantities in a helium 
afterglow at 300°K and a gas pressure of 0.54 mm Hg based on 
the model of the afterglow described in the text. 


and atomic ions at the beginning of the afterglow, meta- 
stable-metastable ionizing collisions cause an increase 
in 2, and nı. As the metastable density decreases, lead- 
ing to a smaller ionization rate, the diffusion to the 
walls and conversion of atomic ions leads to the final 
decay of nı and 7e. Meanwhile, the molecular-ion den- 
sity, which also started essentially from zero at the 
beginning of the afterglow, increases as a result of 
atomic ion conversion, reaching a considerably smaller 
maximum value than the electrons somewhat later in 
the afterglow. 

One may account for the intensity variation of the 
afterglow radiation, as follows; in the early afterglow 
when metastable atoms are plentiful and molecular 
ions are absent the radiation should be the result of the 
production of fast electrons by metastable-metastable 
collisions which then excite other metastables to radiat- 
ing states. Thus, the radiation produced by this process 
should decay at three times the metastable atom-decay 
rate. 

Considering the electron’s loss of kinetic energy at 
elastic collisions, it can be shown that approximately 
6X10 collisions are required to reduce the electron's 
initial energy from ~15 eV to the threshold value, 
~3 eV, for exciting 23S metastable atoms to the ob- 
served radiating states (e.g., 33D). For the inferred 
initial metastable density of ~10!? cm-?, the observed 
early radiation intensity can be accounted for, provided 
that the average cross section for excitation of the 
metastable state to a radiating state over the electron 
energy range from 3-15 eV is comparable to the cross 
section for scattering of electrons by normal atoms,” 
i.e., ~4X 1075 cm?. Although there are no direct meas- 
urements of electron impact excitation of metastable 
states, such a cross section does not seem unreasonable, 

since it is substantially less than the theoretical maxi- 
mum value? for such a process. The predicted shape of 


2 R. B. Brode, Rev. Mod. Phys. 5, 257 (1933). 
7 H. S. W. Massey and E. H. S. Burhop, Electronic and Ionic 
Impact Phenomena (Clarendon Press, Oxford, 1952), p. 144. 
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the early afterglow radiation (metastable excitation 
intensity) is indicated in Fig. 4. 

In the late afterglow, when the molecular ions have 
reached an appreciable concentration and the meta. 
stable atoms have diffused away, the radiation should 
be the result of excited states produced by dissociative 
recombination, reaction (13). The intensity of this radia- 
tion should, therefore, be roughly equal to (anana), 
where a is the dissociative recombination coefficient, on 
the assumption that recombination predominantly pro- 
duces the desired state either directly or by cascading 
from higher states. The recombination radiation in- 
tensity curve of Fig. 4 was predicted from the computed 
electron density and molecular-ion density curves and 
an assumed recombination coefficient of 1X108 
cm?/sec.? The total predicted intensity is then the 
sum of the metastable excitation and recombination 
intensities. The absolute intensities measured in the 
late afterglow are consistent with this model and the 
assumed recombination coefficient. However, the dif- 
ficulties of the absolute intensity determinations are 
such that the values may be in error by up to a factor 
of 3. 

It should be noted that, with the aforementioned 
model of the afterglow, the radiation resulting from 
metastable excitation by electron impact in the early 
afterglow should exhibit a thermal spectral linewidth. 
By contrast, the late afterglow radiation, which is as- 
sumed to result from the dissociative recombination 
process, should exhibit a broadened line shape. 


Photocurrent (up amp. ) 
8 


0 0.4 Ose cle 16 
Time (msec) 


Fic. 5. Observed behavior of the electron density and the ma 
tensities of two lines emitted from a helium afterglow at sp 
and p=0.54 mm Hg. The line \5876 corresponds to the 3 3D-2 
transition; 34472 to 4 3D—2 3P. 


2 This value was used as an approximation of the results of 
C. L. Chen, C. C. Leiby, and L. Goldstein, Phys. Rev. 121, 159^ 
(1961), who are the only investigators who achieved recombin2: 
tion control in their afterglow. 
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V. RESULTS AND DISCUSSION 


The observations which led to the construction of the 
afterglow model discussed in the previous section sug- 
gested a natural subdivision of the afterglow into an early 
and a late period, based on the line intensity and line- 
width variations. In Figs. 5 and 6 are shown the varia- 
tions of s, and of the intensities of 45876 and 4472 
observed during afterglows at T=300°K and 0.54 and 
2.05 mm Hg pressure, respectively. It will be seen that 
the observations at 0.54 mm Hg reproduce qualitatively 
the predictions of the model shown in Fig. 4, in that the 
electron density reaches a maximum value at ~300 
usec in the afterglow and the intensity of 44472 exhibits 
a slight hump at ~500 usec. No comparisons with the 
2.05 mm Hg data (Fig. 6) are made, since the simplify- 
ing assumptions used in the model calculation at 0.54 
mm Hg pressure are no longer valid at the higher 
pressure. 

On the basis of the proposed afterglow model one 
should expect significant changes in the character of the 
radiation emitted in the early and in the late afterglow. 
It is noted that in the early afterglow the relative in- 
tensities of the various lines emitted are quite similar 
to those observed during the discharge. In both cases 
the excited states are presumed to be produced princi- 
pally by electron impact excitation of lower excited 
states (ground state or metastable states). From the 
observed intensities in the early afterglow, one deduces 
a population of excited states which decreases mono- 
tonically with increasing principal quantum number, 
n. (Lines from upper states n= 3, 4, 5, 6 were observed.) 
By contrast, in the late afterglow a change in spectral 
distribution is noted, the relative population of the 
states of higher quantum numbers, e.g., n= 5, increasing 
markedly in comparison to the early afterglow values. 
As will be seen in the later discussion, such a behavior 
is consistent with dissociative recombination into higher 
excited states. 

A principal effort of the present investigation lay in 
the determinations of the widths of the 45876 line during 
the discharge and at various times in the afterglow in 
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Fic. 7. Observed linewidth (including instrumental width) 
during the pulsed discharge in helium at 300°K. The linewidths 
are expressed as fractions f of the free spectral range of the 
Fabry-Perot etalon, Ajp,=0,667 cm, The minimum width at 
~3 mm Hg corresponds to the computed width for an excited 
atom temperature of ~300°K. 


order to test the dissociative recombination hypothesis. 
As discussed in Sec. IIIC, the measured linewidths in- 
clude contributions from fine structure, Doppler broad- 
ening, and instrumental broadening. The results of the 
line broadening measurements are, therefore, presented 
in terms of the observed total linewidths, determined 
at the half-maximum intensity points and expressed as 
fractions f of the free spectral range of the etalon, 
Avter= (2ut)=0.667 cm~. The “temperature” of ex- 
cited atoms required to yield the Doppler broadening 
contribution is then determined from the line-shape 
calculations as described in Sec. III C. 
" The observed width of the \5876 line during the 
discharge in helium for various gas pressures is shown 
in Fig. 7. The minimum width observed at ~3 mm Hg 
corresponds to the linewidth calculated for an excited 
atom temperature of ~300°K. The increase in width 
at >3 mm Hg is the result of the appearance of some 
molecular light and pressure broadening effects. The 
increase in linewidth with decreasing pressure below 
2 mm Hg can be shown to result from radiation of the 
excited atoms before they lose their recoil kinetic energy 
gained at the inelastic impact of the electrons having 
more than 23 eV kinetic energy which create the excited 
states. The linewidth data extrapolated to zero pressure 
are consistent with the expected recoil energy of ~0.004 
eV, and the fact that this additional broadening does not 
disappear until ~3 mm Hg pressure suggests that excita- 
tion transfer to slow atoms and loss of kinetic energy by 
collisions of the excited atoms with normal atoms are 
slow enough to permit observation of line broadening 
due to dissociative recombination at pressures €2 
mm Hg.?* 
The observed 45876 linewidths in the afterglow, as 
well as during the discharge, are shown for several gas 


% An alternative possibility is that narrowing of the discharge 
line may be due to a change in mode of production of excited 
states with change of gas quee If metastable 2 3S atoms reach 
a very large density >10% cm as the pressure increases, then 
impact excitation to the 3?D state by ~3 eV electrons will 
produce the observed intensity of 45876 and the atom recoil 
energy wil be too small to appreciably add to the thermal 
linewidth. 
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Fic. 8. Variation of the observed linewidths with time during 
the discharge and in the afterglow of helium at 300°K. The widths 
are expressed as fractions f of the free spectral range of the etalon. 
The scales at the right refer, respectively, to the equivalent atom 
temperature required to yield the observed width and to the 
dissociation kinetic energy Ep which must be added to the initial 
molecular ion thermal motion to give the observed width. 


pressures at T=300°K in Fig. 8 and at T=77°K in Fig. 
9. Here, improved Fabry-Perot plates were used. They 
had dielectric reflecting coatings of higher reflectivity 
and lower absorptivity than the silver films used on the 
previous plates. Thus, the instrumental broadening was 
reduced relative to Fig. 7. The horizontal widths of the 
symbols indicate the time gate during which the photo- 
multiplier was made sensitive to light; the vertical 
extent of the symbols indicates the uncertainty in 
the linewidth determinations in these low-signal 
measurements. 

From the temperature scale at the right of the figure, 
it will be seen that, as noted previously at 300°K, the 
line during the discharge at 77°K shows broadening in 
excess of ambient temperature effects, consistent with 
atom recoil at impact of the exciting electrons. Im- 
mediately afterwards, in the early afterglow at 300 and 
at 77°K, the linewidths decrease to values indistinguish- 
able from the widths appropriate to the ambient tem- 
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Fic. 9. Variation of the observed linewidths in helium at 77°K. 
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peratures. This observation is consistent with the pro. 
posed mechanism for production of the early afterglow 
radiation by electron impact on metastable atoms, since 
the ~3 eV electron energy required to excite to the 
radiating 3 3D state causes negligible recoil in the struck 
atom compared to its thermal speed. 

Tt will be seen in both Figs. 8 and 9 that the observed 
linewidths increase with time in the later afterglow, 
These observations are consistent with the expectation 
that; as the contribution to the radiation by metastable 
atom excitation decreases and that due to dissociative 
recombination increases, the line should increase in 
width, until a maximum width is attained when re- 
combination alone is important. Unfortunately, the 
decrease in intensity with time in the afterglow does not 
permit line width measurements to be carried beyond 
~1 msec at 300°K and ~2 msec at 77°K. 

Although a pronounced line broadening is detected 
in the late afterglow, if one calculates the dissocietion 
kinetic energy required to yield the observed width,” 
then, as the second scale at the right of the figures 
ndicates, the inferred change of potential energy be- 
tween the initial and final states of reaction (13) is 
rather smaller than one would estimate. As mentioned 
in Sec. III C.3, it is difficult to assure radiation of the 
excited, fast moving atom before slowing down by elastic 
collisions or excitation transfer to slow atoms takes 
place. If one argues that the momentum transfer cross 
section for scattering of 3 3D atoms by normal helium 
atoms is the same magnitude as for the metastable 2 88 
atoms, namely, Q,,— 8X 1075 cm?,/5/? then the excess 
energy of the atoms produced by dissociative recombina- 
tion decays at the rate veou~10® sec? at 2 mm Hg 
pressure. Alternatively, if the excitation transfer cross 
section is of the same order of magnitude (~10-“ cm?) 
as the momentum transfer cross section, appreciable 
“narrowing” of the observed line will also occur by 
the transfer process. Excitation transfer cross sections, 
even for nonresonant cases, attain large values. For 
example, Gabriel and Heddle deduce values of 
Qiranster 105 cm? for various nP — n'D transfers 
(n=3, 4, 5, 6) at thermal energy. Thus, it is quite pos- 
sible that in these studies we have been able to observe 
only a small fraction of the initial dissociation kinetic 
energy produced by the recombination process. 

A final point, mentioned briefly in an earlier paper; 
involves accounting for the appearance of such radiation 
as A5876 and M472 as strong afterglow lines resulting 
from dissociative recombination. As will be seen from 
Fig. 10, with an expected binding energy of ~2 eV for 
(Hest),?6 it is necessary to postulate that the molecular 

"It is assumed that the molecular ions have a speed deter- 
mined by the ambient temperature. Then, since the dissociation 
energy is shared equally by the two helium atoms, one may calcu- 
late the line broadening to be expected for various values of the 
dissociation kinetic energy. 

3A H Gabriel and D^ W. O. Heddle, Proc. Roy. Soc. (London) 
A258, 124 (1960). 


(1958) A. Mason and J. T. Vanderslice, J. Chem. Phys. 29, 361 
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ions are in vibrationally excited states as they undergo 
dissociative recombination in order to produce the 
high-lying atomic states which yield the observed lines. 
(The figure is schematic, the vibration levels having 
been calculated for a Morse potential function.) The 
formation of the molecular ions in high-lying vibration 
states is a reasonable consequence of the assumed con- 
version reaction (12), in which the most likely binding 
collision is one in which a minimum loss of kinetic 
energy between two atoms takes place. Recent studies 
of vibration persistence in gases? have indicated that, 
for such well-separated vibration levels as in Hes*, the 
probability of de-excitation by transfer of vibration 
energy to kinetic energy of relative motion at a collision 
with a helium atom is likely to be very small, <10~*. 
Thus, on the millisecond time scale of our measurements, 
the ions should persist in the higher vibration states for 
an appreciable fraction of the observing interval. 

In summary, the initial goal of determining whether 
or not the dissociative process is responsible for the 
large electron-ion recombination losses observed in 
microwave afterglows has not been fully achieved. The 
observations are consistent with a model of the helium 
afterglow in which, at later times, recombination be- 
comes an important source of the afterglow radiation. 
At these times the emitted lines definitely show broaden- 
ing in excess of thermal width. If one were to argue that 
the late afterglow broadening is the result of some process 
other than dissociative recombination, then one is faced 
with the dilemma of explaining how the excited atom 
temperature in the early afterglow sinks to thermal 
values and then increases rather abruptly. Clearly, 
such effects as pressure broadening cannot explain these 
observations, and careful monitoring of molecular band 


27 S, J. Lukasik and J. E. Young, J. Chem. Phys. 27, 1149 (1957). 
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Fic. 10. Schematic representation of the potential curves 
required to explain the observations of strong 44472 and 5876 
afterglow lines in terms of dissociative recombination. It will be 
seen that long-lived vibrationally excited states of the molecular 
ion are involved. 


radiation intensities indicates that such spurious broad- 
ening effects do not account for the observed increase 
in width. Thus, dissociative recombination appears 
the most likely source of the modestly increased line- 
widths in the late afterglow.” Studies of line shapes in 
neon afterglows, where recombination is the predomi- 
nant electron removal process over a wider range of 
experimental variables, are in progress.” 
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The yield cross sections on,+, on+, and eg for production of fast Hs* and H* ions and H atoms resulting 
from the ionization and dissociation of fast H» molecules in single collisions with Hz gas molecules have been 
measured in the 6- to 120-keV energy range (collision velocity 0.7 to 3.2X 108 cm/sec). The neutral primary 
beam contained a fraction of H atoms which was determined by proportional counter analysis. Secondary 
particles produced by the H atoms in the primary beam were determined in separate H beam experiments 
and subtracted from the composite (H,Hs) neutral beam data to yield the production of secondaries due to 
He primaries alone. Cross section on,+ increases monatonically from 0.5 to 2.0 Å? in the energy range 6 to 


120 keV while oy+ increases from 0.28 A? at 7.5 keV 


to a maximum of 0.70 A? at 50 keV. Cross section on, 


measured only at 10 keV, was found to have an approximate value of 2.5 À? based on an assumed shape 


of the H atom angular distribution at small angles. 


The results are compared with data on a number of 


collision processes in the same velocity range involving various species of hydrogen ions and H atoms inci- 


dent on He gas molecules. 


INTRODUCTION 


a is a growing need for experimental data 
concerning ionization and charge exchange in 
collisions of atomic particles at relative velocities of the 
order of 108 cm/sec. Such velocities are encountered in 
fusion machines, ionic propulsion engines, high-voltage 
gaseous dischanges, and nuclear explosions. Experi- 
ments with hydrogen ions and molecules are directly 
applicable to all of these problems and are needed as a 
guide to further development of collision theory. In 
the particular velocity range of interest, theoretical 
calculations of collision cross sections are most difficult 
because the optical electron velocity of the colliding 
particles is about the same as the atomic translational 
velocity. Neither the adiabatic nor the impulsive model 
of a collision is valid under these circumstances. 

The main purpose of this paper is to report measure- 
ments of the cross sections for ionization and dissocia- 
tion of 6- to 120-keV Hs molecules in collision with Hz 
gas molecules. Apparatus used previously for the study 
of H+, H+, and H;* collisions with Hz molecules was 
employed.! The method consists of preparing a mono- 
energetic beam of fast molecules by charge exchange, 
passing the beam through a thin target of H» gas and 
measuring the production of fast H* and Hg* ions and 
H neutral atoms. 

The reactions contributing to the production of the 
observed species are as follows, neglecting reactions in 
which negative ions are formed. 


Reaction 1: H.+H.— H+H-+ (H), 
Reaction 2: H»+H.— Hz*4- (ed- H3), 
Reaction 3: H.+H.— H++H-+ (ed- H5), 
Reaction 4: H+H: — H++H*+ (ed-e4- H5). 


Here the heavy fragments of the primary H» molecules 
are shown immediately following the arrow while elec- 


* This work performed under the auspices of the U. S. Atomic 


Energy Commission. 
1G. W. McClure, Phys. Rev. 130, 1852 (1963). 
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trons removed from the primary and the post-collision 
target molecule are shown in parentheses on the right. 
The target H» molecules may be ionized, excited or dis- 
sociated, and the product particles H;* and H may 
carry off some energy in the form of internal excitation, 
but in the great majority of collisions of the type con- 
sidered the two constituent protons of the primary 
molecules, whether bound or dissociated after the 
collision, suffer very little change of velocity or direc- 
tion of motion. 

The present measurements do not completely resolve 
the above reactions but rather determine the yield 
cross sections c+, on,*, and on for the three types of 
fast heavy particles H+, Hz+, and H. In terms of the 
reaction cross sections i, 9», 73, c4 for the above reac- 
tions, the measured quantities are: 


TH += 02; 
on+=03 t201, 
ou= 2e1-03- 


Particle counting was used to measure the intensities 
of both the primary neutral beams and the secondary 
ion and neutral beams. The counter employed permitted 
Hs; and H neutrals of the same velocity to be clearly 
distinguished by the relative size of electrical impulses 
they produced. Ability of the counting system to resolve 
H and Hp counts placed a lower limit of about 3 keV 
on the present method of measurement. The narrow slit 
aperture on the proportional counter (needed to suppor t 
a thin-film gas barrier) prevented our use of a technique 
devised by Sweetman? which might otherwise have been 
employed to distinguish the reactions 1 to 4. 

The apparatus was designed so that all heavy frag- 
ments of the primary molecule having a post-collision 
angle less than 2.2 deg relative to the direction of the 
Hs beam could be detected. At the 6-keV lower energy 
limit of the present (Hz--H:) measurements the cross 
section for Coulomb scattering through an angle greater 


? D, R. Sweetman, Proc. Roy. Soc. (London) A256, 416 (1960). 
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than 2.2 deg of one of the two protons of the incident 
molecule against one of the two protons of the target 
molecule is 2X 10-7 cm?, neglecting screening. This 
cross section is of the same order of magnitude as the 
measured H* production cross section at 6 keV; hence, 
some error may be introduced into the H* data at this 
energy. However, throughout most of the present 
energy range no significant large angle scattering errors 
are expected due to Coulomb interactions. 

At angles up to a few degrees from the forward 
direction the differential cross section for production of 
dissociation fragments of fast molecules may exceed 
that due to Coulomb scattering because of electronic 
transitions of the incident particle to antibonding states 
in which the two nuclei repel each other. Resulting pro- 
duction of dissociation fragments at angles outside the 
detector aperture may cause the measured cross sec- 
tions to be 10 to 20% below the true total cross section 
at the lowest energies. 

An important problem associated with the present 
measurements was that of producing a primary beam of 
neutral Hs molecules. This was accomplished via the 
H.* on He reactions, studied previously,'? which give 
rise to a mixed beam of Hə molecules and H atoms of 
the same velocity via the processes 


H.t-++H» = H.-+ (H+), 
Hz-4-H» =? H+ H+ (Hs? Jb 
H+H: — H4-H*4- (Ho). 


Reaction 5: 
Reaction 6: 


Reaction 7: 


In order to calculate the background production of H* 
caused by electron loss from the H atoms in the mixed 
neutral beam, it was necessary to know the cross 
section oo,1 for the reaction 
Reaction 8: H--H5— H++ (ed- H5). 

Although this cross section has been measured; a 
serious discrepancy existed between the available meas- 
urements below 10 keV. Therefore, oo,1 was remeasured 
over the entire energy range 2 to 120 keV. At the same 
time, remeasurements of oo,-1, the cross section for the 
reaction 


Reaction 9: H-+H.— H-+ (H+) 


were made to resolve a gross discrepancy in previous 
data on this process. Both measurements involving H 


feds gue, Phys. Rev. 81, 1026 (1951). p 
2 Lh AGG ae E. Barnett, Phys. Rev. 103, 896 (1956). 
5 C. F. Barnett and H. K. Reynolds, Phys. Rev. 109, 355 (1958). 
5T. M. Fogel, V. A. Ankudinov, D. V. Pilipenko, and N. V. 
Topolia, Zh. Eksperim. i Ted cM (1958) [English 
. 7n /S.— E > A 1 3 ¿ 
SS Mq m I. M. Fogel, Zh. Eksperim. i aos ve 
42, 936 (1962) [English transl.: Soviet POS erie D nee 
(1962) ]. Data on oo,1 from this reference (9-30 keV) agree wi 


data from Rel. $^. M. Donahue, Phys, Rev. 118, 1233 (1960). 


? Fred Schwirtzke, Z. Physik 157, 510 (1960). 
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atom primaries are discussed in the first portion of the 
section entitled Results. The second portion of the 
Results section deals with the (H.-+ Hz) measurements. 

All cross sections were measured relative to the cross 
section g1, for the reaction 


Reaction 10: H*-+-H,— H- (H+). 


This reaction” serves as an excellent basis for calibra- 
tion of collision experiments of the present type because 
(1) no excited states are possible in the incident proton 
beam, (2) the angular distribution of the product H 
atoms is very strongly peaked in the forward direction, 
and (3) the absolute value of the cross section has been 
measured in the neighborhood of 10-100 keV by a 
number of investigators whose results are in excellent 
agreement. 

A major problem in the interpretation of any experi- 
ment using fast neutral beams produced by charge 
exchange is that of uncertainty concerning the quantum 
states of the incident particles. In the present work this 
question is not settled; however, a discussion of the 
possible states of excitation of the neutral particles is 
given. Also, the conditions of production of the neutral 
beams are fairly well defined so that future data on the 
initial population and decay of excited states produced 
in charge-exchange collisions may be brought to bear 
on a more refined interpretation of the present work. 

The section entitled Discussion compares the 
(Hz4- H;) collision data, obtained for the first time in 
this investigation, with data on related collision 
processes. 


APPARATUS 


The ion source, ion accelerator, and magnetic beam 
analyzer described in Ref. 1 were used without altera- 
tion to provide a 1-20-keV beam of either H* or Hz* 
ions having an energy spread of approximately one 
percent. Either of these beams could be directed along 
the sequence of apertures Sn S», Sa, and S4 (Fig. 1) 
which formed the entrance and exit slits of two differ- 
entially pumped chambers 7; and T; operated with 
separate gas controls and pressure gages. These cham- 
bers served as a neutralizing chamber and main collision 
chamber, respectively. Electrostatic deflection plates 
D, located between T, and Tz were used in the neutral 
beam experiments to deflect unconverted ions emergent 
from 7, out of the neutral beam, permitting only 
neutrals to enter 7». To change from an ion beam ex- 
periment to a neutral beam experiment required only 
the closing of a bypass valve between T, and the main 
vacuum chamber, the admission of a gas to the neutra- 
lizing chamber 7, and the application of a deflection 
voltage to plates D. 

A second set of electrostatic deflection plates D» at the 


1? S, K. Allison and M. Garcia Munoz, in Atomic and Molecular 
Processes, edited by D. R. Bates (Academic Press Inc. New 
York, 1962), p. 751. 
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Fic. 1. Apparatus diagram. Tı and T5, collision chambers. Vi and Vs, ionization gauges. Gi and G», gas inlets. Ci, Cs, and C3, Faraday 
cups. D; and Ds, deflection plates for deflecting charged particle beams off the main beam axis. Si, S», Ss, and Si, circular apertures 
having diameters 0.01, 0.02, 0.01, and 0.10 in., respectively. Ss, rectangular detector slit 0.001 in.X0.963 in.; long dimension perpen- 
dicular to plane of drawing. A, ionization chamber forming lower section of counter assembly shown in Ref. 1. DP, 400 liter/sec oil 


diffusion pump. B; and B», bellows-type vacuum joints constraine 


d by external gimbals. B; is used to align apertures Sı and S; with 


direction of primary beam incident on 5; from the left. B: is used for mechanically sweeping detector slit Ss across the electrostatically 
analyzed set of particle beams emergent to the right from D». B» permits detector slit Ss (and attached assemblies A and DP) to swing 
through a 2-20? arc in the plane of the drawing about center of rotation Po. 


exit of T; divided the different species of fast charged 
particles emergent from T: into separate beams. Each 
of these beams was measured by moving the entrance 
slit S; of the proportional counter detector across the 
beam while recording with single-channel pulse-height 
analyzers the integrated counts of "single" and “double” 
amplitude pulses. The single-amplitude pulses came 
from H+ or H particles and the double-amplitude 
pulses came from Hz or Hz*. Counts occurred in both 
channels only when the neutral beam from primary He 
neutrals was being scanned. In this case pulse-height 
discrimination was used to distinguish H from He in 
the composite neutral beam emergent from the collision 
chamber. The geometry and properties of the propor- 
tional counter were discussed thoroughly in Ref. 1. A 
256-channel pulse-height analyzer was used in setting 
the pulse-height acceptance limits of the single-channel 
analyzers. 

A refinement of the original apparatus! was the addi- 
tion of a servo mechanism to drive the counter entrance 
slit at a speed proportional to the primary ion beam 
current. This system greatly reduced errors due to 
beam current fluctuations. The servo drive signal was 
derived in the neutral beam measurements from a 
Faraday cup Cz surrounding the entrance of chamber 
T, into which the unconverted ion beam emergent 
from T, was directed. In all cases only a small fraction 
of the beam entering 7; was neutralized and all un- 
converted ions were received by the Faraday cup. 
High stability of the neutralizing chamber pressure 
was obtained with a specially designed gas leak." Con- 
stancy of this pressure insured that the neutral beam 
entering Tı was exactly proportional to the servo drive 
signal once the leak was set. 


1 D, L. Allensworth, Rev. Sci. Instr. 34, 448 (1963). 
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CROSS-SECTION DETERMINATION 
Cross sections were determined from the formula 
c;(j) - L;/ I md, (1) 


where o;(j) is the cross section for production of 
secondary particles of type i by primaries of type j, I; 
is the current of primary particles of type 7, J; is the 
current of secondary particles of type i, and nd is the 
equivalent number of gas molecules in the collision 
chamber per cm? of target area normal to the beam 
direction. 

Because molecular flow conditions were present in the 
collision chamber the following relation holds 


nd=ap, (2) 


where # is the collision chamber pressure gage reading 
and a is a calibration constant having a fixed value for 
a given target gas and a given statistical distribution 
of incident-particle paths through the collision chamber. 
It is assumed that the distribution of ray paths is inde- 
pendent of the type and energy of primary beam used 
in these experiments. 
With the further assumption that 


N./N;=1./1;, (3) 


where N; is the number of counts recorded while the de- 
tector is scanned across the type 7 secondary beam and 
N; is the number of counts recorded while the detector 
is scanned across the type j primary particle beam, Eqs 
(1)- (3) yield the following cross-section formula: 


ei(j)— (N/N) (1/op). (4) 


Secondary particles produced at the edges of the 
entrance slit to chamber T» or "spurious secondaries 
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which are present in the incident beam entering the 
collision chamber are not dependent on the collision 
chamber pressure p. Either of these effects or the 
presence in Tz of a residual gas or vapor having a 
different cross section than the target gas result in 
failure of Eq. (4) to give a correct cross section. AII 
three sources of error can be eliminated by plotting 
N; versus p and by using the slope AN;/Ap of this 
curve in place of N;/5 in evaluating Eq. (4). This pro- 
cedure was used in every cross-section determination of 
the present work. 

To summarize, the general working cross-section 
formula is 


oi(7)= (1/aN 3) (AN /Ap). (5) 
CALIBRATION 


The calibration of the apparatus consisted of deter- 
mining a single value of the gas-dependent constant a 
of Eq. (5) since all measurements were made with H» 
as the target gas. This constant was determined by 
measuring the conversion of protons to H atoms by 
electron capture in He gas (reaction 10) and assuming 
the cross-section value 8.2X 10715 cm? at the calibration 
proton energy of 10 keV. The value of a was found to 
be constant within a range of +5% in several deter- 
minations made during the course of the work. The 
chief cause of variations in successive a determinations 
was beam current fluctuations. The servomechanism 
used to compensate for beam current variations in 
all of the other measurements herein reported could not 


Fic. 2. Cross section eoi for con- 
version of primary H atoms to H* ions 
and cross section go,-1 for conversion 
of primary H atoms to H- ions. 
Present results are compared with pre- 
vious data of Montague (Ref. 3), 
Stier and Barnett (Ref. 4), Barnett 
and Reynolds (Ref. 5), Fogel et al. 
(Refs. 6 and 7), and Curran and 
Donahue (Ref. 8). 
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be used for the calibration because when the H* beam 
current was made low enough that the primary beam 
could be counted directly at the output of Tz, the 
fringing portion of the beam entering Faraday cup C; 
was too weak to provide a monitoring signal to drive 
the servo properly. This difficulty was offset in the 
calibration by taking a large number of points on the 
N; versus p curves. 

The use of reaction 10 for calibration continues the 
practice begun during our earlier work of reducing all 
hydrogen target measurements to a common basis. 
Should the value assumed for this cross section prove 
to be in error in the future, all of our measurements 
can be corrected accordingly by applying the same 
multiplicative correction factor throughout. 


EXPERIMENTAL CONDITIONS 


Hydrogen gas at a pressure between 10-* and 10-? 
mm Hg was used as the neutralizing gas in all of the 
neutral beam measurements. These pressures were 
sufficiently low that less than 10% of the incident ions 
were neutralized. The residual gas pressure in the 
neutralizing chamber was about 10-5 mm Hg with the 
gas inlet valve closed and about 10-* mm Hg with the 
bypass valve open for calibration measurements. 

Hydrogen gas of 99.9% purity was used in the 
collision chamber as the target gas. The residual gas 
pressure in the main collision chamber was 5X10-7 
mm Hg with the hydrogen inlet leak closed and was 
varied between 1075 and 10-* mm Hg in order to obtain 
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the N; versus p curves. The maximum fraction of 
incident particles which suffered either charge changing 
of dissociative collisions in the collision chamber was 
0.7% and the maximum fraction of either the primary 
or secondary particles which underwent charge changing 
or dissociative collisions in the drift space between the 
collision chamber and the detector at the maximum 
drift space pressure of 107^ mm Hg was about 0.07%. 
Under these conditions the attenuation of both the 
primary beam and secondary beams during traversal 
of the collision chamber and drift space could be 
ignored in evaluating the quantities IV; and IV ;. Direct 
evidence of thin-target conditions was seen in the fact 
that all of the N; versus p curves were linear within the 
statistical uncertainties of the particle counts. This un- 
certainty varied from 2 to 10%. 

Ion beam currents entering chamber Tı were generally 
of the order of 10- A throughout the data runs. 


RESULTS 


H Primaries 


By permitting reaction 10 to occur in chamber 7i 
and by applying a suitable deflection voltage to plate 
D; a beam of H atoms cleared of H+ and H~ ions was 
directed into the collision chamber T2. The fast second- 
aries Ht and H~ emergent from Tz due to reactions 8 
and 9 were deflected by plates D» at angles of +3 deg 
from the unconverted H primary beam. Scans of the 
detector across all three beams at several energies 
indicated that a 1-deg scan was sufficient to receive 
essentially all of the particles emergent from the 4.4-deg- 
wide collision chamber exit aperture S4. Consequently, 
1-deg scans were used in all the co,ı and co, 1 Measure- 
ments. For determinations of co,1 the H* and H beams 
were scanned by the detector at several collision 
chamber pressures. Throughout these scans the Ht 
beam emergent from chamber T; was collected at cup 
2 and used as a servo drive signal. Calculation of the 
cross section oo,1 was then accomplished by the direct 
application of Eq. (5). Results are plotted in Fig. 2. 

Except ın the neighborhood of 10 keV the yield of 
H- particles from reaction 9 was found to be so low 
that a large increase in the primary H beam was needed 
for accurate H- yield determinations. In order to avoid 
correcting the recorded H counts for large dead-time 
losses at the required increased beam intensities, cup 
C, was moved into position to collect the H beam 
emergent from Tz and a suitable positive bias was 
applied to the suppressor ring at the entrance to the 
cup to give a large secondary electron signal. This 
signal was then used to drive the servo while the Ht 
and H- beam were alternately scanned at several 
pressures to obtain plots of Nu+ and Nu- versus 
pressure. The ratio o,-1/o0,1 was then determined from 
the formula 


o0,-1/00,:= L[ANu-/Ap ]/[ANu +/Ap | (6) 
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and ao,-1 was determined by multiplying this ratio by 
the values of oo.1 from Fig. 2. The results for co, are 
shown also in Fig. 2. 

Uncertainties in the experimental points are +5% 
except for the points at 2 and 90 keV where larger un- 
certainties are indicated. 

Previous measurements of o0,1 and co,.1 are shown 
in Fig. 2 together with the present results. At all 
energies from 4 to 120 keV except in the vicinity of 6 keV 
our results appear to be consistent with the results of 
Stier and Barnett. Agreement is excellent for both 
70,1 and g0,—1- 

In the energy range 40 to 120 keV our data points 
and those of Stier and Barnett for 0,1 fall about 15% 
above those of Montague.’ Between 15 and 40 keV the 
results of all investigators scatter over a range of +20% 
from the mean, but smooth curves drawn through the 
individual sets of data possess very nearly the same 
shape. It seems likely, therefore, that the discrepancies 
in this region could be accounted for by uncertainties 
in collision chamber pressure determinations. 

Schwirtzke? has determined a quantity he calls øo, 
which we believe is actually 9,14-00, 1. The Schwirtzke 
result, thus interpreted, has been compared with values 
of (co1-7o,:) calculated from our data and is found 
to agree within +15% over the range 10-50 keV. 

Below 10 keV the experimental values of 0,1 fall into 
two divergent groups with relatively high values given 
by the present results and those of Stier and Barnett, 
and relatively low values given by Curran and Donahue® 
and by Fogel ef al. The Fogel measurements do not 
extend below 7 keV where an extremely serious depar- 
ture sets in between the present “high” values and the 
“Jow” values of Curran and Donahue. 

In an attempt to understand the drastic departure 
between the so, results of Curran and Donahue and 
the oo,1 values of the present investigation below 7 keV 
we have hypothesized the presence of the well-known 
excited 2s metastable state in our H atom beam which 
was not completely quenched by the relatively low 
electric field used between deflection plates Dı. This 
proposal seemed worthy of investigation since the 
energy defects in the two reactions 


H(1s)2- Hs =? Ht+ (A+ e), 
H(2s5)+H: > H++ (Hate) 


(13.6 and 3.4 eV, respectively) are such that the latter 
should have a relatively high cross section. Thus even 
if the percentage of atoms in the 2s state were small, the 
effect on the weighted average value of ao,1 could be 
large. 

According to a private communication,” the fields 
used by Curran and Donahue® were large enough to 
give complete quenching of the 2s metastable; however; 
deliberate steps were not taken in our measurements 
to suppress the 2s states completely. The fraction of 2s 


? T. M. Donahue (verbal communication, 1963). 
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atoms surviving after passing through the deflection 
field at D, was calculated from the expression 


sce] - | volish) |, (7) 
6 


where $, the survival probability, is defined as the ratio 
of the 2s atom current leaving the D, deflection field 
region to that entering the D; deflection region from 
chamber T, v(x) is the local decay rate of 2s atoms at 
distance x from slit S», v is the H atom velocity, and b 
is the length of path from S» to the end of the deflection 
plates. The quantity »(x) was calculated from the 
formula v(x)— A[E(x) P where £ is the estimated elec- 
tric field strength at distance x and A is a constant 
equal to 2780 sec! V= cm?.? [Equation (7) is based on 
the assumption that the incremental fraction of 2s atoms 
decaying in time increment dt is equal to the product vdt 
where v is an instantaneous decay rate given by the 
“static-field” formula quoted. ] 

For the set of points plotted in Fig. 2 the calculated 
2s survival probabilities are given in Table I. The 
rather irregular variation of the survival probability 
is caused by the extreme sensitivity of this quantity to 
the deflection voltage coupled with the fact that no 
special attempt was made to choose deflection voltages 
which would lead to a regular variation of ¢. In some 
cases, particularly at low energies, the survival prob- 
abilities were somewhat large; hence a subsidiary ex- 
periment was performed as a specific test of a possible 
2s effect. 

Two measurements of oo,1 were made at 4 keV (not 
plotted in Fig. 2) in which the deflection voltages used 
on plates D; were 85 and 25 V, respectively, and for 
which the survival probabilities were 0.02 and 0.71, 
respectively. The two results were o,1(85 V)=7.92+0.4 
and co,1(25 V) 2 8.282:0.5, indicating that the maximum 
increase in the cross section eo, due to 2s atoms would 
be about 20%. Therefore, a variable population of the 


TABLE I. Calculated survival probabilities [¢, Eq. (7)] of 2s 
metastable atoms traversing deflection plates D; in the coi 
measurements as a function of H atom energy. 


Energy keV Survival probability 
2 0.145 
3 0.206 
4.5 0.276 
6.8 0.095 
9 0.270 
10 0.575 
15 0.056 
22.5 0.10 
40 0.0057 
66 0.4X 1071 
120 107? 


"M H d by 
13H. Beth d E. Salpeter, Handbuch der Physik, editec j 
S. Flägge (Springer-Verlag, Berlin, Germany, 1957), Vol. XXXV, 
p. 373. 
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Fic. 3. Ratio fr=Nu/(Va-+Nn,), where Nu and Nrg are, re- 
spectively, the fluxes of H atoms and H; molecules in the com- 
posite neutral beam entering collision chamber T» during the He 
primary measurements. Ratio Fy}=on/(on-+on,), where on and 
ong are, respectively, the cross sections for production of fast H 
and Hz secondaries in collisions of fast H»*+ ions with Hs molecules, 
F; is calculated from the experimental data of Ref. 1. 


2s level does not seem to afford an explanation for the 
discrepancy with the Curran-Donahue results. 


H: Primaries 


A composite neutral beam comprising fast Hs mole- 
cules and H atoms was formed by passing magnetically 
analyzed H;* ions into the neutralizing chamber. Fast 
Hz and H neutrals produced in reactions 5, 6, and 7 
emerged from the neutralizing chamber and entered 
the collision chamber while the emergent H5* and H* 
ions were deflected out of the beam by deflector D;. 

The intensity and composition of the neutral beam 
emergent from Tz was determined by making detector 
scans 1 deg wide. These scans were shown to be ample 
to cover the entire beam by trial runs at various 
widths. (The maximum angle of divergence of a pri- 
mary neutral from the collision chamber axis was 0.38 
deg as determined by the sizes of apertures Sz and 57.) 
Figure 3 shows the composition of the composite neutral 
beam emergent from Ts as a function of the Hz beam 
energy. The composition is described by the quantity 


J= Nu/[Nnzt-Nu], (8) 


where Vy and Nn, are, respectively, the H and Hz 
counts recorded in a scan of the neutral beam. It may 
be seen that the composition varies considerably with 
beam energy and follows a form quite different from the 
function F; given by 


Fi—en/[vu--cn;] D (9) 


where cy and on, are, respectively, the production cross 
sections for H and Hz secondaries in the reactions 5-7 
calculated from the data of Ref. 1. This difference 
between fı and F, results from the small solid angle 
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primary beam. The cross section a * for H* production 
by the H; primaries alone and the cross section c1 for 
production of H* by H primaries alone are related to 
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Fic. 4. Cross sections for conversion of fast Hz molecules into 
fast secondaries H+, H»*, and H in collisions with H2 molecules. 
out, cn5*, and ox are, respectively, the production cross sections 
of the secondaries H+, H.*, and H. 
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subtended by aperture S5 at the neutralizing chamber 
coupled with the fact that the H» molecules have a more 
strongly forward-peaked differential angular distribu- 
tion than do the H dissociation products. 

Because of the very narrow angular spread of the H» 
beam and the small size of apertures Sı and Ss, it was 
necessary to carefully align the SS; axis in order to 
achieve a minimum value of f; needed for optimum H» 
measurement accuracy. Owing to unexplained slight 
differences in the direction of the Hz* beam entering Sı 
for various energies, care was taken to achieve good 
alignment at every accelerator energy. The slight de- 
parture of the point at 45 keV from the smooth curve f: 
probably resulted from misalignment which slightly 
favored H atoms at this energy. 

To determine the cross sections for H;* and H* pro- 
duction from Hə primaries, a deflection voltage was 
applied to plates D2 which deflected the secondary Hz* 
beam 3 deg from the main axis and the H* beam 6 deg. 
The neutral beam and the H;* and H* beams were 
scanned repeatedly at several collision chamber pres- 
sures, providing curves of Ng,* versus f and Nut 
versus p and values of Nn and Nn. Scan widths were 
adjusted to achieve complete collection of the H* and 
H+ secondaries at each energy. One-deg scans sufficed 
for Hs* at all energies and 2-deg scans were sufficient 
for H* at all but the lowest energies. Below 10 keV the 
full 4.4-deg scans were used and considered to be 
adequate. The cross section ou,* for Hz* production 
was straightforwardly determined from the formula 


1 1 AN x,+ 
0Ha3*7— 
Ap 


a Ng, 
obtained from Eq. (5) with appropriate substitutions. 
‘Cross section ou+ for production of H+ by Hz pri- 


(10) 
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or by the equation ' e 


or=fiortfon*; fs 1— Ri, (12) 
which can be rearranged in the form | 
ont=or/ fa—004]V fe - (13) a 


i 
| 


All the quantities on the right of Eq. (13) are known | 
from the measurements described. Values of so,1 were | 
taken from the present data shown on Fig. 2 using the | 
cross section for H atoms of one-half the H; beam 
energy. Deduction of correct values of on* by use of | 
this formula rests on the assumption that the H atoms 
produced by H:* dissociation in the neutralizing’ 
chamber have the same electron loss cross section as H | 
atoms produced by proton neutralization in the neu- 
tralizing chamber. 

The results of the on+ and eg,* measurements are 
shown in Fig. 4. The uncertainties associated with 
counting statistics (standard deviations) are indicated 
by the error flags. These are believed to dominate all 
other sources of error. The relatively large uncertainties 
in the cg* values are associated with the relatively 


9 Or/f; 


7 
5 
4 ^ Ooi fi/f2 
3 


CROSS SECTION (CM?/MOLECULE) 


gag wu 2 
ENERGY (EV) 


S 45 7 10 2 


Fic. 5. Cross section out for fast proton production by Hs 
primaries and the two quantities or/ fz and eo fi/ fs [see Ea. (133 
which enter into the calculation of em*. The small difference 
between the upper curves at the higher energies leads to the large 
statistical uncertainties in ømt at high energies. Dashed curve — 
is the curve for amt obtained by using the lower values of 2 E 
given by Curran and Donahue (Ref. 8) and Fogel (Ref. 7) instead - 


of the co,1 values from the present work in evaluating Eq. qd is 
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IONIZATION AND 
small cross sections as well as the fact that oy+ is a 
difference between two quantities [Eq. (13)], each of 
which has associated statistical errors. This difference 
became quite small at high energies, leading to quite 
large percentage uncertainties. The two terms appearing 
on the right-hand side of Eq. (13) and the difference 
ou* are plotted in Fig. 5 to show clearly the relative 
magnitude of the quantities as they occurred in the on+ 
determinations. 

The effect on oy+ of assuming the lower set of ao, 
values given by Curran and Donahue (4- to 40-keV H 
atom energy) and Fogel (40- to 50- keV H atom energy) 
is indicated by the dashed curve in Fig. 5. This curve 
is believed to represent a less reliable estimate of ont 
than the solid curve since the “low” values of oo,1 used 
in calculating the dashed curve may have resulted from 
calibration disparities which were eliminated in the 
present set of measurements by our use of a common 
basis of calibration in the oo,1 and ey measurements. 

The cross section oy for fast H atom production in 
reactions 1 and 3 could not be determined accurately 
because the high flux of H atoms in the primary neutral 
beam tended to mask the production of secondary H 
atoms at small angles. However, an estimate of on was 
made at 10 keV as follows. Careful measurements of 
the angular distribution of fast H atoms emergent from 
the collision chamber were made at two collision 
chamber pressures with results shown in Fig. 6. At 
angles greater than j deg a very distinct pressure- 
dependent H atom component is present corresponding 
to an H atom production cross section of 1.25 101^ 


ai Tre rcu | T 


A 215x107 mmHg 
Oo 10x 10° mmHg 


x H FROM HZ 
DISSOCIATION 
3 


[s] 


H COUNTS/0.25 DEGREES 


l o l 
ANGLE (DEGREES) 


. 6. lar distribution of fast H atoms measured in order 
e the cross section for production of fast H atoms 
from Hz primaries. The large peak in the center is due principally 
to H atoms present in the primary beam and, consent ys ow f 
no dependence on collision chamber pressure over the dn cie i 
pressure range 1.0X 1075 to 2.15X 10 * mm Hg. pes nes su 
extrapolations used in estimating the small-angle pr Bee B EH 
atoms due to Hz dissociation. Crosses indicate angus a AS on 
of H atoms from Het dissociation in collision with He. 


tributions are for 10 keV primaries and were obtained with a long 


slit (Ss) as the detector window. The distributions are, therefore, 


not the same as “differential angular distribution” in the usual 
sense of the term. 
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Fic. 7. Curve A: on,* the cross section for conversion of fast 
Ha molecules to H5* ions from Fig. 4. Curve B:en* the cross sec- 
tion for conversion of fast H} molecules to H* ions from Fig. 4. 
Curve C gives the sum of eu* and oy,* which equals the total 
cross section for electron loss from fast Hs molecules. Curves D 
and DX2 are, respectively, 1 and 2 times the cross section 9,1 
for electron loss from H atoms from Fig. 2. Curve E: ionization of 
H: by H primaries, including o;,-; the ion production due to 
electron capture by the primary ions from Schwirtzke (Ref. 9). 
Curve E': curve E minus o;, .1 from Fig. 2. Curve F: cross section 
for free electron production in H.*-+ Ha collisions from Afrosimov 
(Ref. 15). Curve G and G' are, respectively, the results of Guidini 
(Ref. 16) and Sweetman (Ref. 2) for electron loss from H,* ions 
in collision with Hz. Curve J: curve F with curve G subtracted 
to give cross section for electron removal from Hz when struck 
by H;*. Curve K: cross section for production of slow H* ions in 
collision of H2* with Hz from Afrosimov (Ref. 15). A portion of 
curve DX2 between abscissa values 1.5 and 3108 cm/sec was 
deliberately left out to avoid confusion with curve J. 


cm?. The portion of this cross section due to Hs dissocia- 
tion is estimated as 1.17 1071? cm? by subtracting the 
calculated amount expected from Coulomb scattering 
of H atoms in the primary beam. By making a linear 
extrapolation of the large-angle H atom distributions 
to zero angle, the estimated ratio of total H atom 
production to production outside 3 deg was found to 
be 2.14, giving a final estimate of 2.5 10-!^ cm? for the 
value of en. 

The use of the linear extrapolation in estimating the 
small-angle H atom yield could not be rigorously justi- 
fied, but seemed reasonable in view of the shape of the 
angular distribution of H atoms from H;* primaries 
obtained with the same apparatus and shown super- 
imposed in Fig. 6 for comparison. 


DISCUSSION 


No directly applicable theoretical calculations are 
available for comparison with the present experimental 
results; however, it is instructive to compare the pre- 
viously available cross section data on several different 
hydrogen atom and molecule collisions with the new 
data on Hs4-Ha collisions. To facilitate such a com- 
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parison, a number of cross section vs velocity curves 
have been compiled in Fig. 7. 

Curves A and B are the present on,* and on* results 

from Fig. 4. Curve C is the sum of on,* and ex * which 
represents the total cross section for electron loss from 
the H» projectile in an H2+ Hz collision. Curves D and 
DX2 are, respectively, the cross section 0,1 for electron 
loss from H atoms and co, multiplied by two. It may be 
seen from a comparison of curves C and DX2 that the 
Hə molecule does not behave as two separate H atoms 
in regard to electron loss except in a very narrow 
velocity range centered at about 1.7X 10° cm/sec. The 
question of the similarity of H» to two separate H 
atoms in the somewhat different situation of proton 
charge exchange has been discussed by Tuan and 
Gerjuoy.4 They conclude that there are important 
interference effects between the capture amplitudes 
from two interacting H atoms which can make the 
capture cross section quite dependent upon nuclear 
separation. The above noted disparity between curves 
C and DX2 does not seem surprising in view of the 
possibility of analagous interference effects in the elec- 
tron loss process. Curve E is the cross section given by 
Schwirtzke? for ionization of H» by H atoms while 
curve E’ is the same cross section with c1,—1 subtracted 
to eliminate the ionization contribution due to electron 
capture by the H primaries. It may be seen that curve 
C lies about a factor of 2 above curve E’, indicating 
that H» molecules are about twice as effective in re- 
moving electrons from H» molecules as are H atoms of 
the same velocity. 

Curve F is the cross section for slow electron pro- 
duction in collisions of Hs+ and He as determined by 
Afrosimov el al.,5 and curves G and G’ are, respectively, 
the results of Guidini!® and Sweetman? for electron loss 
from fast H;* ions in collisions with Hə. Curve J, the 
difference between curves F and G, represents the 
ionization of H» by Het, not counting loss of electrons 
from the H;* primaries. In the velocity range 1.7 to 
2.2108 cm/sec, curve J lies quite close to curve C, 
indicating that H;* and He are roughly equivalent in 
their power to remove electrons from Ha However, 
at 3X 108 cm/sec there appears to be a real difference 
of about 50% between the two cross sections. 

The cross section for removal of one electron from H» 
to form Hz* (curve A) is about three times larger than 
the cross section for removal of an electron from H2* 
(curve G). This large difference seems reasonable since 
the first ionization potential of Hə is 15 eV while the 


XT. F. Tuan and E. Gerjuoy, Phys. Rev. 117, 756 (1960). 

15V. V. Afrosimov, R. N. Ilin, and N. V. Fedorenko, Zh. 
Eksperim. i Teor. Fiz. 34, 1398 (1958) [English transl.: Soviet 
Phys.—JETP 7, 968 (1958) ]. 

1$ J, Guidini, Compt. Rend. 253, 829 (1961). 

11 Here we compare electron loss from the target Hz molecule in 
the Hs*--H; collision with electron loss from the projectile in the 
H+H: collision. 
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ionization potential of Hz* is 30 eV, indicating that the 
second electron is much more strongly bound than 
the first. 

Curve K is the cross section for slow proton produc- 
tion in collisions of fast Hzt with Hə. This curve lies 
somewhat below curve B, indicating that the fast 
primary in Hz4-H» collisions converts to H* with a 
slightly higher probability than the target particle in 
Hs*-- Hs collisions. 

One of the more interesting results of this study is 
that the fast H atom yield from the pair H+H, 
measured at 10 keV, is only 3 as great as the yield of 
fast H atoms from the pair Hz*-- He from Ref. 1. This 
may perhaps be explained by the fact that the lowest 
repulsive state of H» is the triplet state 1?Z, which can 
only be reached from the ground state of H: by a spin 
flip or by electron exchange. Both of these processes are 
theoretically very unlikely compared with the simple 
near resonant process of electron transfer from an H, 
molecule to an Hs* ion, placing the newly formed 
molecule in a 1 *2, repulsive state. 

It is quite probable that the H» molecules which form 
the primary beam in our Hz primary measurements 
(Fig. 4) are not in their ground vibrational states. This 
follows from the fact that the Hs* ion has a nuclear 
potential minimum at a nuclear spacing somewhat 
larger than that of the H» molecule. This has the effect 
of causing the H;* ions to be formed preferentially in 
their third or fourth vibrational quantum level in which 
the vibrational excursion of nuclear spacing is quite 
large. When the Hz* ions are subsequently converted 
back to Hs by electron capture, the molecule may 
therefore be forced into very high vibrational levels. 
The degree of influence that vibrational excitation 
exerts on the Hy ionization and dissociation cross sec- 
tions has not yet been determined. Some evidence 
exists, however, of a dependence of Hat dissociation 


cross sections on vibrational excitation.^!9?? Recent, 


calculations?! have shown a strong vibrational depend- 
ence of ionization of Ht ions by electron impact. 
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This is a study of some of the properties of the discrete energy levels of an electron in an exponentially 
shielded Coulomb potential, which is known in plasma physics as the Debye-Hückel potentíal and in nuclear 
physics as the Yukawa potential. A system with this potential possesses a finite number of bound states 
and these states are not degenerate with respect to the orbital angular momentum. First-order perturbation 
theory is used to obtain simple, analytical expressions for estimating the energy levels; the approximations 
appear to be quite accurate for large values of the Debye length. The perturbation calculations lead to an 
estimate of the number of bound states as a function of the Debye length. The matrix elements of the 
Hamiltonian for this potential are calculated in the representation of the hydrogen atom wave functions. 
The Hamiltonian matrix may serve as a convenient starting point for several other methods of computing 


the energy levels. 


1. INTRODUCTION 


LTHOUGH the two-component gas with Coulomb 
interactions has not yet been adequately treated, 
certain results are available for various temperature 
and density regions. Specifically, the Debye-Hiickel 
potential is reported to be the effective two-body inter- 
action in the high-temperature, low-density region. 
This result is often interpreted to mean that in a 
hydrogen plasma a bound electron moves in the po- 
tential given by the Debye-Hiickel theory. By virtue 
of their spectra, atoms in a plasma serve as noninter- 
fering probes, and a study of the effects of the Debye 
shielding on the energy levels of atomic hydrogen may 
be of more than academic value. For example, in the 
highly excited bound states of atoms in a plasma the 
Debye shielding may be approximately realized, and 
one wishes to examine the effects of this shielding on 
the atomic partition function. 

It is the purpose of the present paper to estimate 
the energy levels of an electron in a Debye-Hiickel 
potential (which, in nuclear physics, is known as the 
Yukawa potential). Although certain aspects of the 
problem have been investigated by others; it is not 
apparent that the problem has been treated as gen- 
erally as might be desired. Ecker and Weizel? restrict 
their study to s states and their method of calculation 
is open to question.® Harris? has used a variational 
technique for making numerical estimations of several 
energy levels, but it appears that her results are not 
useful for laboratory plasmas, since she chooses mostly 
small values for the Debye length. Her energy levels 
for the larger Debye lengths are in agreement with those 
calculated from analytical expressions obtained in the 


present paper. 
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2. PERTURBATION CALCULATIONS 


The Debye (or Debye-Hückel) potential for an elec- 
tron in the shielded field of a singly-charged ion is ' 


V(r)2 — eletr). (1) 
Here e is the unit of charge and d is the Debye length. 
d=(kT/4rén)', (2) 


where & is the Boltzmann constant, T is the equilibrium 
temperature of the plasma, and wm is the electron 
density. The potential in Eq. (1) possesses a finite 
number of energy levels, and these energy levels are 
not degenerate with respect to the angular momentum, 

Atomic units are employed, so that the unit of dis- 
tance is 7?/ue and the unit of energy is —pe!/2n?, 
where u is the reduced mass of the electron. Instead 
of the radial wave function R(r), it is desirable to use 
P(r)=rR(r) so that the radial Schrédinger equation 
with the potential of Eq. (1) becomes 


dr? 


r 7 


CRD ppc I(H-1) 
( E w)P— 
where / is the angular-momentum quantum number 
and W is the negative of the energy eigenvalue of the 
electron. Note that W>0 for bound states. The Debye 
length for a hydrogen plasma is usually larger than 
10-5 cm, which, in terms of atomic units, means d> 107. 
The potential in Eq. (1) may be written as a Coulomb 
potential plus a perturbation U(r): 


V(r)- 2/r4-U (r) = 2/r+2((e-/4—1)/r). (4) 


It is seen that the perturbing potential U(r) becomes 
comparable to V(r) when r~d. For the unperturbed 
case the average value, (r), of r goes as V*, where N is 
the principal quantum number. Thus, it is expected 
that first-order perturbation theory will be accurate 
as long as Vd and becomes progressively less ac- 
curate as N? approaches d. It is to be noted that the 
Debye potential is not meaningful for the lowest energy 
levels, since the electron orbits will not encompass 


A1235 
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enough charged particles to realize any screening at all. 
Furthermore, for the lower states the orbital frequency 
of the electron may be large compared to the plasma 
frequency so that any static potential will be inap- 
plicable to the problem. 

The first-order energy shift AW; is given by 


AWi- (Prt Pud= | [Pyilr) PU()dr, (5) 


where the Py:(r) are the normalized zero-order eigen- 


229 (N 2-D) SF (—N+/+1, 
N? (014-1) (N —1— 1) Q/ N -1/d Ng? 


AW, 


where F is the hypergeometric function. 


If second and higher order terms in 1/d are neglected, 
AW;£-—2/ (d4-N?). 


RAY 


SMITH 


functions of Eq. (3) and in this case are the radia] 
hydrogen functions 


ee L | 

N m | 
ha 2r H! Le 2r 
—r/N{ — +f __ 

D. i) ae Gr (6) 


where Lye (2r/N) is the associated Laguerre poly- | 
nomial. The integration in Eq. (5) may be performed | 
and yields 


Py l (r) E 


—N--IH-, 214-2, 4d?/N?) 


, (7) 


Eq. (7) reduces to 
(8) 


Hence, to first order in 1/d, first-order perturbation theory provides the following expression for the energy levels 


of Eq. (3): 


Wezi/N:—2/ (d4-N2). 


(9) 


Although Eq. (9) indicates that energy levels of Eq. (3) are not degenerate with respect to the angular momen- 
tum, it is to be noted that the degeneracy still exists if second and higher order terms in 1/d are neglected. 


A useful approximation to AW, is obtained by expanding U(r) in a power series in 


becomes 


mr S 1 2) 4 1 * 
a ae Gone 


1/d. If this is done Eq. (5) 


1 
(Ape, (10) 


60d* 
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For Py;(r) in the form given in Eq. (6) Bethe and Salpeter? have an explicit formula for evaluating (r) through 


v—4. Equation (10) then becomes? 


B. 3 N? 
AWS- BN] 
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ce the J=0 values are obtained for a given N, a 

approximation for AW; with higher / values is 
yp UP w= (Pro, U Pro) 

; -E0D/ 2@\(d—N?). (12) 

at the angular 

d? and higher. 


N? 
oS ESO 1)— 3022) (—1)2-3(02-2) 2-2) 0—1)] 


[63N*—35*Q4-21—3)—5/(1--1) (3P-+3/—10)+12]. (At) 
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calculations are based on Eq. (11). (The lower N 
values are included only for completeness.) E 
If accurate estimates of the eigenvalues of the Debye 
potential become necessary, an efficient metho | 
computing the eigenvaluesis the Rayleigh-Ritz met 
This method involves diagonalizing the Hamil 
matrix computed in some convenient represe 
ccuracy of this method depends on a ji 

base vector: n 


to 


——————————————— c 
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Tase I. Eigenvalues (W) as calculated 
from perturbation theory. 


d W 
(a.u.) N 1-0 l=1 1-2  Unperturbed 
10? 1 0.9980 1.0000 
2 0.2480 0.2480 0.2500 
3 0.1091 0.1091 0.1091 0.1111 
4 0.0605 0.0605 0.0605 0.0625 
5 0.0380 0.0380 0.0380 0.0400 
6 0.0258 0.0258 0.0258 0.0278 
7 0.0185 0.0185 0.0185 0.0204 
8 0.0137 0.0137 0.0137 0.0156 
9 0.0105 0.0105 0.0105 0.0123 
10 0.0081 0.0081 0.0081 0.0100 
11 0.0064 0.0064 0.0064 0.0083 
12 0.0051 0.0051 0.0051 0.0069 
13 0.0041 0.0041 0.0041 0.0059 
10? 1 0.9802 1.0000 
2 0.2306 0.2305 0.2500 
3 0.0924 0.0923 0.0921 0.1111 
4 0.0447 0.0446 0.0444 0.0625 
5 0.0233 0.0232 0.0230 0.0400 


so that / values are not mixed) in this representation is 


(Py1HPyi)=(1/N*)bnnw + (Py1,UPyn), (13) 
where 


: 0 g 1/d—1 
(UP | Pr Jevar. (14) 
0 [4 


With Py: given by Eq. (6) the matrix elements of 
U (r), Eq. (14), become 


Agte-T 
A(QU,N I) — 
Qu—E8*Qu—E)* 


kk’ 
x nt 
Qu—Ek)Qu—£) 
Norte’ 
=A (NC) c Te 
(A2—k)*(ho— &')* 
‘ kk’ (13) 
XF (e a aM) , 
UNS Oak) Oak’) 
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4 142 
=1 — 
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The evaluation of these matrix elements is rather 
tedious but could be rapidly executed on an electronic 
computer. 


3. THE NUMBER OF ENERGY LEVELS FOR A 
DEBYE-HUCKEL POTENTIAL 


If Eq. (9) were exact, then the principal quantum 
number would satisfy the condition 


N*«d (16) 


since it is required that W>0 for bound states. In 
spite of the limited accuracy of first-order perturbation 
theory and the approximation made in arriving at Eq. 
(8), the result in Eq. (16) compares favorably with 
results of others?^ In fact, Eq. (16) is identical to an 
expression obtained by Ecker and Weizel. 

In connection with a hydrogen plasma, such an ex- 
pression as Eq. (16) is of limited usefulness, since 
other perturbations in the plasma must also be con- 
sidered,” especially for N?~d. For example, if the 
l degeneracy is neglected, it is not to be assumed that 
N — d? energy levels must be included in the atomic 
partition function. 


4. CONCLUDING REMARKS 


The principal value of the results of this study is 
that several approximate expressions are made availa- 
ble, which reveal some of the interesting properties of 
the exponentially shielded Coulomb potential. It does 
not appear that these results alone have immediate 
application to the calculation of thermodynamic func- 
tions, since other large perturbative effects must also 
be superposed on the perfect gas behavior of a hydro- 
gen plasma. At this point it is desirable to re-emphasize 
that the preceding calculations are relevant to plasmas 
only for large electron-protron separations and not for 
low-lying states. As already indicated, Eq. (16) should 
not be used in terminating the atomic partition func- 
tion. In this connection it should be mentioned that 
the well-known work of Inglis and Teller? which 
treats the effects of the interatomic Stark broadening, 
likewise should not be used in terminating the partition 
function. Their work is applicable only to estimating 
the number of distinct spectral lines emanating from 
a radiating gas: Even though the higher energy levels 
are Stark broadened to such an extent that the spectral 
lines coalesce into a continuum, these broadened levels 
still represent bound states and must be included in 
the atomic partition function. It is not at all clear how 
one should sum over such broadened levels. 


10D, R. Inglis and E. Teller, Astrophys. J. 90, 439 (1939). 
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Partial differential cross sections for slow neutron scattering from liquid sodium have been measured at 100, 

150, and 200°C. The Materials Testing Reactor phased-chopper velocity selector was used with a 95% i 

transmission sample at incident neutron energies of 0.025, 0.07, and 0.10 eV at eight scattering angles *. 
(16 to 85°). In the scattering law obtained from these cross sections, prominent coherent scattering maxima 
appear for energy transfers AE $0.2 &5T, but disappear beyond A/2—0.3 keT. At each temperature, time- 
dependent widths p(t) of the intermediate scattering function have been obtained by Fourier transformation 
of the scattering law for large momentum transfers. At times «1510755 sec these functions confirm the pre- 
diction of gas-like behavior, but at larger times are solid-like. The p(t)’s are also momentum-dependent at 
larger times indicating that the selí-correlation function is not strictly Gaussian. Measurements of the 
zeroth and first moments of the energy transfer have been obtained from the scattering law at each tem- 


perature for several values of momentum transfer. The zeroth moment is in agreement with neutron dif- 
fraction experiments, but the first moment is as much as a factor of 2.5 larger than predicted theoretically. 


I. INTRODUCTION 


Bl report gives the results of an experimental 
measurement of the partial differential cross 
sections for slow neutron scattering by liquid sodium. 
Besides the cross sections themselves, which are of 
interest to reactor technology, information about the 
dynamics of the motions of atoms in the liquid can 
be obtained from them. Extensive investigations of this 
type have been made for water'? and for liquid lead.? 
Sodium was chosen as the sample for this experi- 
ment because it meets the following requirements: 
(A) The liquid is monatomic so that no molecular effects 
occur. (B) The isotopic abundance is 100% Na? so 
that the scattering from only a single isotope is in- 
volved. (C) It is one of the few elements meetiug the 
first two requirements whose incoherent scattering cross 
section (1.85 b) is a large fraction of the total scattering 
cross section (3.4 b), and which also has a reasonably 
low absorption cross section (0.5 b). A relatively large 
incoherent cross section is desirable since it is only the 
incoherent scattering which yields information about the 
motions of individual atoms. A cold-neutron study of 
sodium has been made by Cocking! who studied the 
inelastic scattering in the range 80 to 200°C. 

The present experiment was performed using the 
Materials Testing Reactor (MTR) slow-neutron phased- 
chopper velocity selector at incident neutron energies 
of 0.025, 0.07, and 0.10 eV at sample temperatures of 
100, 150, and 200°C. The experimental data show that 


* Work performed under the auspices of the U. S. Atomic Energy 
Commission. 

1M. Sakamoto, B. N. Brockhouse, R. G. Johnson, N. K. Pope, 
J. Phys. Soc. Japan Suppl. BII, 370 (1962). 

2K, E. Larson, N. Dahlberg, Proceedings of the Chalk River 
Symposium on Inelastic Scattering of Neutrons im Solids and 
Liquids (International Atomic Energy Association, Vienna (1963), 
Vol. 1, p. 2 

3 B. N. Brockhouse, N. K. Pope, Phys. Rev. Letters 3, 259 


4S; J. Cocking, Proceedings of the Chalk River Symposium on 
Inelastic Scattering of Neutrons in Solids and Liquids (Interna- 
tional Atomic Energy Agency, Vienna, 1963), Vol. 1, p. 227. 
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marked inelastic interference scattering occurs up to 
energy transfers AE~0.2kgT. Time-dependent width 
functions obtained from the data show that the self-part 
of the space-time correlation function is fitted well by 
the Gaussian approximation at very short time, but at 
intermediate times, Gaussian width functions differ 
by as much as a factor of 2 for different momentum 
transfers. We have also measured the first energy 
transfer moment of the scattering law at several fixed 
values of momentum transfer including corrections for 
multiple scattering and find that these are as much as a 
factor of 2.5 larger than predicted. 


Il. EXPERIMENTAL PROCEDURE 


The MTR 4-rotor phased-chopper velocity selector 
used in this experiment has been described elsewhere in 
detail.5 This velocity selector produces bursts of nearly 
monochromatic neutrons of thermal energies by the 
use of two Fermi choppers spinning at the same speed 
(5000 rpm) and phased with each other. The scattering 
target is placed 0.5 m beyond the second chopper and 
the scattered neutrons are detected simultaneously at 
eight scattering angles by BF; counter banks located 
2 m from the sample. These detectors were at scattering 
angles of 16.3, 20.1, 26.0, 36.4, 47.6, 59.5, 72.1, and 
84.7°, and subtended 7.5? in the scattering plane except 
the detector at 20.1? which subtended 3.5°. 

The energy, flux, and time resolution of the incident 
neutrons are determined by a U?** fission chamber 
located in the transmitted beam 2 m from the sample. 
For each data run at a given temperature, the sample 
and empty sample are cycled alternately into the beam 
at 10-min intervals for a total running time of 48 h or 
longer. The data from each of the two holders are 
stored in a separate matrix of the 8192 channel time-of- 
flight analyzer. For this experiment, 256 10-uset 
channels were used to time-analyze the data for each 


( 5 35 M. Brugger, J. E. Evans, Nucl. Instr. Methods 12, 75 
1961). 
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angle and also for the beam monitor. Background cor- 
rections are made by subtracting the empty-holder 
data from the filled sample-holder data. If necessary, 
these data are normalized to the same counting rate 


at long and short flight times. After corrections for 
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RING FROM LIQUID Na  A1239 
detector efficiency, the data are then converted to 
cross section and scattering law. The data of this 
experiment comprise runs made at 100, 150, and 200°C 
at each of the incident energies of 0.025, 0.07, and 0.10 


eV. The runs at 100 and 150°C were taken at the 8 
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TABLE I. Velocity selector energy resolution 
for elastic scattering. 


E(eV) Al(usec)  At/t AE/E, — Ag 


A 0.07 eV 
a 0.1 eV 


dO(o.8)/dQ (barns / steradian atom) 


0.8 


Fic. 4. Diffraction pattern for liquid sodium. The solid curve 
at both temperatures is the crystal spectrometer result at 100°C of 
Gingrich and Heaton (Ref. 7). 


angles mentioned. At 200°C, additional detector angles 
of 84.4, 96, 107, 120, and 133° were used with a chopper 
speed of 6000 rpm. 

The liquid-sodium sample used in this experiment 
was in the form of a rectangular slab having a thickness 
of 0.25 in. It was made as thin as possible in order to 
reduce effects of multiple scattering, and the above 
thickness is a compromise between this criterion and 
reasonable counting rates. The calculated transmission 
for thermal neutrons incident normally on the slab is 
95%. In practice, the sample was oriented at 45° to the 
incident beam and was viewed in transmission by the 
detectors so that the effective transmission for this 
orientation was 91.5%. Multiple scattering in this 
sample is estimated to be less than 10% of the single 
scattering. 

The sample holder was made of two hardened alumi- 
num frames havinga0.25-in.-thick spacer between them. 
The frames sandwiched 0.001-in.-thick stainless steel 
windows against the spacer. The total window area of 
the holder was 54X3 in. Gasketing was accomplished 
by a narrow metal-to-metal contact around the edges 
of the window. No evidence of any leakage occurred 
at any time during the experiment. The difference in 
the thermal expansion coefficients of the aluminum and 
stainless steel kept the windows taut and no evidence 
of bulging of the sample was seen. The holder was made 
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0.025 36 
0.07 36 
0.10 36 
0.025 37.5 
0.07 35 
0.10 33.5 
0.025 31 
0.07 29.5 
0.10 30 


0.0393 
0.0658 
0.0786 
0.0410 
0.0143 
0.0735 
0.0323 
0.0567 
0.0657 


0.079 
0.132 
0.157 
0.082 
0.129 
0.147 
0.0645 
0.113 
0.131 


taller than needed and was filled to less than capacity to 
allow for expansion of the sodium on melting. This fur- 
ther reduced any chance of bulging. The holder was not 
sealed from the atmosphere and in order to prevent ex- 
cessive oxidation, it was operated under an argon purge. 
The purge volume was above the top of the beam so that 
there was no scattering from the argon. The window 
area was made large enough to accommodate the entire 
beam area (4X 1.30 in.) and the frame was covered with 
cadmium to reduce the possibility of secondary scat- 
tering from the frame. Both the filled and empty 
holders were heated by 150-W cartridge heaters in- 
serted into the top and bottom edges of each sample 
holder frame. The temperature was controlled auto- 
matically using a chromel-alumel thermocouple located 
at the edge of the spacer frame. After completion of 
the experiment, the sample holder was disassembled 
and the sodium sample analyzed. Spectrographic analy- 
sis showed only traces of Al, Ca, Cu, Fe, Mg, and Si. 
X-ray reflection patterns showed a thin film (~ 100 x) of 
NaOH and NaO on the surfaces. The NaOH is believed 
not to have been present during the experiment but 
was formed afterward due to H:O contamination of 
the kerosene used to store the disassembled sample. 


II. EXPERIMENTAL RESULTS 


Typical partial differential cross sections for liquid 
sodium are shown in Figs. 1-3 for an Ep of 0.025 eV at 
each of the three temperatures. The counting rate data 
used for these figures were not smoothed. The five- 
point data smoothing" technique® previously used, 
although it works very well for gases, is not satisfactory 
for sodium because the narrow quasielastic peaks 
(especially at forward angles) are comparable in width 
to the number of channels used to smooth the data. 
The result is that the smoothing gives counting rates 
that are too low (or even negative) at the base of the 
peak where the slope is changing rapidly. The error 
flags shown in the figures are counting statistics only. 
The 150 and 200?C data are of somewhat better 
quality than that at 100°C since the run times for these 


5 P. D. Randolph, R. M. Brugger, K. A. Strong, R. E. Schmunk, 
Phys. Rev. 124, 460 (1961). 
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Fic. 5. Reduced partial differential cross sections for sodium at 100°C. 


were each of 64 h, whereas the 100°C data were run 
only for 48 h. 

A measure of the energy resolution for elastic scat- 
tering is given by the burst shapes of the incident beam 
which are also shown in the figures. Table I gives a 
summary of the widths of the bursts A/ (full width at 
half-maximum) measured by the beam monitor 2 m 
from the sample. This is calculated by At/to where to 
is the sample-to-detector time-of-flight at the incident 
energy Eo and AE/Eo=2(At/to). The resolution is also 
expressed in terms of AGB=AE/2ksT, where T is the 
sample temperature. 

The improvement in resolution at 200°C is due 
to the higher chopper speed (6000 rpm) used at this 
temperature. 

The spectra for each temperature show two general 
features, a narrow quasielastic peak and a low broad 
spectrum of inelastically scattered neutrons. At the 
three forward angles, where the momentum transfers 
are small, the quasielastic scattering dominates and the 
inelastic scattering is very small. At 36°, the quasi- 
elastic peak is quite large compared with the other 
angles and the inelastic scattering has become larger. 
The momentum transfer for this quasielastic peak cor- 
responds to the first maximum in the diffraction pattern 


so that this peak has a large contribution from coherent 
scattering. As the scattering angle continues to in- 
crease, the quasielastic peak becomes small or dis- 
appears and the inelastic scattering dominates the 
spectrum. The fact that at 200°C a quasielastic peak is 
seen at all angles is probably due to the improved 
resolution used at this temperature. Except for this, 
the spectra at each angle are qualitatively the same, 
and only small quantitative differences in the inelastic 
spectrum are present. At the 0.07 and 0.1 eV incident 
energies which are not shown, the data cannot be 
easily resolved into a quasielastic and an inelastic part. 
The cross sections shown in these figures are normalized 
to the total cross section value of 3.4 b. 

Figure 4 shows the diffraction pattern obtained at 
100 and 150°C by integrating the observed cross sections 
over the scattered wavelength. These are plotted as 
do (A9)/dQ versus s=sin}9/ho, and each plot includes 
the data from each of the three incident energies. The 
solid curve is the 100?C result of Gingrich and Heaton? 
obtained using a neutron crystal spectrometer and is 
normalized to the data at large s. The agreement is good 
except at the smaller values of s, where the finite 


1N. S. Gingrich, L. Heaton, J. Chem. Phys. 34, 873 (1961). 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


RANDOLPH 


ID 


Pa 


A1242 


EH 
= 


= 23 mn E 
kat P 


M 


LAW {50°C RES 
cimi 


E SODIUM. SCATTERING 


(WojD UDIpDJ8JS A9/SUJ 


0.1 


10 


10 


10 


IKI (angstroms -1 ) 


ED LLLI TL TU DUI Le CA eran S 
el 0.1 


10 10 


pus Lopate 


0.1 


0.01 


0.1 


0.01 


0.1 


0.01 


Fic. 6. Reduced partial differential cross sections for sodium at 150°C. 
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Fic. 7. Reduced partial differential cross sections for sodium at 200°C. 
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angular resolution of the detectors becomes important. 
The two major diffraction maxima are clearly evident. 
A The complete presentation of all the data is given 
in Figs. 5-7 in terms of the scattering law. This form is 
used since it is representative of all the data and is a 
very compact form. The form of the scattering law 
used is the “reduced partial differential cross section?” 
Sr(|x|, ñw) and is defined by 
d? (Eo, E,9)/dEdQ. 
= [E/ Eo]? exp[ — /o/2Rk5T ]Sn 57e), (1) 
where 
Jio E— Eo, 
x=k—ko, 


where Eo, E, ko, and k are the energy and reduced wave 
vectors of the incident and scattered neutron, respec- 
tively, and @ is the scattering angle. This form obeys 
the conditions of detailed balance and is even in the 
energy transfer and in the magnitude of the momentum 
transfer. The results at each temperature are plotted 
as Sp versus « for fixed values of tw (in units of &5T). 
This form has the dimensions of a cross section (b/eV sr 
atom) and is related to the dimensionless form? of 
the scattering law S(@,8) by 


S n (fto) = (0»/4rkeT)S (ab), (2) 
where 
a-—I$ZnMksT, 
b= hw/kgT. 


The mass of the scattering atom is given by M, T 
is the absolute temperature of the sample, and c; is 
the bound-atom cross section. Scales for S(e,8) are 
also shown in the figures. 

The range of momentum transfers spanned at each 
incident energy overlap so that in these regions, the 
scattering law is overdetermined. At any fixed incident 
energy, the smallest scattering angle has the smallest x 
value. At low-energy transfer (80.1) the effects of 
finite energy resolution cause the data from the different 
incident energies to form separate curves at the smaller 
angles, the higher energy curves being below the lower. 
At larger angles where resolution effects are small, the 
different incident energies give a single curve. In spite 
of the energy resolution, the effects of interference 
scattering are clearly present. The interference maxima 
are seen, corresponding to those seen in Fig. 4. These 
maxima are present at each temperature out to energy 
transfer Zio <0.25 ksT, and disappear beyond this. This 
fact demonstrates that in sodium, the convolution 
approximation? is not adequate for energy transfers 
larger than about 0.25 kgT since it predicts contributions 


8R. M. Brugger, Atomic Energy Commission Research and 
DOREM Report No. IDO-16694 (Rev) 1962 (unpublished). 

3 P. A. Egelstatt, Proceedings of the Vienna Symposium on ibe 
elastic Scattering of Neutrons in Solids and Liquids (Internationa! 
Atomic Energy Agency, Vienna 1961), p. 25. 

10 G, H. Vineyard, Phys. Rev. 110, 999 (1958). 
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Fic. 8. Smooth curve scattering law for sodium at 100°C, 


to the coherent scattering that have magnitudes which 
are independent of the energy transfer. For small 
energy transfers at low-momentum transfers, however, 
cold-neutron results! show that the convolution approxi- 
mation is in agreement with the experiment. 

Smooth-curve scattering laws were obtained for each 
temperature by making a hand-drawn fit to the data. 
These are shown in Figs. 8-10. They are plotted as 
reduced partial differential cross sections S$ r(x fuo). 
Also shown are scales for the Egelstaff scattering law 
S(a,@). In drawing these, an attempt was made to 
reduce the effects of energy resolution by drawing the 
curve at low « values for 8<0.3 to follow or lie above 
the lowest incident energy data." Qualitatively, these 
curves are quite similar at each of the three temperatures 
indicating that from 100 to 200°C little change occurs 
in the properties of the liquid. All the smooth S. 1097) 
curves and the data plots are normalized to the total 
cross section using the value s— 3.4 b. 


IV. GAUSSIAN WIDTH FUNCTIONS 


As was first shown by van Hove,” the scattering law 
is related to a time-dependent pair-correlation function 
for the system by a double Fourier transformation. 


u R. M. Brugger, Phys. Rev. 126, 29 (1962). 
11 L. van Hove, Phys. Rev. 95, 249 (1954). 
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Fic. 9. Smooth curve scattering law for sodium at 150°C. 


Various theoretical models for this correlation function 
have been studied for simple liquids. Vineyard” was 
the first to propose a specific model for the self-part of 
the correlation function which describes the self- 
diffusion of the liquid. The model proposed by Vineyard 
is that the self-part of the space-time correlation func- 
tion is Gaussian in its space dependence with a time- 
dependent width. This model was shown to hold for 
a perfect gas, an oscillator, a Debye lattice, and for an 
atom obeying the classical equation of diffusion. This 
model is usually assumed to hold for simple liquids, at 
least as a first approximation. Indeed, it has been 
shown by Rahman et. al., that the Gaussian assump- 
tion enters as the first-order term in an expansion of the 
so-called intermediate scattering function in powers of 
K. 

In order to find out from experiment how well the 
Gaussian assumption holds, it is more convenient to 
work with this intermediate scattering function rather 
than with the correlation function or the scattering law. 
Defining a width function p(/) by the Gaussian self- 
correlation function G,(z,!), 


G.(r,) — [ro (0) ] expire] (3) 


then the intermediate scattering function Z,(«,/), which 


13 A, Rahman, K. S. Singwi, A. Sjölander, Phys. Rev. 126, 986 
(1962). 
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is the Fourier transform of G, over space, may he 
written as a Fourier transform of the scattering law over 
the energy transfer. 


T, (x,t) =expL—p()/4 = af l cos (87)S;.(a,8)dB, (4) 


where in the right-hand member, we have used the 
symmetric form of the scattering law, and r= (keT/h)t, 

The use of the symmetric form of the scattering law 
implies that p(0)=7?/2MkeT rather than p(0)=0. This 
constant term is of purely quantum-mechanical origin 
and is very small. From Eq. (4), if the Gaussian assump- 
tions hold exactly, measurement of S(o,8) at fixed 
a (i.e., fixed i?) in principle allows a determination of 
p(t) as was first pointed out by Vineyard.” Since S(q,8) 
is known for a range of a, then p(t) is overdetermined. 
If the Gaussian assumption is exact, then the p(t) ob- 
tained by Fourier transforming S at different values of 
a should give the same function and, hence, furnishes 
a test of the assumption. 

Fourier transformation of the scattering law for 
liquid sodium has been performed at each of the three 
temperatures for several values of a using a program 
previously written.” Since it is only the noninterference 
portion of S that contains information about the 
diffusive motions, the range of œ at which the trans- 
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Fic. 10. Smooth curve scattering law for sodium at 200*C. 
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formation was computed for «20.1 at cach temperature 
using the smooth curve scattering laws shown in 
Figs. 8-10. It is seen that for «20.1, the strong inter- 
ference effects have disappeared. The advantage of this 
region of æ space is that no approximate method need be 
used to separate the interference and the noninterference 
parts of the scattering. The disadvantage of this is that 
Fourier transformation at large @ restricts the range 
for which p(t) can be obtained to relatively short times. 
In making these transformations, S (v,8) was assumed to 
go linearly to zero beyond B=2. In all cases, this gave a 
zero value for S(af,) at 82.4. 

Figure 11 shows the resulting width functions plotted 
as a function of time for various fixed values of a. 
Also, there is shown for comparison, the slope character- 
istic of a simple diffusion model and the width functions 


for a perfect gas. Generally, the shapes of the various 
p(t) are the same for cach temperature. In none of the 
cases is p(t) independent of æ for £z 1X 107 sec, so 
that in this time range, the Gaussian assumption does 
not hold exactly. At a given time, the experimental 
p(t) values drop by a factor of about 2 in the « range 
covered. For times less than 12€ 10^? sec, the curves 
agree well and follow the gas behavior expected for 
very short times, The apparent large differences in the 
p(L)'s for £<0.1%107 sec are due to the use of the 
logarithmic ordinate; on a linear plot these portions 
of the curves fall very close to each other and the aver- 
age value of p(0) is p(0)22A*/2M ks T. At larger times 
(£2 1X 107? sec.), the slopes of the p(t) curves are in all 
cases much less than that of the simple diffusion model. 
Assuming that the experimental p(/) behavior is ap- 
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proximately Gaussian, and that at sufficiently long 
times it should be proportional to /, characteristic of 
simple diffusion, it is seen that in the time range avail- 
able, such diffusive behavior has not yet occurred. The 
flatness of the curves indicates that for /« 10x 10 
sec, the behavior is more nearly that characteristic of a 
solid. Taking p(/) —0.5X 1079 cm? at /—10X 10? sec 
then the rms value of r obtained assuming Eq. (3) 
is 0.87 A. The nearest neighbor distance ro is 3.83 À so 
that at such a time the atom is still within the cage of 
its nearest neighbors and diffusive behavior is not 
expected. On the other hand at /£0.8X 105, the end 
of the region of gas-like behavior, p(/) - 2X 1075 cm? 
and the rms radius is r=0.55 A. Thus, the sodium 
atoms move as free particles for only 15% of the nearest 
neighbor distance before feeling the effects of binding. 


V. MOMENTS OF THE SCATTERING LAW 


The results obtained in this experiment have also 
been used to obtain the first two moments of the energy 
transfer with respect to the scattering law. Theoretical 
expressions for these moments were first obtained by 
Placzek! and have been discussed for classical systems 
by DeGennes.!5 Rahman et. al.,!* have recently rederived 
the moment relations for the quantum case in a very 
clear and concise fashion. 

Letting (6°) be the zeroth and (8!) the first, then the 
two lowest energy moments are given in terms of the 
symmetric scattering law by 


(92 | cosh (36)5 («,8)48 
T (1/00) koinot con 1+ y (x) J} » © 


(g)=2 | B sinh(6)S(a,6)d8=a, (6) 


14 A brief report on the results of this section has been given in 
Phys. Letters 3, 162 (1963). 

16 G. Placzek, Phys. Rev. 86, 377 (1952). 

16 P, D. deGennes, Physica 25, 825 (1959). 
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where o», Cine and Seon are the total bound, incoherent, 
and coherent scattering cross sections, respectively, and 
1--y(k) is the structure factor for the liquid. These 
relations include the effects of both coherent and in- 
coherent scattering. The integrations are to be per- 
formed at constant momentum transfer (i.e., constant a) 
rather than for a fixed scattering angle. Equation (5) 
is essentially a statement that the integration of the 
double differential cross section over outgoing energies 
for fixed momentum transfer is proportional to the 
diffraction pattern. Equation (6) is a sum rule, the 
longitudinal f-sum rule which is well known in other 
aspects of the many-body problem. It is rigorously cor- 
rect for systems which are reflection invariant and for 
which the interactions between particles of the system 
are velocity-independent." Roughly speaking, it states 
that the mean energy transferred by the neutron is 
equal to the translational recoil kinetic energy of the 
scatterer. 

The integrations in Eq. (5) and (6) were performed, 
using a planimeter, for several fixed values of momen- 
tum transfer in the range 0.004<a<0.2 at each tem- 
perature. Figure 12 shows some representative in- 
tegrands of (8!) at the middle and the extremes of the 
range of logo. The values of S(a,8) were taken from the 
normalized smooth scattering law curves of Figs. 8-10. 
The error flags on the plotted points were estimated at 
each 8 by using the lower limit of the spread of the 
S(o,8) data points of Figs. 6-8. The curve drawn 
through the points indicates the area of intergration. 
In all cases beyond 8—2, the integrand was extrapolated 
to zero in the manner shown. 

The results of the moment measurements for each 
temperature are shown in Fig. 13. These are shown as 
the ratio of the observed experimental moments to 
calculated value at the various values of a. The theoretl- 
cal zeroth-moment ratios [the right-hand side © 
Eq. (5)] at all three temperatures were calculated 
using for 1d-y(x) the experimental diffraction results 


11 D, Pines, in The Many Body Problem, edited by D. Pines 
(W. A. Benjamin Inc., New York, 1961), p. 39. 
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TABLE TI. Ratio of observed-to-theoretical moments of 
the energy transfer. 


Temperature 100°C 150°C 200°C 
£(B)ovs/ (8°) uer Jav 1.06 1.14 14 1 
[48 )os./a Juv 2.76 2.30% 2.25 1.828 1.66 1.428 

2.75 


[48 )obs/a max 2.05 2.649 222» 192 1.80 


^ Values corrected for multiple scattering. 


of Gingrich and Heaton? at 100°C. The results for the 
zeroth-moment ratios cluster near the value 1 in agree- 
ment with Eq. (5). The measured first moments, how- 
ever, are in marked disagreement with the expected 
values, being as much as a factor 3 larger than the 
calculated value. In addition, the discrepancy is tem- 
perature-dependent being largest at the lowest tempera- 
ture. At each temperature, the first-moment ratio also 
exhibits an apparent a dependence, the ratio decreasing 
at large and small o. The decrease at large a would be 
expected since for sufficiently large momentum trans- 
fers, the system would act like an ideal gas, and hence, 
would obey the sum rule. Also, the experimental trend 
indicates that for a sufficiently small, the sum rule may 
also be obeyed. In Table II are shown the mean values 
of the moment ratios plotted in Fig. 13. Also shown are 
the maximum values of the first moment ratio. 

The errors on the first-moment ratio values in Fig. 13 
are estimated to be +35% as obtained from the spread 
in the scattering-law data points at the various beta 
values. It is seen that even on the low-limit side of the 
errors, the first-moment ratios would still disagree with 
the predicted values. The two major sources of system- 
atic error which could lead to high values of (6')obs 
are the finite energy resolution of the velocity selector 
and multiple inelastic scattering in the target. Qualita- 
tive corrections for finite resolution, which were made 
when drawing the scattering law, were checked at 100°C 
by convoluting the known velocity-selector burst shape 
with the partial differential cross section regenerated 
from this smooth-curve scattering law at the smallest 
scattering angle (16.3°) and the highest incident energy 
(0.1 eV), where resolution effects are the largest. The 
resulting cross section agrees within +10% with data 
for that angle. Also, the effects of energy resolution are 
in general most pronounced for the smallest energy 
transfers (8<0.2), and this region contributes little 
to the first moment integral (see Fig. 12). In any case, 
at large scattering angles (large alpha) where the cross 
section peaks are broad compared with the burst width, 
energy resolution effects would be small. 

Since the target is thin, scattering only 8% of the 
incident beam, multiple scattering effects should be 
small. In order to determine to what extent multiple 
scattering affects the first moment, an estimate of the 
multiple scattering was made by fitting S(a,8) to a 
polynomial in œ and assuming that the zeroth-order 
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Fic. 13. Ratios of the observed to the calculated zeroth and 
first energy transfer moments of the scattering law for sodium at 
100, 150 and 200°C. 


term arises from multiple scattering," i.e., 
S (o,8) 5 Ac(8)-- Ax(8)ac- A» (8)e? 4 As (B)? 3- * * - 


Since the first term in a theoretical expression of S(a,8) 
is proportional to a and since multiple scattering is 
expected to be roughly isotropic and hence independent 
of a, it is assumed that the term A o(8) is due to multiple 
scattering. The first moment ratio, corrected for multiple 
scattering is then given by 


(8) core ‘= ((8*)./0)) — (8.2/0), 


where the correction term (8) is obtained from the 
same integral as Eq. (6) with S(a,8) replaced by Ao(@). 
This correction is not expected to be exact, but the 
procedure is a reasonable one and is the best practical 
one available at the present time. Its accuracy is un- 
known and cannot be determined until experimental 
measurements of multiple inelastic scattering are 
available. 

The S(a,8) curves of Figs. 8-10" for 820.25 were 
least-squares fit to a third-order polynomial in æ in 
order to obtain Ao(8). Scattering-law values for B<0.25 
were not used since the extrema in § (aß) caused by 
coherent scattering would necessitate a much higher 
order polynomial. The values of A,(8) for 8<0.25 were 
obtained by extrapolating Ao(8) to 8=0. Table III 
shows the amount of multiple scattering compared to 
the peak value of S(a,8), ie Ao(0)/S(@,0) at each 
temperature together with the value of (82), the cor- 
rection to the observed first moment. Table II gives 
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Tapte II. Multiple scattering correction to the observed 
first energy transfer moment. 


Temperature °C 100 150 200 
A(0)/S (a,0) 0.027 0.059 0.092 
(B) 0.0064 0.0063 0.0072 


the values of the average and maximum of the first- 
moment ratio, corrected for multiple scattering. The 
resulting first-moment ratios corrected for multiple 
scattering are shown in Fig. 13. Since the correction to 
the first-moment ratio is proportional to a, the largest 
effect occurs at small a. At small a the correction is large 
and accounts for most of the measured ratio and may be 
an over correction. At large momentum transfers the 
correction becomes negligible. In general, for alpha 
>0.010 at all temperatures the corrected moments are 
still much larger than predicted theoretically. A con- 
ceivable explanation for this is that velocity-dependent 
interactions might be important in the liquid or else 
that reflection invariance does not hold. However, 
it is extremely unlikely that reflection invariance would 
fail in a monatomic liquid. Also there is little reason to 
expect, nor is there other evidence to support the 
hypothesis that velocity-dependent interactions play 
any great role in a liquid. Additional scattering experi- 
ments of higher precision would be needed to make such 
hypotheses tenable. 


SUMMARY AND CONCLUSIONS 


The scattering law for liquid sodium between 100 
and 200?C has been measured in the range of momentum 
transfer 1 €x € 10 A~ for energy transfers out to about 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


RANDOLPH 


0.06 eV. The scattering law shows pronounced maxima 
due to inelastic coherent scattering for energy transfers 
below 0.2 kT, but which disappear for larger energy 
transfer, demonstrating that the convolution approxi- 
mation does not hold in general for liquid Na. Time. 
dependent Gaussian width functions obtained by 
Fourier transformation of the scattering law at severa] 
momentum transfers show that sodium behaves as an 
ideal gas for times less than 1.0X 107? sec corresponding 
to an rms atomic displacement of 15% of the nearest 
neighbor distance, but for larger times behaves more 
like a solid. In the time range available, [1010-3 
sec, there is no evidence that simple diffusive behavior 
has been approached. The width functions for times 
greater than 1X107! sec are momentum-dependent, 
showing that the Gaussian assumption for the self- 
correlation function does not hold rigorously for momen- 
tum transfers larger than K~6 Â~. Measurements of 
the two lowest moments of the energy transfer show that 
the zeroth moment gives the expected results but that 
the first moment is much larger than the theoretical 
values for a wide range of momentum transfers. The 
cause of this discrepancy remains to be explained. 
Whatever the underlying physical reasons for the dis- 
crepancy, it should not be peculiar to sodium, but should 
also appear in other metals. In any case, it would be of 
interest if energy transfer moments were measured in 
other metallic liquids. 
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A quantum-mechanical variational calculation of the energy per particle, the equilibrium nearest- 
neighbor distance, and the pressure-volume relationship for solid neon at 0^K is performed. Use is made of a 
Jastrow-type trial wave function, a phenomenological Mie-Lennard-Jones 12-6 potential, and a cluster 
expansion evaluation of the energy expectation value. It is found that correlation effects are negligible at 
equilibrium, contrary to the speculations of some authors, and that good agreement with the experimental 
data can be achieved if one of the interparticle potential parameters, e, is changed by a small amount from its 
accepted gas-phase value. This parameter change is discussed. 


I. INTRODUCTION 


HE atoms of the inert gases have closed electronic 
shells which minimizes the importance of the 
details of their atomic structure in any discussion of 
the solid phase of these elements. The fact that only 
the gross properties are significant makes these elements 
ideal for testing basic principles of quantum mechanics 
and solid-state theory. Despite the straightforwardness 
of theoretical treatment, some discrepancies have been 
found in comparison of theory and experiment. For 
one, it is found that the predicted cohesive energy of 
solid neon at 0°K is too small? (One writer? has 
predicted a cohesive energy which is in agreement with 
experiment; we will discuss this result in Sec. IV.) 
Some authors'? have speculated that the disagree- 
ment of the theoretical and measured values is caused 
by correlation effects. We will study this conjecture by 
using a Jastrow-type trial wave function‘ in a vari- 
ational calculation. A cluster expansion will be used to 
evaluate the energy expectation value. The Mie- 
Lennard-Jones 12-6 potential is used to describe 
phenomenologically the electron interaction energy. 
We will show that correlation effects are negligible in 
solid neon near the equilibrium density and that theory 
and experiment can be brought into reasonable agree- 
ment by altering a Mie-Lennard-Jones potential 
parameter value by a small amount. 


II. THE SCHRÖDINGER EQUATION AND THE 
TRIAL WAVE FUNCTION 


The Schródinger equation for both electrons and 
nuclei of the solid is 
HY (xx) - EV (xx), (1) 
where 


H=H(x,p,Xe,Pe) = (1/2m) x p2+ (1/2m.) 2 Pej 


+U (x,x.), (2) 


with m, X; pi, the mass, position vector, and momentum 


s Phys. Rev. 112, 1534 (1958). £ 
i Bernardes, ya G. L. Shaw, Phys. Rev. 128, 546 (1962). 
3 N. Bernardes, Phys. Rev. 120, 807 (1960). 
‘R. Jastrow, Phys. Rev. 98, 1479 (1955). 


operator of the ith nucleus, respectively; Me, Xej, Pej, 
the mass, position vector, and momentum operator of 
the jth electron. U (x,x.) is the total electrostatic energy 
of the system, with the symbol x, for example, repre- 
senting the set of vectors Xi, Xo, ---. The Born-Oppen- 
heimer approximation? can be applied to Eq. (1) to 
yield an equation for the nuclear motion alone, 


[(1/2m) x p?4- V (x) W(x) = Fy(x). (3) 


The wave function W(x) is that of the nuclei; V(x) is 
the electronic energy. 

The most widely used form for V(x) is the phe- 
nomenological Mie-Lennard- Jones 12-6 potential 


V (x) 4e 27 [(s/x:)^ — (s/x)*], (4) 


i<j 


where z;-—|x,—x;|. Gas data are normally used to 
evaluate o and e. 
If we now let 
A= A (2ms?*e), (5) 


then the combination of Eqs. (3) and (4) gives 


[-X E VH E (eer W(x) = LV), (6) 


i<j 


where x; and E are now dimensionless, i.e., we have 
made the substitutions x;;/« — x;; and £/e— E. 

Our procedure is to solve Eq. (6) by means of a 
variational technique. Because of the large repulsive 
term, x;7?, in the potential (4), a wave function for 
the solid must tend to zero when any two nuclei ap- 
proach closely. One way to provide for this is to 
approximate the total wave function. (x) by a product 
of single-particle, localized, nonoverlapping wave 
functions, 


p 
V(x) 2 II é(x;— R2. (7) 

i=l 
R; is the position of the ith lattice site. Bernardes'? 
chose ¢(x;—R,) to be the wave function of a particle 
moving freely in a sphere centered at R,. The radius 


5 M. Born and R. Oppenheimer, Ann. Phys. 84, 457 (1927). 
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of the sphere was used as a variational parameter. In 


a 
this theory an atom feels a force exerted on it by each (—344?). By Eq. (12) 
of the other atoms, but otherwise its motion is inde- 
pendent or “uncorrelated” with that of the others. v2S(i,j) = — (C/2)n(n— 1)/xi"** ]. (15) 
A more realistic wave function is one of the Jastrow- 
type Thus Eq. (13) becomes 
N 
x)= x;— R; Eik) 5 8 PRAE Ww 
VG) - IL eG R2 H fen) (8) Ee. b 
where 2—342N bay 
f(xx)—0 if 2.70 (9) E=3ANN+ , (16) b 
f (xix) = if Tjk — 0. esprdx 
The factor f(x.) is the “correlation” function. The " "T HA 
wave function (8) gives each nucleus a larger effective where the "effective" two-body potential is 
potential well than (7) since it allows it to move more PN ; t a 
freely toward and in between neighboring lattice sites. A V (i,j) 9 4x P+ pr mg), (17) 
Divergences in the expectation value of V(x) are with Mor 
avoided, even if the $(x;— R;) are overlapping, by the p= (C/8)Nn(n—1). (18) 
vanishing of f(xj;) as xj — 0. » 
As normalized single-particle wave functions we We evaluate the matrix elements of V;; in Eq. (16) 
choose by means of the cluster expansion technique.5-3 Fol- 
$(x:—R,) = (24/7) exp[—4 (x;-R,*], (10) lowing the method of Iwamoto and Yamada® we write 
where A is a constant. S 1/ (5. Te Sbdx 
We write 2 eS Y (i, j)e5®dx 
Jen) =exp[S G8], (11) =—Inivis)| O 
and take ee 0g B=0 
SG) e —MC/sa?), (12) fewa 
where C, n are positive constants. It is clear that our * A wer - 
f (xx) satisfies the conditions of Eqs. (9). The constants where the generalized normalization integral,” Iw (8), 
C, A, and z are variational parameters. is defined by 
II. THE ENERGY EXPECTATION VALUE AND Iy(8)= | exp(2S-+BV) dx, (20) 
ITS CLUSTER EXPANSION 
The expectation value of the energy is given by and Ü-Y«; fj). Define the “subnorm alization” 
integrals by | 
{| eSe[ —X > V2--V (x) Je59dx š 
E= ; 8) 5 


|| eSHdx 


where 6=][;¢(x:—R,) is real; S=Di<;S(i,j), and 
dx— dxidxs- - -dxy. If we integrate by parts we find 


i be5V7beSdx= Í eS(pv2(82)— (Ve)? ] 


+32 Z véSQg)dx. (14) 

iGAi) 
(See Ref. 6 for details. Note the absence of three- 
particle terms which occur in earlier Jastrow-type 
calculations.) For our Gaussian wave functions, the 


380 W. Jackson and E. Feenberg, Ann. Phys. (N. Y.) 15, 266 
(1961). 
? J. W. Clark and E. Feenberg, Phys. Rev. 113, 388 (1959). 
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term in square brackets of Eq. (14) becomes simply 


T= l= [Gs Rad, (212) 


Ta ilg: | J paR) 


Xg (x;— Rj)g(1,2)dxidxo , (21b) 
Ig Gjk|g(1,2282,3)8(3,0D 558) » (210) 


Inge = (iriz in) TI (5,8) | iia: im) (21d) 
1<y<é<m 
where 


g (mô) = expL25 (v,8) +80 (7,8). (22) 


3 F. Iwamoto and M. Yamada, Progr. Theoret. Phys. (Kyoto) 
17, 543 (1957); 18, 345 (1957). 
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Using the subnormalization integrals we define the 
cluster integrals 


Y;-li, (23a) 
Yj-lj—l;, (23b) 
Yi-ljg—lg—l&i—Iu—Ijj., (230) 
etc. 
We find 
Y,;=1, (24a) 
Y= (1j|h(1,2)|ij), (24b) 
Y i (ijk| (h(1,2)h(2,3)h(3,1)+4(1,2)2(3,1) 
+h(1,2)h(2,3)+h(2,3)h(3,1) ]|izk), (24c) 
where 
h(y,0)— g(y,0) —1. (25) 
In Ref. 8 it is shown that 
Ty (6)=JI Y *, (26) 
where 
G—$2/ yaa DU! 4$—3i 17 yays 
1j 1,7 1,5; k 


"EG Spare (UU 
atk 


and, for example, y;;— Y ;j/ Y;V;. The prime on each of 
the summations of Eq. (27) means omit those terms 
for which two or more indices are equal. Equation 
(27) is an expansion in powers of the function /(y,6). 
We will consider only terms with two or fewer factors 
of A(y,ô). Those terms symbolized by the dots in Eq. 
(27) are of higher order in 4(y,ô) than we wish to 
consider. 
We find from Eqs. (16)-(27) 


E=3ANN+} X! GjlW(,2)5) 
-4z' GIW a2) Gl» 0,2155) 
-S Gjln(12 i3) GRIW (2) 92) 
i,j,k 


+I Gjk|W(,2)n(,,3) E38) ----, (28) 


nk 
where 2 
W (7,8) =25%V (7,8) , (29) 
and 
n(v,8)=h(7,8) | geo (569 — 1). (30) 


Equation (28) includes terms no higher than first order 
in the function (7,6). Succeeding terms will be higher 
order in (7,6). We will discuss the contribution of such 
terms later. Note that the new “effective” potential 


W (7,6) contains a factor 


exp[25 (y,8)]  expL- O 
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which insures convergent results for the matrix elernents 
in Eq. (28). 


IV. REVIEW OF PREVIOUS THEORETICAL 
TREATMENTS 


Bernardes has made two variational calculations, 
cach of which involves the use of a trial wave function 
which is a product of single-particle wave functions 
each spherically symmetric around a lattice site. In 
his first calculation! he found an energy per particle of 
—420 cal/mole compared to the experimental value of 
—450+10 cal/mole.? His second calculation? gave a 
value of —445 cal/mole in good agreement with 
experiment. Nosanow and Shaw,? however, have done 
a Hartree calculation resulting in a theoretical value 
of —431 cal/mole. They used the same potential 
parameters, o and e, in Eq. (4), as did Bernardes. The 
Hartree technique used by Nosanow and Shaw yields 
the best result in a variational computation of the 
energy with a trial wave function which is a product 
of single-particle wave functions each of which is 
spherically symmetric about a lattice site, We may 
speculate that Bernardes’ first result does not agree 
with that of Nosanow and Shaw either because his 
wave function lacks sufficient variational flexibility or 
because an expansion he makes does not converge 
sufficiently rapidly. As he discounts both of these 
possibilities we are unable to resolve the discrepancy. 
Bernardes’ second calculation leads to an energy which 
is actually lower than the result of the Hartree tech- 
nique. Since our results, given in the next section, agree 
with those of the Hartree technique we are inclined to 
believe that Bernardes' agreement with experiment is 
fictitious. Possibly he did not keep a sufficient number 
of terms in an expansion made in powers of À. 


V. NUMERICAL TECHNIQUES AND RESULTS 


As a first approximation we keep only the first two 
terms of Eq. (28), giving 


E=3ANN+E GjUW Q2). (31) 
2 
We have 
Gasp / / d (xR) (4 R) 
XW (x2)dxidxz 


-GY ffc 


x exp[ — 24 (Xo D Rj*] 


x W (x 12) dX 1d X2 - (32) 


* E. R. Dobbs and G. O. Jones, Reports on Progress in Physics 
(The Physical Society, London, 1957), Vol. 20, p. 516. 
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Let z;— X1— Xo, Z2— Xo— R;. Then 


aia atio) | dz W (il) 


T 
xexE- 2422 f dr expl —4A (z+ Rij)” ] 


xexpL—24(z2--22::2:)], (33) 


with R;—R;—R; Carrying out the zə integration 
yields 


(ijl W(1,2)|ij) = QA/2* | daW (ua) 
XexpL—A4 (n-F Ri] 
= (ryan | dz zW (z) 


XtexpL-AG—- Ra] 
—exp[-AG-R4yp, G9 


where R;;— |R;j| and z= |zi]- 

Our procedure is to choose initial values for » and C 
and minimize E as a function of A. We then choose a 
new value for C and again minimize with respect to A. 
In this way we find the minimum of E as a function 
of C. The whole procedure is repeated for a second 
value of n, etc. All of the above is done for various 
values of the nearest-neighbor distance R until we find 
the equilibrium or zero pressure nearest-neighbor 
distance Ry which is the value of R when E is an absolute 
minimum. The procedure is greatly simplified by the 
fact that the energy expectation value has a broad 
minimum as a function of C and 7. The computations 
were performed numerically using an IBM-7090 
computer. 

In computing the lattice sums indicated in Eq. (31) 
we found sufficient accuracy was obtained if we summed 
over the first sixty “shells” of particles, all particles on 
a “shell” being a given distance from the particle at 
the origin. This involved summing over the nearest 
3126 neighbors to each nucleus. Values for the numbers 
of particles on the shells and for the shell distances for 
the face-centered cubic lattice were taken from Ref. 10. 

First we make use of Bernardes’ potential constants 
o=2.74 A and «=50.0X10- erg. (Second virial co- 
efficient gas-phase data give — 2.75 A, e=49.2X 10-18 
erg.) Initially we take the convenient value = 10. 
We find that the value of C which minimizes the energy 
at a nearest-neighbor distance near the experimental 
equilibrium value is less than 10-4. In fact, we find that 


10 J, Hirschfelder, C. Curtis, and R. Bird, Molecular Theory of 
Gases and Liquids (John Wiley & Sons Inc., New York, 1954), 


. 1037. 
Pu J. de Boer, Physica 14, 139 (1948). 
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the lowest energy value given by Eq. (31) occurs if in 
our numerical computations, we set C=0 and begin 
our integration in Eq. (34) at that value of z for which 
expL—4 (z—Ri;)"] is negligibly small, say, less than 
10-5 in comparison with its maximum of unity, Of 
course, C cannot be precisely zero since then the matrix 
element (ij |W (1,2) | i7) would diverge. We will describe 
this situation by saying C=0+. 

A good measure of the importance of correlation jg 
the relative size of the term px; (^*? in comparison 
with the other potential terms in Eq. (17). For n=10 
the ratio of this term to x;j ?? is p=90(NC/8) «10-5. 
Because C is so small the result does not depend on 
the value of n chosen. To demonstrate this statement 
explicitly we consider the case n=4. Again we find 
C — 10-4 and the ratio of px; +? to x;;^9 is less than 
10-5. Thus we have the important result that near 
equilibrium correlation effects are negligible in solid 
neon. Table I summarizes the results. 

Because the value of C minimizing the approximate 
E value of Eq. (31) is small, the neglected terms in the 
expansion, Eq. (28), do not contribute. To see this 
consider the matrix element (i7|(1,2)|7). Using the 
derivation leading to Eq. (34), we can show 


jln, Dli) = (4/7) O/R) 


x dz z(expL—Cz-*]—1) CexpL—A G— Ray] 
i —exp[—-A(z--R3?]). (5) 


The factor exp[—Cz-**?] is unity over the entire 
range of integration for which the Gaussian functions 
are appreciable, so that (ij|(1,2)|7j) is essentially 
zero. Similar arguments hold for the other terms first 
order in 7(y,8) in Eq. (28). Other terms in the expansion 
are of even higher order in (7,6) and also do not 
contribute. 


TABLE I. Energy expectation value as a function of the vari- 
ational parameters C and , using Bernardes’ potential constants" 
o —2.74À, €—50.0X 10-15 erg. All quantities are expressed in re- 
duced units, length in units of c, energy in units of e. Ao is the 
value of the parameter A which yields the minimum energy per 
particle, Z)/N, for a given C and n, at a fixed value R=1.138 of 
the nearest-neighbor distance near equilibrium. (For the sake of 
comparison we have quoted values of Eo/N to four significant 
figures. Although we claim only three figure absolute accuracy; 
the relative accuracy between the values of Eo/N for various 
values of C is four figures.) 


n G Ao = E/N 
10 0+» 94.0 5.984 
107 94.0 5.984 
LORS 94.0 5.982 
107 93.6 5.968 
4 04-* 94.0 5.984 
107 94.0 5.983 
1073 94.0 5.980 
107? 93.7 5.953 


* Reference 1. 
b Explained in Sec. V of text, 


— — 
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"TnrE II. Comparison of various theoretical results with experiment for the energy per particle eE5/.N 


a —2.74 A, 
e= 50.0% 10715 erg 


and the equilibrium nearest-neighbor distance eR». 


Experiment 


Bernardes Bernardes Nosanow-Shaw 
(Ref. 1) (Ref. 3) (Ref. 2) This work This work Refs. 9 and 12 
—«Ey/N 420 445^ 431^ 431 449 450-410 
(cal/mole) 
c Ro 3.14 3.13% — 3.12 3.13 3.13 
(À) 


^ See Sec. IV of text. 
b Computed at the experimental value of Ro only. 


If we transform from our reduced units we find the 
minimum energy per particle at the equilibrium density 
to be eEo/N — —431 cal/mole, in precise agreement 
with the Hartree computation. This result shows that 
we do not need additional variational parameters in 
our wave function. The nearest neighbor distance at 
equilibrium is found to be ¢Ro=3.12 A in comparison 
with the experimental values? of 3.13 A. 

To find the pressure-volume relationship for the 
solid, we use P— —OE/9V, where P is the pressure, 
and V the volume. We analytically differentiate Eq. 
(31) with respect to the nearest-neighbor distance R 
and use 

OE 221 ðE 
RZ- SS (36) 


which holds for the face-centered cubic lattice. In this 
equation P is expressed in units of e/c*. We find 


P= (8A/n)* (1/38?) (Fo—2AF1), (37) 
where 


Fo=5 (1/Rii) | " dz W (Oep -A-R 
i<j 0 
—exp[— 4A (z+R;)]}, (38) 


F=} j dz zW (2){ (z—R;;) expL- A (2—R;;)°] 
E e E (G9) 


Given an R value we find that value of 4, which 
minimizes E (with C—04- and n=10) and compute 
the corresponding pressure by Eq. (37). (Note that by 
taking C=0+, we have assumed correlation to be 
negligible at all pressures.) Our results for Bernardes 
c and e are shown as curve A in Fig. 1, where we have 
plotted pressure in kg/cm? versus V/Vo, Vo being the 
molar volume at equilibrium. The dots are the experi- 
mental data of Stewart. We note that there is some 
disagreement between curve A and the experimental 
points in contrast to the good agreement each of 


z 58). 
1D. G. Henshaw, Phys. Rev. 111, 1470 (1958). 
n JT. W. Stewart, Phys. Chem. Solids 1, 146 (1956). 


Bernardes’ calculations give. (Bernardes’ curves are 
not shown in Fig. 1.) 

The question arises concerning the possibility of 
obtaining agreement between our theory and all of the 
experimental data by merely adjusting the potential 
constants o and e by a few percent. Consider the values 
o=2.75 A and e-51.7X107!5 ergs. The performance of 
the same operations as described above gives a value 
of —449 cal/mole for the energy per particle at an 
equilibrium distance of 3.13 À. Table II compares the 
various theoretical results with experiment. Curve B 
in Fig. 1 is the resulting P-V relationship which is in 
good agreement with the data up to about 1510? 
kg/cm?. Above that pressure it is possible that corre- 
lation effects become nonnegligible. Our discussion of 
correlation applies only to equilibrium. 


VI. DISCUSSION 


Our results show that it is possible to explain the 
properties of solid neon on the basis of a two-body 
(Mie-Lennard-Jones 12-6) potential and a simple 
product trial wave function. Correlation effects are 


v/Vo 


0.8] 


o 5 


19 
PO? kg Aen}. 


Fic. 1. Pressure-volume relationship for solid neon. The dots 
are the experimental values of Stewart (Ref. 13). Curves A and B 
are the theoretical relationships corresponding, respectively, to the 
potential constants of Bernardes, o=2.74 A, e=50.0X10"* erg, 


Pad to those discussed in the text, o=2.75 A, e=51.7X 10 erg. 
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negligible at low pressures. The values of the potential 
constants which give a fit to the data, c—2.75 À, 
€—51.7X 10736 ergs, differ at most by only a few percent 
from those given by gas phase data, o=2.75 
e=49,2X 10718 ergs. 

The fact that different constants are needed to explain 
the gas and solid-phase data of neon should not be at 
all surprising. The Mie-Lennard-Jones potential is a 
phenomenological interaction and should be expected 
to vary somewhat from one type of phenomenon to 
another.4 A similar situation occurs, for example, in 
the treatment of argon. Second virial coefficient data!* 
give, «=165X10—® ergs, gas viscosity data! give 


? 


14 See, for example, Ref. 10, p. 208. 
( " e Whalley and W. G. Schneider, J. Chem. Phys. 23, 1644 
1955). 


( US L. Johnston and E. R. Grilly, J. Phys. Chem. 46, 938 
1942). 


Tho WOE Ie M wi 


e=171X10- ergs, and crystal data!” give e= 169 19-14 
ergs.!8 
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1 C. Domb and I. J. Zucker, Nature 178, 484 (1956). 

18 A small change of potential parameters may not account for 
the experimental data of the heavier inert gas solids. See the P-V 
data and discussion of C. A. Swenson for xenon [International 
Conference on the Physics and Chemistry of High Pressures (Society 
of the Chemical Industry, London, June, 1962)]. It may be 
necessary to include the effects of many-body forces to correlate 
the properties of solid xenon. 
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The question as to whether the existence of a bound positronium-like system in an ionic crystal is energeti- 
cally possible is considered with particular reference to lithium hydride. The crystal field of LiH is simulated 
by a potential expressed as a sum of central field potentials centered on the ion sites. An upper bound is ob- 
tained for the ground-state energy of a system of an electron and a positron in Coulomb interaction with 
each other and moving in the crystal potential, and comparison is made with estimates of the energies of the 
positron and the electron moving independently in the crystal. The result is energetically favorable to the 
formation and persistence of positronium in the crystal through capture by a positron of an electron from 


the valence band. 


I. INTRODUCTION 


ONSIDERATION of possible mechanisms for the 
annihilation of positrons in ionic crystals gives rise 
to the question as to whether positronium, or more 
precisely a positronium-like bound system, can be 
formed and persist for an appreciable time in such a 
crystal. This question has formerly, on the basis of 
qualitative arguments, been answered in the negative.}? 
It would appear desirable to reconsider the matter on a 
more quantitative basis. 

In the following, we consider in Sec. II the simulation 
of the crystal field of LiH by means of a suitable po- 
tential. In Sec. III we consider the problem of a system 
of a positron and an electron in Coulomb interaction 
moving in the constant periodic potential of the crystal, 
the potential energy of the positron as a function of 
position being taken to be simply the negative of that of 


* Supported by the National Research Council of Canada. 

1R. A. Ferrell, Rev. Mod. Phys. 28, 308 (1956). 

2 P. R. Wallace, Solid State Physics (Academic Press Inc., New 
York, 1960), Vol. 10. 
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the electron. By treating the effect of the crystal field as 
a perturbation of the Hamiltonian for free positronium, 
the energy shift of the ground state is calculated to 
second order of perturbation theory. An improved result 
is then obtained in the form of an upper bound to the 
ground-state energy by calculating the average of the 
Hamiltonian with the aid of a wave function given by 
perturbation theory. In Sec. IV the procedure used 1n 
estimating the ground-state energy of a positron in the 
crystal field is outlined. In Sec. V use is made of the 
results obtained in a discussion of the energetics © 
positronium formation in the crystal. Section VI con- 
tains an assessment of the results and their relation to 
experiment. 


II. CRYSTAL POTENTIAL OF LiH 


The lithium hydride crystal, being an ionic crystal, 
has a relatively open structure. We therefore make tne 
approximation of treating the potential as being må t 
up of a sum of spherically symmetric potentials centere 
on the ion sites. The first problem, then, is to obtain an 


Em. 


, 
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approximation to the potential due to a lithium ion and The value of 8 for the best fit is 
that due to a hydride ion. 
We shall take the potential due to a lithium ion to be 8-115 Å=, (10) 


of the form 
U1(R)= (e/ R)-- (2e/ R)e-** , (1) 


where e is the charge on a proton and a is a parameter 
which is to be determined. This potential has the correct 
behavior for both large and small R. It has the ad- 
vantage of being simple in its mathematical form. 
Similarly, the potential of the hydride ion is taken to be 


Uo(R) = (2e/ R)e-8*— (e/ R), (2) 


where 8 is a parameter which is to be determined. 

The parameter a is obtained by observing that the 
potential Uı(R) is effectively the potential in which the 
third electron of a lithium atom moves. The ground- 
state energy of this third electron is known. We may, 
therefore, set up the Schroedinger equation for this 
particle, wherein only the parameter «œ is unknown. 
With a fixed value of a we may solve the equation 
numerically. The parameter a can be varied until we 
obtain a solution which has one radial node and which 
behaves properly as r — o». This procedure was carried 
out on a computer, and the value for œ obtained was 


a=4,152 Ao, (3) 


The value for the parameter 8 in the hydride ion 
potential was obtained in a different fashion. Using a 
wave function for the hydride ion, the charge density is 


oR) - —2ef IQ Re ra (4) 


The potential due to this charge density, v(K), is given 
by 
V3 (R)2 —4rp(R). (5) 


The wave function for the hydride ion was taken to 
be of the form used by G. Darewych.? 


(Rs Rz) = (id-» Ris) (ORR 7m iR2),— (6) 


where 
R= |R:—R:| E) (7) 
and 
10.9033 A“, 
p=2.0315 A+, (8) 
y=0.5898 A“. 


Using this wave function, it is possible to carry out the 
integration in Eq. (4) and to solve the Poisson equation 
to give an explicit form of the potential. This potential 
was approximated with high accuracy by the form 


U2(R) = (2e/ R)e **— (¢/R). (9) 


3 G. Darewych (private communication). 


The crystal potential is now taken to be 


V(R)=X [0:(R—R,)-+U2(R—Rj—a)], (11) 


where R; is the position vector of the jth positive ion 
and a is the position vector of a negative ion with re- 
spect to an origin at the location of a positive ion. 


III. GROUND-STATE ENERGY OF POSITRONIUM 
IN THE CRYSTAL 


We shall now investigate the problem of positronium 
in this potential. The Hamiltonian for the two-particle 
system in this field is 


H=— (?/2m) (V+ V2") — (e2/| ra2— ril) 
+eV(t1)—eV (re), (12) 


where rı and ry are the positions of the positron and 
electron, respectively. It is to be noted that the po- 
tential energies of the electron and of the positron due 
to the crystal field differ only in sign. 

The Hamiltonian may be expressed in terms of the 
center-of-mass coordinate R and the relative coordinate 
r, where we have 


R= (r+ r2)/2 (13) 
and 
r=r— fí. (14) 
The Hamiltonian becomes 
h? i e 
H=——V;2-—V7-— 
4m m r 


r 
+ev(R-*)-er(R+=) . (15) 
2 2 


In accordance with perturbation theory, we write 
this Hamiltonian as the sum of two parts: 


H —Hgr-Hi, (16) 
where 
i? 7? e 
H= -— Vv- (17) 
åm h to 7 
and 


T T 
m=ev(R->)-ev(R+) 5 (18) 

2 2 
H, will be treated as a perturbation on the Hamiltonian 


Ho. 
Hy consists of the sum of two noninteracting parts, 


— (h2/4m)V R? (19) 
— @/m)V2—(e/r). (20) 


and 


WENT 7 
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The first is the Hamiltonian of a free particle and the 
second is the Hamiltonian of the positronium atom. 
Therefore, the eigenfunctions of Ho may be written 


Vi. nin= e Posi (E) , (21) 


where $4 is (r) is a positronium wave function, where the 
quantum numbers 1, l, m have the usual meaning. The 
energy corresponding to this state is 


(h2k?/4m)+ Es, (22) 
where E, is the energy of the nth level of positronium. 

Let us examine the effect of the perturbation on the 
ground-state energy. The first-order perturbation energy 
is 

[¥0,100(r),HsWo,100(1) J. (23) 

Since H is of odd parity in the r space, this vanishes. 
From similar parity arguments, it can be seen that all 
odd-order perturbation energies vanish. Thus the 
second-order perturbation, which we shall now calcu- 
late, is in fact accurate to third order. 

The second-order perturbation energy is 


| [Yo,100(r), E im (1) ||? 


(24) 
aim Ey—[(22/4m)+En] 


The inner product in the above expression involves an 
integration over both the R and the r spaces. The 
functions of R are not quadratically integrable, so we 
must resort to a box quantization procedure. The inner 
product becomes 


mo i r T 
es], obe] 


X$um(r)ei*Rdrgdr,., (25) 


o E 


45:|Cxl* 2 
K 


nim 


orthonormal set of functions, we have 


(31) 


the integral to 


M. NEAMTAN AND R. I. 


ker ker’ 
ii 100% (7) $100(7’ ) bim (x) $1 * (x^) sn(—) so Jona 
ENNIUS de o eee 


sum over z, l, and m. Since (óuim(r)) is a which is equal to 


VERRALL 


where 1/17? is the square of the normalization Const q 
used in the box quantization procedure. By suitapy, 
shifts of origin, expression (25) can be reduced to ds j 
form 


1 
[um | ete (dns | 
Lo [3 a 
| | bf ns ( (COM my, (26) 


Now V (R) has the periodicity of the crystal. It may 
therefore be expanded as a Fourier series. 


V (R)2X. Cue’, 
where K is a vector of the reciprocal lattice. 
The integral 
Lim— | ei**V(R)dzn (28) 


Lo» [3 da 


is equal to C. x;, if k is 27 times a reciprocal lattice vector _ 
and is zero otherwise. Also, it is obvious that Cx= C_x. 
Substituting these results into Eq. (26), we find that the 
second-order perturbation, expression (24), reduces to 


| Í dios" (1)ó sis (1) s (Dos 


WR 
Bi— ( +1) 
4—m 


Now En= Ej/n*. As n increases En quickly becomes — 


(Q9) - 


4»|Cxl* 2 
K 


nim 


small. We set E, equal to zero, in the above, in orderto - | 


facilitate summations. We shall come back later and. 
correct for the first few terms. Using this approximation, - 
(29) may now be written 


[22 
E-— 
4m 


-uUa 


2X |Cx|? 
K B31— (tek? /4—m) 


We must now evaluate Cx. From Eq. (27) we o t 


e 


6) 


ay 


29) 


nes 
r to 
and 
lon, 


»" 
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Substituting Eq. (11) in (34) we obtain 


1 
Cc. | ECoR-R)--o.R- 8-3) 


Xei Rdrpg. 


(35) 


It is easily seen that this is equal to 
1 
x [ORo R-tree, (36) 


where this integration is carried out over all space. We 
now let 

K=mbiı+n:b:+ nbz, (37) 
where b;, bz, and b; are a set of basis vectors in the 
reciprocal lattice, and nı, #2, and n; are integers. On 
substituting the functions V;(R) and V2(R) into the 
integral we obtain 


2r?( 2 1 


+ (—1)mrtaztng 2 T (38) 
pg k V 


where Z is the shortest distance between positive and 
negative ions in the crystal. It is to be remembered that 
k= 2rK= 2r (nıb + nıbz+ nbs) 6 (39) 

Substitution of this expression for Cx into Eq. (33) 
leaves only the summation over K to be performed. 

It is recalled that this value is an approximation to 
expression (29) wherein we took E, to be zero. We shall 
now go back and correct for the first few values of 7. 

From parity considerations it is clear that the integral 
in expression (29) vanishes when 7 is even. We therefore 
need consider only the odd values of / in the correction 
calculations. This implies immediately that we need not 
consider »=1 for which J has only the value 0. The 
correction term to be added to (33) is, therefore, 


4». |Cx|? 2 2 [es testet 
K n=2 odd l,m 
k- k-er’ 
xsin(—*) sn( ered 
2 2 
1 1 
LE , (40) 
x LN 
Pi Ea TTN 
: 4m *o4m 


where the summation over # is terminated at n=5. 
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Consider the sum 
| bio” (7) b100(7/) bn tm (1) hn im * Cr) 


odd l,m 
k- k-r’ 
xsin(—) si dred, (41) 
2 2 


We express sin(k- r/2) in terms of complex exponentials 
and use the relation 


elt =F (21 +1)i" Py (cosb) jv (kr), 


l'-0 


(42) 


where jv(kr) is a spherical Bessel function. We also 
write $415 (r) as 
bnim(7)=Railr)¥ (8,0). (43) 


On summation over m and integration over angle, the 
expression (41) reduces to 


kr 2 
dm 55 (21+ o| festen yen] . (44) 
odd L 
The energy shift given by Eq. (40) therefore reduces 
to 


lór £ £ [Cx]? E I1) 


K ne odd 1 
kr 2 
x| [eminent ha] 


1 
«| ——___-_____ ] (45) 
Ei—E.— (h?k?/4m) E— (h°k?/4m) 


wherein the integrals can be explicitly evaluated. The 
summations in expressions (33) and (45) were evaluated 
on a computer. 

These computations yield a second-order energy shift 
of —4.51 eV. Since the unperturbed energy is — 6.802 
eV, this places the energy of the positronium system at 
—6.802—4.51— — 11.31 eV. 

Since this energy shift is quite large, it was decided to 
obtain a check on the result by calculating an upper 
bound to the energy. This was done by evaluating the 
expected average of the Hamiltonian using the wave 
function given by the perturbation theory. This per- 
turbed ground-state wave function is given by 


* +E (Yo.100, HA wx, a tm) 
parse knim E1— E4— (h?k?/4m) 


ti 


komim (46). A 


Using this wave function, we wish to calculate (Y,H¥)/ 


/———E————————— 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1258 S. M. 


(V). The numerator reduces to 


Wx, nim) |? 
(y, Hy) = Ext-2 e | (0,100 Wx,nt ) | 


nim hek? 
[5-5.—— ] 
4m 


fs X LEnt uu (Wo, 10027 Wk, 1») | : (47) 
k,nim [E,— En— (hk? /4m) F 


The denominator reduces to 


| (Wo, 100, H wx nm) | s 
D- IF eS. (48 
(V V) D. [E.—En— (hek? /4m) P ( ) 


The first sum in Eq. (47) is the same as that in 
expression (24). The other two sums in Eqs. (47) and 
(48) are evaluated in much the same fashion. The result 
of this calculation is 


(,HY)/ (4) = —1343 eV. (49) 


It is to be recalled that this is an upper bound to the 
energy. 

It can be shown that the error in the above result, 
arising from the approximation procedure used in the 
summations, is less than 0.01 eV. If, in (47) and (48) we 
attempt to approximate by taking the discrete sum only 
up to n=5, the result obtained for (U,H)/ (Vj) is 
— 7.044 eV. The large discrepancy between this and the 
more accurate value obtained above indicates that there 
is a substantial contribution to the wave function V 
from the higher excited states of positronium. This, in 
turn, implies (since the /=0 states do not contribute) 
that there is a significant departure of the wave function 
from spherical symmetry. 


IV. GROUND-STATE ENERGY OF A POSITRON 
IN THE CRYSTAL 


We must now estimate the ground-state energy of a 
positron in the simulated potential. An upper bound to 
this energy is calculated by making use of the variational 
principle. Since we are seeking the ground state, the 
wave function must have all the symmetries of the 
potential. A trial wave function V is expanded into a 
Fourier series, where the basis functions have the form 


2mniX. 


Y  P(nynans) cos 


P(n1,72,n3) 


2mnoY 
X cos: 


2mnaZ 


(50) 


cos: 
a a 


The X, Y, Z coordinates are chosen along the funda- 
mental lattice vectors, each of length a. The symbol 
P(ninons) represents a permutation of the three 
integers 71, n and ms. The sum is carried out over 
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all six permutations. This function has the complete 
symmetry of the potential. 
The Fourier coefficients in the expansion of y are used 
as the variable parameters in the variational technique 
The Hamiltonian for the positron is à 


H= — (82/2m)V*--eV (R), (51) 


where V (R) is the crystal potential obtained previously. 
It can be easily seen that the inner product (V, Hy) is a 
homogeneous quadratic function of the Fourier coeffi- 


cients, i.e., 
(HJ) - aC C3, (52) 
ij 


where C; is a Fourier coefficient and aj; is a constant, 
Similarly, the inner product (jy) has the form 


WW=L b?C?, (53) 


where the coefficients b,? are constants. 

The variational procedure involves minimizing (y, Hy) 
with respect to the variable parameters. In other words, 
we wish to minimize (V,HV) subject to the condition 
that (,/)- 1. By the introduction of a Lagrange 
undetermined multiplier this problem reduces to finding 
the smallest eigenvalue of the matrix whose (i,7)th 
element is a;;/b:b;. A total of 15 Fourier coefficients were 
chosen, the corresponding values of a,; and b; were 
calculated, and the 15X15 matrix was diagonalized 
with the aid of a computer.* The ground-state energy of 
the positron was found to be 7r 7.317 eV. 


V. POSITRONIUM FORMATION IN THE CRYSTAL 


We consider now the energetics of positronium forma- 
tion following the usual type of argument. Let E, be the 
ground-state energy of the bound positronium system. in 
the crystal, E* the actual energy of the positron, and 
Es* the energy of its ground state, — V ; the energy of an 
electron at the top of the valence band, and — y, the 
energy of an electron at the bottom of the conduction 
band. 

In order for it to be energetically possible for 2 
positron to capture an electron from the valence ban 
to form positronium, the positron energy must be large 
enough to satisfy the inequality; 


E*—V;2 Ep. (54) 


In order for the positronium so formed to persist, energy 
considerations should forbid the separation of the boun 
system with the deposition of the electron at the bottom 
of the conduction band. Thus we require 


Eg—Vi»EC-V.. (9) 
4 The computation of the matrix elements and the diagonal 


tion of the matrix was performed by A. G. Heinicke at tbe 
versity of Manitoba. 
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Values of V ; and V; for LiH are not available, but for 
purposes of the present argument it will turn out that 
rough estimates of these quantities are sufficient. For 
the alkali halides, V ; ranges up to about 9 eV, and V; has 
been estimated of the order of 0.5 eV® or less. For use in 
(54) and (55) we shall set V;=10 eV and Vi=1 eV. 
With the value of E, obtained in Sec. III, (54) becomes 


E*Z —13.44+10=—3.4 eV. (56) 


This is certainly satisfied even if we set E* equal to the 
value of £j*— 7.3 eV obtained in Sec. IV. 

It can, of course, be argued that the value used for 
Eg* is probably much too large, since in obtaining this 
value the positron was considered as moving in a fixed 
potential, no regard being taken of polarization effects. 
In the actual crystal the positron would be expected to 
concentrate the electron cloud about itself, with a 
resultant lowering of its energy. The amount by which 
the energy would thus be lowered would, however, not 
exceed the free positronium binding energy of 6.8 eV. 
Making allowance for this, even if we set £*— Ej* —O0 in 
(55), the inequality is satisfied. Thus, the energy re- 
quirement for positronium formation is satisfied for 
positrons of energies down to the ground-state energy, 
Egt. 


The inequality (55) can be rewritten 
Et— Eg <Vi—Vi~9 eV, (57) 


the right-hand side being now the measure of the width 
of the Ore gap. Thus, we have that a positron in the 
crystal with an energy anywhere within a range of some 
9 eV above the ground-state energy can, by capture of 
an electron from the valence band, form a positronium- 
like bound system, which will be energetically stable 
against dissociation by deposition of the electron in the 
conduction band. 


VI. DISCUSSION AND CONCLUSIONS 


A. number of questions arising out of the preceding 
treatment would seem to merit discussion at this point. 
'These comprise the use of perturbation theory, the 
perhaps surprisingly low value for the upper bound to 
the positronium energy, the mean life for electron- 
positron annihilation from the bound state, and the 
neglect of exchange in treating the electron and positron 
of the bound system as being subject to the same 
effective crystal field. 3 

Perturbation theory is used in the preceding treat- 
ment to the extent that it provides a wave function with 
which to calculate an upper bound to the positronium 
energy. It is not suggested that this function gives 2 
particularly close approximation to the state of ue 
system. The use of a better function would, however, 
result in a lower value of the upper bound, and ts 
would strengthen the energetics argument of Sec. V. 


5 A, J. Dekker, Solid Slale Physics (Prentice-Hall, Inc., Engle- 
wood Cliffs, New Jersey), PP- 371-373. 


For the sake of completeness only, the above wave 
function is used in Appendix A to estimate the mean 
life of the positronium system, but obviously no great 
weight can be attached to the numerical result obtained. 

The result obtained in Sec. II for the upper bound to 
the positronium energy might at first appear to be 
surprisingly low. It would be expected that the posi- 
tronium in the crystal would suffer distortion such as to 
increase its internal energy; and this is indeed the case. 
However, positronium being an extended system, its 
constituent electron and positron will not have the same 
distribution in the crystal, with the result that the two 
particles may on the average occupy (different) posi- 
tions of low respective potential energy. It is in fact the 
very distortion tending to raise the internal energy of 
the system which enables the positronium to "fit" the 
shape of the crystal field so as to lower its potential 
energy. An examination® of the expression (49) for the 
positroniurn energy reveals a very slow convergence of 
the series. This indicates the importance of the terms 
involving large quantum numbers. These higher terms 
do not greatly affect the internal energy, but through 
providing high angular momentum components in the 
wave function they give a distortion of the positronium 
which results in a large downward shift of the potential 
energy. 

The use of the same form of crystal potential for both 
particles of the positronium system implies neglect of 
effects of exchange between the electron of the bound 
system and electrons in Bloch states of the crystal. An 
examination of this question, details of which are given 
in Appendix B, reveals that the exchange energy in the 
interaction between the electron of the positronium and 
an electron of the crystal is zero. This constitutes the 
justification for the neglect of exchange and for the use 
of the same form of crystal potential for the electron and 
the positron of the bound system. 

The present investigation indicates that positronium 
formation is probably an important part of the process 
of positron annihilation in lithium hydride. Pending 
further investigation, one is tempted to speculate that 
this might also be the case in other ionic crystals such as 
other alkali hydrides and alkali halides. If positronium 
is formed in such a crystal in the triplet state, the 
subsequent two-photon annihilation of the positron 
with an electron through a “pick-off” process would be 
expected to contribute to a rz component of the positron 
annihilation radiation. Such a rz component has been 
observed? in a number of ionic crystals. 


APPENDIX A: ANNIHILATION MEAN LIFE 


For a calculation of the mean annihilation rate from 
the singlet state of the positronium-like system with 


$ Details of the calculation are given in R. I. Verrall, M.Sc. 
thesis, University of Manitoba (unpublished). 

7 A. Bisi, A. Fiorentini, and L. Zappa, Phys. Rev. 131, 1023 
(1963). 
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state function V of Eq. (46), it is sufficient to calculate 
for this state the density of the electron at the location 
of the positron (averaged over all positions of the center 
of mass) and compare this with the corresponding 
density in the ground state of free positronium. The 
ratio of the two densities will then be equal to the ratio 
of the corresponding annihilation rates, that of free 
positronium being known. 

It will be recalled that the terms involving excited 
states of positronium which occur in the expansion (46) 
all have / >0, and these terms vanish for r=0. Thus, it is 
only the ground-state term, Yo,100, which contributes to 
the electron density desired, and the contribution of 

l this term differs from that in free positronium only 
through the fact that y is not normalized to unity. Thus 
the electron density at the positron for positronium in 
the crystal is to the corresponding density in the ground 

i state of free positronium in the ratio of 1 / (bw) =1/1.16. 
Thus, for the wave function y the mean life of singlet 
positronium in the crystal is 1.16 times the mean life of 
singlet, ground-state, free positronium. 


APPENDIX B: EXCHANGE EFFECTS 


An examination will now be made of the nature of 
possible exchange effects, hitherto neglected. For this 
purpose we consider the system of positronium in the 
crystal in interaction with an electron in a Bloch state. 

Let the wave function for positronium in the crystal 
be of the form of a superposition of free positronium 
functions similar to (46), i.e., 


¥(1,2)=> A (knim) 

xXexpLziK- (rj 1») ]es1n(ri— 12 D (58) 
where the subscripts 1 and 2 refer to the positron and 
electron, respectively. For the electron in the Bloch 


state we write the wave function (3). For the two 
systems in interaction we take a symmetrized function 


v-—(1/2)(52x()2v(,3x0)]. — 69) 


| examine the effect of exchange, we consider for 
definiteness the Coulomb interaction between the two 
ons. Itsaverage will be proportional to (V, (1/73) V). 


(60) 


NEAMTAN AND R. I. 


VERRALL 


while the exchange term is 


[aD Wa (1/rz3)dr. (61) 


In order to compare these two expressions, we select a 
typical term from the summation which results in each 
from the substitution of the expansion (58). 

Out of the expansion of (60) we take the term 


=| expLzi (K— K) - (14-12) ]enim* (ti— 12) 
X ewm (rı— r2) [x (r3) | "dr. (62) 


By a coordinate transformation this can be rewritten d 


= J ooa — K)- nle (1) ew re C 


1 
x i exp[2mi(K’—K) rs ]-dro 
T2 


x Í IxGro |? exp[2ri(K'—K)-r:]dra. (63) — 


Qut of (61) we take the corresponding term 


J= | exp[mi{K’: (ri-- r5) -K- (1i4-12)) Jesi * (1 09) 
X Qn'lm' (rı— I3)X (r3)x (r2)dr. (64) 
We can, by omitting the exponentials in (64), write 


E i ope 


G 


1 
xx*Gax(r)—dr. (65) 


T23 3 


Because of the nature of the positronium functions € — 
the integral (65) converges, in spite of x not being | 
quadratically integrable. By contrast, the third integral - 
in (63) diverges. Thus, a suitable box normalization to 
ensure the convergence of the direct interaction term I 
causes the exchange term J to vanish. E 
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The oscillatory component of magnetoresistance has been measured for single crystals of n-type gray tin 
at liquid-helium temperatures. The period of oscillation is independent of magnetic field direction, con- 
firming an isotropic effective mass and, therefore, a conduction-band edge at the center of the Brillouin 
zone. Oscillations were also observed in a polycrystalline specimen. Carrier concentrations deduced from 
the oscillatory period are in reasonably good agreement with values obtained from the Hall coefficient over 
a range of almost two orders of magnitude in this parameter. An electron effective mass of 0,024 my was 
obtained from the field and temperature dependence of the oscillatory amplitude for samples having carrier 
concentrations on the order of 10!5 cm^*. Nonthermal damping of the oscillations is attributed mainly to 
inhomogeneities in the impurity distribution. The experimental results are compared with theory. 


I. INTRODUCTION 


I5 recent years oscillatory magnetoresistance meas- 
urements have provided band-structure information 
for several of the compound semiconductors? The 
results are in agreement with those obtained from 
cyclotron resonance and de Haas-van Alphen’ 
experiments. Of the elemental semiconductors, only 
germanium’ and gray tin? have shown an oscillatory 
component of the magnetoresistance. The band struc- 
ture of germanium had already been established through 
cyclotron resonance and other measurements before the 
magnetoresistance oscillations were observed, and 
therefore, this phenomenon was not exploited. In the 
case of gray tin, where cyclotron resonance has not 
been observed, low-temperature band-structure infor- 
mation was lacking. Previous magnetoresistance meas- 
urements at liquid-nitrogen temperature and above had 
indicated a change in symmetry type from that charac- 


* This is a publication of the Materials Research Center, 
Northwestern University. It is based on a dissertation submitted 
by E. D. Hinkley to the Graduate School of Northwestern 
University in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy. The work was supported jointly by the 
U. S. Office of Naval Research and by the Advanced Research 
Projects Agency of the U. S. Department of Defense. P 

+ Present address: Lincoln Laboratory, Massachusetts Institute 
of Technology, Lexington, Massachusetts. 
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teristic of [111] valleys at 273°K to the spherical 
symmetry of a central conduction-band minimum at 
77°K (see Ref. 10). The present work indicates that at 
lower temperatures the electrons remain in the central 
minimum. It also yields a value for the effective mass 
of electrons in this band. 

This paper describes the oscillatory magnetoresist- 
ance measurements made on single crystals (and on one 
polycrystalline specimen) of n-type gray tin. The 
experimental data are compared with magnetoresistance 
theories in terms of oscillatory period, phase, and 
amplitude. The value of electron effective mass and the 
shape of the constant energy surfaces for gray tin as 
deduced from the data are also compared with the 
theoretical predictions. 


II. THEORY 


When a magnetic field is applied to a system of charge 
carriers, the quasicontinuous distribution of states in 
momentum space collapses into a set of highly de- 
generate one-dimensional Landau subbands. Conse- 
quently, under certain conditions, the electrical resist- 
ance of a degenerate (Ep £T) semiconductor is found 
to have an oscillatory dependence on magnetic field, 
periodic in reciprocal field. "The oscillatory period is 
inversely proportional to the extremal cross-sectional 
area of the Fermi surface in a plane perpendicular to 
the magnetic field direction, and is generally a function 
of field orientation. For the special case of a spherical 
Fermi surface, however, the oscillatory period is a 
function only of carrier concentration. 


P= (2e/hc) (312n)-2— 3.18 108-2", (1) 


where n is measured in cm~ and P in reciprocal gauss. 

The usual semiquantitative conditions for the 
occurrence of quantum oscillations require that the 
separation Jte between Landau levels be greater than 
the energy spread associated with the Fermi surface 
electrons. There are two major causes of this energy 
spread: First, a thermal "fuzzing" occurs at any non- 


100, N. Tufte and A. W. Ewald, Phys. Rev. 122, 1431 (1961). 
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Tapte I. Gray tin sample characteristics. 


ngu 04.2 H 

Sample (cm-3)*  (ohm™ cm) o4,2/01.2 (cm?/ V-sec)* 

A-1 4.17X10!5 772 1.00 1.16X 105 

B-1* 1.14 1015 455 1.00 2.50X 105 

B-2° ee 465 1.00 ee 

B-3 ee 375 1.00 

B-5 1.28x 1015 EE ee 

B-6 0.82 1015 

B-7 1.76X 1015 don Oc AE 

C-1 43 1015 240 1.04 3.49105 

C-2 6.6 X10'5 240 1.02 2.28 10® 

D-2 6.4 X104 63 4.5 6.2 X105 


a Obtained from Hall coefficient, R, at 4.2°K as 1/Re. 
b Calculated as the product Rø at 4.2°K. 
* Samples obtained from same single crystal. 


zero temperature since the electrons may deviate from 
the Fermi surface by a few $T; second, collisions limit 
the lifetime 7 of a quantum state, with concomitant 
uncertainty 7/7 in electron energy leading to the 
cyclotron condition er 1. This condition may be 
written more conveniently as pB> 105, where pu is the 
Hall mobility in cm?/V sec and B is the magnetic field 
in gauss. 

In addition to the thermal and collision broadening 
of the electron energy levels described above, the Fermi 
level itself may vary throughout the specimen as a 
result of an inhomogeneous distribution of impurities. 
It is clear from Eq. (1) that such an effect would be 
manifested as a spatial variation of the oscillatory 
period, and partial cancellation of oscillations would 
occur. 

The above conditions, together with their implica- 
tions concerning the observation of quantum oscillations 
at readily obtained steady magnetic fields, are sum- 
marized below: 


(1) Zio kT. Low carrier effective mass, low ambient 
temperature. 

(2) uB 105. High carrier mobility. 

(3) ñw > AE. Homogeneous specimen. 


A recent theory for transverse (BJ) oscillatory 
magnetoresistance proposed by Adams and Holstein" 
embodies all the above conditions, except for inhomo- 
geneity broadening. The latter is taken into account 
by defining a combined nonthermal broadening tem- 
perature T’ as the sum of the collision (Dingle) and 
inhomogeneity broadening temperatures. The analysis 
of Adams and Holstein, while limited to materials with 
spherical Fermi surfaces, shows the oscillatory phe- 
nomenon to be essentially independent of the type of 
scattering. On the basis of their derivation, the oscilla- 
tory part of the transverse magnetoresistance of interest 


i i 7 N. Adams and T. D. Holstein, Phys. Chem. Solids 10, 254: 
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here may be written as 


M AT o Mi2(—1)Me27M kT" Ihu) 
IU VT Das 
FR Eg?) a= sinh[ 27M (&T/ho) 
[77 Ep m 
X cos} == 
ho 1 2 


where po is the zero-field resistivity and M is the 
harmonic of the oscillations. This expression may be 
simplified because only the fundamental (M=1) is 
usually significant. In addition, the hyperbolic sine may 
be replaced by its exponential approximation. Equation 
(2) then reduces to 


Apose kT 
— = —10Vàm—————— 
Po Eg" (fio) ^ 


2g, /m7N (/T+T" 2m r 

xen{ CC C-)] cos(—=-") » (3) 

Bo 7 \mo B PB 4 
where the oscillatory period P, the effective mass m*, 
and Bo(=eh/moc, the double Bohr magneton) have been 
introduced. 

Oscillations in the longitudinal (B||Z) magneto- 
resistance have been treated by Argyres.? The corre- 
sponding equation is identical with Eq. (3) when the 
latter is reduced by a factor of 5. 


IIL EXPERIMENTAL DETAILS 
A. Specimens 


Gray tin crystals were grown from mercury solution 
by the procedure of Ewald and Tufte.” Crystals of 
various impurity concentrations were available as & 
result of different purification processes carried out on 
the commercial zone-refined tin. The different purities 
were reflected in different carrier concentrations at 
4.2°K as determined by the Hall coefficient. The 
samples ranged in carrier concentration from 6.41%" 
cmr? to 4.2X 1016 cm~? and on this basis were divided 
into four categories designated by a letter prefix to the 
sample number as shown in Table I. The donor im- 
purity atoms are probably antimony which is not 
removed during the zone-refining process, and only 
partially removed during subsequent purification. 
Cutting of the crystals into rectangular parallelepipeds 
suitable for electrical measurements was performe« 
with a sand-blast unit. Sample dimensions were approxt- 
mately 0.2X0.4X3 mm. 

Samples were mounted on Bakelite holders at the end 
of long, thin-walled Monel tubes. Current electrodes, 
which also served as support for the sample, wete 9 


No. 36 copper wire and were attached to the ends of the 


12 P. N. Argyres, Phys. Chem. Solids 4, 19 (1958). 
31 A. W. Ewald and O. N. Tufte, J. Appl. Phys. 29, 1007 (1958) 
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sample with gallium-indium (3 parts Ga to 1 part In) 
solder. To reduce adverse effects due to large area 
potential-probe contacts, the following soldering pro- 
cedure was employed. Bare No, 56 (0.0127 mm) copper 
wires, which served as the potential probes, were first 
“tinned” and then brought into contact with the cooled 
sample at the desired locations. A 200 mA current pulse 
was then passed through the potential leads producing 
localized heating which melted the solder. By this means 
the solder contact area was held to less than that of the 
wires. The potential probes were usually attached to the 
side of the sample and (to eliminate “edge” effects) at 
a distance from the ends greater than the sample width. 
In some instances, however, the potential probes were 
mounted at the ends of the specimens with the current 
electrodes. 


B. Apparatus and Procedure 


'The sample was immersed directly in liquid helium. 
Temperatures below 4.2°K were obtained by pumping 
on the helium vapor, and were measured by the vapor 
pressure. The lower tip of the helium Dewar was 
located between the pole pieces of a magnet which could 
be rotated in the horizontal plane. Magnet current was 
supplied by a bank of storage batteries, and regulated 
by water-cooled rheostats. 

A constant current of up to 10 mA was supplied to 
the sample by a high-impedance transistorized current 
source. Maximum sample current was determined by 
the condition that the rise in sample temperature as a 
result of joule heating should not be excessive. Analysis 
of the oscillatory amplitudes for a sample under varying 
conditions of power dissipation showed that if the input 
power was less than 100 „W, sample temperature rose 
by less than 0.1°K. 

The amplified voltage from the potential probes was 
applied to the Y axis of an X-Y recorder, the X axis of 
which was driven by a signal proportional to the current 
through the magnet. Magnetic field values correspond- 
ing to various positions along the X axis were obtained 
from a calibration curve made with a rotating-coil 
fluxmeter. In those cases in which the nonoscillatory 
part of the magnetoresistance was approximately linear 
with field, a bucking voltage proportional to magnet 
current was used to permit greater amplification of the 
oscillations. For precise measurements, €.8., to deter- 
mine Hall voltage or zero-field resistance, the electrom- 
eter amplifier was replaced by a potentiometer. 


IV. DATA REDUCTION TECHNIQUE 
A. Oscillatory Period and Phase 


se of the magnetoresistance oscil- 
btained from a direct analysis of 
es. (In order to determine the 
nce, however, the data 
tive and negative field 


The period and pha 
lations were usually o 
the X-F recorder curv 
nonoscillatory magnetoresistan 
had to be averaged over post 


directions to cancel Hall voltage contributions resulting 
from misalignment of the potential probes.) Envelope 
curves were constructed tangent to the oscillations with 
a nodal line drawn halfway between. The intersections 
of the nodal line with the recorder curve located the 
nodes of the oscillations which were used to determine 
the period and phase. 

Equation (3) may be expressed in terms of an ampli- 
tude A, period P, and phase $ as 


Apos/po= A (B,T) cosL(22/PB--ó--m], (9 


where the minus sign has been absorbed into the 
argument of the cosine. A node occurs whenever the 
argument of the cosine is an odd integer multiple of 
7/2. This condition may be stated as 


Qz/PB,)--ó--7— (2n4-3)r, (5) 


where » takes on both integer and half-integer values 
corresponding to the two nodes per oscillation. Am- 
biguity in the assignment of integers and half-integers 
to the experimentally observed nodes is removed by 
noting that, according to Eqs. (4) and (5), the oscilla- 
tory curve (as a function of B) has positive slope at the 
integer nodes and negative slope at the halí-integer 
nodes. This point is very important to an experimental 
determination of phase. A plot of reciprocal magnetic 
field values at which the nodes occur versus integers 
and half-integers represents a straight line of slope P. 
The intercept 9 with the 1/B=0 axis is related to the 
phase. These relationships become clear upon re- 
arranging Eq. (5) into the following form: 


1/B,= Pn—[($9/2x)--1]P. (6) 
The phase can be deduced from s by the expression 
= (215—1)r. (7) 


If the measured phase agrees with the predicted value 
of —47, then 151. For future reference, an intercept 
no=0 implies a phase of — 4r. Since integral multiples 
of 2r may be added algebraically to ġ without altering 
the situation, zs is likewise arbitrarily defined with 
respect to the integers. 


B. Effective Mass 


Equation (1) shows that, for a semiconductor having 
spherical constant-energy surfaces, the oscillatory 
period is a function only of carrier concentration, and 
is, therefore, independent of the effective mass. The 
effective mass parameter is, however, contained in the 
oscillatory amplitude expression, 


A(B,T) « B3? exp[—v(m*/mo)(T+T)/B], (8) 


where y= 2mk/Bo= 14.68 X 10* gauss/deg If the above 
equation is multiplied by B'?, and the logarithm of the 
resulting expression differentiated with respect to 1/B, 
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Somole B-I (B 111) 
T=1.2°K 
I =10 mA 
W= 3.96 mV 


the following is obtained: 

d \n(AB'?)/d(1/B) = —y (m*/mo) Gar), ©) 
Therefore, a logarithmic plot of AB? versus 1/B 
yields a straight line, the slope of which is proportional 
to the product m*(7+T7’). The unknown quantities 
m* and T' may be evaluated by analyzing the field 
dependence of the oscillatory amplitude for at least 
two different ambient temperatures. For some of the 
samples tested several ambient temperatures were used 
in an attempt to overdetermine the effective mass 
value and (hopefully) increase its accuracy. We found 
it convenient to define a slope function S(T) equal to 
the negative of Eq. (9). The points of an 5 versus 1r 
plot represent a straight line having slope "y (m*/mo) 
and an intercept (at the T=0 axis) of y(m*/mo)T". 


V. RESULTS AND DISCUSSION 
A. Oscillatory Parameters 


Figure 1 shows actual recorder traces of voltage across 
the potential probes of sample B-1 as a function of 


Fic. 2. Field depend- 
ence of transverse mag- 
netoresistance for sam- 
ple B-1 at 1.2°K. This 
curve was obtained by 
averaging data for posi- 
tive and negative field 
directions. A line of 
nodes has been con- 
structed midway be- 
tween envelope lines, 
and was used to deter- 
mine the oscillatory 
period and phase. 
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Fic. 1. Recorder traces of magnet 
resistance voltage versus magnet 
current for gray tin single-crystal 
sample B-1 at 1.2°K. Data in th 
low-field region (inset) were recorded 
point-by-point because the signal was 
comparable to induced voltages occur- 
ring when the magnetic field was 
changed. Magnetic field is parallel to 
sample current. 


magnet current. The continuous curve exhibits oscil- 
lations starting somewhat above 2 kG. Oscillations at 
still lower fields are apparent in the inset curve which 
was taken with one hundred times greater amplification. 
The latter curve was recorded point-by-point to 
eliminate traces of induced voltages due to the quasi- 
continuous change in magnetic field. 

Since sample B-1 exhibited the largest number of 
detectable oscillations of all those tested, it was of 
special interest to our investigation. Figures 2 and 3 
show the transverse and longitudinal magnetoresistance 
field dependences, respectively, for this sample. It is 
noteworthy that the oscillatory amplitudes for the two 
modes are approximately the same, although an order 
of magnitude smaller than predicted by Eq. (3). The 
nonoscillatory magnetoresistance components differ 


Sample B-1 
T*12 °K 
n=1.14 x 10!8/cm* 


o 2 4 6 8 10 12 14 16 


B (kG) 


7 


Fic. 3. Field dependence of longitudinal magnetoresistance fora 


sample B-1 at 1.2°K. Negative longitudinal magnetoresistance 
was observed for all samples in the high-field region. Er 


ce for 
tance 


erc ats 
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from the slopes and | 

intercepts, respectively, a j 
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06 2 4 6 8 10 
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markedly between the transverse and longitudinal 
modes, and the negative longitudinal magnetoresistance 
apparent at high fields was observed at liquid-helium 
temperatures for all samples. Node-integer plots for 
both orientations of magnetic field are shown in Fig. 4. 
The oscillatory periods deduced from the slopes in 
accordance with Eq. (6) are 5.84X10-5 G~ for the 
transverse mode and 5.72 10-5 G^! for the longitudinal 
mode. Since this sample was mounted so that the 
magnetic field for both modes was [110] oriented, this 
agreement is expected. 

A detailed study of the dependence of oscillatory 
period on crystal orientation was made using sample 
B-6. Figure 5 shows the oscillatory period for this 
sample as a function of magnetic field direction. If we 
attribute the 3% scatter of points to errors in the data 
reduction, we conclude that the period is independent 
of crystal orientation. This indifference of oscillatory 
period to magnetic field direction relative to the crystal 
axes was found on all samples and is the basis for con- 
cluding that, at liquid helium temperatures, the elec- 
trons occupy a spherically symmetric band. 

The absence of a crystal orientation dependence of 
the oscillatory period was also verified by the method 


PERIOD of OSCILLATIONS 
s " 
ANGLE BETWEEN "B'a'T* 
9 . v 2 = = 


Fic. 5. Angular de- 
pendence of oscillatory 
period for sample B-6. 
Sample current is in 
the [110] direction, 
with magnetic field in 
the (110) plane. The 
3% scatter of points 
from the average value 
is attributed to errors 
in the graphical data 
reduction procedure. 
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of Sladek? which utilizes a polycrystalline specimen. A 
sample was prepared by transforming a thin white-tin 
foil to the gray phase. (X-ray analysis has shown that 
samples prepared in this manner are highly polycrystal- 
line.) The magnetoresistance field dependence for this 


TABLE II. Results of oscillatory analysis. 


m*/m T (CK) 


Sample ng (cm^?) np (cm?) 
A-1 4.17 X 1015 2.161015 ee EE 
B-1> 1.14 1015 1.30X 1075 0.024* 3.2 
B-2° EE 1.45X 1015 0.025 54 
B-3 ee 1.07 10/5 0.021 9.3 
B-5 1.28X 101^ 0.97X1015 0.023* 5.3 
B-6 0.82X 1015 0.96 1015 0.022 54 
B-7 1.76X 10% 1.28 10% ee tU 
C-1 43 X10 4.2 X10 0.025* 3.9 
C-2 6.6 X10!5 5.0 X10'5 e vee 
D-2 6.4 X104 5.7 X104 


* Combined nonthermal broadening temperature. 
b Samples obtained from same single crystal. k 
* Effective mass value determined on the basis of seven different ambient 

temperatures. Others on basís of two different ambient temperatures, 


sample is shown in Fig. 6, where the presence of oscil- 
lations confirms sphericity of the constant energy 
surfaces. 

Equation (1) may now be used to compare the carrier 
concentration mp as measured by the oscillatory period, 


Fic. 7. Field de- | 
pendence of mag- | 
netoresistance for 
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Fic. 8. Field de- 
pendence of oscilla- 
tory amplitude for 
sample B-1 at am- 
bient temperatures 
of 42 and 1.2°K. 
Slopes are propor- 
tional to the product 
m* (T-- T). 


T*4.2*K 


Sample B-| 

Rei ax 10'Scm? 
o 10 20 30 d 
17B (I0*G) 


40 50 


with that given by the Hall coefficient. For sample B-1, 

np—1.30xX 10 cm, whereas m= 1.14 10'° cm. 

Similar comparisons for other samples are shown in 

d Table II. It should be noted that the my value for any 
particular sample is probably somewhat dependent on 
the location of the Hall probes because of inhomogeneity 
in the impurity distribution. 

Small magnetoresistance oscillations were detected in 
the very pure sample D-2 at 4.2°K. The large decrease 
in conductivity for this sample from 4.2 to 1.2°K 
(cf. Table I) led to a search for a temperature depend- 


Y ence of the oscillatory period, indicative of carrier 
freeze-out. Figure 7 illustrates the magnetoresistance 
i field dependence for sample D-2 at three diflerent 
1 


ambient temperatures. The period is very long (corre- 
sponding to a carrier concentration of only 6.4X 10" 
cm), but a change of period with temperature is quite 
apparent. A shifting of the extrema toward smaller 
magnetic field values with decreasing temperature 


Fic. 9. Tempera- 
ture dependence of 
slope function for 
sample B-1. Slope of 
line through points 
equals y (m*/mo). In- 
tercept with ordinate 
equals y(m*/mo)T’. 


S (10° G) 


Sample B-1 
n*144 xio cm? 


AND A. W. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


EWALD 


50 
FIELD DEPENDENCE 
of 


AMPLITUDE 


Fic. 10. Field de- 
pendence of oscilla. 
tory amplitude for 
sample C-1 at am. 
bient temperatures 
of 4.2 and 1.18?K 
Smaller number of 
data points (cf, l'ig. 
8) reflects longer 
period of this higher 
purity specimen. 


T =1.18 °K 


T#4.2 °K~ 


Sample C-l 
n= 4.3 xloS/cm? ' 
1 


6 10 20 30 40 50 
|/B (I0*G) 


indicates an increasing period, in qualitative agreement 
with Eq. (1). 

For a discussion of oscillatory phase, we return to 
Fig. 4. The intercepts 7 for the transverse and longi- 
tudinal modes are 4 and 0, respectively, which corre- 
spond to phases of —i7 and — ig. Thus, the measured 
transverse phase agrees with theory, but a significant 
discrepancy exists for the longitudinal mode (predicted 
to be —4m also). Several further measurements were 
made on this particular sample over a period of a few 
months (between which the sample was removed from 
its holder), and the above phases were consistently 
obtained. Moreover, a transverse phase of —im was 
observed on all other samples as well. 


B. Electron Effective Mass Determination 


The magnetic field dependence of the oscillatory 
amplitudes for sample B-1 at ambient temperatures of 
1.2 and 4.2°K is shown in Fig. 8. [Deviations from 
linearity at the high-field region stem, at least in part, 


from a breakdown of the hyperbolic sine approximation 
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of Eq. (3).J Five other intermediate temperatures at 
half-degree intervals were also used, but these data were 
omitted for clarity. In Fig. 9 are shown the slopes S(T) 
for all seven ambient temperatures. From the slope and 
intercept of this straight line an effective mass value of 
0.024mo was found, together with a combined non- 
thermal broadening temperature, 7’, of 3.2°K. 

A similar procedure was followed for sample C-1, 
with the results shown in Figs. 10 and 11. Because of 
the fewer detectable oscillations exhibited by this 
higher purity sample, the results are somewhat less 
accurate, as is evidenced by the larger spread of points 
in Fig. 11. Nevertheless, an effective mass value of 
0.0259 was found, with T’=3.9°K. 

In Table II are listed the experimentally obtained 
values for the electron effective mass in gray tin. 
Although T" varies from 3.2? to 9.3°K, the mass values 
themselves are fairly consistent. As mentioned pre- 
viously, T” is the sum of the Dingle and inhomogeneity 
broadening temperatures. Because of the high mobility 
of these specimens, the Dingle temperature is less than 
1?K (cf. Ref. 2). Consequently, we conclude that the 
major contribution to 7" is from inhomogeneities in the 
specimens. 

'The spherical symmetry of the conduction band 
observed experimentally and discussed in the previous 
section was predicted by Herman,“ and Liu and 
Bassani.? However, the calculated effective mass value 
of 0.0036m (see Ref. 16) which is based in part on the 
experimental band gap, is almost an order of magnitude 
smaller than our most accurate value of 0.024. 
Correction of the experimental value for nonparabolicity 
of the band!” reduces it only slightly, to 0.020». To 
account for the relatively large observed mass as well 
as the pressure behavior of the conductivity at tempera- 


14 F, Herman, J. Electron. 1, 103 (1955). 

15 L. Liu and F. Bassani, Bull. Am. Phys. Soc. 8, 51 (1963). 

16 F, Bassani and L. Liu, Phys. Rev. 132, 2047 (1963). 

17 J. Kolodziejczak, Proceedings of the International Conference 
on Semiconductor Physics, Prague, 1960 (Academic Press Inc., 
New York, 1961), p. 950. 


tures below 160°K, Groves and Paul? have recently 
proposed a new model for the band structure of gray 
tin. In this the direct gap is fixed at zero at all tem- 
peratures, and the temperature dependence of the 
conductivity in the intrinsic range is attributed to 
transitions from the uppermost valence band edge to 
[111] valleys. While a detailed quantitative comparison 
between the predictions of this model and all experi- 
mental data has not yet been made, the model appears 
to be more successful than the usual nonzero gap model 
in accounting for the experimental results in the lower 
temperature range. 


VI. CONCLUSION 


An oscillatory component of magnetoresistance has 
been observed at liquid-helium temperatures in single 
crystals of n-type gray tin. Exceptions to the general 
agreement between the experimental results and the 
theories of Adams and Holstein, and Argyres are the 
smaller observed amplitude and a measured longitudinal 
phase of — jm. 

The indifference of oscillatory period to magnetic- 
field orientation relative to the crystal axes indicates 
that the minimum of the conduction band is at the 
center of the Brillouin zone, in agreement with the 
theoretical predictions of Herman, and Liu and Bassani. 
This was also confirmed by detecting oscillations in a 
polycrystalline specimen. A dependence of the oscilla- 
tory period on temperature observed in a very pure 
specimen was attributed to carrier freeze-out. 

On the basis of the field and temperature dependence 
of the oscillatory amplitude we have obtained an elec- 
tron effective mass value of 0.024m5, which does not 
appear to depend on carrier concentration. This is much 
larger than the value 0.0036» calculated by Bassani 
and Liu, and suggests that a major revision of the band 
structure model, such as that recently proposed by 
Groves and Paul, is required. 


18 Steven Groves and William Paul, Phys. Rev. Letters 11, 194 
(1963). 
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Synthetic single crystals of FeAl:04 have been grown from the melt. Measurements of the thermal con- 
ductivity K and optical absorption coefficient o have been made on a combination of synthetic and natural 
single crystals of Mgi-Fe;AlO, for 0<x<1. The a measurements at 300°K in the absorption band of 
tetrahedral Fe?+ between 0.4 and 0.6 eV have been calibrated and then used to measure x. The thermal 
conductivity results between 3 and 300°K show that the heat is conducted by phonons and that tetrahedral 
Fe?* is a very effective phonon scatterer at all temperatures because it possesses a series of low-lying energy 
levels in the 3d shell. This type of scattering is absent in CoAl:04. The phonon scattering by Fe?* is dominant 
for x>0.1, and for these crystals K^ T/4/x. The magnetic susceptibility x of a single crystal of FeA1;0,, 
measured between 2 and 290°K, shows a peak at 9.52-0.5^K. The analysis of the x and K measurements 
suggests that the ~10°K transition is caused by the individual Fe?* ions, and is not associated with a long- 


range, antiferromagnetic ordering of the FeALO.. 


INTRODUCTION 


^S simplest magnetic analog of the basic diamag- 
netic spinel MgAL;O, is one in which all or part 
of the Mg is replaced by a transition-metal ion M. The 
present series of studies concerns crystals in which M is 
divalent Fe. The compound FeAl;0 is a normal spinel"? 
as is$4 MgALO, in which almost all of the Fe (or Mg) 
ions are in the tetrahedral (or A) lattice sites. The 
magnetic properties in a completely normal spinel are 
thus determined by the magnetic A-A interaction. Since 
the A-A separation is 3.52 A in FeAL;O,, this interaction 
is very weak.5 Thus, it is not possible for FeAl;O, to 
become magnetically ordered until very low tempera- 
tures (~ 4°K) are reached. The following measure- 
ments at and below 300°K of the thermal conductivity, 
optical absorption, and magnetic susceptibility of 
synthetic single crystals of FeAl;O; and of natural 
mixed crystals of Mg; Fe;Al;0 give some information 
on the behavior of the individual Fe ions and some on 
the nature of the A-A interaction. 


SINGLE CRYSTALS 


The samples used in the present study were a com- 
bination of synthetic and natural crystals. See Table I. 
High-purity oligocrystallme MgALO, prepared by 
Navias by the vapor reaction of MgO with Al;Os was 
used as a comparison standard. A large sample 1.20.3 
X0.3 cm composed of about five single crystals was 
obtained, and was used for the thermal conductivity 
and optical absorption measurements. A combination 
of chemical and x-ray analyses indicated this sample, 
R68, was a nearly stoichiometric spinel with a composi- 
tion, 1 MgO+1.02 ALO;, slightly rich in Al,O3. The 

17. W. F. Barth and E. Posnjak, Z. Krist. 82, 325 (1932). 

1? F, C. Romeijn, Philips Res. Rept. 8, 304, 321 (1953). 

3G. E. Bacon, Acta Cryst. 5, 684 (1952). 

4S, Hafner and F. Laves, Z. Krist. 115, 321 (1961). 

5W. Roth, Proceedings of the International Colloquium of 
Neutron Diffusion and Diffraction, Grenoble, 1963 (unpublished) ; 
J. Phys. Radium (to be published). 

6L. Navias, J. Am. Ceram. Soc. 44, 434 (1961). 
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crystallite diameter of 0.3 cm is sufficiently large so that 
these results should be representative of single-crystal 
values. 

Natural crystals of Mgi-zFezAl2O., where 0.0005 €« 
«0.3 were obtained through various sources. They 
originated in Australia (Queensland), Burma, and 
Ceylon. For the lowest values of x they were light pink 
in color. This color is caused by the presence of Cr in 
these, as in almost all natural spinel samples. For x— 0.3, 
the crystals were opaque black in the massive state, 
but in thin, polished sections were a yellowish-brown 
color. 

No good natural crystals of hercynite, FeAl;O;, were 
available. Therefore synthetic single crystals of FeAl:04 
were grown from the melt in an iridium crucible under 
a controlled partial pressure of oxygen. 


FeAI;O, Crystals 


Since single crystals of FeAl;O, do not appear to have 
been grown before, it seems worthwhile to describe 
their preparation. The FeO-Al,0; phase diagram at 
high temperatures has been worked out by Fischer and 


Taste I. A list of the single crystals of Mgı-zF'ezAl:04 
used in the present studies. 


Sample ao 


number A ee Source Color 

R68 8.082 3 X105 synthetic colorless in bulk 

R42 8.0866 5 X104 Burma light pink in bulk 

R54 8.0868 6.0X104 Burma light pink in bulk 

R97 8.089 24X102 Ceylon bluish purple in bulk 

R56 8.128  1.9X10-! Australia opaque, yellowish- 
brown thin sections 

R62 8.117  2.7X10- Australia opaque, yellowish- 
brown thin sections 

R67 84138 ~1.0 synthetic opaque, greenish-yellow 
in thin sections 

R75 8.140 ~1.0 synthetic opaque, greenish-yellow 


in thin sections 
ee LLA 
a x fraction of the A sites occupied by Fet* ions as determined from the 


0.6 eV optical absorption peak. For sample R68 the Fe concentration 
determined from an emission spectrograph. 
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Hoffmann’ and by Galakhov.* They show that FeALO, 
is the only ternary compound in the systern, and it has 
& congruent melting point between 1780 and 1800?C. 
The partial pressure of oxygen at the phase boundary 
where FeAI;O, dissociates into Fe, Al;05, and Oz has 
been measured?-? between 900 and 1100?C. When these 
data are extrapolated to the melting point at 1780?C, 
the predicted dissociation pressure is 10-7 atm of 
oxygen. A gas mixture of carbon monoxide to carbon 
dioxide in a molecular ratio of 8 to 1 at a total pressure 
in the vicinity of 1 atm will possess! this partial 
pressure of oxygen at 1780°C. Such an 8 to 1 mixture 
will provide an oxygen pressure about 5 times the 
dissociation pressure at 1000°C. Thus, a CO/CO: gas 
mixture of constant composition is a nearly ideal 
medium to use for growing FeAl,0, since it will supply 
an oxygen partial pressure very close to the one needed. 
If the oxygen partial pressure becomes too high, much 
of the Fe?* will be converted to the unwanted Fe**. The 
major problem in making FeAl,0, is to keep as much 
of the iron as possible in the divalent Fe?* state. In 
analogy with wüstite, “FeO,” it may never be possible 
to have all of the iron in the divalent state." 

The FeAl.O,; crystals were therefore grown in a 
CO/CO: atmosphere with a molecular ratio of 1 part 
CO to 1 part CO». This is slightly more oxidizing than 
necessary in order to provide a margin of safety. 
Following the work of Horn!5 on ferrites, crucibles of 
pure iridium were used. These were in the shape of a 
tapered cylinder with a conical end to facilitate growth 
of only a few nuclei. The crystal growing furnace is 
shown in Fig. 1. The iridium crucible is placed inside of 
an iridium cylinder with a 0.05 cm wall thickness which 
acts as an rf susceptor. The heat is supplied by a 0.5-Mc, 
25-kW radiofrequency oscillator. The rf coil, 20 cm 
long, is wound with a change in diameter and pitch at 
its midpoint, as shown in Fig. 1. The operating power 
level is adjusted so that the FeAl,O, in the crucible is 
liquid when the crucible is in the upper part of the rf 
coil, and is solid when in the lower part. The tempera- 
ture difference between the two parts of the coil is 
approximately 300°C. The motion of the temperature 
gradient in this modified Bridgman technique is pro- 
duced by moving the rf coil upward past the crucible 
at a rate of 1 cm/h. After the melt has solidified, the 


7 W. A. Fischer and A. Hoffmann, Arch. Eisenhüttenw. 27, 


Ye Galakhov, Izvest. Akad. Nauk SSSR, Otd. Khim. 


d . 525. 
NR DLP. Franz, H. Willeke, Z. Anorg. Allgem. Chem. 
184, 1 (1929). 
! V. Cimilli Gazz. Chim. Ital. 76, 339 (1946). 
i E Gl Ce J H. E. Jeffres, G. Withers, J. Iron Steel 
(London) 166, 213 (1950). : 
Tont a eer and J. White, Trans. Brit. Ceram. Soc. 53, 
233 (1954). 
E S 5i Richardson "mu J. H. E. Jeffres, 
261 (1948). 5 
TS iei me RW. Gurry, J. Am. Chem. Soc. 67, 1398 


E PH. Horn, J. Appl- Phys. 32, 900 (1961). 
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Fic. 1. A schematic of 
the apparatus used for 
growing single crystals 
of FeALO, by a modi- 
fied Bridgman tech- 
nique. The rf coil is 
moved upward past the 
stationary crucible. 
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power level of the rf oscillator is slowly reduced to zero 
over a period of hours. The thermal insulation around 
the iridium susceptor was composed of zirconia ceramic, 
Pt-40% Rh radiation shields, alumina ceramic, and 
quartz. Most of this detail has been omitted from 
Fig. 1. for clarity. 

Large, sound single crystals of the order of 1 cm* in 
volume were produced by this technique. Metallo- 
graphic and x-ray analyses indicated a single-phase 
material with the spinel-crystal structure and a lattice 
parameter a; of 8.138+0.001A for the as-grown 
sample R67. This value for ao is in reasonable agreement 
with the literature values*!*—? listed in Table II. It 
appears to be somewhat smaller than the most recent 
values on samples equilibrated at 1000°C, which cluster 
around 8.151 A. If the results of Holgersson'^ on a 
natural crystal are corrected for the small concentration 
of MgALO, in his sample, one obtains 8.155-:0.004 A. 
The variation in as of a nominally pure FeAl,0, sample 
can be caused”? by the formation of mixed crystals 
with Al.Oz, which reduces a, or with FeO, which 
increases ao. It can also be caused by a partial inver- 
sion??? of the spinel from its ideal normal structure, or 
by oxidation of some Fe** to Fe?*. 

A chemical analysis of the as-grown crystal R67 


16S, Holgersson, Lunds Univ. Arsskr. Adv. 2 38, 1 (1927); 
Chem. Abstr. 24, 804 (1930). f 

17 G. L. Clark, A. Ally, A. E. Badger, Am. J. Sci. 22, 539 (1931). 

180. Krause and W. Thiel, Z. Anorg. Allgem. Chem. 203, 120 


1931). 
( 19 "i Tazaki and S. Kuwabara, J. Sci. Hiroshima Univ. A15, 
263 (1952). z 
2 n nana and W. A. Fischer, Z. Physik. Chem. (Frankfurt) 
7, 80 (1956). 
A M. Ars and W. K. Sumida, J. Am. Ceram, Soc. 41, 150 
(1958) 


2A. C. Turnock, Carnegie Inst. Wash. Year Book 58, 134 
(1958). 
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TABLE II. Values for the x-ray lattice constant 
of the spinel FeAl:04. 


ao Ref. 
(A) Sample source Reference No. 
8.146-£0.004" natural crystals, Holgersson (1927) 16 


some Mg 


8.1352:0.002 synthetic powder: Clark, Ally, Badger 17 
(1931) 
8.1002-0.002 synthetic powder ES d Thiel 18 
1931 
8.13 synthetic powder — Tazaki and Kuwabara 19 
(1952) 
8.1254-0.002 synthetic powder Hoffmann and Fischer 20 
(1956) 
8.1522-0.001 synthetic powder, Atlas and Sumida 21 
1000°C (1958) 
8.150-+0.002 synthetic powder, Turnock (1958) 22 
oi o 
8.152+0.002 synthetic powder, Roth (1963) 5 
1200°C 
8.1402-0.001 synthetic single Slack (present work) 


crystals, 1200°C 


a 8.155 -:-0.004 A for pure Fe.M»O4 when corrected for slight amount of 
MgAl2Ox. 


gave an atomic ratio of aluminum to iron of 2.002€0.04. 
Even though the Al to Fe ratio is exactly 2.00, some of 
the Fe is probably in the Fe?* state. Such oxidation of 
the Fe will increase the O to Fe ratio above the ideal 
value of 4. The chemical analysis of R67 gave this 
ratio as 4.140.1, indicating that some of the Fe was 
in the Fe?* state. If the O to Fe ratio is actually 4.1, 
then 20% of the Fe occurs as Fe?*. Except for the 
possibility that some of the Fe is in the Fe?* state, the 
present synthetic crystals are close to being ideal 
FeAl.0,. A heat treatment of the as-grown crystals is 
needed in order to reduce the Fe?* concentration to its 
lowest possible value. Crystal R75 was prepared by 
taking an as-grown sample of FeAl;O,, surrounding it 
with high-purity AlO powder, and sealing it inside a 
capsule of high-purity iron. This capsule was then 
annealed at 1200°C for 21 h in an effort to bring the 
volume of the crystal to equilibrium with the 
reaction??! ; 


FeAl,O—Fe+ Al203+ 202 5 


This equilibration produced a slight increase in ao to 
8.140 +0.002 A and, as will be shown later, a notice- 
able decrease in the optical absorption, and a sharpening 
of the kink in the thermal conductivity curve at 10°K. 
Since the equilibration is a diffusion process, annealing 
times even longer than 21 h may be necessary to bring 
the bulk of the crystal to equilibrium with the oxygen 
atmosphere. A thin sample of FeAl;0, which was given 
a longer anneal had a lattice constant of 8.144 A, which 
is approaching the ideal value of 8.151 A. The fact that 
the ao of the synthetic FeAl;O, is less than 8.151 A is 
attributed to the presence of Fe**. 
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Chemical Purity 


The chemical purity of the various samples has been 
measured in order to determine what foreign elements 
were present. This is especially necessary since many of 
the samples were natural crystals. The impurity content 
of samples R42, R54, R56, and R62 has previously been 
published.” The major impurities in the other four 
samples are listed in Table III. These have been deter. 
mined by spectrographic analysis in all cases except 
for the Fe and Cr of R97. These two impurities were 
determined by aqueous solution colorimetry to + 10%, 
These results show that Fe is the dominant impurity 
in R97. Crystal R97 was in fact chosen from a group of 
several natural spinel crystals just for this reason. 

The synthetic FeAl,O, is reasonably pure. Most of 
the impurities appear to have been introduced during 
the growing process since Johnson-} Jatthey "Specpure" 
oxides were used as the starting material. With more 
care purer samples could be made. 


THERMAL CONDUCTIVITY 


The physical properties that were measured were the 
lattice constants ao (see Table I), the thermal conduc- 
tivity, the optical absorption, and the magnetic 
susceptibility. The interpretation of and the inter- 
relationships between these parameters will be 
discussed. 

Extensive measurements of the thermal conductivity 
K have been made between 3 and 300°K by techniques 
previously described. The K versus T curves for R42, 
R54, R56, and R62 have already been published.” They 
are reproduced again in Fig. 2 along with the new data 
for the other crystals. The main features to notice in 
Fig. 2 are the monotonic decrease in K at any and all 
temperatures with the increasing content of Fe. The 
pure FeAl,0, has the lowest K of all. This crystal is a 
rare example of a good single crystal with no (or very 
little) atomic disorder in which the K decreases con- 
tinuously with decreasing T for all temperatures below 
300°K. The other important feature to notice is the 
“kink” in several of the K versus T curves at 10°K. 


TABLE III. Major metallic impurities in several spinel 
samples in units of 101 atoms/cm*?. 


Sample 
Impurity R68 R67, R75 R97 
Ca 8 «07 <5 
Cr <0.2 3 35 
e 0.4 m? 360 
[4 m 0.3 m 
Si 2 7 4 
Ti 0.5 1 5 
V «04 0.3 8 
Zn «0. «0.8 20 


a m =major constituent at a concentration of about 1.5 X10** cm™. 


233 G. A. Slack, Phys. Rev. 126, 427 (1962). 
^^ G, A. Slack, Phys. Rev. 122, 1451 (1961). 
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Fic. 2. The thermal conductivity K versus temperature for 
several mixed crystals of Mg; Fez Al;O4. The value of x for each 
crystal is given. Note the continuous decrease in K with 
increasing x. 


'The heat transport in all of these crystals is believed 
to be caused by phonons. It does not appear that the 
magnetic moments in any of the crystals become well 
ordered between 3 and 300°K, so that a magnon heat 
transport is not at all likely. This lack of ordering will 
be discussed later. The phonons in the pure MgAL;O; 
with +=0 are scattered by? the crystal boundaries, 
the isotopes, and the other phonons for 3°K<T 
<300°K. The relative strengths of these scattering proc- 
esses vary with temperature, and their combined effect 
gives an upper bound to the K given by the dashed curve 
labeled x=0 in Fig. 2. This curve is computed for a 
specimen diameter of 0.3 cm, which is just the average 
crystallite size in the purest sample R68. The other 
crystals ranged in diameter from 0.2 to 0.4 cm, and in 
length from 0.3 to 1.2 cm. Thus, 0.3 cm is a reasonable 
average diameter for all of the crystals. Only the purest 
sample R68 comes close to the K limit imposed by 
boundary scattering at 3°K. S 

The failure of sample R68 to have a K as high as the 
dashed curve in Fig. 2 may be caused by some disorder 
ia the Mg and Al distribution on the 4 and B sites that 
has been seen in other synthetic spinels.‘ It may also 
be caused by the presence of the various chemical 
impurities listed in Table III. The Fe and Ti are the 


i iti i ities, and are the most 
transition-metal impurities, and art m 
NON the impurities. 


likely source of any phonon scattering by 
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In these crystals containing tetrahedral Fe?* it is 
suggested in this paper that the isolated Fe?* ions are 
very effective phonon scatterers. The scattering mech- 
anism is believed to be the same as that postulated 
for Fe?* in the tetrahedral sites in CdTe,” and is caused 
by phonon assisted transition between the several 
low-lying energy levels of the 3d’ configuration. Thus, 
the phonon scattering and hence the thermal resistivity 
W=K should depend on the Fe?* concentration. 
Notice that in Fig. 2 for a crystal with x=0.1, i.e., 
somewhere between R97 and R62, the K would decrease 
monotonically with decreasing T for 7<300°K. This 
means that for «>0.1, the phonon scattering by the 
Fe** predominates over the normal phonon-phonon 
scattering for all T<300°K. In order to characterize 
the Fe scattering mechanism, we are interested in 
the K as a function of T and of x. 

Values of x for the concentration of tetrahedral Fe** 
in the various crystals have been obtained from the 
strength of the optical absorption at 0.60 eV. The 
details are described later. These x values are given in 
Table I. The x values for R67 and R75 will change 
slightly with heat treatment and the residual concen- 
tration of Fe^*, however, they are probably between 0.9 
and 1.0. Figure 3 shows a plot of the thermal resistivity 
W versus x at 3, 10, 30, 100, and 300°K for the crystals 
studied. The straight lines in Fig. 3 are drawn to weight 
the points for R97 and R75 most heavily since in these 
two crystals Fe?* is by far the most prevalent impurity. 
Crystals R62 and R56 have substantial concentrations 
of Fe, as found in the optical absorption studies. 
Crystals R54 and R68 both have several other im- 
purities whose concentrations are equal to or greater 
than that of the Fe. Figure 3 indicates, however, that 
Fe?* is responsible for a major part of the W in these 
two crystals between 3 and 30°K. The lines in Fig. 3 


W,THERMAL RESISTIVITY, cm deg /wott 


p LÀ 
1, FRACTIONAL CONCENTRATION OF Fe”? 


Fic. 3. The thermal resistivity W versus the Fe** concentration 
x for several temperatures. For any given temperature, w varies 
approximately as 1459. 


35 G. A. Slack and S. Galginaitis, Phys. Rev. 133, A253 (1964). 
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Fic. 4. The reduced thermal conductivity versus temperature 
in the region of 10°K. The similarity of the “kink” at 11°K for 
R75 and R97 is quite plain. 


show that when the Fe?* is the dominant phonon 
scatterer, W varies approximately as «°°°4°%, or K as 
4-955 for all temperatures between 3 and 300°K. The 
K versus T curves in Fig. 2 show that in the temperature 
range where Fe?+ is the dominant phonon scatterer, K 
varies roughly as T+. A detailed calculation of the 
magnitude, temperature, and Fet concentration de- 
pendence of K for the crystals shown in Fig. 2 could be 
made using Callaway's'* approach if one had a good 
idea of the relaxation times for phonon scattering by 
the Fe?* ions and by the normal phonon processes. 
However, neither is very well known at present. The 
scattering from the Fe?*, 3d° configuration in tetrahedral 
sites involves a series of five equally-spaced magnetic 
levels, and is similar to the case of Fe?t in CdTe.” Some 
recent, preliminary measurements by the author on the 
optical absorption of crystal R97 at low temperatures 
places the highest of these 4 excited levels at 120°K 
above the ground state. If this estimate is correct, the 
phonon scattering by the Fe?* will be important for 
temperatures up to a few times 120°K. It is thus not 
difficult to believe that the Fe?* has reduced the K of 
FeAl,0, at 300°K by a factor of 5 compared to that of 
MgAl,0,. The continuous decrease in K with decreasing 
temperature for T<300°K is also not too surprising. 
The effect of the Fe?* on the K of FeAl,O, should 
disappear only when T7»120?K. From Fig. 2 it appears 
that the K of FeAl;0, and MgAL;O, may be nearly equal 
for T>1000°K. A more detailed calculation to explain 
the 4-95, T'* variation of K is not yet feasible. 

The curves for R75 and R97 show that there is some 
structure in the K versus T curves. Both of these 
exhibit a decided “kink” or change in slope at 11-- 1*K. 
In order to demonstrate the similarity between the two 
curves, reduced K values for both of them have been 


26 J, Callaway, Phys. Rev. 122, 787 (1961). 
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replotted side-by-side in Fig. 4. The reduced K is taken 
as 39.55 K in order to correct for different x values. The 
“kink”? for R97 comes at 11+1°K, while that for R75 
comes at 10.5+1°K. An average of 11-E1?K will be 
used. For crystal R97 with x— 0.024 the Fe?* concentra. 
tion is so small that there is no likelihood of a magnetic 
A-A interaction between the various Fe* ions. Thus 
in this crystal, each Fe?* ion is scattering the phonons 
independently of all the other Fe** ions. Since Fig, 4 
shows that the temperature dependence of K is nearly 
the same for R75 and R97, the conclusion is that the 
Fe?* ions in R75 are also scattering the phonons 
independently of one another for x— 1. It is concluded 
that the “kink” in the curves in Fig. 4 at 11°K is not 
produced by any cooperative, long-range magnetic 
ordering among the A site Fe?* ions in FeAL0,, but is 
produced by some change in the scattering of the 
phonons from the individual Fe?* ions at 11°K. The 
K results for R56 in Fig. 3 show a very small “kink” 
at 11°K, but it is rather weak. The presence of a large 
amount of Fet may be responsible for suppressing 
the effect. 

The question as to whether FeAl;O, undergoes a 
transition to an antiferromagnetic state below 11°K is 
not completely answered by this result. Roth? believes 
that the magnetic susceptibility data, which will be 
discussed later, demonstrates the presence of short- 
range antiferromagnetic ordering below 8 to 10°K. His 
neutron results, however, fail to show any long-range 
ordering at these temperatures. The present K results 
show that if a transition to an antiferromagnetic 
state does take place in FeAl;O;, such a transition does 
not affect the K. This would be surprising in view of the 
results?#2728 on MnO, CoO, MnFs, and CoF2; on” 
CuCl.2H;O0 and CoCl::6H:0, and on? UO», where 
there was a noticeable break in the K versus T curve 
on passing through the Néel temperature. More work 
will be necessary before the 11°K transition in FeAl,04 
is understood. 

As a check on the hypothesis that the Fe?* in the 
FeAl,0, is the cause of the anomalously low thermal 
conductivity, some measurements of K were made on 4 
single crystal of CoAlO,. This crystal R106 is a normal 
spinel® and was grown by E. M. Clausen of this labora- 
tory using a flame-fusion technique. The K results for 
a crystal 0.83 cmX 0.20 cmX0.18 cm annealed in air for 
100 h at 1000°C are shown in Fig. 5 together with the 
curves for MgAl.O, (R68) and FeAl;O4 (R75). The K 
results for the as-grown CoAl,0, sample were identi 
to those for the annealed sample. Notice that the K 
values lie between the other two curves except at the 


21 G. A. Slack and R. Newman, Phys. Rev. Letters 1, 359 (1958) 

28G. A. Slack, Proceedings of the International Conference ? 
Semiconductor Physics, Prague, 1960 (Academic Press nC, 
New York, 1962), p. 630. 


? R. H. Donaldson and D. T. Edmonds, Phys. Letters 2 130 


(1962). 253 
%20. Bethoux, P. Thomas, and L. Weil, Compt. Rend. 494; 
2043 (1961). 
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Fic. 5. The thermal conductivity of a single crystal of CoAl:O4 
versus temperature. The results lie between those for MgALO, 
and FeALO,, which are also shown. 


very lowest temperatures. The K of CoAL0, does not 
decrease monotonically with decreasing T. All three 
crystals appear to have nearly identical K for 
T>1000°K. The tetrahedral Co*+ has a 3d" configura- 
tion, and does not possess” any low-lying energy levels 
in the 1 to 1000°K energy range. Thus it should not 
scatter phonons with anywhere near the same effective- 
ness as does the tetrahedral Fe?+. This conclusion is 
demonstrated by the results in Fig. 5, at least for 
temperatures between 4 and 300°K. Solely on the basis 
of the magnetic scattering by the tetrahedral Co?*, the 
K of CoALO, should be nearly as high as that of 
MgAl:04. The failure of the CoAL0, to exhibit such a 
high K may be caused by a partial inversion of the 
CoAL;0,, or by other crystal imperfections. 

From the differences in K of the spinels MgALO,, 
FeAl.0,, and CoALO,, one concludes that the phonon 
scattering is very sensitive to the details of the magnetic 
nature of the A-site ions. ` 


OPTICAL PROPERTIES 


Optical absorption measurements on most of the 
samples listed in Table I and on a sample of commercial 
MgO-3.5 Al;O; spinel have been made at 300°K over 
the photon-energy range from 0.1 to 7.0 eV (12- to 
0.18- wavelength). These measurements were made 
using three different recording spectrophotometers. A 
Perkin-Elmer! model-421 grating instrument was used 


31 Perkin-Elmer Corporation, Norwalk, Connecticut. 
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from 0.1 to 0.5 eV, and a Beckman? DK-2A and a 
Cary? model-14 were used from 0.5 to 7.0 eV. These 
instruments operated well down to about 1% sample 
transmission. The sample thickness ranged from 
5X107 to 3X 107 cm. Samples thinner than 3 10? cm 
were difficult to prepare because they often broke during 
the polishing process. This combination of instruments 
and samples allowed one to measure values of the op- 
tical absorption coefficient a from 107! to 10*? cm. The 
coefficient a is defined by 


I-I(1—R)e-t, (1) 


where To= incident light intensity, /= transmitted light 
intensity, R- reflection loss for a single surface at 
normal incidence, t= sample thickness in cm. 

The two main purposes of these measurements are to 
determine the site symmetry of the Fe?* in the spinel 
crystals and to determine the concentration of the Fett. 
The Fet in the tetrahedral sites produces a strong 
absorption in the infrared over a broad band between 
0.4 and 0.8 eV. The a-versus-photon energy /t» curves 
allow one to make some estimate of the Fe** and the 
Cr+ concentration in the crystal. 


Reflectivity 


Some optical reflectivity measurements were made 
on MgALO, and FeALO, from 0.03 to 2.0 eV in order 
to identify the regions of strong lattice absorption and 
to measure the reflection loss R. These measurements 
were made at 300°K with a Perkin-Elmer! model-83 
monochromator at nearly normal incidence. The strong 
lattice peaks were all found in the energy range between 
0.03 and 0.11 eV. The samples used were a natural spinel 
crystal from Burma which was medium-pink in color, 
and a synthetic crystal of FeAl,O; in the as-grown, 
unannealed condition. The surfaces of the crystals were 
polished so that they were optically flat before the 
reflectivitv was measured. 

The reflectivity results for MgALO, are shown in 
Fig. 6. The main lattice absorption peaks are at 0.066 
and 0.091 eV. The other smaller peaks that were found 
are listed in Table IV along with those found for 
FeAl,O,. The reflectivity values for MgALO, in 
Table IV for normal incidence are reasonably accurate. 
The values for FeAl;O, taken at about 30° from the 
normal are only approximate, but do indicate the 
relative strengths of the various peaks. Table V also 
gives the positions for the absorption maxima by 
transmission found by other authors‘ using pow- 
dered samples compressed in KBr pellets. The Greek 
letters serve to identify the various peaks. Even though 
the peaks in the reflectivity and absorption are not at 


2 Beckman Instruments Inc., Fullerton, California. 

11 Applied Physics Corporation, Monrovia, California. 

uK. A. Wickersheim and R. A. Lefever, J. Opt. Soc. Am. 50, 
831 (1960). 

3 $. Hafner, Z. Krist. 115, 331 (1961). 
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''ABLE IV. Reflectivity maxima found for MgALO, and FeALO, single crystals. 
Designation MgAl.O, (natural) FeAl,0, (synthetic) 
(a) hy %R Character hv %R Character 
a 0.0375 eV 36 sharp peak 0.038 eV ~50 small peak 
8 0.059 89 shoulder 
Y 0.066 100 broad peak 0.061 ~70 broad peak 
ô 0.072 75 shoulder 0.068 ~50 weak shoulder 
€ 0.091 100 sharp peak 0.086 ~80 peak 
G 0.104 50 weak shoulder 0.099 ~40 shoulder 
0 0.176 small see Fig. 7 0.18 low see Fig. 7 


a These Greek letters serve only to identify the various peaks. 


identical energies, the energies are nearly equal. The 
strong reflectivity peaks are associated with the strong 
absorption ones. There is a close similarity between 
these lattice vibration peaks for MgAl:O, and FeAl:04 
since they both have the spinel crystal structure, and 
nearly the same lattice constants. The Fe is a heavier 
atom than the Mg and the reflectivity peaks for F eAl1;04 
occur at an energy about 5% less than those for 
MgAl.O,. In the energy range between 0.12 and 3.0 eV 
(and up to 4 eV actually) the reflectivity R of MgAl:04 
is quite small (2 to 8%), and there is no structure in 
the R versus ky curve in Fig. 6. The same general 
statement can be made for FeAl;O,. Thus, any structure 
found in the optical transmission curves of the single 
crystals in this energy range is caused by variations in 
the absorption coefficient a and the reflection correction 
is small, see Eq. (1). 


Absorption Coefficient 


The % transmission curves that were taken were 
corrected for reflection losses using R values calculated 
from the refractive index?9?" n for MgAl;O, in the range 
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Fic. 6. The optical reflectivity R versus photon energy. These 
results for a natural single crystal of MgALO, at 300°K were taken 
at nearly normal incidence. 


ae 8) Rinne, Neues Jahrb. Mineral. Geol., Beilageband 58A, 43 
1928). 
‘ 31 K. Schlossmacher, Z. Krist. 72, 447 (1930). 
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between 1.6 and 3.0 eV. Beyond these energy limits the 
calculated R values were extrapolated using the results 
shown in Fig. 5. The resultant optical absorption 
coefficients a for the various samples are given in Figs. 7 
and 8. 


1. Pure MgAl:0, 


The highest purity MgAl;O, sample R68 in Fig. 7 
exhibits an optical window between 0.23 and 6.0 eV. 
These are the limits at which a= 1 cm™. This synthetic 
MgAl.O, was grown? in an atmosphere of hydrogen by 
diffusing MgO into Al;O;. It does not show any signs of 
the OH or H:O band at 0.4 eV (34) that has been 
seen*.8 in synthetic spinels grown by the flame-fusion 
process. As a check on the presence of this band a 
synthetic crystal of MgO--3.5 AlsOs from Linde? 
(crystal R53 of Ref. 23) was measured in this region. It 
showed a double-peaked band with peaks at 0.415 and 
0.436 eV with absorption coefficients of 6 cm™ and 
3 cm, respectively. For contrast, the R68 crystal had 
an a«0.05 cm™ in this same energy region. Thus, the 
0.4-eV band is not an intrinsic feature of MgAL;O,, but 
shows up only in the flame-fusion grown crystals. It 
has not been seen in any of the natural MgAl.0s 
crystals. It is necessary to understand the source of this 
band, since the absorption peak for Fe?* ions in the A 
sites occurs at about this same energy. 


TABLE V. Absorption maxima for MgAl;O, and F eAl;04 
powders from the literature. 


Designation MgALbO4 FeALO4 
(a) (b) © (d) (e) 
y 0.0645eV 0.067 0.065 0.063 
à 0.0716 0.070 
e 0.0849 0.086 0.085 0.083 
t 0.0932 0.092 0.097 
7 0.124 


* These Greek letters serve only to identify the various peaks. 
b Natural MgAIsO4, Ref. 4. 

* Synthetic MgAI:O4, Ref. 35. 

d Synthetic MgALO,, Ref. 34. 

e Synthetic FeAI;O4, Ref. 4. 


33 G. Calingert, S. D. Heron, R. Stair, Soc. Autom. Engin. J. 
39, 448T (1936). 
3 Linde Air Products Company, New York, New York. 
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2. Telrahedral Fe?* 


The main feature in all of the æ versus /i» curves in 
Figs. 7 and 8 is the very strong absorption band between 
0.4 and 0.8 eV. This band is assigned to the tetrahedral 
Ve?*, 3d" transition ££ (5D) ^T. (5D). This same transi- 
iion has been seen in several II-VI semiconducting 
compounds doped with Fe. Probably the carliest report 
of it was by Coblentz* in natural crystals of zincblende 
(cubic ZnS), which almost always contain Fe, where he 
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j 1 i ion coefficient a versus the photon 
Fic. 7. The optical absorption eee Corrections 
for ri ion losses have been made. The energies of various 
Paene peaks have been labeled in electron volts. The amar 
curve for R75 indicates that the absorption between 0.4 me 
0.8 eV was too large to be measured. However, the estimate 

behavior is shown. Similarly the ? for R67 indicates that above 
0.8 eV, a was too large to be measured. The main absorption 
peaks at 0.45 and 0.61 eV are caused by tetrahedral Fe**. 


"Ww. W. Coblentz, No. 97, Carnegie Institution of Washington, 
D. C., 1906 (unpublished), Part VI, p- 57. 
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Fic. 8. The optical absorption coefficient a versus the photon 
energy hv for two pleonaste spinels which contain large amounts 
of both Fe?* and Fe?*. Corrections for reflection losses have been 
made. The main peaks are labeled in electron volts. 


found a broad single absorption peak at 0.41 eV. Low 
and Weger“ have also found this peak in natural 
zincblende, Pappalardo and Dietz have seen it in 
Fe-doped synthetic CdS, and the present author? has 
seen it in Fe-doped synthetic CdTe. The energies of the 
two peaks in this double peak are listed in Table VI. In 
some cases only a single, broad peak is seen. This 
absorption peak shifts toward higher energies as the 
Fe-anion distance decreases, see Table VI. 

The strength of the electrostatic crystalline field at 
the Fe?* ion produced by the other surrounding ions is 
given“ by the parameter Dg. The energy hv, at the peak 
in the a versus /t» curve is approximately equal to 10 Dg. 


TABLE VI. Peak energies for the double-peak, infrared optical 
absorption band of tetrahedral Fe?* at 300°K. 


L——————————————————————— 


Peaks Fe-anion 

hy, distance, 
Crystal eV Reference A 
MgaAl,0,, cubic 0.45, 0.61 present work 1.81 
ZnS, cubic 0.41 Coblentz? 2.34 
ZnS, cubic 0.35, 0.50 Low and Weger* 2.34 
CdS, hex. 0.306, 0.387 Pappalazdo and Dietz! 2.53 
CdTe, cubic 0.34* Slack* 2.81 


MmMMM——————M——P MÀ 


» Splits into two peaks when crystal is cooled below 100°K. 
b See Rei. 40. 
* See Ref. 41. 
d See Ref. 42. 
* See Ref. 43. 


a W. Low and M. Weger, Phys. Rev. 118, 1119, 1130 (1960). 

4 R. Pappalardo and R. E. Dietz, Phys. Rev. 123, 1188 (1961). 

4 G. A. Slack (unpublished). 

« D. S. McClure, in Solid State Physics, edited by F. Seitz and 
D. Turnbull (Academic Press Inc., New York, 1959), Vol. 9, 
p. 399, 
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The breadth of the absorption band and the presence 
of the double peak make the determination of Dg rather 
inexact. However, an average energy for the two peaks 
in MgAL;O, is 0.53 eV — 4300 cm-!— 10 Dg. So Dg has 
the not unreasonable value of +430 cm™ for tetra- 
hedral Fe?*. Further understanding of what causes the 
broadening of the absorption peak will yield a better 
value of Dg. It is postulated here that the breadth is 
caused by a strong coupling between these magnetic 
levels and the lattice vibrations. It is just this coupling 
which gives rise to the pronounced effects of Fe?* on 
the thermal conductivity. 


3. Other Transition Metal Ions 


The other peaks in Figs. 7 and 8 also require some 

explanation. The small, sharp peak at 0.176 eV has been 
seen in all of the crystals, and is independent of doping. 
It is probably a lattice vibration band, and is therefore 
listed in Table IV. There are numerous peaks between 
1.0 and 4.0 eV which vary from sample to sample. Since 
pure MgAl,0,, sample R68 has no absorption bands 
between 0.23 and 6.0 eV, these bands are caused by 
various transition-metal impurities. So far none of 
these other bands in Figs. 7 and 8 has been definitely 
associated with transitions in the tetrahedral Fe?* ion. 
They all appear to be caused either by octahedral Cr?* 
in the natural crystals (R54 and R97), by Fe?*, or by 
octahedral Fe?* in the other crystals. The various Cr?* 
peaks in R54 and R97 are dealt with in the Appendix. 
The conclusion is that except for the 1.34-eV peak in 
R97 all the other peaks above 1.0 eV in these two crys- 
tals are due to Cr+. Crystal R54 shows the two peaks at 
2.24 and 3.23 eV characteristic of octahedral Cr+. In 
R97 the simple octahedral Cr+ spectrum has been 
modified by the presence of the much larger concentra- 
tion of tetrahedral Fe?*. 


4. Other Fe Ions 


Let us now consider the possibilities involved in the 
absorption spectra of Fe*+, both octahedral and tetra- 
hedral, and of octahedral Fe?*. If FeALO, were a 
perfectly stoichiometric, normal spinel, then only 
tetrahedral Fe?* would exist. The x-ray data have 
shown? that FeAl;O; is a normal spinel. The recent 
work of Schmalzried*® on other aluminates has shown 
that CoAl;O, is nominally a normal spinel, but when 
equilibrated at 1000°C is partially inverse. In view of 
the similarity of the octahedral preference energies of 
CoAL,0, and FeAl.O, calculated by Miller,4® FeAl;O; 
can be expected to be slightly inverse in the 1000 to 
1500?C range. From neutron scattering data, Roth* has 
found that in his powder sample of synthetic FeAl2O., 
annealed at 1200°C, about 15% of the iron was in the 
octahedral sites. Thus there will probably be some small 


45 H. Schmalzried, Z. Physik. Chem. (Frankfurt) 28, 203 (1961). 
« A. Miller, J. Appl. Phys. 30, 245 (1959). 
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amount (i.e., 1 to 20%) of octahedral Fe** in the prese 
3 o nt 
single crystals annealed at 1200°C. The presence of 
Fe? in the FeAl;O, is less certain. It is well known 
that the iron in wüstite or “FeO” cannot be reduced i 
the point where all of the iron is divalent. Thus one 
B : » he 
might expect some Fet in the FeAl;O; even when itis 
at the low-oxygen pressure dissociation limit of decom. 
posing into Fe, Al;Os, and O». From Miller's! calcula- y 
tions one would predict that this Fe?* would be on the 
tetrahedral sites. Notice that any octahedral Fett that 
is present requires only that one of the many tetrahedra] > 
Fe?* ions in the FeALl;O, transfers an electron to the 
octahedral Fe?* in order to yield an octahedral Fe’ and — | 
a tetrahedral Fe?* ion. For example, this exchange is | 
quite favorable in FesOu, which is an inverse spinel 
where the Fe?* is on the tetrahedral sites in competition ~ 
with Fe?*. The opposite conclusion of Hoffman and 
Fischer? that the first small additions of Fe* go into | 
the octahedral sites is suspect since their lattice con- 
stants at low Fe Os concentrations do not agree with 
the later ones of Atlas and Sumida”! or of Turnock.” It 
seems probable that at low concentrations of Fe* in 
FeAL;0, the Fe?* occurs on the tetrahedral sites. There- — 
fore, in the interpretation of the optical absorption | 
spectra of the mixed FeAl;O:-MgALO, crystals, itis | 
necessary to consider tetrahedrally and octahedralp < 
coordinated Fe?*, and tetrahedrally coordinated Fett; l 
The optical absorption caused by octahedral Fe?* is | 
believed to be the cause of the broad peaks seen in | 
Fig. 6 for R75 at 1.23 and 1.65 eV. Previous work by | 
Holmes and McClure’? has shown that hydrated, | 
octahedral Fe?* in FeSO,: 7H3O crystals possesses only | 
two rather weak, broad absorption bands at 1.04 and 


1.34 eV in the energy range from 0.75 to 4.7 eV, witha 
combined oscillator strength of f~4x10™. These 
authors also found that the absorption of Fe? in water 
solution was very similar. The method of calculating 
the oscillator strengths f from the optical absorption 
curves is that given by Dexter. Octahedrally coordi- 
nated Fe?* is a well-known colorant in glass,“ where X 
produces an absorption band at about 1.2 eV (1) in 
the infrared. Octahedral Fe? in MgO was found by 
Low and Weger“! to have a broad, poorly defined 
absorption peak at 1.24 eV. 

A study of the optical absorption of Fe?* in MgO by 
the present author? shows, that in the energy range 
between 0.5 and 3.0 eV, there is a broad, rather weak, 
double-peaked absorption band with peaks at 1.21 and 
1.46 eV. The derived oscillator strength is f=6X o“ 
for each peak;? close to the value for the hydrated pe k 
The energies for these absorption bands of Fe** in Mg 
and FeSO,-7H,0 are in reasonable agreement with 


m Hines and D: S MeClure, J. Chem. Phys 26.00 
48 D. L. Dexter, in Solid State Physics, edited by F. Seitz am 
D. Turnbull (Academic Press Inc., New York, 1958), Vot. "5 
p- 353, 371. logy: 
4 W. A. Weyl, Coloured Glasses (Society of Glass Techno 
Sheffield, 1951), Chap. 7. $ 
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1.23- and 1.65-eV bands seen in FeALO, in Fig. 6. This 
is the reason for the assignment of these two bands to 
octahedral Fe?*, If one makes the assumption that 15% 
of all the Fe** in the R75 sample of FeALO, is on the 
octahedral sites, and is producing the 1.23-eV band, 
Fig. 6, the calculated oscillator strength for this 0.7-eV 
wide band is {3X 10-4. This is five times higher than 
that found for MgO. Since the actual concentration of 
octahedral Fe?* in R75 may be higher or lower than the 
15% found by Roth, the f value is only approximate. 
If anything, the octahedral Fe?* concentration in R75 
is probably less than 15% of the total, thus making 
f>3X10-4. The optical transition responsible for this 
broad, double-peaked band is believed to be the lowest 
energy one possible,“ that is 5T2(5D) — *E(*D). The 
intensity of this optical transition and hence its f value 
is enhanced by distortion of the local cubic symmetry 
around the Fe?* ion. The f value is small in highly 
symmetric environments such as at a Mg site in MgO. 
In FeAl,0, the oxygen ions surrounding the octahedral 
site form a distorted octahedron. The site symmetry 
may be further reduced by the presence or absence of 
an Al** ion in one or more of the surrounding tetrahedral 
sites in the real crystal. These sites would, of course, be 
occupied only by Fe?* in the pure, normal spinel. These 
effects will all serve to make the oscillator strength f of 
octahedral Fe? in FeAl:O; greater than it is in MgO. 
Therefore, the optical absorption data is not yet of much 
help in determining the absolute concentration of 
octahedral Fe?*, and hence the fractional inversion in 
FeAl;O, However, it seems clear that the 1.23- and 
1.65-eV bands are caused by octahedral Fe?*, and may 
be useful in determining changes in the fractional 
inversion. 

The other absorption peaks seen for R56 and R62 in 
Fig. 7 above 1 eV and for R75 in Fig. 6 above 2 eV are 
probably associated with the Fe** ions. Just what the 
assignments should be are rather uncertain. The lowest 
energy peak for Fe*+ in octahedral coordination with 
oxygen in a-Fe;O; has been found by Bailey? at 1.49 eV. 
A similar peak at the same energy in Y3FesO12(YIG) 
has been found by Dillon? This is probably the 
644(8S) — *T3(4^G) transition. The first transition for 
tetrahedral Fe?* should be at an even higher energy 
since Fe?+ has a 5$ ground state and the crystalline field 
at a tetrahedral site is about half of that at an octahedral 
site. Therefore, the optical absorption characteristic of 
tetrahedral Fe?* should occur for v7 1.5 eV in these 
oxide spinels. The optical absorption of Fe’? tetra- 
hedrally coordinated by Cl" ions has been studied by 
Friedman.? He found a number of peaks starting at 
1.7 eV and extending toward higher energies. It seems 
reasonable to suppose that the 2.25-, 2.40,- and REN 
peaks in R56, R62, and R75 may be caused by tetra- 


s lev. T. Appl. Phys. 31, 39S (1960). 
s T É uen j Phys. Radium 20, 374 (1959), 
2H. F. Friedman, J. Am. Chem. Soc. 74, 5 (1952). 
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hedral Fet. More work is clearly necessary before such 
an assignment is convincing. 


5. Charge Transfer 


The last feature of Figs. 7 and 8 that remains un- 
explained is the hump in the curves for samples R56, 
R62, and R97 at 1.35 eV. Samples R56 and R62 have 
fairly large concentrations of Fe**+ ions in them, as will 
be shown later. These Fe?* ions are probably on tetra- 
hedral sites, as are the more numerous Fe?* ions. When 
the same cation in two different charge states exists on 
the same lattice site in a crystal, it is relatively easy for 
a photon to cause an electron transfer between the two 
differently charged ions. Such a process is believed to. 
be the source of a rather broad, structureless absorption 
in NiO,” Y2FesOy2,% and Fe,O,." It is suggested here 
that this charge transfer absorption commences at 
about 1.35 eV in the present spinel crystals, and 
accounts for the hump in the curves. It would then be 
responsible for the dull, yellowish-brown appearance of 
the pleonaste spinels, R56 and R62, by transmitted 
light. See Table I. 


CONCENTRATION OF TETRAHEDRAL Fe** 


The shape of all of the absorption curves in Figs. 7 
and 8 between 0.3 and 0.9 eV is nearly the same. 
The absorption in this portion of the curve is caused by 
the presence of tetrahedral Fe?*. Since the shapes are 
all similar, the value of @ at, say, the 0.61-eV peak 
should be proportional to the concentration of tetra- 
hedral Fe?*. The Fe concentration in R97 has been 
carefully measured as 3.60.4 10” atoms/cm’. Since 
there is no siga of any optical absorption due to octa- 
hedral Fe?*, and since the 1.34-eV peak, which is the 
only peak that might be associated with the presence 
of Fé*, is very weak, it is concluded that tetrahedral 
Fe?* is by far the dominant species of Fe in crystal R97. 
The peak absorption cross section e; is defined by 


op =N, (2) 


where ap=absorption coefficient at the peak, and 
N — concentration of the absorbing ions. The computed 
value is p= 2.9X 10-7? cm? for the 0.6-eV peak in R97. 
From Fig. 7 the full width of the 0.6-eV peak at a= 0.5a 
is 0.22 eV. This width and the value of øp of 2.95 10 
cm? yields? an oscillator strength of f=4X10. The 
0.45-eV peak gives a somewhat smaller calculated value 
of f=1X10* ‘ 

The concentration of tetrahedral Fe** in the other 
crystals in Figs. 7 and 8 has been calculated on the 
assumption that tetrahedral Fe** concentration is 
proportional to ap at 0.6 eV. The results are given in 
Table VII along with the total concentration of Fe 


53 R. Newman and R. M. Chrenko, Phys. Rev. 114, 1507 (1959). 
an Op and A. B. Chase, J. Chem. Phys. 32, 1575 


nu E. Engeler (private communication). 
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"TABLE VII. A comparison of the concentration o tetrahedral Fe?* 
ions (optical) with the concentration of total Fe (chemical).^ 


Conc. tetra. Fe?* Conc. Fe total 


Sample atoms/cm? atoms/cm? x 
R68 «5x10r 4X 10" s 3X1075 
RS4 0.9X 1019 2X10" s 6.0x 107! 
R97 3.6X 10" a 3.6x 10? w 2.4X107 
R62 2.7107 5.0 107! w 1.9107! 
RS6 4.0 107! 6.2X 107! w 2.7X107 
R67, R75 1.5X 10? w ~i 


a a =assumed to be identical to concentration of total iron. $ - determined 
irom an emission spectrograph to within a factor of two. w — determined 
from a wet chemical analysis to +10%. x —Íraction of tetrahedral sites 
occupied by Fe?*. 


determined chemically. For the FeALlO; crystal the 
absorption was too large to be measured, and the 
expected curve, shown dashed in Fig. 7 between 0.4 and 
0.8 eV, has been estimated from the chemical deter- 
mination. The agreement with the known portion of 
the curve is quite good, showing that the proportionality 
assumption is not too bad. In Table VII the agreement 
between the optical and chemical values for R68 and 
R54 is again satisfactory. However, for samples R62 
and R56 the chemical concentration is higher than the 
concentration of tetrahedral Fe?+. Judging from the 
chemical analyses of these samples in Ref. 23 and the 
optical absorption curves in Fig. 8, the remaining 40% 
of the Fe probably occurs as Fe?*. 


MAGNETIC SUSCEPTIBILITY 


The magnetic sysceptibility x of an as-grown single 
crystal of FeAl.O, quite similar to sample R67 was 
measured from 1.9 to 290°K. The measurements were 
made by Kouvel and Hartelius with an apparatus? 


9.5 £05°K 


— PRESENT RESULTS 
——- ROTH 
—— LOTGERING 


% [ 200 500 


Fic. 9. The magnetic susceptibility x and its reciprocal x 
versus temperature for a single crystal of FeAl;0, similar to R67. 
The x^! curves obtained by other authors for powder samples are 
also shown. 


56 T, S. Kouvel, C. D. Graham, Jr., and J. J. Becker, J. Appl. 
Phys. 29, 518 (1958). ganay I , J. App 
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described previously. The susceptibility was inq 

s e- 
pendent of applied field for values of the magnetic fielg 
H, between 1.5X10° to 8X 10? A/m. (2000 to 10 009 
Oe.) The susceptibility in rationalized mks units js 
shown in Fig. 9. The susceptibility is defined as the 
ratio of the induced magnetic moment per unit volume 
(A/m) to the applied magnetic field (A/m), and js 
dimensionless. These x values can, if desired, be con- 
verted to emu/cm* by dividing by 4v. The x-ray 
density of FeAl;O, is 4.26 g/cm’. There is a very sharp 
peak in the x versus T curve at 9.5-E0.5?K. The y~ 
curve for the single crystal above 9.5°K is also shown 
in Fig. 9 compared with the recent data of Roth® and 
Lotgering®’ on powder samples. The present results lie 
between these two previous curves, and indicate that 
the x is somewhat dependent on the exact nature of the 
sample. Small changes in the fractional inversion and 
the Fet content are probably responsible for the varia- 
bility in x above 10°K. They are presumably responsible 
for slight shifts in the peak temperature. Roth? found a 
peak at 8°K. These measurements have not yet been 
made on annealed single crystals similar to R75. 

The 9.5°K temperature for the peak in the x versus 
T curve closely matches the “kink” in the K versus T 
curve at 11-41°K. It would seem that these two 
features are closely related. Since the “kink” in the K 
curve appears to be independent of the Fe’ concen- 
tration, it is postulated here that the position of the 
peak may also be nearly independent of the Fet 
concentration. Thus the behavior of the x versus T 
curve may be a function of the individual Fe?* ions, 
and may not be associated with a long-range ordering 
of the magnetic moment of the Fe**. A localized Jahn- 
Teller distortion of the Fe?* ions at 11°K could, perhaps, 
account for much of the observed behavior. It is known 
that the normal spinel FeCr:O4 undergoes**-9 such à 
distortion at 135°K, which is rather higher than 1ts 
Curie temperature?! of about 85°K. Even though 
Roth® found no evidence for a crystallographic distor- 
tion in the cubic FeAl,O; at 4°K, it is possible that 
there is a small, local distortion around each Fe?* site. 
This local distortion, however, may not be large enough 
to produce a long-range, cooperative, crystallographic 
distortion. The neutron scattering results of Roth also 
failed to show any evidence of long-range ordering 1” 
FeAl,O, at 4°K. This rather unusual behavior 9 
tetrahedral Fe?* may be associated with the four lona 
lying energy levels, the closest of which is about 305 
above the singlet ground state. Since the ground state 15 
a singlet with no orbital or spin degeneracy; the £e, 
should have no magnetic moment in zero magnetic 
fields at temperatures well below 30°K. Thus, one might 


51 F, K. Lotgering, Phys. Chem. Solids 23, 1153 (1962). 
58 C. H. Shomate, Ind. Eng. Chem. 36, 910 (1944). 962). 
3$ E. Whipple and A. Wold, J. Inorg. Nucl. Chem. 24, 23 (1 h 
wA. Wold, R. J. Arnott, E. Whipple, and J. B. Goodenous™ 
. Appl. Phys. 34, 1085 (1963). 

& F. K. Lotgering, Philips Res. Rept. 11, 190 (1956). 
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expect that signs of a magnetic ordering at these tem- 
peratures would not occur. It would appear desirable to 
make x measurements on mixed crystals of FeALO; 
MgAl.O, in order to test some of these speculations. 
This has not yet been done. 


CONCLUSIONS 


(1) A method of growing good single crystals of 
FeAl,O,; has been devised. A post-growth heat treat- 
ment in the appropriate atmosphere permits some 
control of the concentration of the residual Fe** and the 
fractional inversion. 

(2) The thermal conductivity K of synthetic crystals 
of FeAl;0, and MgALO, and natural crystals of 
Mg; Fe;Al;0; has been measured from 3 to 300°K. At 
all temperatures, the K decreases with increasing Fe?* 
concentration x for O€ x € 1. For x—1, the K exhibits 
the unusual behavior of decreasing continuously with 
decreasing temperature for T<300°K. In all of the 
crystals the heat is transported by phonons. 

(3) The phonon scattering from the tetrahedral Fe?* 
is quite strong. This scattering produces a change in 
the slope of the K versus T curve at 11-E1?K, inde- 
pendent of the Fe?* concentration. These results suggest 
that the anomaly at 11?K is associated with the 
individual Fe?* ions, and is not caused by the occurrence 
of an antiferromagnetic state. 

(4) The magnetic susceptibility x of a single crystal 
of FeAl;O, has been measured from 1.9 to 290°K. The 
x versus T curve shows a peak at 9.50.5°K, in reason- 
able agreement with the 11°K peak found from the K 
measurements. It is suggested that this behavior may 
be characteristic of tetrahedral Fe?*, and is not to be 
associated with the onset of an antiferromagnetic state. 

(5) Thermal conductivity measurements on CoAl,04 
show that tetrahedral Co?* is not as effective a scatterer 
of phonons as is tetrahedral Fe?* in the range from 4 
to 300°K. 

(6) The optical absorption coefficients of crystals of 
Mg;.,Fe.AlO, for 0€xX1 have been measured at 
300°K for photon energies between 0.1 and 7.0 eV. The 
tetrahedral Fe?* produces a broad, double-peaked 
absorption band between 0.4 and 0.6 eV. This absorp- 
tion band has been used to measure the value of x. 


Additional optical absorption bands have been seen 
which are due to octahedral Fe** and octahedral Cr^*. 
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APPENDIX: THE OPTICAL ABSORPTION OF 
Cr IN MgALO, SPINELS 


Almost all natural crystals of MgAl,O, contain traces 
of Cr. The chemical analyses in Table III and Ref. 23 
show that Cr is indeed present in the natural crystals 
R54, R56, and R97. The Cr* ion has a very strong 
preference for the octahedral sites in the spinel, and 
thus occurs only in these sites in MgAl:O4. The optical 
absorption of Cr+ in spinel has been studied by many 
people," and at low concentrations in natural 
crystals of reasonably pure MgALO, it gives rise to two 
broad, intense absorption bands at 2.25 and 3.21 eV. 
Crystal R54 in Fig. 7 shows these two bands with the 
peaks at 2.30 and 3.18 eV, in good agreement with the 
previous literature. The emission spectrographic 
analysis of this sample showed a Cr content of 2 10? 
cm? to within a factor of, say, 2. The calculated 
oscillator strengths are, to within a factor of 2, 
f=2X10~* and 5X 10~ for the 2.30- and 3.18-eV peaks, 
respectively. If the synthetic crystal containing a 
nominal 0.5 weight 95 Cr studied by Schlossmacher is 
considered, the derived f value for his 2.21-eV peak is 
{=0.9X10~ to within a factor of 2. Thus the assign- 
ment of the 2.30-eV peak in R54 to Cr^* is reasonable 
in terms of both the photon energy and the magnitude 
of the absorption coefficient. The optical absorption 
from a known amount of Cr in a-Al;O; has been 
measured by Maiman et al.” Their results for the 223- 
and 3.02-eV absorption bands averaged over the three 
directions in the crystal give f values of approximately: 
210-4 and 6X 10-5, respectively, for the two bands. 
These results for a-Al;O; are in very good agreement 
with those for MgAI;O;. Of the other various references 
already cited for Cr^* in MgAl,0;, none have enough 
data to enable a calculation of the oscillator strengths 
to be made. 

Let us now consider the various bands found in R97 
between 1.5 and 4.0 eV. The 2.24-eV band and the 
double-peaked band at 3.23 and 3.35 eV are believed to 
be the two Cr+ bands that appear prominently in R54. 
The chemical analysis of R97 gave a Cr content of 
3.5-E0.3X 10? cm. From this value the calculated 
oscillator strengths are f 8X 10~ for the 2.24 eV and 
f[—16X 107 for the “3.29-eV” band. These f values are 
about 3 times larger than before. These two bands are 


? Q. Weigel, Neues Jahrb. Mineral. Geol., Beilageband 48, 274 
1923). ; 
: a 9. Weigel and H. Ufer, Neues Jahrb. Mineral. Geol., Beilage- 
band 574, 397 (1928). 
4K. Schlossmacher, Z. Krist. 72, 447 (1930). 
$5 K. Schlossmacher, Z. Krist. 75, 399 (1930). 
tt K. Schlossmacher and I. Meyer, Z. Krist. 76A, 377 (1931). 
€ Q, Deutschbein, Ann. Phys. 14, 712, 729 (1932). 
6 E. Kolbe, Neues Jahrb. Mineral Geol, Beilageband 694, 
183 (1934). p 
© O. Schmitz-Dumont and D. Reinen, Z. Electrochem. 63, 978 
On A. Ford and O. F. Hill, Spectrochim. Acta 16, 1318 (1960). 
71 A. Neuhaus, Z. Krist. 113, 195 (1960). 
7: T. H. Maiman, R. H. Hoskins, I. J. D'Haenens, C. K. Asawa, 
and V. Evtuhov, Phys. Rev. 123, 1151 (1961). 
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TABLE VIII. Comparison of the optical absorption transitions 
of Crt in MgALlO, with those of [Cr(H20)s]**. 


Energy, 4v, in eV 


"Transition R54 R97  [Cr(H;O)c P* 
"SERT s g 
1 d 
TAUF) 2.30 224 223 
—3T,06) 272 2.60 
— 47; (F) 318 323,335 305 
— 341 0G) 37 
— TP) 4.64 


also assigned to Cr?*, and their f values, though larger, 
are reasonable. The somewhat larger f values are 
thought to be caused by the presence of the Fe?* in the 
tetrahedral sites enhancing the transition probabilities 
of the Cr+ in the octahedral sites. The chemically 
determined concentration of Fe in R97 is (3.60.3) 
X109 cm. Thus, 1 out of every 42 tetrahedral sites is 
occupied by Fe. Since each tetrahedral site in the spinel 
lattice has 12 nearest octahedral sites, there will be 
4.3X.10? cm-? octahedral sites, or 1/7 of the total 
which have an Fe atom as a next-nearest neighbor. If 
the Cr atoms are randomly distributed over the octa- 
hedral sites, then 1/7 of them will be affected by the 
presence of the tetrahedral Fe in R97. For low concen- 
trations of Fe and Cr the fraction of the Cr sites that 
are paired with Fe is proportional to the Fe concentra- 
tion. The fraction of the Cr sites in R54 that are paired 
is 0.004. The optical absorption spectrum of the Cr’+ in 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


R54 is thus not influenced by the presence of the Fea: 
In R97 the case is different. The absorption of the 2 24 
and “3.29” eV bands is apparently enhanced. Also the 
**3.29" eV band has been split into two components, as 
mentioned previously. In addition there are 3 normally 
very weak octahedral Cr?* peaks which show up in 
Fig. 7 as absorption bands at 1.89, 2.72, and a shoulder 
at 3.7 eV. The 1.89- and 2.72-eV bands have been seen 
before in blue or blue-green, natural spinel crystals*es 
which are known" to contain Fe?*. The 3.7-eV band 
has been seen before by Weigel? in synthetic MgALO, 
containing Cr+. The assignment of these several bands 
to various known transitions in octahedral Cr** is given 
in Table VIII. The corresponding energies found for 
[Cr(HzO)s f* are also shown.” The agreement is good, 
indicating that the several absorption bands in R97 
between 1.5 and 4.0 eV are caused by the Cr*+. None of 
them can be attributed directly to optical transitions 
within the d shell of the tetrahedral Fe?*. This is in 
contrast to the suggestions of Schlossmacher**-99 who 
believed that the absorption spectrum in the visible 
range of the Fe containing MgALO, crystals was 
actually due to the Fe?*. In his crystals as in R97, it is 
caused by the presence of Cr+. The Fe?* has, however, 
strongly modifed the Cr?* absorption spectrum. The 
absorption band at 1.34 eV in R97 is at too low an 
energy to be caused by the octahedral Cr?*. As explained 
before, it is believed to be associated with the presence 
of some tetrahedral Fe?*. 


73 Y. Tanabe and S. Sugano, J. Phys. Soc. Japan 9, 766 (1954). 
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The electron distribution function is calculated for a. many-valley semiconductor in an intense electric 
field for the case of anisotropic scattering by acoustical phonons at low temperatures. 


I. INTRODUCTION 


[5 this paper we shall consider the low-temperature 
galvanomagnetic properties of a semiconductor in 
an intense electric field. In order to calculate the low- 
temperature anisotropic transport properties of semi- 
conductors such as n-type germanium and silicon, it is 
necessary to abandon both the simple picture of 
spherical constant energy surfaces and the assumption 
of phonon equipartition. 

Zylberstejn and Conwell! have studied the deviations 
of the phonon distribution from equilibrium in a semi- 
conductor in an intense electric field at low tempera- 
tures. They show that the deviation is small for a sample 
of small dimensions and small carrier concentration. We 
shall treat this latter case and assume that the phonon 
distribution remains in equilibrium. 

The predominant electron scattering mechanism in a 
pure semiconductor at low temperatures is by acoustical 
phonons and the equilibrium number of such phonons 
of crystal momentum q is given by 


Nq=1/(e!*?—1), (1) 


where s is the velocity of sound, & is Boltzmann's 
constant, and T is the absolute temperature. The 
conservation of energy and crystal momentum require 
that the acoustical phonons with which an electron of 
energy e can interact have a maximum energy of the 
order of 2(ems?)!? where m is the effective electron mass. 
At high temperatures the average electron energy e, 
even in an intense electric field, is such that 


2(ems?)*«&kT and therefore gsN,—RT. (2) 


This is the case of equipartition of energy. 

At low temperatures, on the other hand, an intense 
electric field may result in the average electron energy 
€ being sufficiently large such that 


2(ems?)? 
2(ems?)"2>kT and Noc exp— =- 23) 


Since the probability of phonon emission and absorption 
are proportional to 1+Nq and Na respectively, we 
may in this case neglect Ne compared to one and 
consider only acoustical phonon emussion; this is the 
case of zero-point scattering. 
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Stratton? has calculated the electron distribution 
function in the limit of zero-point scattering for the 
case of spherical constant energy surfaces and isotropic 
scattering, while Paranjape? has calculated the electron 
temperature for a Boltzmann distribution for the same 
case. Koenig, Brown, and Schillinger (K.B.S.)! have 
recently applied Shibuya’s® theory of hot electrons in 
order to describe hot electron phenomena in n-type 
germanium at low temperatures, Shibuya's treatment is 
based on the assumption of phonon equipartition and is 
therefore not applicable under the condition for which 
K.B.S. have used it. We see in fact from Table II 
of their paper that the electron temperatures, or the 
corresponding mean electron energies, do not satisfy 
the equipartition condition (2). 

Stratton's zero-point distribution function and 
Paranjape's calculation predict an electron mobility 
n^-E-95, where E is the electric field. This is in agree- 
ment with the low-temperature conductivity measure- 
ments of Bray and Brown.* 

In this paper we shall extend the previous calcula- 
tions?^? by considering a many-valley semiconductor 
with ellipsoidal constant energy surfaces in an intense 
electric field and a magnetic field. We treat the case of 
acoustical phonon scattering in the zero-point limit, and 
allow for anisotropic scattering in our calculation. 

In Sec. II of this paper we set up the Boltzmann 
equation for a many-valley semiconductor, and in 
Sec. III we derive expressions for the energy and 
momentum relaxation in the case of anisotropic zero- 
point scattering. We solve the Boltzmann equation for 
the distribution function in Sec. IV. 


II. BASIC EQUATIONS 


In a previous publication, the author has shown 
that the Boltzmann equation for the case of ellipsoidal 
constant energy surfaces can be expressed as two 
coupled equations for S and A, the isotropic and aniso- 


? R. Stratton, Proc. Roy. Soc. (London) 242, 355 (1957). 

3 B. V. Paranjape, Proc. Phys. Soc. (London) B70, 628 (1959). 

4S. Koenig, R. Brown, and W. Schillinger, Phys. Rev. 128, 
1668 (1962). 

5 M. Shibuya, Phys. Rev. 99, 1189 (1955). 

6 R. Bray, D. Brown, Proceedings of the International Conference 
on Semiconductor Physics, Prague, 1960 (Czechoslovakian 
Academy of Sciences, Prague, 1961). 

1H. Budd, Phys. Rev. 131, 1520 (1963). 
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tropic parts of the distribution function, respectively, L 


€ ^ 
cE/- Vy S--An(cE'- Vy A) -—D' X B.V, A-C'A, 


mo 
Is(eE’-VpA}=C'S, a 
with p’=ap, E’=aE, B’=RB, e= p?/2mo, | 
mo ug s 
— 0 0 
mz | 
ma WO TE n 
filo m. o A mo 
a=| 0 — 0 = m ; : 
My d (m,mym;) ^ 6) 
0 0 m 
mo 


Mz 


d 
| 
where E and B are the electric and magnetic field vectors, € and p are the electron energy and momentum and | 
Mz, My, m. are the electron masses corresponding to the three principal axes of the ellipsoid. Í 
An(x) and Is{x} represent the anisotropic aud isotropic parts of x respectively, and C" represents the collision | 
operator in p' space. | 
Assuming that '4 can be described by an energy-dependent tensor whose principal axes coincide with thoseof — | 
| 


the ellipsoid: 
T2(€) 0 0 
2l 0 Te) 0 | ; (6) 
0 0 72(€) 


and neglecting the An{eE’-Vp’A} term (see Appendix II), we obtain straightforwardly : 


i dS ; [ c (cE/-- re E x eB! /mo)+ (e2/m?) B/ (B'- cE) 4 d 
A= ge AEE /m) PL à )4- (By?/ 70) + 07/7335 
C'S= AR 2 J2 ( QETE-4-2 (eE 5 B//m)? J | d 
3m? de de \1+ (em?) (:2/72)9- (By’?/ Ty) (17) 


where V/— (p'/mo) and 7?— TzTyTz. - 
We shall consider the solution of this equation after having calculated the energy relaxation term C'S, and the 
relaxation time tensor T. 
II. CALCULATION OF ENERGY RELAXATION TERM AND RELAXATION TIME TENSOR 


In this section we shall derive expressions for 7 and Ó'S for a many-valley semiconductor with anisotropic 
scattering by the zero-point acoustical phonons. Denoting by M 7?, cz and M7”, cr, the matrix elements and elastic 
constants corresponding to the longitudinal and transverse modes respectively, we obtain 


2v 
h (2x5) 


Cf(p)= | [acreroarn -onae ftam rave narret (9) 


Xr r= (qsz.r/26,7)M 1,7°6(e(p+ q)—€(p)—9gsz,7), 


where 
Yı r= (qsz,m/ 261,7) M zr" (e(p— a) — (p) -asz.7) ; 
8 
ps12=C1, psr^—cr and where p is the density. The matrix elements have been calculated by Herring and Mo j 
and we see. from Table V of their paper that Mg? and Mz? are simple polynomials in cos'6, where 0 is the angie 


between the principal axis of the ellipsoid and q. 
Transforming the elliposidal constant energy surfaces to spheres (5) in both the electron and phonon spaces 


8 C. Herring and E. Vogt, Phys. Rev. 101, 944 (1956). 
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8 / dy f(p'-+4/) OHN e) SN eX +X] 


+ J YTS AN fp) (A+ Ne) CY z/-- 2j , (10) 


where 


Xa — (q*s1,7/2cr,7)M r,r? (cos'6*) 
X6(e(p'4- q') — elp) 

Y 1,2 = (q*sz,r/2cr,7) M pr? (cos?0*) 
&(e(p'— q')— e(p^)--g*sz,r). 


The principal axis of the ellipsoid has been taken as 
the polar axis in q' space; and mr and mz are the 
transverse and longitudinal masses, respectively ; 


—q*sz,7), 


g* = (mr/mo)"?q' [1+ (K —1) cos?’ }/2 , 
K? cos6! mr, 


[1+ (K — 1) cos?6^]u ' mr 
where 6' is the cone angle in q' space. 

We first calculate Ó"S, where S is a function of energy 
only. If the scattering were isotropic the matrix elements 
would be angularly independent and Ó^S would depend 
only on energy. In our case, Mz? and Mz?are angularly 
dependent and Oy Ni ll therefore i in general be a function 
of the direction of p'. We shall calculate the average of 
C'S over all directions of p’, i.e., 


(C S). | f C'SdQy J Í J My, (12) 


where dQ,, is the differential solid angle in p' space. 
Assuming that: 


(1) The distribution function varies slowly over an 
energy interval equal to the phonon energy: 


dS (qs)? dis 


S(e+qs)= SO torah 5 25 = (13a) 


(2) The average electron energy € is sufficiently large 
such that JV, is negligible compared to one: 
2 (ems)? kT (13b) 


JijowL wh 


Kx d ( 
Mal —\=zel 1 1+yz( 0.15— 
Mz (=) Za ( T p 


and carrying out the required integrations we obtain 


('s 6A m;m,;? d 4v2 d. 
y gm ya] els mz aga ys — 
V2] pe? Je ( Š 5 pix -) | i 


(14) 


where 


: Kx? Ka? 
MET 
9 R* Ry 
! 1 Kx? Kx 2 
vem | Re satz (= jte Mr? (= )e/z. 
D R? R? 


and 
R-[1--(K —1)x7]?*. 


A relaxation time does not generally exist when the 
matrix elements are angularly dependent; we follow 
the procedure of Herring and Vogt and define two 
principal relaxation times as follows: 


re f fotodan f | [mim 
(15) 
1/7, — i / px C 6,40, / | | p Ny, 


with 
$u— p'-a/f', $-— p b/»', 


where a and b are unit vectors parallel to the major 
and minor axes of the ellipsoid, respectively. We obtain 
the following expressions for the two relaxation times: 


1 48(2x)? 
——— nmi" mM Nun, (16) 
Ti, Sh 
where 
To— TA 
Nu=T, N.=——, 
and 


: SL Kz Sr Kx 
T Í ase Gs ane u Gi |: 
0 cL R?/ cr R? 
The matrix elements and average elastic constants are 


given in Table V of Herring and Vogt’s paper: 


1.5Kz* 


n] 
+ a , 
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KN Kxz2W? Kx? Kx? 2 
M JC) = (s Yr) ; W-—, 
R? R? R? R? Za 
0.375 0.625 
=; (111) valley 
3 1 C44 cs +30" 
CL ast 2c444- 0.6c* ; —-— 
€T 3 2 
———— (100) valley , 
Cue catie" 
MES 9 1 1 
—, (111) valley E =|). (111) valley 
3cr 8 C44 cast 3C* 
TW r= 
z (100) vall 6 : : (100) vall 
——, (100) valley zi = ) ; valley , 
CL cutie” cat C7 


where Sq and Z, are the deformation potential constants for dilation and uniaxial shear. Inserting these matrix 
elements in Eqs. (16) and (14) we obtain the relaxation time tensor and C'S (see Appendix I). 


IV. THE DISTRIBUTION FUNCTION 
We shall now solve Eq. (8) for S, the isotopic part of the distribution function. Substituting Eq. (14) in Eq. (8), 


we obtain 
G He?de 
S=exp— i , (17) 
Q 2 ( gE'7E'-- [(e/mo) E’- B' P7? ) 
3moNL4- (€278/mo2) L(B2?/t2)+ (By2/,)2- (B./72) J 
where 
64 mTmj?m* _ 4&2 
He. nr? Veo. 
V2h4p 5 


We shall consider the case of a small magnetic field stant energy surfaces, Z,—0 and neglect elastic aniso- 
(w’r?1) and we neglect the field-independent term in tropy (c*=0), the last factor in our expression for T. 
the denominator of the integrand of Eq. (17) (see becomes equal to 1, as does X (y), and Eq. (18) becomes 


Appendix II). We then obtain identical with the distribution function calculated by 
S=exp—(¢/kT.)*?, (18) Stratton? Since the low-field mobility for aco 
0. e 


ES B phonon scattering varies as J~*?, our expressi 

o : 

TES j DUO R. independent of temperature. , 

A i Ee TeEX =| (19) Using Eqs. (19) and (20) we obtain the following 
expressions for the mobility u and the low magnetic 


4 


with 
[Eno T3/2/s; 15 324 2/5 field Hall coefficient R. 
T.=X(y) =| , (20) —PE-98. R=B 
[mzsz2/ k }!® 162?*N,srp H= D LM g 
where ve where P and B are complicated functions of the 
o ERA E K= KL orientation of the electric and magnetic fields W1 
v T x ; D respect to the crystallographic axes but are independent 
v2 2/5 of the magnitudes of the applied fields. The field depend- 
= (27) (25) ence of the mobility is in agreement with the expen 
2 2 mental results of Bray and Brown.* A discussion of the 
16 Q4-1/K) ental r E 
: s 3 à orientation dependence will be considered in 4 futu 
and y is the direction cosine of E with respect to the publication. 
longitudinal axis of the ellipsoid. yo is the low-field 
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APPENDIX I 
We shall express all the required integrals of Sec. III in terms of the elementary integrals 
i x"dx 
(m,n)z f — 
o [1+ (K=1)x2 " E^ 
ZU-XÓAK QW) y [0.1—0,45K -15K*(4,2)-2KW (0.05— 15K (4,2)4-1.75K*(6,4)) 
T + GCW)*(0.15(4,2) — 1.5K(6,4-++1 A5K*8,6)) Hr (KW)*(4,2)— K(6,4))},, (A2) 


nc za 020, —3)+2KW(2, —1)4- K*W*(4,1)) -ssKW?*(Q, —1)— K(4,1)) 
tyzs1[0.15(0, —3)—1.5K (Q2, —1)4-1.75K*(4,1)--2KW (0.15(2, — 1) — 1.5K (4,1)4-1.75K*(6,3)) 
+ (KW)*(0.15(4,1) — 1.5K (6,3)+1.75K%8,5)) ]+-yrs7(KW)2((4,1)—K(6,3))} , (A3) 
Tezi, —1)4-2KW (4,1)4- (CW) ($3) KW (0,1) K3) 
n [0456, —1)- 15K ($1) -1.75K*(5,3)2- 2KW (0.156,1)— 1.5K (53)4-1.75 (5,5) 
+ (KWy(0.15($3)— 15K 5) LK (7) Her KWy(,3)— Kgs) . (Ad) 


The integrals (m,n) are all straightforward. For K —1, (m,n)=1/(m-+1), and for K1 one can easily show that 


1 
(m4-2, n4-2)— 


n(K— 


| e] (A5) 


for n7z£0. We have tabulated some of the integrals in Table I. The remaining ones are easily calculated by means 
of Eq. (A5). 


"TABLE I. Integrals. 


K»1 K=1 K«1l 

0, =1) 52 scam ; Kin sin (1—K)12 
| 2 2(K—1)* 2 2(1—XK)^ 

sinh-1(K —1)!2 Jd — sin"t(1— K)" 
"dante einem] amare 

1 [K-4 tan(K-1'" 1 = 
En zs 3 ^ (K-)^ ] ! ü-KyL 3 (0-5 
APPENDIX II 


We see from Eq. (18) that € &T,/T (0.6) &3&T, and therefore using Eq. (3) the zero-point scattering condition 
is satisfied for electric fields large enough such that 
n * 2 


1/2 E all? S 2/8, 354 1/5 
/6 Bs cea e] > T, (A6) 
k [mrs17/ kno 16>°*) ‘Srp. 
where m* is a complicated average of the longitudinal and transverse masses and is given approximately by 
m*=łmr(K*2—1)/(K—1) 


d s* is the average sound velocity. 
B SEE putes energy suríaces and 2,—c*—0, Eq. (20) becomes approximately 


2. A (nst/ EL Ens T**/s 5» T . 


i i à i ivalent to neglecting ms*/kT, 
-i dent term in the denominator of Eq (17) is equivalent to 1 : 
x ee s men s the An{cE’-V,-A} term is a correction to D Mar inp Ar due to 
NE higher order terms than the first-order spherical harmonics, i.e., terms of the form £z ,$), etc. 
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Spin-Lattice Relaxation Time of Conduction Electrons in Sodium Metal* 


FREDERICK VEsCrAL, N. S. VANDERVEN, AND R. T. Scuumacuert 
Department of Physics, Carnegie Institute of Technology, Pittsburgh, Pennsylvania 
(Received 6 January 1964) 


We have measured the temperature dependence of the spin-lattice relaxation time T, in sodium metal 
from 14 to 77°K. Our results agree with those of Feher and Kip, and Levy, in the range 20 to 40*K, but 
disagree with Feher and Kip above 40°K. As a consequence, we find marked deviation from the relation 
Ty — constant even at the highest temperatures of our measurements. The temperature dependence we ob- 
serve lends strong support to the recent calculation of Yafet in the temperature range 35 to TK. 


I. INTRODUCTION 


HE question of the temperature dependence of the 
conduction-electron relaxation time in metals has 
recently been re-examined by Vafet. He finds that in 
the very low-temperature limit, T«&0p, where 05 is an 
appropriate Debye temperature for the lattice, the 
spin-lattice relaxation time T should vary as T=, con- 
trary to the previous expectations that T1T— constant 
over the entire high- and low-temperature range. The 
only metal in which a temperature-dependent T, has 
been observed over an appreciable range of tempera- 
tures is sodium. In that metal Feher and Kip? measured 
T, from 300 to 4°K and concluded that their data 
supported the relation 7;7=3.0X 107° sec-°K over the 
entire range of their measurements. In fact, their 
published data do not support this conclusion below 
55°K, nor do the data of Levy.’ All these investigators 
show longer T's in the range 20 to 40°K than predicted 
by T,T£:3.0X10-5* sec-?K. Narrower lines than 
expected at 20°K were also observed in this laboratory 
during a recent remeasurement of the sodium con- 
duction-electron susceptibility. 

We report here a remeasurement of T,(T) which 
supports the conclusion of Yafet' that Tı(T) deviates 
from the 1/T dependence as T becomes less than an 
appropriate Debye temperature for the sodium lattice. 
We describe those aspects of the apparatus relevant to 
this particular measurement, some precautions taken in 
sample preparation and experimental measuring tech- 


nique, and we compare our results to the conclusions of 
Yafet. 


II. EXPERIMENTAL DETAILS 
A. Microwave Apparatus 


The measurements were made entirely at X band 
with a single klystron superheterodyne spectrometer 


* Supported by grants from the National Science Foundation 
and the Alfred P. Sloan Foundation. From a thesis submitted 
by Frederick Vescial in partial fulfillment of requirements for the 
Ph.D. degree. 
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described previously. The microwave cavity was 
cylindrical, and was operated in the TEon mode, which 
requires that the sample be inserted into the geometric 
center of the cavity. In this configuration the microwave 
field H; is vertical in the laboratory and the de field, Ho, 
is in the horizontal plane. The cavity was matched by a 
variable coupler of the type described by Gordon. 

The samples, which will be described in detail below, 
were contained in the innermost of a series of concentric 
Pyrex tubes. The sample tube was 3 mm o.d., a few 
centimeters long and was completely sealed. The next 
tube, 5 mm o.d., was approximately one meter long. 
Tt was inserted into a tube 8 mm o.d., which is perma- 
nently attached to the microwave system. The 8 mm 
o.d. tube extends halfway into the cavity, is coaxial with 
it and sealed at the lower end, and in the other direction 
extends to the top of the Dewar system, where it forms 
part of an "air-lock" system similar to one described 
by Estle? Samples were changed while cryogenic liquid 
was still in the Dewar system by removing the 5 mm o.d. 
tube. 


B. Cryogenic Apparatus and 
Temperature Measurements 


The relative bulkiness of the cylindrical cavity 
necessitated the use of a Dewar system whose outer 
diameter (that of the nitrogen Dewar) is 5 in. The 
inner diameter of the inner Dewar is greater than 25 in. 
The cryogenic liquid used in our measurements was In 
all cases liquid hydrogen. However, the hydrogen 18 
entirely excluded from the interior of the microwave 
cavity and the inside of the waveguide. The outermost 
pyrex tube is also not in direct contact with the hydro- 
gen, but is separated from it by a cupronickel alloy | 
sleeve which fits snugly around the Pyrex and 5 
soldered to the top of the cavity. The sample is thus 
removed from direct contact with the bath by 8. meta 
sleeve and three Pyrex tubes, all of which nest together 
rather snugly. All the tubes except the sample tube 
were filled with a fraction of an atmosphere of He gas 
to improve the thermal contact between the sample 2n 
the bath. 


J. L. Hall and R. T. Schumacher, Phys. Rev. 127, 1892 qo 
also J. L. Hall, thesis, Carnegie Institute of "Technology; 
SCR Rev. Sci. Instr. 32, 658 (1961) 

. P. Gordon, Rev. Sci. Instr. 32, : D 

1T. L. Estle and G. K. Walters, Rev. Sci. Instr. 32, 1058 (1961). 
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Temperature measurements were made with a 
nominally 390-Q carbon resistance thermometer. This 
thermometer was calibrated from 4.2 to 77°K by a 
platinum resistance thermometer which had been 
calibrated by the National Bureau of Standards. The 
carbon thermometer became rather insensitive above 
50°K, but was still adequate for our measurements. The 
thermometer could not be placed in direct contact 
with the sample, which was in any event sealed in the 
3-mm-o.d. tube. The top of the sample tube projected 
just above the top of the cavity, and the carbon 
thermometer was pressed into direct physical contact 
with the tube at that point. We relied on the physical 
contact plus the helium exchange gas to keep the 
thermometer at the same temperature as the sample. 
In the region from 14 to 20°K there was always adequate 
agreement between the temperature of the bath as 
deduced by vapor pressure measurements and the 
temperature indicated by the thermometer. Since the 
thermometer was not in any poorer contact with the 
bath than was the sample, we have no reason to believe 
that the sample temperature differed in any significant 
way from that registered by the thermometer. 

Above the boiling point of liquid hydrogen data were 
taken by allowing the system to warm gradually after 
the liquid hydrogen level dropped below the bottom 
of the microwave cavity. During this time the field was 
swept back and forth through the resonance while the 
derivative of the resonance line was continuously 
recorded as was the temperature of the carbon resistance 
thermometer. This procedure gave data at about every 
2°K, except for a few intervals during which the micro- 
wave system was being retuned. The warmup procedure 
from 20 to 70°K took about 60 min. Our best estimate 
of the type of error introduced by this procedure is that 
the sample was warming faster than the thermometer, 
and that the error became worse as the temperature 
became higher. Errors in temperature measurement are 
probably responsible for the increased scatter in the 
data above 50?K. 


C. Sample Preparation 


Our sodium samples were prepared by the technique 
introduced by Levy: Briefly, the technique is to prepare 
dilute sodium-ammonia solutions, and then freeze 
them. Metallic sodium precipitates out upon freezing 
in the form of dispersed small particles. We used 
sodium, kindly given to us by Dr. Raymond Bowers of 
Cornell University, which had a resistivity ratio of 
more than 7000. We found it advisable to dissolve this 
sodium in ammonia which had first been cleaned by 
dissolving some normal purity sodium in it. This 
procedure removes any impurities in the ammonia 
which might react with sodium. The Soa mere 
confined to a length in the 3-mm-o.d. tube of a ect 2 
mm in order to confine the sample to the sr ues 
feasible volume to minimize effects of magnetic fie 
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inhomogeneity, After the required amount of ammonia 
had been introduced, the sample was frozen in liquid 
nitrogen and sealed off under vacuum a few centimeters 
above the sample. The 3-mm-o.d. tube was inserted 


in a long 5-mm-o.d. tube and stored under liquid 
nitrogen. 


D. Some Experimental Precautions 


The relaxation times which we measured were in the 
range from 5X10 to 5X10-7 sec. Such relaxation 
times are awkward to measure directly, so we took 
advantage of the equality of the transverse and longi- 
tudinal relaxation times* and measured the linewidth. 
Such a procedure is line shape-dependent, so it will be 
given extended attention below. Here we want to point 
out that at low temperatures, even for relatively short 
relaxation times, a serious danger to accurate measure- 
ment lies in partial saturation of the resonance line. 
The danger arises not from ordinary saturation broaden- 
ing, but from the Overhauser effect.” We have observed 
at temperatures as high as 20°K effects predicted by 
Kaplan? and observed by Gueron and Ryter.! If the 
external field is swept slowly from above resonance 
while the rf field is strong enough to saturate the 
resonance even partially, the sodium nuclei will become 
partially polarized through the Overhauser effect. 
They will in turn contribute to the total field seen by the 
electrons an amount H* which depends on the micro- 
wave power and the size of the sodium particles, Thus, 
the electrons still satisfy the resonance condition with 
the applied field an amount H* below the resonance 
field, and absorption is still observed. 

If large, the effect is readily identifiable, and it is 
simply required, in order to observe the natural 
linewidth, that the microwave power be sufficiently 
small. Our klystron operated at approximately 50 mW. 
We were able to observe Overhauser polarization at 
20°K and below in this manner with powers incident 
on the cavity which were 20 dB down from the klystron 
power. Our linewidth measurements were made with 
powers 40 dB down from 50 mW, and we required 
power-independent linewidths over a 10-dB range of 
powers before the data was considered acceptable. 
Such low power levels required the full sensitivity of 
the superheterodyne system. This system’ stabilizes the 
klystron to the sample cavity, so the single klystron 
superheterodyne system cannot be instantaneously 
retuned as the cavity and sample warm continuously 
above 20°K towards 77°K. Fortunately the relaxation 
time is short enough above 30°K to eliminate the 
saturation problem, and a simpler homodyne detection 

at higher power could be used. In this system the signal 
at the modulation frequency is taken directly from the 


5 T. R. Carver and C. P. Slichter, Phys. Rev. 102, 975 (1956). 
? A W. Overhauser, Phys- Rev. 89, 689 (1953). 3 

1 J, I. Kaplan, Phys. Rev. 99, 1322 (1955). 

11 M. Gueron and C. Ryter, Phys. Rev. Letters 3, 338 (1959), 
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Fic. 1. (a) Derivative of ideal Dyson absorption line in the 
region of the anomalous skin effect. (b) Typical absorption 
derivative observed in our samples. Horizontal scale is not the 
same as in (a). 


microwave crystal detector. Biasing power is provided 
by a small amount of microwave power reflected from 
the cavity, which is deliberately mismatched. The 
: klystron is still locked to the microwave cavity, and 
e retuning the system as the cavity frequency drifts 
r takes only a few seconds. 

We might remark parenthetically that we have made 
intensive measurements of the Overhauser shift in the 
hydrogen range, where the shift H* becomes as large 
as 5 G. The sodium heat capacity in this region is large 
enough to prevent the metal particle temperature from 
rising above the bath temperature because of the 
microwave power absorbed by the spin system. Diffi- 
culties of this type have plagued Overhauser shift 
measurements on sodium in the helium region for 
years.” Unfortunately, direct measurements of the shift 
sufficiently precise to be of interest in themselves 
proved impossible, and our data was only good enough 
to be in qualitative agreement with the magnitude of 
H* as expected from spin susceptibility’ and Knight 
shift? measurements. 


II. RESULTS AND DISCUSSION 


In most of the temperature region over which our 
measurements were made, the conductivity of the 
sample is in the anomalous region; i.e., the electronic 
mean free path A is larger than the classical skin 


"12 Private communications from many investigators. See also 

C. Ryter, Phys. Rev. Letters 5, 10 (1960). 

t E R. Garvey and H. S. Gutowsky, J. Chem. Phys. 21, 
(1953). 
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depth ô. In addition most of the sodium particles were 
large compared to A. The resonance absorption line 
shape for this case has been calculated by Kittel? based 
on the calculation by Dyson. The derivative of the 
power absorbed versus (I — I o)T; in Kittel’s theory 
is plotted in Fig. 1. The line shape is characterized by 
extreme asymmetry, the ratio of the low-field peak to 
the high-field peak being given by A/B=5. The full 
width at half-maximum of the low-field peak is given by 


AH =1.36/y-T1, (1) 


where ye is the electron gyromagnetic ratio. 

The line shape given by our samples is shown in the 
lower half of Fig. 1. The asymmetry is markedly less 
than 5:1, and was in fact always about 2.5:1. We 
believe our lines did not have the Dyson-Kittel shape 
because our samples in fact contained a mixture of 
particle sizes, with a substantial contribution to the 
signal coming from particles whose dimensions are 
comparable to or smaller than A. For these particles 
the line shape would be Lorentz, with 4:B=1. This 
rationalization of our line shape is supported by the 
large amount of data taken on the Overhauser shift 
in the hydrogen temperature region, where we observed 
in detail the diminution of the intensity of the shifted 
line as the shift increased. This effect and its relation 
to particle size is discussed by Ryter.” 

For a symmetrical Lorentz line, the relation between 
Tı and AH is given by 


AH —122/4,T,. (2) 


For simplicity we have used Eq. (1) in converting from 
AH to T. A comparison of the observed and theoretical 
line shapes, along with the relations (1) and (2), leads 
us to believe that use of (1) does not result in T's more 
than 5% high. Since the temperature dependence of Tı 
rather than its absolute value is our main interest, the 
uncertainty is of no importance. 

The results of our measurements on all samples that 
showed a temperature-dependent AH to 14°K are 


ex FOE Lt Ue i ea 
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Fic. 2. AH versus T. O, from Kip and Feher (Ref. 2); +, from 
Levy; e, our data. Solid line and error flags explained in text. 
Dashed line is from the relation T7 —3X10-* sec —°K, con- 
verted to AH by Eq. (1) of text. 


“F, J. Dyson, Phys. Rev. 98, 349 (1955). 
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Fic. 3. The theoretical and experimental T's of Table I on 
a log-log plot. O, Yafet; +, €, experiment as in Fig. 2. The solid 
line is T1T—constant drawn through theory and experiment 
at 200°K. The constant, 2.6X10-® sec —°K, is not the same as 
in Fig. 2. 


shown in Fig. 2, along with the data of Feher and Kip? 
and of Levy? Approximately 25 data points below 20°K 
and 25 points between 30 and 50°K are represented by 
the smooth curve drawn through them and error flags 
giving the limits within which the data fall at several 
temperatures. The somewhat more scattered data above 
50°K (twenty points) are shown as heavy dots. The 
straight line above 30°K is a least-squares fit to the 
data between 35 and 50°K; the equation of this line is 


AH —0.0277T — (0.577+0.010). (3) 


The data above 50°K is omitted from this fit not 
because of its scatter but because of line shape changes 
associated with the change between classical and 
anomalous skin effect regions. 

The data, converted to Tı, is replotted in log-log 
form, in Fig. 3. The solid line is one drawn by Feher and 
Kip; the dots, crosses, and circles are experiment and 
theory as explained below. We note several features of 
the results from Fig. 3. (1) Our data extrapolates at 
high temperatures towards the 7,7—3.0X 10-5 sec-°K 
relation found by Feher and Kip. (2) If the theory of 
Yafet is fit to our data at 40?K it deviates significantly 
from TT constant as high as 150°K. (3) Our data 
fit Yafet's theory from 77°K, our highest temperature, 
to 30°K, where our Ty's begin to fall well below the 
theory; and Yafet’s theory, fit to our data at 40?K, 
agrees with Kip and Feher's data at high temperatures. 

Feature (3) deserves discussion. The spin-lattice 
relaxation mechanism discussed by Yafet is an electron- 
phonon scattering accompanied by a spin flip. The 
mechanism which couples the two spin states is the 
spin-orbit interaction. Yafet’s theory? has three param- 
eters. The Debye phonon spectrum 1s assumed, but two 
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Debye 6’s are used, Oy and 6y. Ow is the Debye tem- 
perature for longitudinal phonons; the NV stands for 
"normal," since electron-longitudinal-phonon scatter- 
ing is the normal resistivity scattering mechanism. 
umklapp processes are important because there is a 
term in the spin-flip matrix element involving only 
transverse phonons which vanishes for small electron 
scattering angle, but which contributes if the electron 
wave vector changes by the phonon wave vector plus a 
reciprocal lattice vector. Use of a different Debye 8 
for processes to which transverse phonons contribute 
corresponds to a modification of the strict Debye model 
which is realistic if @y is chosen less than Oy. At least 
one transverse mode corresponding to a low 6 has been 
observed in neutron diffraction by Woods ef al 

Dr. Yafet was kind enough to furnish us the details 
of his calculation, the results of which for Ox = 250°K, 
6u — 200°K, are shown in Fig. 9 of his paper.' We were 
able to use his work to calculate the points shown in 
Fig. 3. The third adjustable parameter, the strength 
of the spin-flip matrix element, was chosen to fit the 
data at 40°K, and several values of 0y and 6g were 
tried. The results are quite sensitive to 4y, with values of 
the pair (8, 8v) of (250°K, 80°K) and (250°K, 200°K), 
respectively, lying distinctly below and above the data. 
The points shown on Fig. 3 correspond to fy =250°K 
and #y=150°K. For @;=150°K, longitudinal phonons 
contribute only from 7% to 15% to the total relaxation 
rate. It is worth pointing out that the Debye 8 deduced 
from heat-capacity measurements is (156-+2)°K. 
Table I presents the calculated and measured Ty’s at 
several temperatures. 

Finally, we note that below 30°K the experimental 
Tys deviate from theory; the data give (T1)imons 
=$ (T3) ratet at 25°K, and the theoretical T; is beginning 
to rise very sharply just as the data are beginning to 
approach a temperature-independent value. Our line- 


TABLE I. Calculated and measured relaxation times. 


Calculated* Measured 
T in °K TıX 1077 sec TYX107 sec 

20 8.85 

30 3.00 tee 

35 2.13 2.00° 

40 1.47 1.479 

45 1.22 1.12» 

50 1.00 0.97» 

75 0.52 0.52» 
100 0.357 0.28* 
150 0.223 0.22* 
200 0.128 0.13* 


«From Vafet (Ref. 1); fit to data at T=40°K, with parameters ôy 
=150°K, 0x —250?K. È 

è Calculated from Eq. (1), where AH was obtained from a least-squares 
fit of data between 35 and 50°K, as explained in text. Point at 75°K was 
also calculated from (1) and the least-squares AH., 

¢ From Kip and Feher (Ref. 2), their Fig. 13. 


15 A. D. B. Woods, B. N. Brockhouse, R. H. March, and R. - 
Bowers, Proc. Phys. Soc. (London) 79, 440 (1962). 
1$ Douglas L. Martin, Phys. Rev. 124, 438 (1961). 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


———————— MM ——— 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


; A1290 VESCIAL, VANDERVEN, AND SCHUMACHER 
width in fact becomes temperature-independent at the order of 16 our present limit. Thus a substantia] v 
AH 0. G. A simple linewidth measurement of the improvement in homogeneity would not extend our 
proton resonance in a water sample of the same dimen- range of valid data very significantly. The calculated 
sions as the sodium sample showed the field inhomoge- relaxation times in the hydrogen temperature region are 
i niety over the sample to be about 0.1 G. The field was long enough to allow a direct measurement of T, by 


so inhomogeneous over the small sample because the transient techniques. 


j ratio of the magnet gap to the pole cap diameter was ACKNOWLEDGMENTS | e 
j 5:12; the large gap was required in order to fit in the : io 

i rather bulky microwave cavity and the glass Dewars. We would like to thank Dr. Y. Yafet for bringing this 

3 'Thus our data may not be compared to Yafet's calcu- problem to our attention, and for continued interest | 

1 lations below about 35°K, where the measured line- and encouragement. We are particularly indebted to ES 
3 widths is 0.4 G. We can say in defense of our poor him for allowing us to use his unpublished numerical 


4 homogeneity that even if Yafet’s calculations are results to obtain the values of 71 quoted in Table I | 
: correct only down to 20°K the linewidth would be on and displayed in Fig. 3. | 
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Absence of Pronounced Quadrupole Effects in the Nuclear Resonance 
of In" in a Noncubic Environment* 


LAWRENCE H. BENNETT AND REX J. SNopGRasst 
Metallurgy Division, National Bureau of Standards, Washington, D. C. 
and 
Department of Physics and Astronomy, University of Maryland, College Park, Maryland 
(Received 23 December 1963) 


The nuclear magnetic resonance of In!!5 has been observed throughout the cubic phase of Pb-In alloys, 
at 77 and 300°K. The Knight shift is 0.86% and is essentially unchanged with composition. The linewidths 
increase rapidly with In concentration, probably mostly due to pseudodipolar interaction. The average 
electric field gradient at a In!!5 nucleus for any configuration of Pb neighbors is about q~3X 10% cm“, l 
about 20 times smaller than for Cu having a Zn second neighbor in the fcc Cu lattice. The small value of the 
field gradient is discussed in terms of the behavior of the charge oscillations, and is related to the small 
change in Knight shift for Pb?" in these same alloys. 


I. INTRODUCTION clear magnetic resonance has been observed in the liquid 


HE nuclear electric-quadrupole moment interacts phase only. However, when Ga is substituted in the face- | 
with the electric field gradient at the site of the centered cubic lattice of Cu, the cubic environment | 
nucleus. In a position of cubic symmetry, the electric substantially reduces the quadrupole coupling for many . ® 
field gradient vanishes and there is no quadrupole inter- of the Ga atoms and the nuclear magnetic resonance is E 
action. In the liquid phase of a metal, atomic motion is observable. Knight suggested that it might be possible 


usually sufficiently rapid to average out the field gradi- to make suitable alloys of other metals for which the * 
ents. Knight! pointed out that, in some cases, strong TeSonances are unobservable in the pure state. 
quadrupole interactions may obliterate the nuclear Following this suggestion, one of the present authors” 


magnetic resonance (NMR) signal of a quadrupolar observed a strong NMR signal of In!!5 substituted in | 
nucleus in the noncubic lattice of the pure metal, but the cubic lattice of a 95% Pb—5% In alloy. If the in- | 
that the nucleus may still exhibit a strong NMR signal dium is randomly distributed throughout the lattice, j 
when substituted in the cubic environment of another bout one-half of the indium atoms have only lead for 
metal. For example, in pure orthorhombic Ga, the nu- first, second, and third nearest neighbors. The intensity 


| 
: of the In!5 resonance compared to the Pb?" in this | 
j S one "of this gon ous s part of ED thesis submitted alloy led to the conclusion that these atoms were ob- 
o the Universit SITAS: i t : E D q 
js he National ae of dM p served together with POE Gi te cut intensity: 
t Guest worker (1963-64), University of Paris, Faculté des An alternative possibility is that all In!!5 nuclei partici- 
Rc uud one W, Service de Physique des Solides, pate in the resonance but that some of the satellite in- 
rance. : D ; 
1W. D. Knight, in Solid State Physics, edited by F. Seitz and tensity 1s smeared out. 
D. Turnbull (Academic Press Inc., New York, 1956), Vol. 2. For In order to distinguish between these possibilities the 
further information on nuclear quadrupole effects in solids, see — ——————— 
_M. H. Cohen and F. Reif, ibid., Vol. 5 (1957). ? L. H. Bennett, Bull. Am. Phys. Soc. 4, 251 (1959). n 
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In"? resonance was looked for in other alloying propor- 
tions. Very surprisingly, the In"? resonance remained 
observable with almost no change in intensity per atorn 
throughout the cubic lead phase.’ Figure 1, for example, 
shows the observed room-temperature Pb"? and [n"* 
NMR absorption derivatives in alloys containing 0.5 
at.% In and 35 at.% In. There is no essential difference 
in the In"? resonance at 77°K in any of the alloys 
studied. : [ 

Since the start of this research, the pure quadrupole 
resonance of In! in tetragonal indium metal has been 
discovered*^ with a room-temperature quadrupole 
coupling constant of approximately 30 Mc/sec. The 
high-field nuclear resonance of In!5 in indium metal 
has also been observed,®’ but with extremely large 
quadrupole effects in agreement with the known 
quadrupole coupling constant in the tetragonal lattice. 
The In" resonance has also been observed in liquid 
indium,* but here the rapid motion washes out quadru- 
pole interactions. Nuclear magnetic resonance studies? 
of In" in an unstrained single crystal of InSb revealed 
negligible quadrupole interactions, due to the high 
symmetry of the zinc blende lattice of InSb. Thus, none 
of these other results is directly comparable with our 
surprising observation of the absence of pronounced 
quadrupole effects in the noncubic environment of the 
nondilute Pb-In alloy. 

In Sec. II we present the experimental data on the 
In» NMR absorption derivatives linewidths, centers 
and relative intensities as a function of concentration of 
In in Pb, magnetic field, and temperature. In Sec. III, 
we discuss the significance of the observation of the 
In" resonance in the ‘‘noncubic” environment of the 
high indium concentration alloys in lead. We relate 
this absence of pronounced quadrupole effects to the 
small changes in Knight shift in the Pb” resonance in 
these same samples discussed in a preceding paper.” 


II. EXPERIMENTAL METHODS AND RESULTS 


Indium, a column III element, is dissolved in lead, a 
column IV element. Lead is a face-centered-cubic metal 
with melting point of 327°C. All measurements were 
made in the cubic phase which persists up to about 68 
at.% In. In!5 is 96% abundant, has a nuclear spin 
[—9/2, a large nuclear magnetic dipole moment 


3 L. H. Bennett and R. J. Snodgrass, Bull. Am. Phys. Soc. 7, 228 


1962). 

‘ Ae R. Hewitt and W. D. Knight, Phys. Rev. Letters 3, 18 
1959). V 
ow. W. Simmons and C. P. Slichter, Phys. Rev. 121, 1580 
(1961). 


23 z A E D Rev. Letters 9, 255 
Oe P. Flynn and E. F. W. Seymour, Proc. Phys. Soc. (London) 
TR R. O Shulman, B. J- Wyluda, and P. W. Anderson, Phys. Rev. 
GER. J. Snodgrass and L. H. Bennett, Phys. Rev. 132, 1465 
1963). 
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(a) 
Pb In 


99.5 Pb -0.5In 
7 KOs 


65 Pb -35 In 
7 kOe 
(c) 
65Pb-35In 
5.8 kOe 


Fic. 1. First derivative of the room-temperature nuclear mag- 
netic resonance absorptions of Ph% and In! in (a) 99.597 
Pb—0.5% In at 7.0 kOe, (b) 65% Pb—3557 In at 7.0 kOe and 
(c) 65% Pb — 35^; In at 5.8 kOe. 


(5.5 n.m.), and a large electric quadrupole moment 
(1.16 b). 

All of the resonances were observed at both 77° and 
300°K with a Varian wide-line spectrometer. The experi- 
mental techniques and the preparation of the alloys 
have been discussed previously.”" Typical resonances 
are shown in Fig. 1. Since the Pb? and In!!* resonance 
differ in shape and width, it is somewhat misleading to 
compare the recorded absorption derivatives in order 
to determine relative intensity. Instead, a point by 
point integration was made of each resonance, and the 
relative areas of the resulting absorption curves were 
compared. The assumption was made that the absorp- 
tion is zero in the wings when the derivative is zero. 
This assumption has the effect of ignoring flat wings, 
and thus underestimating the contribution of In!!5 
satellites. 

If we make the reasonable assumption that we are 
observing all of the Pb? nuclei, then a comparison of 
the relative intensities of the Pb’ and In!!* resonances 
in the same sample gives an indication of the portion of 


n R. J. Snodgrass and L. H. Bennett, Appl. Spectry. 17, 53 
(1963). i 
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In'!3 ABSORPTION LINEWIDTH AT 300*K (kc/sec] 
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Fic. 2. In" absorption linewidth versus In 
concentration at 300°K and 7.0 kOe. 


the possible In!!5 resonance we are observing. This is 
independent of frequency, filling factor, and other ex- 
perimental parameters, provided only that there is no 
modulation broadening. This comparison showed that, 
with the exception of the 68% In sample, there was al- 
most no change in intensity per atom of the In'5 
throughout the cubic lead phase. The absolute intensity 
of the In!5 in any of these samples corresponded to 
about 10 to 20% of that possible, in good agreement 
with the intensity expected from the central transition 
alone. We conclude that we are observing only the cen- 
tral line of essentially all In!5 nuclei in the alloys, at 
both 77 and 300°K. 

There is virtually no second-order quadrupole shift 
at 7.0 kOe or above. For the 35% In sample, Fig. 1[ (b) 
and (c)] shows the room-temperature resonance ab- 
sorption derivatives of In! and Pb?" at 7.5 and at 5.8 
kOe. The In!5 resonance at the lower field shows some 
sign of second-order broadening. Any shift in center be- 
tween these two In! resonances (normalized to 5.8 
kOe) is less than 1 kc/sec. 

The presence of substantial ordering or clustering 
would of course change the main result of this paper, 
that we are observing only small quadrupole effects in 
a noncubic environment. An x-ray study” of the 20% 

— In and the 35% In sample used in this research revealed 
— no short-range order. The short-range order param- 
eter in the ith shell around an A atom is defined as 
;—1—Pap,/ Xs, where Pag; is the probability of a 
B atom in the ith shell around an A atom, and X p is the 
fraction of B. If a, was — —0.05 or greater in 
tude, it would have been detected. Clustering 


Schwartz and J. B. Cohen (private communication). 
3. Rider and P. L. Roney, J. Inst. Metals 91, 328 (1963). 
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short-range order were this high, fewer than 1% of the 
In would be in cubic environments in the 50-50 alloy. 

The In!!^ linewidth as a function of In concentration 
is shown in Fig. 2. The increase in the In"5 linewidth 
with In concentration can be understood if the pseudo- 
dipolar interaction is assumed to be about 10 times the 
ordinary dipolar. The pseudodipolar interaction was 
found to be about 10 times the dipolar for the Pb?" 
linewidth” in these same samples. No careful studies 
of the field dependence of the linewidth was made— 
it is believed that a combination of second-order quad- 
rupolar broadening at the lower fields, and anisotropic 
and inhomogeneous isotropic Knight shift broadening” 
at the higher fields could be important. The 68% In 
point does not fall on the smooth curve in Fig, 2. 
From the Pb?" Knight shifts, we had previously" con- 
cluded that this alloy was in a two-phase region. 

The resonance center is largely independent of con- 
centration, occurring at 6.586-40.001 Mc/sec at 7000 
Oe. This corresponds to a Knight shift of 0.86+0.01%, 
which is close to the value found in the tetragonal phase 
of pure In? (0.82-+0.04% for the isotropic shift), or in 
the liquid phase of In metals (0.792-0.0366, the + tak- 
ing into account the variation with temperature, and 
chemical shifts). There is no a priori reason for expecting 
this close agreement. 


IH. DISCUSSION 


The most important result of this work is the observa- 
tion of the In!5 resonance in the “noncubic” environ- 
ment of nondilute Pb-In alloys. This result was un- 
expected in view of the large quadrupole moment of 
JIn"5 and the observations of Rowland and Bloembergen" 
on the Cu-Zn system. In that case Cu was the solvent, 
Zn the solute. In pure Cu, the Cu® resonance is reduced 
by cold working to 0.4 times the intensity in the well- 
annealed metal. An average electric field gradient of 
q=1.6X 10" cm-? produced by cold working accounts 
for this observation. As Zn is added to Cu, the Cu® in- 
tensity is further reduced until the resonance is unob- 
servable for Zn concentrations of more than about 20%. 
Rowland concluded that no Cu® nuclei were observed 
when Zn destroyed the cubic symmetry in the first or 
second coordination shell. Rowland estimated that a 
gradient q> 1.6X 10? cm-? is needed to account for the 
loss of Cu* nuclear contributions to the NMR line for 
Zn nearest and second-nearest neighbors. In a recent 
direct measurement, Redfield!5 showed that the quad- 
rupole frequency for second neighbors was 1980 kc/sec 
(corresponding to g=7X10% cm-3), and for first 
was probably greater than 5 Mc/sec, consistent with 
Rowland’s estimate. 

In order to contrast the Pb-In results with the Cu-Zn 
work, it may be meaningful to think of In as the solvent, 


^T. J. Rowland, Ph.D. thesis, Harvard University, 1954 
(unpublished); N. Bloembergen and T. J. Rowland, Acta. Met. 
1, 731 (1953); T. J. Rowland, Phys. Rev. 119, 900 (1960). 

15 A. G. Redfield, Phys. Rev. 130, 589 (1963). 


| 
| 
| 
| 
| 


————- oor Oooo 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


». 


i 


4 


wa — 


| 


NUCLEAR RESONANCE OF In!!!" 


TABLE I. Average electric field gradients at nuclei in metals. 


Cu Having 


Cold worked ^ Zn 2d In in Alin Al-Mg or 
Cu neighbors In-Pb — ALZn alloys 
q(cm?) —— 16X10? 70x10? 3x10? 17x102 


Pb as the solute. We do not observe the In in the dilute 
region (high In concentration) due to the tetragonal 
structure. But, consider a hypothetical cold-worked 
cubic In metal in analogy to the cold-worked cubic Cu 
metal. As the solute (Pb) is added, the In"® intensity 
per atom would be expected to decrease. Thus, for ex- 
ample, we would expect the In!5 resonance in the 50% 
In-50% Pb alloy to be unobservable (unless there is 
substantial clustering or ordering, which does not seem 
to be the case). In fact, the resonance intensity in the 
50% In-50% Pb alloy is very large, corresponding to 
the observation of the central transition of all the In! 
nuclei. Using a method described by Rowland, and 
assuming we have no more second-order effects than dis- 
cussed in Sec. II of this paper, we obtain q«4.6X 10? 
cm"? for the average electric field gradient at a In!!5 
nucleus for any configuration of Pb neighbors. From the 
fact that the satellites do not contribute to the observed 
lines, we find g>1.4X10” cm~. Thus we can take 
q73X10? cm? as the average gradient at the In 
nuclei, which is comparable to the field gradient in 
pure cold-worked Cu, and is 20 times smaller than that 
found for Cu having Zn second-nearest neighbors, as 
is shown in Table I. 

Kohn and Vosko” have explained the magnitude of 
the field gradients at the Cu site due to various solute 
atoms" on the basis of a redistribution of the conduction 
electron charge density near the solute.!5 The reduction 
in Cu NMR absorption intensity with increasing solute 
concentration was taken as confirmation of a long-range 
oscillatory behavior of the electron density. This same 
type of oscillatory behavior was shown to be responsible 
for the change in Knight shifts in silver??? cadmium,” 
and the alkali metals.” 

In the case of Pb-In, it has already been pointed out! 
that the changes in Knight shift of the Pb? upon the 


16T, J. Rowland, Progress in Material Sciences (Pergamon 
k, 1961), Vol. 9. 

DS ROSA S: A Vosko, Phys. Rev. 119, 912 (1960. 

18 See also J. Friedel, Phil. Mag. 43, 153 (1952); A. Blandin, 
E. Daniel, and J. Friedel, ibid. 4, 180 (1959); A. Blandin and E. 
Daniel, Phys. Chem. Solids 10, 126 (1959) ; and A. Blandin and J. 
Friedel, J. Phys. Radium 21, 689 (1960). 

3T. J, Rowland, Phys. Rev. 125, 459 (1962). 

? T, E. Drain, Phil. Mag. 4, 484 (1959). s 

2L. Rimai and N. Bloembergen, Phys. Chem. Solids 13, 257 


(1960). 
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addition of In are small. No definite conclusion could 
be reached as to the importance of the charge oscilla- 
tions in the lead-indium system. In any case size effects 
are probably too big to be neglected. We have not yet 
made a quantitative estimate of the field gradients. A 
possible explanation for both the small change in Pb?! 
Knight shift and the absence of In™5 pronounced quad- 
rupole effects is that the charge oscillations are greatly 
damped or that they oscillate rapidly and are effectively 
averaged out. It might then be appropriate to say that 
the In atoms in the cubic Pb phase screen themselves to 
“look almost like” Pb atoms, as far as Knight shifts and 
quadrupole effects are concerned. Of course, the aniso- 
tropic Knight shift broadening? depends on the In 
disrupting the cubic symmetry of the lattice so this 
description is not completely adequate. 

It is also interesting to compare the Pb-In work with 
Webb’s measurements? on Al-Zn and Al-Mg, For both 
Pb”? and AI? only small changes in the Knight shift 
are observed. The average electric field gradient g at the 
site of an aluminum nucleus which is nearest neighbor 
of a solute (Zn or Mg) atom is found by Webb to be 
17X10” cm. This is an intermediate value between 
In-Pb and Cu-Zn. It appears as if the effective valence 
difference between adjacent metals having higher group 
(or atomic) valences is less than for those with lower 
valences. 

The angular correlation of the y-y cascade of In!!! in 
thin layers has been measured? as a function of In 
concentration. This experiment shows a relatively large 
increase in anisotropy in going from almost pure lead to 
Pb-1.5% In. Kaiser attributes this increase to the elec- 
tric field gradients at an indium atom due to its nearness 
to other indium atoms. He then compares his result to 
Bloembergen and Rowland's observation of the rapid 
loss in Cu NMR intensity upon addition of Zn. Our 
results show that the In-Pb result should not be com- 
pared to the Cu-Zn work. However, it is possible that 
an increased quadrupole interaction, reflected in the 
anisotropy, is associated with a first-order splitting of 
the satellites. 
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The Pb® nuclear magnetic 
containing Cd, In, Sn, Sb, Hg, TI, and Bi as solutes. 
portion to the solute concentration and in the oppos 
ing of the Thomas-Fermi type. The fractional ch 


between the impurity atoms and the lead atoms. The 


large widths, and small quadrupole effects 
of electronic charge. However, there is evid 


HE nuclear magnetic resonance (NMR) of Pb? 
has been accurately measured in a series of lead 
base primary solid solutions containing second (Cd, 
In, Sn, Sb) and third (Hg, TI, Bi)long period solutes. 
This is the first report of comprehensive Knight shift 
measurements on a polyvalent nontransition metal 
matrix. It is especially interesting to see if the theory! 
developed by Friedel, Blandin, and Daniel concerning 
the long-range oscillation of electronic charge density 
around impurity atoms in monovalent and divalent 
metals is capable of explaining our data for a polyvalent 
material. The major conclusion is that, in qualitative 
agreement with the Friedel theory, the valence of the 
solute is the controlling factor in the change of Knight 
shift with solute concentration. 
Figure 1 shows the change in Pb?" resonance fre- 
quency from pure lead metal at 7 kOe and 300°K as a 
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Fic. 1. Change in Pb?" Knight shift versus solute 
concentration at 300°K. 
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Astronomy, 


e primary solid solutions 
shifts in pro- 


all solutes is small Ak/k 20.05 c. 


t is not possible to determine if the small shifts, 


function of solute concentration.? The central resonance 
frequency of Pb?" shifts in proportion to the solute 
concentration (c). The shifts are in the opposite 
direction to be explained by impurity charge screening 
of the Thomas-Fermi type. The fractional change in 
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T— 
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Fic. 2. Lead solid-solution lattice spacings at 20°C. 
After Tyzack and Raynor (Ref. 3). 


Knight shift for all solutes is small Ak/~0.05 c. The 
experimental points were omitted for clarity. The 
experimental error is of the order of the separation of 
the In and Tl lines in Fig. 1. 


2'The Pb9-In data was described in detail in R. J. Snodgrass 
and L. H. Bennett, Phys. Rev. 132, 1465 (1963). J 
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NUCLEAR MAGNETIC RESONANCE IN Pb ALLOYS 


The lead solid solution lattice spacings? for the same 
solutes are shown in Fig. 2, A comparison of Figs. 1 
and 2 shows that the frequency shifts partially correlate 
with lattice parameter changes. However, Bi and Sb 
give identical changes in Knight shift while the former 
expands and latter contracts the Pb lattice. Tl and In 
are more widely separated in lattice spacings than in 
resonance frequencies. Sn causes a larger change in 
lattice parameter than Tl, but a much smaller change 
in Knight shift (and in the opposite direction) to TI. 
The correlation of the Knight shift results with Pauling 
or Goldschmidt radii is not good at all. From this we 
conclude that "size effects" are not the controlling 
factor in determining the Knight shift in these alloys. 
It is not surprising that there is partial correlation with 
lattice parameter data since the lattice spacing reflects 
not only size effects, i.e., misfit between solvent and 
solute, but also valence effects. The important parame- 
ter seems to be the difference in valence between the 
impurity atoms and the lead atoms. However, super- 
posed on this may be a slight period dependence 
associated with impurities of equal valence which belong 
to different periods of the periodic table A pure 
valence effect would perhaps predict zero shift for tin. 
The observed shift is indeed small. Similarly, the small 
deviation of Cd and In from Hg and TI, respectively, 
may be a period effect. However, Sb does not deviate 
from Bi. 

The absorption linewidths are shown in Fig. 3. They 
broaden rapidly upon the addition of solute. It was 
shown previously? that the Pb?" resonance depends 
linearly upon the applied magnetic field for the Pb-In 
alloys. Since the line shapes are predominantly 
Lorentzian, especially for the dilute alloys, it is difficult 
to estimate second moments for comparison with theory. 
The anomalously large breadth due to Bi may be 
associated with its large magnetic moment, high-spin 
and high-isotopic abundance, together with a large 
exchange or pseudoexchange coupling constants. It is 
also possible that T, effects enter. 

In conclusion, it seems somewhat surprising that the 
change in Pb Knight shift is so small, while the increase 
in widths is large. We believe the small change in Knight 
shift and the absence of pronounced quadrupole effects* 

3 C. Tyzack and G. V. Raynor, Acta Cryst. 7, 505 (1954). 

1 F. J. Blatt, Phys. Rev. 108, 285 (1957). 

5L. H. Bennett and R. J. Snodgrass, Bull. Am. Phys. Soc. 7, 
228 (1962), and to be published. 
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Fic. 3. Pb?" absorption linewidths versus solute 
concentration at 300°K and 7 kOe. 


at the quadrupolar In'5 nucleus in Pb-In alloys are 
related. In the absence of a quantitative calculation 
it is not possible to determine if the small shifts, large 
widths, and small quadrupole effects of In! are con- 
sistent with the theory of long-range oscillations of 
electronic charge. However, there is evidence that this 
theory may be capable of explaining our data. (1) The 
shifts correlate with valence. (2) The sign of the shifts 
is in the opposite direction to be explained by Thomas- 
Fermi screening. (3) The absorption linewidths are 
field-dependent. This might also be due to anisotropic 
Knight-shift broadening, but we have not excluded 
inhomogeneous isotropic Knight-shift broadening. 
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The partition function is derived for a gas 
uniform magnetic field. From it the steady terms ir 
total electronic magnetic susceptibility are calculated 
additivity of the paramagnetic and diamagnetic sus 
- approximation are discussed. 


I. INTRODUCTION 


EI purpose of this note is to derive expressions 
for the nonoscillatory free energy and magnetic 
susceptibility of a degenerate, nearly free electron gas. 
That is, we consider a gas of noninteracting electrons 
in the presence of a weak potential, due to a lattice of 
positive charges, and a uniform magnetic field. The free 
energy of the system is evaluated to second order in the 
temperature and field parameters with both paramag- 
netic and diamagnetic terms in the Hamiltonian taken 
into account; however, only nonoscillatory behavior 
is considered. The effect of the lattice on the de Haas- 
van Alphen oscillations will be the subject of a future 
M report. 
The calculation was made with two ends in view. 
First, considering the success of the pseudopotential 
concept,’ it appears that the nearly free electron ap- 
A proximation has computational value as well as being 
merely instructive. Second, we wish to go beyond the 
limitations of the effective mass approximation for 
describing lattice effects in the context of magnetic 
properties. A calculation is performed in Sec. ITI using 
available pseudopotential parameters for the alkali 
metals Li, Na, and K. There exists no completely 
satisfactory calculation of the role of correlation in 
metallic properties for the density range into which these 
metals fall and, since this topic is outside the scope of 
the present study, we shall only point out that correc- 
tions can be estimated by interpolation methods. 
Quantitative accord with experiment thus can not be 
'expected, but it is proposed that for metals with small 
nergy gaps the effect of the lattice is qualitatively 
counted for. The criterion suggested is that the ratio 
f the energy gaps to the Fermi level be much less than 
nity. 


] II. CALCULATION 

We consider N noninteracting electrons moving in a 
eriodic potential V(r), which occupies a very 
yolume Q, in the presence of a magnetic field 


tudy was supported in part by U. S. Air Force Office 
c Research Grant No. AF-AFOSR-260-63. 
C. Phillips and L. Kleinman, Phys. Rev. 116, 287 (1959). 
ohen and V. Heine, Phys. Rev. 122, 1821 (1961). 
yid 6, 1013 (1961). 
t v. 130, 1703 (1963). 
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of electrons in the presence of a weak periodic potential and 


1 the free energy and the zero-field-zero-temperature 
. The results are applied to the alkali metals. The non- 
ceptibilities and the inadequacy of the effective mass 


represented by the vector potential A= (— Hoy, 0,0). 
An electron will therefore be described by the 
Hamiltonian 


5C= (1/2m)(P— (e/c)A)?+-2uoH oS z4- V (r) 


mV). (1) 


As is well known, the eigenstates of 3Co are (apart from 
normalization) 
| Eas. o) — ed 079 H (y— yo)ei EH X, , (2a) 
E,(1,kz) = 2uoH o(n+1/2)+i?k2/2m-+ poll oc , (2b) 
where guo— Bohr magneton; X,—spin state, o= +1; 
n= (eH /hic)?y, qo — (ic/eH)?k;; H,— nth Hermite 
polynomial. 


The thermodynamic properties of the system can be 
studied conveniently by means of the partition function 


Z(y)=TrfexplL—v (Sot V) J}, y=1/kT. (3) 


Using the Schwinger trace formula® (the variable of 
integration has been changed to u=ys1) 


Z(y)&Tr(e-*) — -y Tr(V (r) 
1 
TY n. | duV (x)e~*5o('—4) V (r)e- ov 
0 


2Z(Q-ZQoAo. 0 


Since the Zeeman term commutes with the remainder 
of the Hamiltonian, the spin part of the trace is per- 
formed very simply to give 


Z(y)=2 cosh(uoHoy)[20(y)—21(7) +x) J. ©) 


zo(y) is known from the work of Sondheimer and 


5L. Landau and E. Lifshitz, Quantum Mechanics, (Addison- 
Wesley Publishing Company, "Reading, Massachusetts, 1958), 


Sec. 125. 
9 A. Saenz and R. O'Rourke, Rev. Mod. Phys. 27, 381 (1955): 


There is a misprint in Eq. (1.2.8) which should read: 
Tr(Q(s)) =Tr{e*2} —s Tr(be-*2) +s z (—1)”/n 


xTr{ bee if "duos f ast eu | 


l 
| 
:| 
; 


% 


(4) 


y 


e 
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Wilson’: 
2y(y) = Lary? (usl yy) esch (nol gy) , 


(6) 
a= (m/2sh y . we 


zi(y) can be calculated directly if V (r) is represented as 
a Fourier series 


V(r)- *x Veet, (7) 


where the K’s are reciprocal lattice vectors. Represent- 
ing the trace in the form 


gen 3 
Tr{B}= me? fafu > (kank:|B|kank:}, (8) 
LT)“ 


( n=) 


and using the generating function for products of 
Hermite polynomials? we obtain 


ai(y)=yV oly), (9) 


where Vo is the average crystal potential. For simplicity 
it will be assumed that V has a center of symmetry. 

The evaluation of z»(y) is straightforward but tedious. 
We begin with the expression 


Q1/3 1 
ajmara | du | dedki | dka 
Am)! Jo 


X X. | Gn. V(r) | k/n'k,)|? 
Xexp( —,E(n'k/) (12)--E(n,5,u). (10) 


First the spatial integrals over r and r’ in the matrix 
elements can be done, except for the y and y’ integrals, 
making use of (8). The sums over n and w’ are evaluated 
next using the generating function? and the y and y’ 
integrals then performed in terms of the variables 
yzEy'. At this point the remaining integrals, except for 
the « integration, are done without difficulty leading 
to the result 


zx (y) - Y'so()22 Vx? 
K 


1 hic yh 
xf exp | -Kiu Kuult) du. (11) 
0 H 


€ 2m 
K., and Ki, are the components of K, perpendicular and 
parallel to the magnetic field, and 
u= coth (uoHoyu)+cothluoH oy (1—u)]. (12) 
Combining (5)-(11) we have 
Z(y)=Zo(y){1-wWotar Ve 
+37 D' G(K,y) Vx}, (13) 
where G(K,7) is the integral in (11) and 
Zo(y) = 2Qecy*” (uoH oy) coth (poly) (14) 


"E. H. Sondheimer and A. H. Wilson, Proc. Roy. Soc. (London) 
A210, 173 (1951). 
* Eq. (A4) of Ref. 7. 
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is the partition function for free electrons. For conveni- 
ence, the term K=0 has been separated from the sum 
and this is denoted by the prime. 

For comparison the partition function has also been 
calculated for the case where the diamagnetic terms in 
(1) are omitted. This can be carried out easily using 
plane-wave states; the result is 


Z' (y) = Zda)! 1I—y4Vse-Myg 


| 


1 dh P LCD | 
2 (Qn)! «(&—K)— «()] 


where 
Zo (y) = 20ary-?? cosh (uH qy) , 
e(k) - I282/2m. 


It follows from these expressions that to second order 
in the potential the paramagnetic and diamagnetic 
effects of the field are not simply additive. 

The free energy and other thermodynamic properties 
can now be calculated. Since we consider here only the 
steady susceptibility, we may simply disregard the 
poles of (13) lying off the real axis in the inverse 
temperature plane? It is convenient at this point to 
separate Z(y) into field-dependent and independent 
parts retaining terms of no higher than second order in 
the field. Introducing the parameter B= poH we find 
uEPyS[1—3(8y)S] where S—u(1—u) and G(K,y) 
= 31 ol tve(K) JAPP (€(K1)/96)o[47e(K) ], where 


I yet 
A, gm. MISTI (15) 
a (1—5)'* 
Thus, 
Z()-Z0 H+ PZ), 
where 


ZO (y) = Rayt (31 VE tV e 


Tivi. Vell ive(K)]}, (16a) 
ZO (y) 20e?" (5? — My Vo 
Ti Ver YE Vieliye(K)] 
+ (1/192) $^ Ve (K)Pb[1ye(K)]. (16b) 


The quantity P= F—n¢, (n= N/9), will now be calcu- 
lated using Eq. (14) of Ref. 9. Asymptotic expansions in 
powers of the degeneracy parameter (yf) can be ob- 


tained for the contributions of the first three terms of — 


(16a) and (16b) to obtain 4. For the remaining terms 

the procedure described in Ref. 9 would be permitted 

only if y(2^1ye(K); since this is not the case for the 

alkali metals, the relevant parameter for these terms 

is a=4¢/e(K). - 
The calculation as outlined gives 


b= by py OHP (B/O, (17) 


? M. L. Glasser, J. Math. Phys. (to be published). 
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where 
bo = — (2/Vr)af (8/15)p5?—4 Vo Vet 
TYSVRDeéR)]^f(), (18a) 
MOs — p Qa// x) (P Vu LV et 
E! Vig (0) 
f(a) =va+ (a—1) tanh- a, 
g(a) a^ (1—a)7; 
P=- Qa// n) (PP AY P Y 
+ X7 VeLe(C) ?"g(a) 
+20! Vx2(e(Ki)/Le(K)}*")h(a)} , (20a) 
$0 — Ya? (a/r) {V 
— Ve (16/3) D! VaeCe() ^g" (0) 
+40! Vx(K)/Ee(K)]2)i'(a)) , (20b) 
a! (5—3a) 


2(—a? 


(18b) 


(19) 


and 


h(a)=$% tanh-/a4- (21) 


In order to use these expressions, the chemical poten- 
tial must be known to second order in the various 
parameters. It can be calculated by solving the equation 
(055/0t)a-I- — 0. Except in rare cases, the temperature 
and field dependences of ¢ are small enough to be 
neglected. Hence we need retain only o9? in this equa- 
tion and solve for ¢ to second order in the lattice poten- 
tial. This calculation is easily carried out and yields 


te (LEE (/ Vt) at 27^ Vr Lel) 1^7" (ao ], Q2) 


where £o 1 (9rn?) 3 (2rh?/m) and ag—4to/ (K). 

The magnetic susceptibility can now be calculated. 
Putting (17) into the formula x= — (1/H) (9F/8H)|o 
we find x= —2ue [d$ 4- (kT) 9 ]. To second order, 
the temperature-independent susceptibility is 


Xo= Xote [1— (1/809) 27 VitF(a) ], — Q3) 
where 
F (ag) = (ao)? tanh (ag)? 
do 1 (K) 
SS ao], (ao) D (24) 


and Xofree— uo/Ço is the susceptibility for free electrons. 
A similar formula can be obtained for the temperature- 
dependent term. 

An equation, resembling (23), for the paramagnetic 
susceptibility was published recently by Abe.” His 
formula is derived along the lines of this calculation in 
the Appendix. 


IH. DISCUSSION 


In this section the results of the preceding analysis 
are applied to the alkalis Li and Na, these being the 
only metals for which the spin susceptibility has been 


39 R. Abe, Progr. Theoret. Phys. (Kyoto) 29, 23 (1963). 
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"TABLE I. Pseudopotential and free electron 
parameters for the alkali metals. 


to (Ryd) Vx (Ryd) xm" oy" 
Li 0.355 0.112 0.54 025 — 
Na 0.235 0.010 0.43 0.20 
K 0.155 —0.018 0.35 0.18 


a (10° cgs vol. units). 


measured. We shall cast our analysis for these metals 
in the form of a one-parameter model in terms of the 
lowest nonzero Fourier coefficient of the crystal potentia], 
This is determined by the energy gap at the zone sur- 
face and estimates from band calculations??? are listed 
in Table I. An estimate for sodium based on Callaway's! 
pseudopotential showed that the sum in (23) is given 
almost entirely by the contributions fron the reciprocal 
lattice vectors of shortest length. The results obtained 
by summing over only these vectors and using the Four- 
ier coefficients listed in Table I are shown in Table II. 
F (ao) is a universal function for a given lattice; for the 
bcc case F(ao)=—6.33 for the vectors of type 2m/a 
(1, —1, 0) and —4.29 for those of type 2r/a (1,0,1) 
where a is the lattice constant. The calculation for Na 
carried out using Callaway's pseudopotential gave a 
result much larger than that listed in Table IT. This is 
consistent with the observation that the energy gap 
(at the point N) found by Callaway is an order of mag- 
nitude larger than that found in most other calculations. 

Estimates of the correction to the susceptibility due 
to correlation have been made on the basis of Silver- 
stein’s! work and are listed under x* in Table I. The 
entries in Table II denoted by a star were obtained by 
including these adjustments. The starred values of the 
paramagnetic susceptibility are in close agreement with 
Abe's? values, obtained by a Sampson-Seitz procedure. 
Considering that the sum in (23) has been underesti- 
mated and that correlation has been accounted for in à 
very crude fashion, agreement with experiment for Li 
and Na appears to be satisfactory. 

Also listed in Table II are values for the “diamagnetic 
susceptibility” obtained by subtracting Xp (A5) from 


TABLE II. Experimental and'calculated susceptibility 
(X10? cgs vol. units). 


Li Na K 
xe 1.20* 0.64% 0.56" 
0.95 0.44 0.38 
xs 1.94-1-0.05 0.88-+0.03 
Xp! 1.33* 0.84* 0.73* 
1.08 0.66 0.55 
xj 2.082-0.10 0.95-£0.10 
Xalo) —0.13 —0.22 —0.17 
Xa? —0.14+0.15 —0.07+0.11 
EDU A ere: E 
a From Eq. (23). 


) 
b R. T. Schumacher and C. P. Slichter, Phys. Rev. 101, 58 (1950). 
e The ionic diamagnetism has been subtracted. 
4 From Eq. (A5). 


uJ. Callaway, Phys. Rev. 112, 322 (1958). 


= 


= ——»—-———————————————————————— OUEST 


nt 


fa 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


MAGNETIC PROPERTI ES OF 
Xo (23). Because of the nonadditivity of diamagnetic 
and paramagnetic effects, discussed below in more 
detail, these values correspond only roughly to the 
Landau-Peierls diamagnetism. There is some evidence!” 
that the diamagnetic susceptibility is only slightly af- 
fected by correlation; thus, much of the correlation con- 
tribution is subtracted off by this procedure. The close 
agreement between theory and experiment here is 
strong evidence in favor of a pseudopotential theory 
of these metals. 

-In the course of this calculation it was discovered that 
the same basic method has been applied by Samoilovich 
and Rabinovich? to study the diamagnetism of nearly 
free electrons; where comparable, their results agree 
with this calculation. However, due to the nonadditivity 
of the susceptibilities, the result of adding their expres- 
sion to Abe's differs from (23); indeed 


A= [Xa(S R) -+ Xp(Abe) = Xo(23) ]/ xtree 


Kj) 
) [ao(1 ao)? 


E 
—— Vee 


= sa Ph(ao)]. 
16 e(K 


For lithium, A=0.62, while for sodium, A=0.012. 
Therefore care must be taken in separating the para- 
magnetic and diamagnetic susceptibilities for compari- 
son with experiment. Since the diamagnetic suscepti- 
bility is not directly measured, it is believed that the 
estimates in Table II are more significant than those of 
Samoilovich and Rabinovich. The existence of this 
nonadditivity has been discussed in a general way by 
Blount. This feature would appear to be a source of 
considerable error in previous calculations of diamagnetic 
susceptibilities. 

In conclusion, a comparison can be made with the 
effective-mass approximation within the scope of the 
nearly free electron model. The susceptibility is given 
in terms of the effective mass ratio by? 


Xo/ Xote = 3 (m */ m) — 3 (m/m*) 2, (25) 


In the nearly free electron approximation m*/m, aver- 
aged over the Fermi surface, is 


m*/m=1-+ (1/460) 30’ V g(a)! tanh-1(ag)!^. (26) 
Combining (25) and (26) gives 
Xo/ Xo'ree 14 (5/8t9) 27^ V ^ (a9)! ? tanh-!(a)*, 
which differs significantly from (23). 
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APPENDIX 
Using (13’) and proceeding as in the body of the text 
we find 


p! — po 4E, + Pol 4-0", 


(A1) 
where 
Py = —mf y (m/2z h^ PPL Bel (£-FusH)) 

t BibvG— usl))], 
br =r Volm/2rh y) EBay (c+ noH )) 

E Baby (C — us) ], 
Do! = — 97 /1V è (m/ 2 ln YL Bi oy (£-- uH )) 


Bib —uH)) ], 
and 


1 chin 
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is an integral discussed in another paper.’ Itis convenient 
to handle &/", corresponding to the last term in (13^), 
somewhat differently. We first calculate z"(E), the 
inverse Laplace transform of the corresponding term in 
Z' (y)/3?. Then we obtain 


fo 
DESDE vef P Et i H)-F g(K, E—usH))dE, 
ðE 


9 


where fois the Fermi function and 


s) aa | Pk e(k— K)— e(k) J. 


eG) 
In the zero-temperature limit (41) becomes 
—&(T=0)=[1/0 (3) Jm/2zh?yi* 
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—&V ob Cot noH 2? + (£— poH i] 

Tr (45/8) V PL + noH) + (p — uH) 
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where ko is the free electron Fermi momentum. The 
chemical potential is obtained as before. Inserting (22) 
into (A3) leads to Abe's' expression 


x,/x,f*— 1+ (1/868) 


Vep 1-2 aa 
xE li in| — l, (A4) 
ae 
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where X,''**— 3ug'n/2£o, and u=K/2ko=1/+/ao. 
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Electron Paramagnetic Resonance of Fe** in the Strong j 
i Axial Field of PbTiO, Host 
|. 
a Denis J. A. GAINON 
Electronic Research Division, Clevite Corporation, Cleveland, Ohio 
i (Received 16 December 1963; revised manuscript received 10 February 1964) 
An electron paramagnetic resonance investigation has been made of single crystals of tetragonal PbTiO;. P 
A spectrum attributed to Fe?* was found which can be described by the spin Hamiltonain: $6— gug11,5; 
--nB(ITsSd-HyS,) with S=4, gi 2.0092-0.005 and g1=5.97+0.02 at room temperature. These values 
f are explained in terms of a strong tetragonal component of the crystalline field. 
f Lu 
} $ 
$ INTRODUCTION impurity in the three possible orientations of domains 
e (twinning of 180? being magnetically unobservable). 


EAD titanate is a ferroelectric which is isomorphous 
at room temperature with tetragonal barium 
titanate and possesses a Curie point at 490°C, where the 
structure changes to cubic symmetry. The cell dimen- 
sions are a=3.904A and c=4.150A at 20°C. Small 
um single crystals can be grown, but ceramics of PbTiO; 
E tend to fracture when cooled through the Curie point. 
'The purpose of this work has been to study the para- 
magnetic resonance spectrum of PbTiO; and to corre- 
late the results with the known properties of this 
ee substance. 


faa aby. 


MEASUREMENTS AND RESULTS 


Single crystals of PbTiOs, grown from a KF flux melt 
in platinum crucibles, occur in the form of small plates 
or cubes up to several millimeters which can be yellow, 
green, or black. It is not at present possible to grow 
large single domain crystals. 

Measurements were made primarily at room tem- 

perature’ using a Varian V-4502 spectrometer working 
at about 8.8 Gc/sec with a magnetic field modulation 
$ frequency of 400 cps. 
ý All the crystals studied show the same spectrum, 
$ which is essentially composed of three resonance lines. 
i The lines are relatively broad; the absorption derivative 
f peak-to-peak value is about 50 G. Variation of the 
orientation of the magnetic field relative to the crystal- 
lographic axes shows that each of the lines belongs to a 
paramagnetic impurity of effective spin S=} and 
follows the spin-Hamiltonian 


H= guBH SH 2.8 (HS.4-H,Sy) ? 


with 
£n 2.0092-0.005, 
£1—5.974-0.02. 


. The only difference between the three lines is the 
_ orientation of the Z axis which, for each line, corresponds 
_ to one of the crystal axes. It is therefore natural to 
: conclude that the spectrum is produced by the same 


- — YInyestigation of the spectrum up to 300°C and down to 
- —120*C did not produce any fundamental change. In particular, 

E the transition to a multiple cell structure reported at —100°C 
when cooling slowly (see Ref. 9) did not produce a transformation 
of the spectrum. 
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This is confirmed by the fact that the relative intensity 
of the lines is strongly variable from one crystal to 
another, showing a different distribution of the domains, 
The three lines can be roughly equal; but also one or 
two of them can be of weaker intensity or unobserved, 
particularly in the crystals occurring in the form of 
plates. Among the cubes, the most extreme departure 
from random distribution was found in a crystal 
possessing about 80% of the domains in the same 
orientation. 

The identification of g, is given by rotation of the 
magnetic field around one of the crystal axes. In this 
case the domains whose Z axes always remain perpen- 
dicular to the direction of H correspond to the orienta- 
tion independent resonance line. 


DISCUSSION 


The measurements show that the paramagnetic im- 
purity responsible for the spectrum is located in a 
position of local tetragonal symmetry and therefore 
probably substituted for an ion in the lattice. 

Several considerations allow us to identify this im- 
purity as Fe?*. The signal is relatively strong, and 
spectrographic analysis of nominally pure samples 
shows that iron is the impurity present in the highest 
concentration (between 0.05 and 0.5 at. %). Crystals 
intentionally iron-doped gave the same spectrum; no 
additional resonance lines appeared. The resonance of 
Fe* is relatively strong in the other titanium com- 
pounds of perovskite-type structure, BaTiO and 
SrTiOs which are grown basically by the same method. 
No trace was found of the five-line spectrum character- 
istic of Fe?* in tetragonal symmetry in any of the 
PbTiO; samples. The iron could possibly be in a reduced 
state (Fe** is expected to be unobserved at room tem- 
perature). The oxidation of Fe?* to Fe?* was apparently 
done without difficulty in BaTiO3.2 In PbTiOs, heating 
in air for several hours, at temperatures up to 1000°C, 
then in oxygen (flow of 3.25 l/min for 40 h at 800°C) 
did not change the spectrum, suggesting that the iron 
was in the trivalent state prior to the oxidation- 

2 A. W. Hornig, R. C. Rempel, and H. E. Weaver, Phys. Chem. 


Solids 10, 1 (1959). 
3 K. A. Müller, Hely. Phys. Acta 31, 173 (1958). 
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EPR OF Fe?+ 


Although other ions which may be paramagnetic are 
present in the crystal, i.e., titanium, platinum,‘ niobium, 
and copper,” they are not expected to have the observed 
g values and spectral intensity. 

Because of size considerations, Fe^* is expected to 
substitute for Ti'* in the lattice, and its paramagnetic 
spectrum can be explained in terms of a strong ‘axial 
field. The problem was theoretically studied by Griffith. 
Fe in its high-spin form is in the 555,» state. Under 
the action of a tetragonal crystalline field, this ground 
state splits into three twofold degenerate levels and its 
behavior in the presence of a magnetic field can be 
described by an adequate spin-Hamiltonian. If, how- 
ever, the coefficient D of S? in the spin-Hamiltonian, 
proportional to the zero-field splitting, is much larger 
than the magnetic field energy, the lowest doublet is not 
mixed with the highest ones and the spectrum is de- 
scribed by a new spin-Hamiltonian with S=}, gu 2&2, 
and g,'Zz6. This effect was observed by Griffith and 
Bennett el aL, but with relatively low precision in a 
few organic compounds (haemoglobin derivatives). 

Several facts help to explain the large zero-field 
splitting of Fe** in PbTiO;. The work of the above 
authors suggests that a D of at least 1 cm™ is required 
to observe a true g, of 6 if the minimum wavelength 
available is about 3 cm. Although studies at shorter 
wavelengths would provide further information, a 
minimum D of 1 cm: seems consistent with the crystal- 
line structure of PbTiOs. The relation between D and 
the coefficient A? of Y? in the development of the 
crystalline field potential is not completely understood, 
but an extrapolation of the tentative quadratic relation 
given by Nicholson and Burns? leads to an A,’ at least 
two times larger in PbTiO; than in BaTiO;. This is in 
accord with the main feature in, PbTiOs, which is the 
large displacement of the Pb and Ti ions, leading to 
& strong tetragonal component. Pb moves through 
0.47 A (0.05 A for Ba in BaTiO;) and Ti through 0.30 A 
(0.10 A in BaTiO;) relative to the almost undistorted 
framework of oxygen ions. The ratio c/a of the cell 
dimensions is 1.063 in PbTiO;, about six times larger. 
The spontaneous polarization calculated from ionic 


* Z. Sroubek, Z. Zdansky, and E. Simanek, Phys. Stat. Solidi 3, 
165 (1963), observed in BaTiOs at 78°K a signal that they 
attribute to Pt?*. Their crystals, however, were doped with about 
0.4 at. 95 niobium, which acts like a donor, to reduce the valence 
state of diamagnetic Ptf*. The fact that our samples contained 
only about 0.03 at. % niobium is additional evidence that the 
Spectrum is not due to Pt?*. : s 

5 In some crystals a weaker signal is observed, following the 
same spin-Hamiltonian as the strong one but with gi 12.002-001 
and g1—2.244-0.01. This may be eventually attributed to Cu. 

6 J. S. Griffith, The Theory of Transilion-Metal Ions (Cambridge 
University Press, Cambridge, 1961), Chap. 12. E 
* 1J. E. Bennett, J. F. Cibeon, and D. J. E. Ingram, Proc. Roy. 

oc. (London) A240, 67 (1957). 
$ W. le d and = Burns, Phys. Rev. 129, 2490 (1963). 


IN STRONG AXIAL FIELD OF PbTiO; 


H (KGAUSS) 


Fic. 1. Angular dependence of the three resonance lines for 
rotation of the magnetic field around one of the symmetry axes. 
Curve (a) corresponds to domains whose Z axes always remain 
perpendicular to the magnetic field. Curves (b) and (c) correspond 
to domains whose Z axes lie in the plane of rotation of H. 


displacements is about three times larger than for 
DaTiO;? Thus, the observed EPR spectrum of Fett 
appears consistent with the strong tetragonal crystalline 
field of PbTiO;. It should be noted that possible distor- 
tions due to the noncompensated charge of Fe?+ may 
locally modify the crystal field. 

After this work was completed we learned from 
Müller? that similar resonances have been observed in 
cubic SrTiO; which have an intensity about 0.15 times 
that of the central transition of the cubic spectrum. 
Accurate determination of g, at two different frequencies 
allowed these workers to calculate a value of 
|2D| —2.85-:0.15 cm~. They concluded that the only 
possible explanation for such a large D is that local 
charge compensation takes place at a nearest oxygen 
site. An interpretation of the PbTiO; spectrum in terms 
of local charge compensation would imply (1) that this 
charge compensation occurs only along the direction of 
polarization (since the resonance lines are of unequal 
intensities) and (2) that all the Fe^* ions are locally 
compensated in an equivalent manner (since no other 
spectrum due to Fe** is detected). 
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Stimulated Phonon Emission by Supersonic Electrons 
and Collective Phonon Propagations 
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(Received 23 December 1963; revised manuscript received 27 January 1964) 


The emission of phonons by electrons with supersonic drift velocities is examined, and the crystal mo- 
mentum transferred to a piezoelectric phonon band is determined. This momentum is transferred principally 
to a band of phonons which are in a cone centered about the direction of electron drift and of frequencies 
approximately 10-10" cps. This momentum-transfer process is stimulated in the sense that it amplifies 
any variations in the phonon momentum distribution. In particular, it will amplify a collective wave in the 
“phonon gas” which corresponds to a sound wave in a particle gas. The collective wave is described in terms 
of a local particle drift velocity and a local temperature. In a phonon gas, temperature plays the role that 
pressure plays in a particle gas. By the use of energy and momentum conservation equations, it is shown that 
such a collective wave is possible when the usual large losses are overcome. The derived gain and expected 
loss of the collective wave are shown to be comparable in CdS at room temperature. The current saturation 
observed when the electron drift velocity exceeds the sound velocity appears in the self-consistent solutions 


of the electron and phonon systems. 


I. INTRODUCTION 


HE observation of an anomalously slow phonon 
propagation has recently been reported in CdS 
crystals under conditions of acoustic gain.! This has been 
interpreted as a collective phonon propagation similar 
to the"collective phonon wave which describes second- 
sound propagation in super-fluid helium. In nonconduct- 
ing solids the collective phonon wave in a “phonon gas” 
could be considered analogous to a sound wave in a 
molecular gas. The collective wave would then be a 
density fluctuation in the phonon distribution, much 
like a sound wave is a particle density fluctuation in a 
particle gas. In a macroscopic description an increase in 
phonons gives rise to an increase in temperature. Tem- 
perature plays a role in phonon transport similar to 
pressure in particle gas transport. The collective phonon 
wave could be described equally well as a temperature 
fluctuation wave. 
The propagation of such a collective phonon wave in 
a solid has been examined in some detail.? It was found 
that umklapp collisions (which have no counterpart in 
molecular gases) would destroy the crystal momentum 
of the phonons and damp the collective propagation. In 
solids, at all but lowest temperatures, this damping 
term is much larger than the harmonic term, and dif- 
fusion, (i.e., normal thermal conductivity) rather than 
harmonic propagation, occurs. At very low temperatures 
the umklapp collision rate is greatly reduced and collec- 
tive waves may be possible, but the reduction in normal 
collisions would then tend to disrupt the waves. Normal 
collisions are necessary to maintain the collective nature 
of the wave, just as particle collisions are necessary for 
‘sound propagation in particle gases. These two condi- 
tions, long umklapp relaxation time and short normal 


1H. Kroger, E. W. Prohofsky, and R. W. Damon, Phys. Rev. 
Letters 11, 246 (1963). 

*E. W. Prohofsky and J. A. Krumhansl, Phys. Rev. 133, 
A1403 (1964). E. W. Prohofsky, thesis, Cornell University, 1963 


: — (unpublished). 
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relaxation time, are extremely difficult to satisfy 
simultaneously. 

In a semiconducting solid, particularly those with 
piezoelectric phonon modes, one can transfer energy and 
crystal momentum from electrons to the phonons. It has 
been shown by Hutson, McFee, and White? that for 
ultrasonic waves, ie., low-frequency phonons, the 
internal losses of the phonons can be more than com- 
pensated for when v4? v,. Here, va is the electron drift 
velocity and v, the sound velocity. If such a mechanism 
could transfer crystal momentum to phonons capable of 
participating in a collective wave, which would over- 
come the internal losses, it would be possible to propa- 
gate the type of collective phonon wave discussed. 
Whether or not a particular phonon mode could partici- 
pate in a collective wave would depend on how strongly 
coupled the mode was to the bulk of the collective wave 
by the mixing collisions. 

The transfer of momentum or energy to phonons has 
been examined by White* for low frequencies, i.e., 
qA <1, where q is the phonon wave vector and A is the 
electron free path. In this range, the electrons are able 
to equilibrate to their local surroundings. The best 
description of this system is in terms of the classical 
fields and currents used by White. For higher frequency 
phonons, where qA 1, a better description of the system 
involves quantum mechanical electron and phonon 
states which undergo collisions. Pippard® has discussed 
the interaction mechanism for high-frequency phonons 
from a degenerate electron band with imposed drift 
velocity, va? v,. Spector has derived the amplification 
of high- and low-frequency phonons using a free-electron 
gas model for conduction of electrons in metals. Neither 
of these approaches is ideally suited to examine the 
highly nondegenerate-electron distribution which exists 


* A. R. Hutson, J. H. McFee, and D. L. White, Phys. Rev- 
Letters 7, 237 (1961). 

* D. L. White, J. Appl. Phys. 33, 2547 (1962). 

* A. B. Pippard, Phil. Mag. 8, 161 (1963). 
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STIMULATED PHONON EMISSION 


in CdS under conditions of acoustic gain. Paranjape? 
has discussed the amplification of high-frequency sound 
waves using a quantum mechanical approach suitable 
to nondegenerate electron distributions. 

This paper will examine the transfer of momentum 
to high-frequency phonons using a golden-rule scattering 
integral technique suitable to nondegenerate-electron 
distributions in direct semiconductors. This momentum- 
transfer mechanism will, in turn, be used to evaluate 
the momentum transferred to collective phonon dis- 
turbances. It is believed that the high-frequency 
phonons will be most important in the collective wave 
behavior for two reasons: The number of higher fre- 
quency modes is much greater than that of low- 
frequency modes, and the mixing collision rate should 
be much less for low frequencies. This drop-off of mixing 
collision rate will supply a lower limit to the phonon 
frequencies involved in the collective wave. Those modes 
which have a lifetime longer than the period of the 
collective wave cannot be equilibrated within the collec- 
tive wave. 

In Sec. II, the first-order stimulated phonon emission 
and momentum-transfer rates are evaluated for a model 
in which there is a piezoelectric phonon mode and a 
nearly free, nondegenerate-electron band. 

The results of Sec. II show that the electrons are 
strongly coupled to phonons of moderately high fre- 
quencies, but the interaction drops very rapidly beyond 
a certain cutoff frequency due to the mass mismatch be- 
tween electrons and high-energy phonons. Energy and 
momentum transfer, upon collision, is most efficient for 
particles of roughly the same mass. An apparent phonon 
mass is 

my hv/vg , (1.1) 


where / is Planck’s constant, v the frequency, and v, the 
velocity of sound. The frequency most strongly coupled 
to electrons is that for which mp=me, where m, is the 
electron mass. This frequency is roughly y=10" cps. 
The actual peak frequency found in Sec. II is very close 
to this value, but varies with temperature as it is effected 
by such factors as the available phonon and electron 
phase space and the average electron energy. The fre- 
quency dependence of the efficiency of energy and 
momentum transfer is greatly modified when the 
coupling has a frequency dependence, as is the case in 
deformation potential scattering. 

The cutoff of the interaction for high-phonon fre- 
quencies enhances the probability of collective wave 
propagation. The phonon momentum losses rise sharply 
with frequency. Pumping momentum into a low-fre- 
quency band can give rise to a situation similar to that 
encountered in anomalous thermoelectric power.* The 
situation is illustrated in Fig. 1. A low-frequency band 
of phonons becomes “hot” with respect to the bulk of 


* H. N. Spector, Phys. Rev. 127, 1084 (1962). 
* B. V. US Phys. Letters 5, 32 (1963). 
5 C. Herring, Phys. Rev. 96, 1163 (1954). 
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Fic. 1. Crystal momentum relaxation flow chart, Electrons 
accelerated by an electric field lose crystal momentum in collisions 
with optical and low-frequency acoustic phonons. For low electron 
drift velocities the relaxation to optical modes dominates. When 
7427, the coupling to low-frequency acoustic modes becomes 
dominant. This low-frequency band can become hot since the 
momentum flowing into it can be greater than its losses to the 
high-frequency modes. 


the thermal phonons. The losses from this band are slow 
because of a bottleneck which keeps these phonons from 
equilibrating with higher frequencies. The high-fre- 
quency umklapp relaxation time is very fast, and the 
thermal phonons remain uneffected in the propagation 
of the lower-frequency band. 

The emission process for 247» v, is somewhat similar to 
Cerenkov radiation. As described in Sec. II, it, too, is 
limited to a forward cone centered about the direction 
of electron drift. The active phonon band which may 
support collective oscillations is a mediurn-frequency 
band of phonons occupying a cone in phonon-phase 
space centered about va. 

The expression for the crystal-momentum transfer, 
found in Sec. II, is only valid in a region where v42-9,. 
The use of perturbation theory appears valid, as the 
dimensionless coupling parameter described by Froelich’ 


15 
h "fe ep rI? 
“lal Ger" bs] cnm 
2pv d d JL ph 


where p is the lattice density, e the electronic charge, $ 
the proper piezoelectric constant, and d a dielectric con- 
stant. The parameter is certainly small for frequencies 
of the order of 10-10" cps. For low values of electric 
field, the electron relaxation is mainly due to collisions 
with optical modes. This relaxation time is roughly 
10-2—10-? sec, and the inequality gA> 1 is also satisfied 
for phonon frequencies of the order of 10" cps. 

The problem of the normal collision rate is discussed 
in Sec. III. The collisions between phonon states in the 
active band are primarily due to higher-order electron 


9H. Froehlich, Advan. Phys. 3, 325 (1954). 
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Fic. 2. Energy and momentum conserving phonon emission. 
The phonon dispersion relation w= ħv.q is superimposed on the 
electron dispersion relation E= /?&?/2m. A possible phonon (w,q) 
can be emitted in an energy and momentum conserving process if 
(1) E;—ho--Er, and (2) b; q--kr, where i and F refer to the 
initial and final electron states. All such double intersections insure 
that these two conditions are satisfied. The double intersections 
cease when the line is tangent to the curve when E; — Er and the 
velocity of this state is equal to the vleocity of sound. 


—k 


kf 


phonon collisions. The electrons, in this case, play a role 
very similar to that of the rotons in liquid helium. The 
same conditions which bring about large momentum 
transfer (va? »,) cause this second-order collision mecha- 
nism to have a focusing effect on the phonons. That is, 
it tends to make the final phonon state more likely to be 
within the gain cone than the initial phonon state. The 
inability of the electron to transfer large amounts of 
energy to the phonons tends to keep the final state 
within the frequency limit of the active phonon band. 
In Sec. IV, the equations for energy and crystal mo- 
mentum conservation are applied to the active phonon 
band; the equation of conservation of momentum is 
applied to the electrons; and the interaction developed 
in Sec. II is used to couple the equations. A zero-order 
solution exists which describes the equilibrium distor- 
tion in the electron distribution (current), and the 
phonon distribution (heat flow) under the high electric 
field conditions. This solution shows a marked current 
saturation similar to that first reported by Smith. In 
addition, solutions for displacements about this equilib- 
rium exist which correspond to the collective propaga- 
tion discussed. Throughout this section the excess 
energy stored in the electron distribution is ignored 
compared to that stored in the phonon distribution. 
Plasma effects are ignored because the electron density 
-is assumed very small. 
ET In Sec. V, numbers such as electron mass, piezo- 
e ee constant, etc., are substituted into the equations 
_ that have been developed. The gain of the system is then 
‘compared to the expected phonon crystal momentum 
losses. 


P 


R. W. Smith, Phys. Rev. Letters 9, 87 (1962). 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 
W. PROHOFSKY 


Il. STIMULATED PHONON EMISSION py 
ELECTRONS IN PIEZOELECTRIC 
MATERIALS 


In the quantum limit, i.e, g-A>1, interactions 
involving electrons and phonons must conserve energy 
and crystal momentum modulo 7G (G being a reciproca] 
lattice vector). The possible initial and final states in 
which an electron emits or absorbs a single phonon in a 
normal collision (i.e., 7G=0) can be determined by 
using a diagram similar to the one in Fig. 2. The curved 
line represents a dispersion relation for an electron jn 
one dimension and is assumed to be a quadratic function 
of the momentum. The curve in Fig. 2 could be related 
to excited electrons in a direct material such as CdS. The 
straight line represents the acoustic phonon-dispersion 
relation; the slope equals the velocity of sound. 

If the origin of the phonon line were placed at any 
point on the electron curve, a second intersection by the 
phonon line would imply conservation of energy and 
momentum between the initial and final electron states 
determined by the two intersections. All possible inter- 
actions correspond to some possible placements of the 
curves that result in two intersections. It can be seen 
that such intersections cease when the phonon line is 
just tangent to the electron curve. At this point, the 
energy and momentum of the emitted phonon have 
decreased to zero. The velocity of this limiting electron 
state is equal to the velocity of sound. For emission, the 
initial electron state must be on the high-energy side of 
the tangent point (i.e., v, v;), and the final state must 
be on the low-energy side (i.e., v,« v;). For absorption, 
the initial and final states are reversed. In the case of a 
quadratic-electron-dispersion relation, the initial and 
final k values are equidistant from k’, the location of the 
tangent point. 

In a three-dimensional solid, the dispersion relation 
for electrons is a bowl-shaped hypersurface in four- 


Fic. 3. Electron population inversion with respect to phonon 
emission. (This mechanism has been discussed by Pippard, Ref. 5.) 
The dashed „Circle represents phenomenologically the electron 
distribution in 2 space. The dashed line at k/ represents those 
states whose velocity is equal to the sound velocity in the z direc- 
tion. Emission or absorption of a phonon with q=— ôk involves 
initial and final states which are symmetrically spaced about the 
plane k,’. In the figure on the left the electron distribution is at rest 
and there are fewer electrons on the high-energy side of k+’. In the 
figure on the right the electron drift velocity is equal to the soun 
velocity and there are as many electrons on the high-energy side 
of Ej. For greater drift velocities there will be more electrons on 
the high-energy side 
respect to emission of phonons in the z direction. 


giving rise to a population inversion with 
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Fic. 4. The extent of population inversion for phonons not 
parallel to vg. The solid-line sphere corresponds to the velocity of 
sound in any direction. The dashed sphere represents the electron 
distribution with a large drift velocity v4 —9,--e;. Phonons oriented 
at an angle 0 will see a population inversion for e(0)>0, where 
€(9) 27a cos0—v. (0). 


dimensional space. Since only one phonon is involved in 
these processes, we may still use a one-dimensional 
phonon-dispersion line to examine the emission of 
phonons with one orientation. The origin of the phonon 
line is placed at a point on the electron-dispersion 
surface, and a second intersection implies a possible 
momentum and energy conserving process. The line and 
hypersurface will have a tangent point which will play 
a role similar to the tangent point in the one-dimensional 
analysis. 

Based on the above discussion, it can be seen that a 
surface could be drawn in electron k space perpendicular 
to the direction of the emitted or absorbed phonon wave 
vector q at k’, where v.(k’)=vy,. All emission processes 
have the initial electron state on the high-energy side 
of this surface and the final state on the low-energy side. 
The change in electron wave vector, Ak— —q, is exactly 
bisected by this surface when the electron dispersion 
relation is quadratic in the momentum. 

Pippard? has pointed out, in arguments based on a 
diagram similar to that of Fig. 3, that the phonon- 
emission rate could be greatly enhanced if the electron- 
drift velocity vg were made greater than the sound 
velocity. In such a case, the centroid of the electron 
distribution in k space is outside the surface described in 
the preceding paragraph. Therefore, more electrons in 
States of high energy are opposite holes on the other side 
of this surface, and electron transition to the lower 
states is very probable. One can speak of a population 
inversion in k space with respect to phonon emission. 

The emission of phonons for a given orientation In 
space will be shown as a first approximation to be 
Proportional to a parameter e, where 


(2.1) 


€— 734—995; 
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where vq is the electron drift velocity and v, the sound 
velocity. For different phonon orientations, as those 
whose wave vector make an angle @ with the direction of 
electron flow, the emission is proportional to 


(0) = va cosd—2,. (2.2) 


One can define an angular cone in phonon q space in 
which phonon emission will be enhanced. This is 
demonstrated in Fig. 4. 

All the collisions discussed are normal, and the effect 
of such collisions is a transfer of momentum from one 
system to the other. The population inversion, when 
747 v,, favors momentum transfer to the phonons in the 
“gain cone” from the electrons. For well-defined elec- 
tron and phonon states, the rate of phonon emission is 


ON 4| 2v : 
anl x PENOJ f(7)] 
! eol 1 RE 
-NOS - (03) 


Xâlk' —q—k)ôlEr—ħo— 8,), (2.3) 


t 


where N (q) is the occupation number of phonon mode 
q, f(k) is the occupation number of the initial electron 
state, and f(k’) is the occupation number of the final 
electron state. The factor g is the coupling constant, or 
matrix element, for the coupling of the electron transi- 
tion and the phonon. This energy can be written as 


xe LN e: 
EN 
2po, d 


where x is a normalized phonon displacement, e the 
charge on the electron, p the appropriate component of 
the piezoelectric tensor parallel to the phonon displace- 
ment, d the appropriate dielectric constant, p the lattice 
density, and w, the frequency of the phonon mode q. 

For high temperatures we assume V(q)>1. This is 
equivalent to ignoring the spontaneous emission in 
comparison to the stimulated emission. 


oN 2c 
—| &e—2XgNDfG)— Sk] 


Ot loo. Å kk 
x 6(k’—q—k)6(6;-—hw— b). (2.5) 


For a quadratic electron dispersion relation, 


k= (m*/h)v.t (q/2)+kzy, (2.6) 
k'= (m*/h)v.— (q/2) +key: (2.7) 

For drift velocities near the velocity of sound let 
T= Urt E. (2.8) 


One may assume that the electron distribution under 
high field is a Fermi distribution centered at 
k= (m*/ħ)va. That is, 


JEln*/h)va]= fo(0), (2.9) 
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where fo(k) is the equilibrium Fermi distribution. 
Substituting Eq. (2.8) into Eq. (2.5), and summing 
over kx’, ky’, ki’ to eliminate the ô(k) function, 


aN (q) 
OL ie 


2:20 


2 gN (q) 


Sr x 
ne) 
(2255) eae 

SEC DEN gt ey 


=l 
Xb(6h—ho— 6,) CS Je gN (q) 


mh k 


ð fo m* 
K— grad, E: = 2e6(6p —ħw— Ex) (2.10) 
OE h 


as fo has reflection symmetry. 
Summing over k, to eliminate (6), 


aN (q) 1 q 
"om in i ENON) 
ONUS 2722 dike 2 
q 1 m* 
| (E+ tn) E- —2edkdky. (2.11) 
2 KT hh 
d After replacing the remaining sums by integrals and 
a; carrying out the integration over dk.dk, we obtain 
aN (q) 11 
| = ARNO 
9t 'col 2m h^ 


hg? Er 
x i-am ( E- )]. (2.12) 
16m*KT 2KT. 


; The total crystal momentum transferred via this process 
Ki is 


i] ð oN (q) 
I —(hq) => hq (2.13) 
ot col a ot col 
Replacing the summation by an integral, Eq. (2.13) 
13 becomes 
9(7iq) 1 vn 
—| = 3^ g*hg cosON (g)e(8 
Ot leor 167524 J Nw) 
ng? Er 
xim S ) ee sinód6dq. (2.14) 
Be 16m*KT 2KT. 
E The interesting range of @ runs from zero to 6’, where 6’ 
j descri es a gain cone in q space, and 
E e(0')= (v, +e) cos&/—v,—0, 
E ; ; (2.15) 
* heres cos = : 
we 1+<¢/v, 
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Itis assumed that the value e/v«1 to be consistent with 
previous approximations, therefore 


0 = (2e/v,)? «1, (2.16) 


and the phonons involved in the interaction do not 
deviate greatly in the direction of propagation. There- 
fore, to first order, one can use a single component of the 
piezoelectric tensor in Eq. (2.4) for all the phonons in 
the cone. 

One can express the effects of the angular integration 
of Eq. (2.14) in terms of an averaged value of e(0). 


6’ e 
¿= anf cosĝe(0) sindo = 2y— , (2.17) 
0 Va 
to lowest order in e/v,. Equation (2.14) then becomes 


à (iq) el «ree? 
li = m*?g? [xan 
Ot 3001 8a? h4 Vs 


hg? Ep 
xfi tanh( ) ea. (2.18) 
1l6m*KT KT. 


For a nondegenerate electron gas the term in the square 
brackets can be simplified as 


h?g? Er 
1—tanh (M ) 
16m*KT 2KT 
Tin 1—tanh (/?g?/16m*K T) 
= T , (249) 
2(2m* KT)? 1-- tanh (J292/16m*K T) 


where z is the electron density. A further simplification 
occurs if we define a temperature T, by 


KT,—im*w; (2.20) 
and Eq. (2.18) becomes 
9(7iq)| ar i e 
— eas P Ur) n [ a) 
ðt col 8 h Us 
1— tanh (22v,?9?/32K T,K T) |g?d. 
xL (hi?0,2q?/ )Ja’dq . Q1) 


1+ tanh (/2v,?9?/32K T,K T) 


We now allow for a distortion in the phonon field 
characterized by the phonons within the gain cone 
having a net drift velocity 2. In this case, the distorted 
phonon probability distribution becomes 


1 
N(q)=No(q)+2.- Ag No (Not1), CW) 


where No refers to the equilibrium Bose distribution. 
It is now possible to separate the momentum transfer 
process into two parts, i.e., 


alha) e € 
pu 


9L logi Vs v,? 


(2.23) 
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a qm epN? / TN 
OQ 
2 pv, \dh Ts 


E ; Ic tenh[ (1/32) (7/722 s 
e*—1 1+ tanh[(1/32)(T/T,)x?] 


m? EA) -) 
Exe em bm (x)dx 
2 po, Nd (= [re , 


q?/epN? m* 7 T \ 3/2 
wil) 
8\dh/ pv, NT, 


xf x 1—tanh[ (1/32) (T/T,)x ]dx: 


(2.24) 


x? csch?- 


2 i-ctanh[ (1/32) (T/T.)a*]) 


m^ /epN? m* 7 TN 312 
-“(<) —(—) n [ rela, (2.25) 
2\dh/ pv, NT, 


x=ħvq/KT. (2.26) 


The first term on the right-hand side of Eq. (2.23) 
describes spontaneous emission—spontaneous in the 
sense that the drifting electron distribution loses 
momentum to a nondrifting phonon distribution. The 
second term is a stimulated-emission term—stimulated 
in the sense that momentum is transferred at a rate 
determined by the net momentum present in the phonon 
system. This stimulated term describes a process whose 
net effect is a linear amplification of the net local mo- 
mentum of the phonon distribution. The function r(x) 
is greater than r;(x), and the “stimulated” emission 
term can be comparable to the “spontaneous” one. 

The function, (T/T,)?r;(v), is plotted in Fig. 5 for 
several values of T/T.. The values of T/T, correspond 
to helium, nitrogen, and room temperature for free 
electron masses, and room temperature for an effective 
mass of 0.2. At low-phonon frequencies gA « 1 this 
quantum approach is not valid. A more valid approach 
at these low frequencies is White's, as previously dis- 
cussed. The increase in the gain for phonons (as T/T, 
increases) reflects the increased stimulation due to the 
presence of greater numbers of phonons. The shift in 
the frequency of maximum gain reflects the fact that the 
electrons have more energy and are more capable of 
emitting higher energy phonons. 


where 


II. NORMAL PHONON COLLISIONS AND 
HIGHER ORDER ELECTRON PHONON 
COLLISIONS 


Phonons undergo collisions with other phonons via 
the anharmonic terms of the interatomic forces. Colli- 
sions in which no net crystal momentum 15 Jost are 
called normal collisions. Such normal collisions are 
Necessary for the propagation of a collective wave aad 
establishing a local temperature. The collective wave 
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Fic. 5. Relative stimulated momentum gain as a function of 
phonon frequency. The function r2(v) is plotted for T —4T,, 78T,, 
300T,, and 1500T,. These temperatures correspond roughly to 
liquid helium, liquid nitrogen, and room temperature for m* 2m, 
and room temperature for m*~0.2m,. 


examined in this paper, however, must be propagated 
within a restricted band of lower-than-thermal fre- 
quencies, since the momentum loss of thermal phonons 
is much greater than the momentum gain available to 
the system. As pointed out in Sec. I, only the slow rate 
of equilibration of these low frequencies to the thermal 
reservoir makes the collective wave possible. Normal 
collisions in which momentum is transferred to phonon 
states outside of this restricted band will dissipate the 
momentum of the collective wave and give rise to 
damping. Instead of colliding among themselves, the 
low-frequency phonons within the restricted band are 
more likely to collide via the anharmonic perturbations 
with thermal-frequency phonons due to the much larger 
number of available states at higher frequencies. These 
collisions play the same role in this restricted-band 
collective wave as do the umklapp collisions in thermal 
phonon collective waves. The rate of these processes 
proceeding in the absence of carriers, i.e., dark attenua- 
tion in photoconductors, will be an estimate of the mo- 
mentum losses from the restricted band to the thermal 
reservoir. 

The slow rate of relaxation of the hot phonon band 
gives rise to a relaxation bottleneck. The phonons 
within this hot band are more likely to undergo further 
collisions with electrons before relaxation can occur. 
Successive electron-phonon collisions or higher order 
electron-phonon collisions are the fastest mixing colli- 
sions in the band and will set up a somewhat distorted 
equilibrium within the hot phonon band. The essential 
mixing collisions necessary for the propagation of the 
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1G, 6. (a) First-order phonon emission. (b) Phonon absorption 
followed by emission. 'The intermediate state need not conserve 
energy if it is short-lived. (c) Phonon emission and absorption. The 
existence of an intermediate state is not specified. 


collective wave will be those diagrammed in Figs. 6(b) 
and 6(c). There will be some contribution from collisions 
[Fig. 6(b)] in which the intermediate state need not 
conserve energy if it is short lived. 

The effect of large electronic drift velocities on these 
second-order electron-phonon processes can be examined 
qualitatively by describing the processes in a manner 
analogous to the emission problem of Sec. II. The 
change in electronic configuration in these second-order 
processes is the same as it would be if the process 
were an emission or absorption of an apparent phonon 
with energy and momentum equal to the differences in 
energy and momentum between the initial and final 
phonon states. The enhancement of a particular inter- 
action occurs where va> Va, where v, is the apparent 
emitted phonon velocity and is 


Aw Vs 


Ag 1+(q:/Ag)[1—cos(y—6)]’ 


where qı, Ag, v, and @ are illustrated in Fig. 7. A gain 
cone extending over an angle y’ for these second-order 
processes can be defined where 


(3.1) 


1 
1-- (p/Ag)E1— cos(&—9)]' 


where cos@’ is given in Eq. (2.15). Because of the elec- 
tron population inversion, all interactions involving 
States qə whose vectors terminate on the c; line inside 
¢ will tend to be apparent emissions, since the net 
interaction is more probable if q2 is the final state and 
qi the initial state. For qz terminating outside e/ the 
met interaction favors transitions with qx as initial 
State and q; as final state. On the basis of this analysis 
_ it is seen that the mixing collisions tend to keep the 
© excess phonon population within the gain cone where it 
— is amplified, rather than outside the gain cone where it 
could be quickly damped. Furthermore, the focusing 
effect of the second-order collisions contribute to the net 

momentum gain of the phonon distribution. 
One can easily obtain a lower limit of the second-order 
scattering from those intermediate states kz [ Fig. 6(b)] 
ich energy is conserved. The second-order matrix 


cosy’ — cos" (3.2) 
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element from second-order perturbation theory is 


(ka| V"| Rigs) (aga | V? | ko) 
V?| Egi) - 22 = 7 , 
k2 E(kı)+hwi— E (k3) —i/21 


(Esq (3.3) 


where T' is an energy related to the finite lifetime of the 
intermediate state. That group of states ko in Which 
energy is conserved has a resonant energy denominator, 
and the matrix element takes on a Breit-Wigner form, 
In the case of strongly enhanced first-order processes, 
the main part of the broadening will come from processes 
like “a” in Fig. 6. In this situation, the contribution to 
the scattering by the energy-conserving intermediate 
states will be approximately equal to the rate of first- 
order absorption processes. The final states of these 
higher order processes will lie in the same frequency 
band as the first-order emission processes. The contribu- 
tion to mixing, from processes in which the intermediate 
state is not an energy conserving one, is expected to be 
small. This is because of the small size of the parameter 
a defined in Eq. (1.2) which determines the relative size 
of terms in this perturbation expansion. 

A better understanding of the relative magnitudes of 
the emission and absorption rates can be achieved by 
examination of Fig. 3. When va= v, the electron distribu- 
tion is symmetrically distributed such that the proba- 
bility of absorption and emission are equal. The rate of 
both processes is very large because the energy and 
momentum in both the electron and phonon systems are 
ideally distributed for such interactions. The net trans- 
fer of momentum or energy is zero as the two processes 
cancel each other. An increase in drift velocity gives rise 


Fic. 7. Higher order electron phonon processes. These processes 
can be described in terms of the emission or absorption of apparent 
phonons. Two such processes are shown which have different 
changes in Wave vector g2—qi or q»/—q: for a fixed energy differ- 
ence wz— w1. This is equivalent to a variable apparent velocity. The 
cone ¢’ is the gain cone for these apparent phonons. Emission or 
absorption of these apparent phonons will be more probable 
depending on whether ge is inside or outside €'. 6’ is the gain cone 
for first-order processes. 
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to a net phonon gain by increasing the emission rate 
relative to the absorption rate. The mixing rate, which 
is limited by the slower of the two processes (absorption 
or emission), would then be decreased due to the de- 
crease in the absorption rate. If successive first-order 
processes are the dominant contribution to the mixing 
collisions, an increase in net gain will be accompanied by 
a decrease in the mixing rate. The ability to propagate 
a collective wave will depend on achieving a delicate 
balance between the gain and the mixing rate. 

Higher temperatures will broaden the electron dis- 
tribution and therefore reduce the sharpness of the 
population inversion for a given electron drift velocity. 
The phonon momentum gain per phonon mode actually 
decreases with increasing temperature [see Eq. (4.35) ]. 
Higher temperatures will therefore tend to increase the 
mixing collision rate; lower temperatures will increase 
the gain. For a given drift velocity there is a temperature 
in which the two conflicting requirements for collective 
propagations may be simultaneously satisfied. Higher 
temperature will reduce the gain, and lower tempera- 
tures will reduce the mixing rate. 


IV. CURRENT SATURATION AND COLLECTIVE WAVES 


An equation for conservation of electron momentum 
can be derived from the Boltzmann transport equation 
and written as 


ð O18 UAE 
(hk) d3- — (effi) — ( Y= an] A 
ðt Ox; h NOR; 


col 


where the brackets refer to expectation values over the 
electron distribution. For a spatially uniform electron 
distribution, the second term is zero. When the drift 
velocity is less than the velocity of sound, the collision 
term can be simplified by a relaxation time approxima- 
tion and becomes 


Atig| eoi (— nmva/ vo) , (4.2) 


where 2m is the total electron mass per unit volume and 
To is the electron relaxation time. For materials such as 
CdS at room temperature, the electron relaxation is 
mainly due to collision with optical phonon modes. That 
part of the relaxation which is to optical modes, should 
remain relatively unaffected when vg exceeds v.. If we 
add the collision terms of Eq. (2.23) to represent the 
greatly enhanced part of the relaxation, Eq. (4.1) 
becomes 


ð Ee jaf nma E EA 2 
“nma —— ) == SR Rr (4.3) 
9t h NOR TO Us Us 


and, using the usual definition of mobility a, this 
becomes 


à nmyuyE — nma e € 
—nmva— =— -R Rr . (44) 
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Equations for the conservation of energy and crystal 
momentum for the “hot” phonon band can be similarly 
derived. The conservation of energy is needed in this 
case, as the phonons are the thermal and inertial 
reservoir of the combined electron-phonon system. The 
conservation equations are 


ð qom. 
—q)-- —(/h9;) = (Migi)on, (4.5) 
ðt Ox; 
ð ð 
—(ho,9)-+—(o;ho,q) = (At. eet. (4.6) 
Ot Ox; 


The variation of energy in the restricted phonon band 
can be specified by an effective temperature 


T=TotTy, (4.7) 


where T, may fluctuate. The distribution function for 
this group of modes then becomes 


N(q)=[exp|fe.q—2-q/K(T+T))|-1}, (4.8) 


where À is the phonon drift described in Sec. II. Using 
Eq. (4.7) and (4.8), Eqs. (4.5) and (4.6) become 


ð ĝ e he M 
—MA-r—YyCT,7 Ri—4- Rz———À, (4.9) 
al dz U^ P ES 


à ð 
a E —vAfg)e. (4.10) 
t 


őz 


Here, 7 refers to collisions which transfer momentum 
out of the active phonon band, C is the specific heat of 
the phonon band, y is the average of cos? over the cone 
of phonons making up the phonon band, M and D are 
constants which depend on the ambient temperature as 


1 ; eene (ENS (Q-- 19" 
M=— | hq cose No(q)ELNo(q)2-1]g 
m MES 2 
Xsin6dqd6d e 
1 y(KTo)* x Joe 
=— ————— ]* cached | sin§dé , (4.11) 
dr? Aiv,* 2 9 
1 y(KTo)* x i 
D= T) cide sin&d8. (4.12) 
dr? hn? 2-545 


It is understood that the integral over x includes only 
those frequencies in the “hot” phonon band. 

A static equilibrium solution can be found by setting 
time and space derivatives equal to zero and solving for 
eo and Ao. Equations (4.4) and (4.9) become 


—nm —nm Rye Roo 
uE-— (Quebec oo EN 
To To bs 2 
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fe ess (4.14) 
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The sum of these equations is 


nm nme M Xo y 
p= E ; (4.15) 
TO To Vs Tr Us 


where 3 
uE 
8-—-1 


Us 


(4.16) 


and is the effective excess field driving the system. The 
righ-hand side of Eq. (4.15) consists of the net loss from 
the combined electron-phonon system due to electrons 
and phonons colliding outside the system. The resulting 
cubic equation for € or Ào can be easily solved in two 
limits: for 0€ 68««1 
«/0,— M8, (417) 
and for 821 
€)/v,— (Rs/2R) "^g! . (4.18) 
The current saturation, observed in CdS crystals!? 
when the electron-drift velocity is comparable to or 
exceeds the sound velocity, is explained by these equa- 
tions. At low electric field (a<0) the coupling between 
Eqs. (4.13) and (4.14) is negligible. The primary 
electron damping is due to the optical phonon modes 
and normal resistive behavior results. When vazv, the 
further current increases Aj for increases in field are 
given by 
Aj=neAeo= ne3uAE. (4.19) 
This causes a knee in the current voltage curves where 
the slope drops to one-half its original value. At very 
high electric fields the saturation increases and Aj be- 
comes proportional to the square root of E. Accompany- 
ing this saturation, a net drift is set up in the phonon 
distribution described by Ao, where 


Ao NMT) 2) 
T; M TO ( Us 

Although the electrons are capable of conforming to 
the instantaneous phonon distribution very quickly, 
it will take a time, roughly equal to the phonon relaxa- 
tion time, for the phonons to achieve their equilibrium 
value. Such a delay in achieving equilibriurn can explain 
the observed transient behavior in CdS. 

It is emphasized that these solutions are valid only in 
bulk material. No attempt has been made to introduce 
the effect of boundary conditions or spatial variations. 
The size of the sample should certainly effect transient 
time, particularly for small crystals. Also, no attempt 
has been made to include the effects of the low-f requency 
qA«1 phonons on saturation. It is felt, however, be- 
cause of the greater number of available high-frequency 
modes, that the effect of the low-frequency phonons will 
not be too important. 

The electron-phonon system can be examined for its 
behavior under slight departures from equilibrium. The 


(4.20) 
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parameters e, à, and T can be allowed to vary in both 
time and space about their equilibrium values 


PS i.e, 
e ent a, A= M, T=To+Ti, where e 


a= eei) 
A= Apei), (421) 
Ti= T Peit), 
Equations (4.4, 4.9, and 4.10) become 
inma? = — AM? — Be?— (nm/ro)e:?, (4.22) 
IQM O —ilyCT L= Bet3- AN9— (M / 7), (4.23) 


iQCT ,9— 4 DAL = VL Be? + A Ap — (M /r3)9] j (4.24) 


where 
p Ra(e9/v,?) 


B=2[ Ri(eo/2;)H- Ro(e9vo/v;?) ]. 


The dispersion relation for simultaneous solution of 
Eqs. (4.22)-(4.24) has roots which describe the collec- 
tive propagation discussed. This dispersion relation is 


(4.25) 
(4.26) 


—qQ2—2:0[ A' — (1/74) H-yo22— 0, (4.27) 
where 
A'—A/M, (4.28) 
B'=B/nm, (4.29) 
and 
0°=D/M (4.30) 


Substituting in Eqs. (4.25), (4.11), and (2.26), this 
becomes 


ep\? Tio, m? 7 TNI 
A'z (E) c) n 
d/ (KT)ypNT, 
x [1— tanh (1/32) (T/T)? d« 
Í x? csch? 
€0 


. 2 1+tanh[(1/32)(7/T.)x*] 


Ue x 
E csch?-d« (4.31) 
2 


where the approximation 


f cos?0 sin6d6z e/v, (4.32) 


has been used. Equation (4.32) holds for small e/v,. A 
simplifying approximation can be made in Eq. (5.1) by 
assuming 


[1—tanhy]= f ? ed ; (4.33) 


0, y>0.5 


and expanding the cschzx/2 functions for small values of 
*. The maximum value of x in the integrals is then 
given by 


max =4(T,/T)'2. (4.34) 


, 


EQ NE pia 


Oth 


1) 
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Carrying out the integral to max, Eq. (4.31) becomes 


, €x? fio, m** / TN? es 
A =3r(2) cm n—, (4.35) 
d/ (KT)pNT, v, 
The damping of the collective wave is 
er $(1/ry— A"), (4.36) 


where A is the stimulated momentum gain rate for a 
phonon distortion due to phonon emission by the 
electrons. When e; «0 a phonon distortion will grow 
rather than be damped. 

In a more detailed treatment of this problem, one 
would retain the q dependence of both A’ and 1/7). The 
gain band would be defined as those phonon modes for 
which 

(1/71) (9) — A' (q) <0. (4.37) 


The loss due to 1/r,(q) will modify the Írequency dis- 
tribution of the gain band. The value of gain at various 
frequencies shown in Fig. 5 is greater than the net gain 
by an amount equal to 1/74 (c), since the losses increase 
with increasing frequency and the density of frequencies 
participating in collective waves will be shifted to lower 
frequencies. The angular distribution of the hot phonons 
will also be affected since the gain cone should be defined 
by that angle at which the net gain is zero. 

An exact determination of the collective wave 
velocity would require a more thorough investigation 
of the role of the mixing collisions as well as detailed 
knowledge of the loss mechanisms which damp the 
wave. The problem is in knowing the extent of the cone 
over which cos? must be averaged in the determination 
of y of Eq. (4.9). The mixing collisions bring about 
transfers of momentum which involve phonon modes 
well outside the gain cone. The effect of the drag of this 
additional specific heat on the modes involved in the 
transport of momentum is not as yet understood. If the 
entire forward half of the phonon distribution is involved 
in the local thermalization process[i.e., determining T; of 
Eq. (4.9) ], the value of y will be 4. A restriction to some 
smaller cone would increase this value. The maximum 
value of y is y= 1, and this would occur when the cone is 
limited to a line in the direction of current flow. The 
absence of mixing collisions would also give rise to an 
apparent value of y— 1. A 

The factor 7. of Eq. (4.9) should be a weighted 
average over all the excitations participating in the 
collective wave. This is important in the case of second- 
sound propagation in He II as the admixture of rotons 
slows the collective wave velocity." In the wave dis- 
cussed in this paper, the electrons participate In the 
collective wave and should be averaged in. The effect of 
the electrons should be very small due to their relatively 
small numbers and small mass. The phonons involved in 
the wave also have varying velocities. This is because 


uK, R. Atkins, Liquid. Helium (Cambridge University Press, 
Cambridge, England, 1959), p. 148- 
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the phonons having piezoelectric fields become impure 
modes in some directions. In CdS the piezoelectric mode 
along the “C” axis is a longitudinal mode. The effect 
of the different velocities of phonons for large angles, off 
the direction of propagation, is reduced by the factors 
of cosó in the constants M and D of Eqs. (4.11) and 
(4.12). This change in velocity with orientation would 
also effect the extent of the gain cone, etc., with 
orientation. 


V. CONCLUSION 


The observation of an anomalously slow acoustic 
propagation in CdS has been recently reported.! The 
effect is directly related to the passage of a piezoelectri- 
cally active shear wave through the crystals under 
conditions of acoustic gain. The anomaly could only be 
explained as a wave traveling in the CdS with a 
velocity just slightly faster than 1/v3v,, where v, is the 
velocity of the piezoelectric shear wave. This shear wave 
is the slowest of the possible sonic waves in CdS. The 
pulses were observable only with applied voltages such 
that the electrons had drift velocities roughly 50 percent 
greater than the velocity of sound. For drift velocities, 
roughly twice the velocity of sound, the pulses were 
damped out. Similarly, the pulses were observed only 
over a small range in temperature near 300°K. Higher 
and lower temperatures caused the wave to be damped. 
At low temperatures the velocity of the wave was 
observed to increase as the wave was damped. These 
observations can be explained on the basis of the 
theoretical development of a collective-wave propaga- 
tion presented above. 

The collective wave consists of a harmonic fluctuation 
in the density of phonons within an active phonon band 
(phonon frequencies ~ 10"°-10" sec-!), This could also 
be described as a harmonic temperature fluctuation in 
the active band. The phonon density (temperature) 
fluctuations in turn, have a frequency equal to that of 
the shear wave which generated them (shear wave fre- 
quency ~ 10" sec). The role of the initial shear wave 
was to introduce a single-frequency perturbation, which 
when amplified, could be coherently detected. 

The propagation of a collective wave requires that 
two conditions be satisfied: (1) The gain in crystal 
momentum of the active phonon band must exceed the 
losses, and (2) mixing collisions among phonons within 
the band must be sufficiently frequent that a local tem- 
perature can be defined within a collective wavelength. 

In Sec. II of this paper, the momentum gain for 
piezoelectrically active phonons has been evaluated. 
The gain mechanism is such that the electrons couple 
only to a band of phonons of frequencies 10°-10" eps. 
In CdS, under conditions approxi thase 
experimental observation, NE a 
with 1—10* and 52:10* D/esu)* the-gain in the b: 
given by Eq. (4.35) is ae 3 


s s k 
A' z 103 (e/v,) hec. ' @.1) 


-— 
hoá 
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"That is, this gain is great enough, at large electric field 
strengths, to overcome active phonon-thermal relaxa- 
tion times of 10-5 sec. 

The losses which this gain would have to overcome to 
satisfy condition (1) are the losses from the active 
phonon band to thermal phonon reservoir. These 
processes are due to the anharmonic perturbations and 
do not involve electrons. The phonon-thermal phonon 
relaxation time for the phonon frequencies expected to 
be of importance is difficult to estimate, and direct 
measurements at these frequencies and temperatures in 
CdS are not available. The difficulty arises because the 
active band lies between very high ultrasonic frequencies 
and very low thermal frequencies. Woodruff and Ehren- 
reich? have shown that the phonon-phonon damping 
increases roughly as w? in the ultrasonic region, and as 
w for higher frequencies. The extrapolation to the fre- 
quencies important in collective wave propagation could 
be approximated by a variation as o^ with 1« «2. 

The dark relaxation time (i.e., no carriers) measured 
by Kroger" in CdS crystals, which have shown collec- 
tive wave propagation, is 10-?-10-* sec! at 107 cps. If 
one extrapolates from 10’ cps to the higher frequencies 
of the active phonon band, a very large part of the 
phonon band which is capable of supporting collective 

waves [satisfying condition (2)] will have stimulated 
gain greater than the losses. The net gain would be the 
difference between two large numbers, roughly the order 
of 10? sec. From Eq. (4.35) it can be seen that the 
momentum gain decreases for decreasing electric field 
and for higher temperatures. This decrease in gain would 
explain the observed damping of the collective wave at 
lower electric fields and higher temperatures. 

As pointed out in Sec. III, the major contribution to 
mixing collisions probably comes from successive ab- 
sorption and emission processes. This combined rate is 
limited by the slower of these two processes. When the 
electron drift velocity is exactly v, the rates of the two 
processes are expected to be equal. For higher drift 
velocities (i.e., higher external fields) the absorption 


12 A. R. Hutson, Phys. Rev. Letters 4, 505 (1960). 
(1961) O. Woodruff and H. Ehrenreich, Phys. Rev. 123, 1553 
“H. Kroger (private communication). 


rate, and therefore the mixing rate decreases. Higher 
temperatures tend to enlarge the spread in the electron 
distribution and would bring the absorption and 
emission rates closer together. Lower temperatures, on 
the other hand, tend to make the electron distribution 
more compact, which would decrease the absorption 
rate relative to the emission rate. The rate of mixing 
collisions then has opposite temperature and voltage 
dependence than the net gain. This loss of coherence, or 
second viscosity damping, would account for the damp- 
ing of the collective wave at high electric fields and low 
temperatures. The increase in velocity with onset of loss 
of coherence or second viscosity is also observed in 
second-sound propagation.!! It comes about because for 
few mixing collisions a large part of the collective wave 
energy travels straight through as unhindered phonons, 
With no normal collisions the energy transfer velocity 
would be vs. 

Another cause for decrease in the observability of the 
collective waves at low temperature is the decrease in 
the energy and momentum content of the effective 
band. Figure 5 shows the total crystal momentum trans- 
ferred to the entire band for various temperatures. It 
can be seen that the total magnitude of the entire 
collective wave decreases with temperature. 

We expect the mixing collision rate to be comparable 
to the rate of Eq. (5.1). Therefore, the highest collective 
frequencies possible at room temperature would be 
roughly 105 cps. For frequencies of 109 cps and lower, 
the collective wavelength would be as large or larger 
than the size of readily available crystals. The experi- 
mentally observed wave was observed at a frequency 
of 107 cps. 
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Second-harmonic generation (SHG) of the Nd-doped CaWO, optical maser beam in ferroelectric BaTiO, 


has been investigated. Each of the three nonlinear coefficients which 


determine the magnitude of the SHG 


have been measured from room temperature to near the Curie temperature, It is found that all three co- 
efficients have the same temperature dependence as the spontaneous ferroelectric polarization. The sym- 


metry of the nonlinear coefficients required for a dispersionless, los 


sless SHG mechanism, namely, d1s2 da 


for BaTiOs, applies over the entire measured temperature range. The effects on SHG due to antiparallel ferro- 


electric domains are described. The circumstances under wh 


ich random as well as special antiparallel do- 


main arrays can produce either an enhancement or a degradation of the second-harmonic intensity are dis- 
cussed. Because of the dependence of SHG on the domain Structure, the determination of the nonlinear 
coefficients requires single-domain crystals. A method for preparing suitable BaTiOs single-domain crystals 
is described. These crystals are also useful for investigations of the characteristics of ferroelectric domain 


growth. 


INTRODUCTION 


la studies of the second-harmonic 
generation! (SHG) of optical maser beams in 
ferroelectric crystals are interesting for several reasons. 
All ferroelectric crystals? have optical, electrical, and 
other properties that undergo large changes with tem- 
perature so that one might also expect the efficiency 
of SHG in these crystals to vary with temperature in 
some pronounced manner. In addition, ferroelectric 
crystals have a domain structure that can be in- 
fluenced with an external electric field; and, as will be 
discussed, the ferroelectric domain structure can have a 
maked effect on the production of the second har- 
monic.’ Neither the domain effects nor the marked 
temperature dependence of SHG will occur with the 
usual nonferroelectric piezoelectric crystal. Ferro- 
electric barium titanate is, in many respects, a good 
choice as a material for an investigation of these 
nonlinear effects. For example, suitable crystals are 
readily available, the point group (Cs) is simple, it is 
among the most efficient nonlinear materials known,’ 
the Curie temperature (120°C) is easily attained, its 
optical properties are well suited to SHG with at least 
one of the commonly used optical masers, and its 
domain dynamics are quite well understood.? 

From the point group of BaTiOs, it can be shown 


! For a review on the subject of SHG, see P. A. Franken and 
J. F. Ward, Rev. Mod. Phys. 35, 23 (1963). 4 

? For a general review of the properties of BaTiO; and other 
ferroelectric crystals, the reader is referred to F. Jona and G. 
Shirane, Ferroelectric Crystals (The Macmillan Company, New 
York, 1302) IRRA 

3 R. C. Miller, quoted in Ref. 1. 

CE: NEIN Bull. Am. Phys. Soc. 8, 62 (1963). : : 

5 J. van der Ziel and N. Bloembergen, Bull. Am. Phys. Soc. s: 
380 (1963), and N. Bloembergen, Proceedings of the S em 
School of Physics *tEnrico Fermi," Varenna, Como, Italy, 


s R Dm A. Kleinman, and A. Savage, Phys. Rev. 


3). 
ERE e J. L. Bjorkstam, Appl. Phys. Letters 3, 


109 (1963). 


that the second-order polarization P, has the form,!.8 


 - £0 Qu a y o 
Pus= | 0) CODO Onan) 
(dst da dy 0 0 Oj 


In this equation, P», is the dielectric polarization at 
twice the maser frequency w, d; are the nonlinear 
coefficients, and Æ; are the optical electric fields in the 
medium. Inspection of Eq. (1) readily shows that 
c-domain ferroelectric plates, i.e. plates with the 
ferroelectric direction (z axis) normal to the major sur- 
faces, are not very useful for a study of SHG. One would 
like to use a-domain plates, i.e., samples with the 
ferroelectric direction in the plane of the major sur- 
faces of the plate, so that with appropriate use polarizers 
and analyzers, one can determine each of the three non- 
linear coefficients. There is some indication in the 
literature that a-domain samples are difficult to prepare; 
and, furthermore, that they may be unstable. In 
addition, since SHG is influenced by antiparallel fer- 
roelectric domains,* one must prepare a-domain samples 
which are free of antiparallel domains, Le., they must 
be single domain crystals. A method for preparing 
samples suitable for these SHG studies will be described. 

The a-domain samples prepared for the SHG studies 
are also useful for another type of investigation. With 
electrodes along two opposite edges so that electric 
fields can be applied along the crystal z axis, one can 
study the domain dynamics of polarization reversal, 
i.e., the nucleation and growth of the antiparallel 
domains. In recent years, studies? of domain dynamics 
through the direct observation of the domains have been 
largely restricted to investigations of the sidewise 
motion of the 180? domain walls—the boundaries be- 
tween antiparallel domains. With the a-domain sam- 


* P. H. Fang and W. S. Brower, J. Appl. Phys. 34, 1516 (1963). 
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ples, one can for the first time observe in detail directly 
with a microscope the forward growth (growth in the 
direction of the spontaneous polarization) of the anti- 
parallel domains under nearly ideal conditions. These 
phenomena, however, will not be described in this 
paper. 

This paper discusses an analysis of some of the 
effects of antiparallel ferroelectric domains on SHG, the 
preparation of single crystals of BaTiO; suitable for 
studies of SHG and the forward growth of antiparallel 
domains, the determination of all the nonlinear coeffici- 
ents as a function of temperature, and a brief discussion 
of these and other related experimental results. 


EFFECTS OF FERROELECTRIC DOMAINS ON SHG 


The effects which will be considered are those due to 
antiparallel domains. Effects which arise with twinned 
crystals due to mixed a- and c-domain configurations 
will not be discussed. Barium titanate will be treated 
in some detail, however, the phenomena to be described 
are not specific to BaTiO;—they will occur with all 
ferroelectric crystals. 

Consider what happens when an optical maser beam 
is at normal incidence on a BaTiO; crystal platelet. 
"The maser beam fundamental produces a second har- 


VACUUM FERROELECTRIC CRYSTAL 


FORCED WAVE 
2k; x -2wt) 
1 


Ean, aloe i 


_ 
MASER BEAM 
ei(kix-wt) 


T7 


lcz 
€ k2-2kj 


SECOND HARMONIC 
INTENSITY, 


ec|E2k, * Ek; | | 


Fic. 1. Illustration of the enhancement of SHG due to anti- 
parallel ferroelectric domains, one coherence length, le, thick 
The forced and free second-harmonic waves are shown at the top 
of the figure and the resultant second-harmonic intensity at the 
bottom of the figure. At each domain wall the forced wave changes 
phase by =, and the free wave increases in amplitude by an amount 
equal to twice that of the forced wave. The dotted, dashed, and 
solid curves at the bottom of the figure show the effect on the 
pecen d harmonic intensity of zero, one, and two 180° domain 
walls. 


ROBERT C. 


MILLER 


monic forced wave?!? in the crystal as indicated in 
Fig. 1. This wave, which will be expressed as 


Emale exp? (2kıx— 2ot) (2) 


is tied to the fundamental maser light wave 
expi(kix—wt). In these expressions, w is the frequency 
of the fundamental light wave, and k, the propagation 
constant in the medium. The quantity /, is the coherence 
length,! which will be described in more detail in the 
next paragraph. The plane of polarization of the forced 
wave is determined by the nonlinear coefficient in- 
volved in the SHG process. For ease of presentation, 
the phenomenon to be described will be illustrated with 
the nonlinear coefficient ds, so that all polarizations, 
including the ferroelectric polarization, are in the same 
plane. 

Boundary conditions imposed by Maxwell's equa- 
tions, namely, the tangential components of Æ and H, 
must be continuous across the surface of the crystal, 
require that a free second-harmonic wave’? also be 
present in the crystal. This wave, which is produced 
at the surface, is expressed by 


114-1 
Eye —l 


expt (kox— 2o) , (3) 


7134-2 


where k» is the propagation constant in the medium 
appropriate to the second-harmonic frequency. The 
quantities 71 and 72 are the indices of refraction at the 
fundamental- and second-harmonic frequencies, re- 
spectively. A reflected second-harmonic wave of small 
amplitude is also generated but will not be discussed 
further.?:? Since dispersion between the fundamental- 
and second-harmonic frequencies is usually present, 
2k, will not in general be equal to kz so that the forced 
and free waves, which are initially out of phase by 
v at x=0, travel with different velocities. The second 
harmonic intensity 7; in the crystal is the result of the 
interference between these two waves, and can be 
approximated by 

Inc | Bop t+ En, 2 (4) 


Since (ni-1)/(74-1) is nearly unity, Is at «—0 is 
essentially zero compared to the maximum value of 
I» which first occurs at x equal to one coherence length 
l. where the two waves are in phase; and then provided 
no domain walls are present, at successive odd mul- 
tiples of l. The minima occur at even multiples of Le 
This oscillating phenomenon is illustrated near the 
bottom of Fig. 1. The coherence length J, is given by 


l= r/ (2) AQ — m), P 


where X is the free-space wavelength of the fundamental. 
Now, consider what happens to these two second- 
harmonic waves when they traverse a 180° domain wall 


?J. A. Armstrong, N. Bloembergen, J. Ducuing, and P. S 
Pershan, Phys. Rev. 127, 1918 CORN RE 
12 D. A. Kleinman, Phys. Rev. 128, 1761 (1962). 
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where the normal to the wall is in the beam propagation 
direction. Since the z axes in two antiparallel dornains 
are in opposite directions, Eq. (1) says that the phase 
of the second-order polarization wave Pas, and hence 
that of Esr, will change by 7 on traversing the wall. 
The effect of the wall on the free wave is determined by 
the requirement that’the tangential components of the 
E and H fields at frequency 2w be continuous across the 
domain wall. For a wall at x= xo, the free-wave changes 
from the form given in Eq. (3) to 


q[— (nı+n) 
i (1154-1) No 


expt (25,— be] 


Xexpi(Rox—2ol), (6) 


for «> xo. Thus, in the case of the free wave, the domain 
wall in general introduces a change in amplitude as well 
as a change in phase. A reflected second-harmonic 
wave is also produced at the wall; however, since its 
amplitude is down from that of the transmitted second- 
harmonic wave by more than three orders of magnitude, 
the reflected wave will not be considered further. Assume 
as shown in Fig. 1, that a wall is at xo—1,, in which case 
the maximum increase in Ez, and hence I; is obtained. 
Setting 72:75, i.e., small dispersion, one sees from 
Eq. (6) that the amplitude of the free wave increases by 
a factor of 3. Therefore, the maxima of the total second- 
harmonic intensity increase by a factor of 4 due to the 
first “properly” placed domain wall. In addition, the 
minima are no longer zero, but are equal to the maxima 
obtained in the absence of the 180? wall. Each properly 
placed 180? wall results in an increase of the amplitude 
of the free wave by (ni4-2:)/1372 times the amplitude 
of the forced wave. With JV properly placed walls, the 
part of 7I» which does not average to zero with 
x can be shown to increase by a factor equal 
[1+(1+2N)?]/2. The largest enhancement one can 
obtain from this process occurs when the 180? walls 
are present at x=/,, 2L, 3L, etc., up to the crystal 
thickness /. The effect is then to increase the coherence 
length from /, to the crystal thickness /. With the Nd 
maser and BaTiOs, /,— 2.1X 10-* cm for d33° so that for 
a crystal of typical thickness, 2X10~? cm, and with 
domain walls spaced /, apart, one has N max= 10? or a 
SHG enhancement of 2X 105, a very large effect. 

For an array of N 180? domain walls situated ran- 
domly in the crystal, the resultant second-harmonic 
electric field is given by 


EcL (— 1)Neitiz— giis (1—2 exp(— ei) 
+2 exp(—imn2/l.)—2 exp (—imxs/L.)--- - - 
42-1)" exp(—ixxy/I.))], (7) 


where the walls are placed at x— x1, v=x», etc., and it is 
assumed that nı=nz except in le Then, for 2NZ,«I, 
Eq. (7) gives a second-harmonic intensity enhancement 
proportional to (1+ 2N), where again only the part of 


1; that does not average to zero with x is considered. 
The restriction that 2NI.<1 is required so that in 
averaging the product of Eq. (7) and its complex con- 
jugate one can permit each phase zx;/L, to vary from 
0 to 2x. Thus, even a random array of antiparallel 
dornains can produce a substantial enhancement of the 
second harmonic in a ferroelectric crystal. 

For ll, i.e., when the free and forced waves pro- 
pagate at very nearly the same velocity, Eq. (7) can 
be used to show that the intensity of the resultant SHG 
is proportional to the square of the difference between 
the total thickness of domains of one sign minus that for 
domains of the opposite sign. Since SHG for Lx, in 
the absence of domains! is proportional to 2, antiparallel 
dornains will have a serious degrading effect on SHG 
under “velocity matched” conditions. 

In the case just treated, the crystal was assumed to 
consist of slabs of antiparallel domains with the normals 
to the walls parallel to the beam direction. If the wall 
normals are perpendicular to the beam direction the 
analysis given above does not apply. In the latter situa- 
tion, one has second-harmonic coherent light beams 
emerging from the crystal with the phase of all the 
beams coming from domains polarized in one direction 
out of phase by r with those beams coming from do- 
mains polarized in the opposite direction. In an un- 
focused beam experiment with a few antiparallel 
domains present so that the cross sections of the radiat- 
ing volumes are much larger than a wavelength, the 
emerging beams will be nearly parallel and will not 
interfere with each other and reduce the second- 
harmonic intensity below that which one would ob- 
serve with a single-domain crystal. However, when 
diffraction effects become large, that is, when the cross 
sections of the radiating areas are comparable to a 
wavelength, the emergent beams are no longer nearly 
parallel to that they can interfere destructively with 
each other and reduce the second-harmonic output be- 
low that which one would observe with one or a few 
domains. 

The optimum domain array for SHG is one where the 
crystal consists of sheets of antiparallel dornains, each 
l. thick, with the wall normals parallel to the beam direc- 
tion. The question arises as to how one can produce 
such domain arrays. One possible method that would 
apply to crystals which undergo second or higher order 
ferroelectric phase transitions,? such as potassium 
dihydrogen phosphate (KDP), Rochelle salt, and 
triglycine sulfate (TGS), involves the formation of a 
domain pattern, as the crystal goes from the para- 
electric to the ferroelectric phase, which is determined 
by the minimum free energy? In the case of Rochelle 
salt, theory and experiment give a domain spacing ob- 
tained in this manner which is proportional to the 

square root of crystal thickness." (This crystal thick- 
ness refers to the direction of the ferroelectric polariza- 


uJ. Mitsui and J. Furuichi, Phys. Rev. 90, 193 (1953). 
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tion.) Thus, one may be able to produce a suitable array 
using this technique. This method will not work for 
crystals such as BaTiO; which undergo first-order phase 
transitions since the domain pattern formed on cooling 
through the phase transition is in this case determined 
by independent nuclei and not by the minimum free 
energy.? 

It may be possible to produce desired domain arrays 
with electric fields. There is some evidence that the 
number of domains per unit area is determined by the 
magnitude of the field producing them,” so that with a 
suitable choice of the magnitude and the duration of 
the field, one may be able to produce useful domain 
patterns. A combination of electrical and thermal treat- 
ment may also be found useful. 

Order of magnitude enhancements of the second- 
harmonic have been observed with both BaTiO; and 
TGS when antiparallel domains were known to be 
present. In these experiments, the ferroelectric domains 
had probably either circular, or approximately square, 
cross sections in the BaTiOs; and probably lenticular, 
or circular, cross sections in TGS.? Thus, in neither case 
did the antiparallel domain configuration approximate 
the ideal slab structure. However, in no case was a 
decrease of the second harmonic observed when anti- 
parallel domains were known to be present which shows 

that the enhancement effects of the 180? walls are 
larger than the degrading effects such as those due to 
small radiating areas. 

A somewhat similar arrangement for enhancing SHG 


in quartz where l<} was described earlier? and con- 


sists of stacking crystal plates in such a way so as to 
produce the effects shown in Fig. 1 and described in the 
text. 


PREPARATION 


Since no method of preparing samples suitable for 
the present experiments has been given in the literature, 
the techniques employed to produce the crystals will be 
described in some detail. Clear, undoped crystal plates 
of the order of 2X 10-? cm thick (grown by the Remeika 
method") with clean smooth surfaces were selected and 
then etched in concentrated H3PO, at 155°C sufficiently 
long to reduce the sample thickness by at least 5X 10? 
cm. The samples were then rinsed in water, alcohol, and 
de poled in distilled water with platinum electrodes. 
After poling, the specimens were examined with a 
polarizing microscope and only those samples which 
had large, unstrained c-domain areas were selected. Of 
these crystals, only those which had areas a millimeter 
or so on a side that were sufficiently plane parallel to 
give a few interference fringes with visible monochro- 
matic light were used. These selected crystals were then 
cleaved, or broken, with a razor blade such that with 
much patience and some luck one obtained a c-domain 


18H. L. Stadler and P. J. Zachmanidis, J. Appl. Phys. 34 
- (1963) J. Appl. Phys. 34, 3255 


^3 J. P. Remeika, J. Am. Chem. Soc. 76, 940 (1954). 
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sample, rectangular in shape, several millimeters on 4 
side, with the sides parallel to the @ axes. Then air- 
drying silver-paste electrodes were painted on the more 
perfect two opposite edges and dc fields of a few kV/cm 
were applied to pole the sample a domain. The poling 
process was monitored with a polarizing microscope 
and the process usually hastened by heating the sample 
with a focused light beam to a temperature below but 
near the Curie temperature. When the poling process 
was complete, the light and field were removed and the 
resultant a-domain sample examined. At this point, the 
samples would in most cases be without antiparallel 
domains. If a few antiparallel domains did appear, 
they were usually readily eliminated with fields of a few 
hundred volts per centimeter. In the studies of SHG, a 
field of this magnitude was always kept on the sample 
while under investigation to insure that no antiparallel 
domains were present. However, even under these condi- 
tions, both twinning and antiparallel domains usually 
occur at temperatures above approximately 110?C. For 
this reason, coupled with the fact that going through 
the Curie temperature is frequently a destructive opera- 
tion, almost all the studies were made at temperatures 
less than about 110?C. There was no indication that 
these a-domain samples were unstable, i.e., over periods 
of months, no changes in the domain structure were 
observed. 


SHG STUDIES 


Figure 2 shows a schematic drawing of the essential 
components of the experimental arrangement for 
studying SHG in BaTiO;. For ease of analysis of the 
data, a careful alignment was made so that the polariza- 
tion of the laser beam was in the xz plane of the crystal 
and at 45? to the x and z crystalline axes. Table I gives 
the nonlinear coefficients d;; involved in SHG as deter- 
mined from Eq. (1) for various orientations of the 
polarizer and analyzer with respect to the crystal z axis. 
As seen in the table, special orientations of the polariz- 
ing elements enable one to observe separately SHG from 
each of the three nonlinear coefficients. The polarizing 
elements in Fig. 2 are set to observe SHG due to dis 

To make quantitative comparisons between nonlinear 
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Fic. 2. Schematic drawing showing the orientation of the BaTiO: 
sample with respect to the maser beam. The polarizing elements 
are set to measure the second harmonic due to dys. The electric 
field is applied to insure that the sample remains single domain: 
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coefficients, it is necessary to consider the interference 
effects that occur between the second-harmonic forced 
wave produced by the maser beam, and the free second- 
harmonic light wave produced at the surface of the 
crystal. As mentioned in an earlier section, this inter- 
ference effect, which was first demonstrated in quartz," 
produces & second-harmonic wave whose amplitude 
varies in a periodic manner along the beam direction 
from essentially zero to some maximum value. For 
ds; in BaTiO; and a crystal of thickness L, the second- 
harmonic intensity, 75; generated by the fundamental 
intensity 7; is given by 


Ij e Ijf(n)dalg sin? (71/2133) , (8) 


where f (n) is a function of the indices of refraction, and 
Jag is the coherence length. The function f (7) is given by 


ne 
(14-1) (2-1)? (032-02) F 


In Eq. (8), the intensities are those measured outside of 
the crystal. Sufficient data on the indices of refraction 
of BaTiO; are not available to calculate the coherence 
length from Eq. (5) so that it must be determined ex- 
perimentally from the data. The second-harmonic in- 
tensity is observed as the crystal is rotated about the 
z axis and then plotted as a function of angle. This 
rotation changes the path length of the beam in the 
crystal so that the periodic variation of the second 
harmonic is observed. These data on the angular de- 
pendence of J;; are then used to determine /;; and are 
extrapolated to normal incidence to obtain relative 
values of d;;. Room temperature determinations of dij 
and /; were given earlier, but are also given here in 
Table II. 

The temperature variation of the nonlinear coeffici- 
ents is calculated from observations of the second har- 
monic intensity versus temperature. Figure 3 shows 
some of the data for the coefficient dsı. The oscillations 
present in the data occur due to changes in the magni- 
tude of /4 through the temperature dependence of 
73—7,.5 Data on both the angular and temperature 
dependence of the second-harmonic intensity are re- 
quired to determine directly the direction in which liş 


f(n)- (9) 


"TABLE I. Orientations of polarizer and analyzer to determine 
nonlinear coefficients in BaTiO;.* 


Polarizer Analyzer Nonlinear coefficient 
0? 0? dis 
90? 0? ds 
90° 90° none 
45° 0° dis and dai 
45? 90° dis 


* The angles are between the transmission direction of the electric field 
in the polarizing element and the s axis in the crystal. 


1 P. D. Maker, R. W. Terhune, M. Nisenoff, and C. M. Savage, 
Phys. Rev. Letters 8, 21 (1962). 
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TABLE II. Room temperature BaTiO, nonlinear 
coefficients and coherence lengths. 


— 197012, EN 
2:00--0.14 a 
2.07 40.04 u 


* The di are relative to dis for KDP where dis 21.00, 


changes with temperature. For example, in the case 
of dıs, it is found that dI1s/d0 is negative when dI1s/dT 
is positive so that lı; must be increasing with tempera- 
ture. Similar considerations show that J; and la 
both decrease with temperature. These experimentally 
determined temperature dependences for ly are in the 
direction one would predict from the limited data on 
the optical properties of the negative uniaxial BaTiO, 
crystal. The published data? show that the index of 
refraction for the ordinary ray is temperature-independ- 
ent while the index of refraction of the extraordinary 
ray increases with temperature and becomes equal to 
the ordinary index of refraction at the transition 
temperature. 

Since f(n) in Eq. (9) is very nearly temperature- 
independent, the temperature variation of the amplitude 
of the maxima shown in Fig. 3 arises from changes in Ja 
and ds1. Since /4; is known from the room temperature 
data on the angular dependence of the second harmonic 
intensity, and each oscillation of the second harmonic 
shown in Fig. 3 represents the introduction of two 
additional coherence lengths, lı can be determined 
as a function of temperature. The temperature variation 
of da can then be calculated from Eq. (8). Figure 4 
shows data obtained in the manner just described on the 
temperature variation of each of the nonlinear coeffici- 
ents for BaTiO;. The relative values of d;; were com- 
puted at the intensity maxima and are normalized so 
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Fic. 3. Second harmonic intensity due to di; as a function of 
temperature. The oscillations are due to changes in the indices of 
refraction with temperature. These data are used to calculate the 
temperature dependence of the nonlinear coefficient. 
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Fic. 4. The temperature dependence of the three nonlinear 
coefficients in BaTiO;. All three have been normalized to 10 at 
20°C. Within the experimental uncertainties of the measurements, 
all three coefficients vary with temperature as the spontaneous 
polarization which is also plotted in the figure and normalized to 
10 at 20°C. The experimental points were calculated from the 
maxima of the second-harmonic intensity versus temperature 
data similar to that shown in Fig. 3. 


that they all equal 10 at room temperature. All three 
coefficients were measured as a function of temperature 
with a crystal 2.48 10~ cm thick, and in addition d31 
was also measured with a second crystal 6.25X 10? cm 
thick. In the case of the coefficient dis, the curve shown 
in Fig. 4 is an average of six temperature runs. Although 
not shown in Fig. 4, it has been found that the di; 
decrease to zero discontinuously at the temperature 
where the crystal goes through a first-order phase 
transition from a ferroelectric phase into a nonpiezo- 
electric phase? The discontinuous decrease of the 
second-harmonic intensity at the phase transition is 
consistent with the first-order phase transition. The 
temperature dependence of the spontaneous polariza- 
tion of BaTiO; (26 uC/cm? at 300°K) normalized to 
10 at room temperature, is also shown in Fig. 4. Note 
that all d;; vary with temperature within a few percent 
of each other and P,. The temperature dependence of 
the d’s will be discussed in the next section. 


DISCUSSION 


Attempts to describe the temperature dependence of 
the d;;s given in Fig. 4 in terms of P,(T)n show that 
n= 1 results in the best over-all fit to the data. Empirical 
fits more “accurate” than that given by d,;(T) « P,(T) 
can be obtained for ds; and das, however, these fits are 
in some cases not physically significant, e.g., d;; does 
not go to zero as P, goes to zero, and furthermore com- 
plicated empirical fits would not seem justified at this 
stage since within the experimental uncertainties of 
the measurements d;(T) and P,(T) have the same 
temperature dependence, Therefore, the temperature 
dependence of each of the three nonlinear coefficients 
for BaTiOs from 20°C to about 105°C can be described 
by 


d; (T)-agP.(T) (10) 


ROBERT C. 
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in which P,(T) is the temperature-dependent spon- 
taneous polarization, and a;; is independent of tempera- 
ture. The question arises as to the significance of Eq. 
(10). At first glance there may be some objection to 
trying to describe a purely optical property such as di; 
in terms of P, which is a dc or at most a low-frequency 
characteristic. However, there is at least one purely 
optical quantity whose temperature dependence js 
described by some power of P,. Namely, in BaTiO; the 
birefringence is proportional to P,’.!° Also in KDP, the 
temperature dependence of the change in birefringence 
consequent on passing through the ferroelectric phase 
transition is proportional to P,?!5 Therefore, at this 
point one can say that it may not be simply fortuitous 
that Eq. (10) describes the temperature dependence of 
the dij. 

There are few other data with which one can compare 
the present results or test the general validity of Eq. 
(10). SHG studies of KDP both above and below the 
Curie temperature, 123°K, have been described in two 
recent publications.9" Since SHG occurs in KDP both 
above and below the Curie temperature (it is piezoelec- 
tric in both phases) the temperature dependence of 
SHG in KDP will be different from that of BaTiO;. 
The onset of ferroelectricity in KDP involves a phase 
change from the point symmetry D», for the paraelectric 
form, to the point symmetry C2, for the ferroelectric 
phase. This lowering of the crystal symmetry results 
in the introduction of one nonlinear coefficient, dis, 
not present in the paraelectric phase. Therefore, the 
temperature dependence of d;; will be different from 
that of the coefficients which are also present in the high- 
temperature phase. If it turns out that Eq. (10) does 
indeed have some general validity, it would be reason- 
able to propose that the temperature dependence of 
the nonlinear coefficients for a ferroelectric crystal which 
is piezoelectric in the paraelectric phase, such as KDP, 
is given by 

d; (T) - dif -oP QD) (11) 


where d; is a temperature-independent nonlinear coef- 
ficient characteristic of the paraelectric phase. The 
change in the nonlinear coefficient in going through the 
phase change is described by a;;P,(T). For the new 
coefficient, das, which appears when the crystals be- 
comes ferroelectric, d3? =0. 

Himbarger and Bjorkstam? observe no significant 
change in the second-harmonic intensity from KDP n 
going from room temperature, through the phase transi- 
tion, to 80°K. The data, which were obtained with a 0 
ruby rod and therefore not a highly polarized maser 
beam, indicate that any discontinuity in the coefficients 
at the transition temperature is small. In terms of Eq. 
(11), these data would imply that d,,>>a.;P, except for 
dss and also that dss all other d,;. The coefficient das 


16 W. J. Merz, Phys. Rev. 76, 1221 (1949). D. Meyerhofer, 
Phys. Rev. 112, 413 (1963). 
18 B. Zwicker and P. Scherrer, Helv. Phys. Acta 17, 346 (1944). 
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is reported by van der Ziel and Bloembergen’ to be at 
least an order of magnitude smaller than the coefficients 
in the paraelectric phase. However, in disagreement with 
the data of Himbarger and Bjorkstam, van der Ziel and 
Bloembergen find that the nonlinear coefficients which 
are present in both phases change by factors ranging 
from about 3 to 0.2 as the crystal symmetry is lowered. 
Furthermore, except for the change in these coefficients 
at the phase transition, they are temperature-independ- 
ent. The temperature dependence of d;; has not been 
determined. The discrepancy between the results of 
these two investigations on KDP could be due to 
effects arising from antiparallel ferroelectric domains; 
however, both groups were clearly aware of this com- 
plication. Discontinuities in the coefficients at the 
Curie temperature coupled with temperature-independ- 
ent coefficients in the ferroelectric phase as reported 
by van der Ziel and Bloemberger are not consistent with 
Eq. (11). 

Kleinman has used a thermodynamic approach to 
show that if SHG arises from a nondispersive, nonlossy 
process, e.g. a high-frequency electronic mechanism, 
symmetry in addition to that required by the point 
group of the crystal will appear in the tensor which 
describes the second-order polarization coefficients. 
Data®!%19 on the Kleinman symmetry condition in a 
wide variety of crystals show that the additional 
symmetry is indeed present in the second-order polariza- 
tion tensors. This symmetry condition requires for 
BaTiO; that dj;=d3;. As is evident in Table I, this 
condition is satisfied at room temperature within 
the approximately +10% experimental uncertainties. 
The data shown in Fig. 4 demonstrate that the Klein- 
man symmetry condition is also satisfied over the 
entire measured temperature range. 

A phenomenological theory? of ferroelectricity in 
which the Helmholtz free energy is expressed as a power 
series in the polarization has been quite successful, 
especially in the case of BaTiO; (Devonshire’s theory), 
in explaining much experimental ferroelectric data. 
The validity of the Kleinman symmetry condition 
mentioned above, and the more general symmetry re- 
lations for nonlinear optical effects in lossless media 
derived using energy considerations by Pershan* both 
suggest that it may prove fruitful to extend the ferro- 


electric thermodynamic approach into the optical 


1 D. A. Kleinman, Phys. Rev. 126, 1977 (1962). 

OTN, Savage and R. C. Miller, Appl. Opt. 1, 661 (1962). 
? R, C. Miller, Phys. Rev. 131, 95 (1963). 

2 p. S. Pershan, Phys. Rev. 130, 919 (1963). 


region. This approach would give d;;(T) expressed as a 
power series in P, with the coefficients of the various 
terms determined from the free-energy expression for 
the paraelectric phase. For BaTiO, only odd powers 
of P, will occur in dy, while dy for KDP will involve all 
powers of P, Therefore Eq. (10) includes only the 
first term for BaTiOs, and Eq. (11) the first two terms 
for KDP. However, dispersion does occur between the 
present optical frequencies and the frequencies at 
which the various coefficients in the free-energy expres- 
sion have been determined. "Therefore, coefficients for 
the free-energy expression at optical frequencies as well 
as clarification of the KDP second harmonic data and/or 
additional SHG measurements with other ferroelectric 
crystals are required before the full significance of Eqs. 
(10) and (11) can be ascertained. 

Another phenomenon which will be discussed briefly 
is the possible effect on the SHG due to a temperature 
dependence of the absorption edge in the BaTiO; 
crystals. As the crystal is warmed, the absorption edge, 
about 4000 A at room temperature, shifts to slightly 
longer wavelengths.” It has been suggested??? that the 
efficiency of SHG should increase as the second- 
harmonic wavelength approaches that of the absorption 
edge; however, to date, no data supporting this sug- 
gestion have been presented. In fact, data? on SHG in 
CdS show that d does not change significantly when the 
absorption edge is moved thermally through the fre- 
quency of the second harmonic. In any event, this edge 
effect would if anything give an increasing dy with 
temperature, which is opposite to what is observed. 
"Therefore, if the proximity of the edge to the second- 
harmonic frequency is affecting the temperature de- 
pendence of the present data, it is being dominated by a 
still larger effect which results in d;; which decrease with 
increasing temperature. 
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Below 3.3°K, the normal-state specific heat of niobium, measured in a magnetic field of 17 kG, can be 
represented by C,=7.79T-+-0.09473 mJ/mole deg. This corresponds to a value of the Debye parameter at 
0°K, O (0), of 275°K. The specific heat in the superconducting state does not appear to be anomalous, as has 


been reported. 


INTRODUCTION 


HEORY predicts that the lattice specific heat of 
a superconductor should be the same in both the 
normal and superconducting states. The normal-state 
specific heat is a sum of an electronic term yT and the 
lattice term Crin. Since for sufficiently low temperatures 
the superconducting electronic contribution Ces be- 
comes negligible, the total superconducting-state specific 
heat C, is equal to the lattice contribution Cys. So, for 
reduced temperatures above T./T=5 one expects 
C,-Cimn. 

Two independent measurements of indium have 
provided an exception to this rule, however.'? The 
superconducting specific heat C, was found to be 
10-20% less than the normal lattice contribution Cin. 
A similar anomaly was reported by Hirshfeld et al. for 
niobium. Recent conversations with Dr. Boorse have 
encouraged independent measurements and he was 
kind enough to offer one of his samples. The results 
are reported below. 


EXPERIMENT 


The apparatus was the He? cryostat described pre- 
viously. A germanium thermometer, similar in com- 
position to the one described in an earlier publication’ 
was used. Resistance range of the thermometer was 
400 at 4.2°K, 1202 at 1.0°K, and 5009 at 0.4°K. The 
thermometer was calibrated in separate runs at zero 
field and at 17 kG. Magnetoresistance of the thermom- 
eter, Ap/p, varied from minus 2% at 4.2°K to plus 8% 
at 0.4°K. The 1962-He? temperature scale of Sydoriak 
and Roberts® for the range 0.4 to 1.2°K was used in 
conjunction with a paramagnetic salt thermometer for 
values below 0.5°K for the zero-field measurements and 
the 1958-He* temperature scale" above 1.0°K. An ex- 


* Work supported by U. S. Army Research Office. 
1 C. A. Bryant and P. H. Keesom, Phys. Rev. 123, 491 (1961). 
2H. R. O'Neal, Ph.D. thesis, University of California, UCRL- 
10426, 1963 (unpublished.) 
3 A. T. Hirshfeld, H. A. Leupold, and H. A. Boorse, Phys. Rev. 
127, 1501 (1962). 
4G. M. Seidel and P. H. Keesom, Rev. Sci. Instr. 29, 606 (1958). 
5 B. J. C. van der Hoeven, Jr. and P. H. Keesom, Phys. Rev. 
130, 1318 (1963). 
6S. G. Sydoriak, T. R. Roberts, and R. H. Sherman, in Eighth 
International Congress on Low Temperature Physics, London, 
1962, p. 297 (unpublished). 
7 F. G. Brickwedde, H. van Dijk, M. Durieux, J. R. Clement, 
and J. K. Logan, J. Res. Natl. Bur. Std. 64A, 1 (1960). 
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pansion of 1/T in powers of In up to (In R)? was done 
on an IBM-7090 digital computer, using the method 
of least squares as described by Moody and Rhodes,® 

Total correction for the addenda (thermometer, 
heater, and glyptal) was less than 10% of the total heat 
capacity of the sample in the superconducting state at 
the lowest temperature. As the heat capacity of the 
addenda is known within +3%, this introduces an 
error of at most 0.3% in the specific heat. Use was made 
of the most recent data of Ho et al? for the constantan 
heater wire. 

Normal-state measurements were made with a new 
set of cans incorporating the use of a superconducting 
Nb,Zr wire solenoid with an inner diameter of 2.5 cm. 
The working volume was 1.5 cm in diameter and 3.5 
cm long. A field of 17 kG was applied. In the super- 
conducting state, a Helmholtz coil arrangement was 
employed to cancel the earth's magnetic field. The 
cylindrical sample of 0.365 moles was the same one 
used by Hirshfeld eż al. with a small piece cut from the 
bottom. Dimensions were 1.3 cm in diameter, and 


TABLE I. Specific heat of niobium. (C is in mJ/mole deg.) 


T CK) Ci T (°K) Cn 
0.372 0.038 0.568 4.52 
0.723 0.077 0.647 5.12 
0.799 0.089 0.736 5.76 
0.849 0.097 0.824 6.42 
0.906 0.107 0.946 7.39 
1.016 0.128 1.011 7.94 
1.152 0.156 1.186 9.39 
1.272 0.242 1.281 10.2 
1.361 0.297 1.370 10.9 
1.499 0.406 1.519 122 
1.635 0.562 1.646 13.2 
1.794 0.822 1.771 14.3 
1.988 1.28 2.016 16.5 
2.221 2.14 2.225 18.4 
2.456 3.40 2.451 20.4 
2.676 5.01 2.734 23.2 
2.998 8.18 3.011 25.9 
3.260 11.4 3.311 29.2 
3.562 16.1 3.616 33.0 
3.784 20.0 3.752 34.8 
4.014 25.2 3.995 38.3 
4.304 32.6 4.298 42.8 


cI 


* D. E. Moody and P. Rhodes, Cryogenics 3, 77 (1963). 
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SPECIFIC 


length 3.5 cm. The estimated experimental accuracy 
is 4:295. 


RESULTS 


The results for the superconducting and normal states 
are listed in Table I. They are also plotted in the form 
C/T versus T* in Fig. 1, together with a few points from 
the smoothed curve of the data of Hirshfeld ef al. The 
two measurements are in very good agreement within 
combined experimental error. Up to 3.3°K the normal 
state data may be represented by 


Cr=7.79T+-0.0947? mJ/mole deg. 
The coefficient of the T? term a corresponds to a Debye 
parameter at 0°K, O(0), equal to 275°K. 
In Fig. 2 is plotted an expanded portion of C,/T 
versus 7? up to 7?— 4. If one plots here the values of 
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Fic. 1. C,/T and C,/T versus 72. 


C,/T found by Hirshfeld et al., no difference is observed. 
Two slopes are drawn representing the O(0) value of 
these measurements and the same quantity given by 
Hirshfeld et al. of 238°K. The small positive intercept 
of C,/T (0.022 mJ/mole deg?) will be discussed later. It 
is clear from this graph that the anomaly disappears 
using the new value of 0(0)— 275?K. 

The data for Ces/YTe can be very well represented by 
a-exp(—6bT,/T). Plotting in this manner yields values 
of a=8.31 and b= 1.53, very close to those predicted by 
the BCS theory of a=8.5 and b= 1.44. 


DISCUSSION 


The lattice specific heat of a metal may be expressed 


i C)= 1944(T/O(T)) J/mole deg , 


where the Debye parameter is a function of tempera- 
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Fic. 2. C,/T versus 72. 


ture. As mentioned before, the agreement between the 
values for the specific heat as measured by Hirshfeld 
et al. and these measurements are in excellent agreement 
within experimental error. However, the additional 
points at lower temperatures and with higher magnetic 
field modify the interpretation of the results. The initial 
value of O is now 275°K, in excellent agreement with the 
value derived from velocity of sound measurements by 
Alers and Waldorf of 277°K.® At 3.3°K, the value of 
the Debye parameter gradually decreases to a value 
of 238°K above 4.5°K, indicating the presence of a T3 
or higher term in the specific heat. 

The increase of the normal-state specific heat at the 
lowest temperature may be a latent heat contribution 
due to insufficient quenching of the superconductivity. 
However, the magnitude of this deviation is within our 
experimental accuracy and could be due to inaccuracies 
in the thermometer calibration curve. 

In Fig. 2 one observes that the specific heat points 
C./T gradually become tangent to the slope corre- 
sponding to 0—275^K at 1°K. The points do not fall 
below this line. Thus, the condition that C,— Cj, is 
met and no anomaly is observed. 

The positive intercept of this curve with the C,/T 
axis of 0.022 mJ/mole deg? indicates a small contribu- 
tion of normal electrons. Comparing this to the value 
of y in the normal state, one deduces that 0.3% of the 
niobium was normal. This effect may be attributed to 
strains or impurities. Trapping of the earth's magnetic 
field would make a contribution an order of magnitude 
too small. However, the lattice specific heat should not 
be influenced by this small contribution. 

In conclusion it appears unnecessary to assume that 
the specific heat of superconducting niobium is 
anomalous. 
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The anomalous specific heat found in a polycrystalline sample of niobium and previously reported from this 
laboratory by Hirshfeld, Leupold, and Boorse (HLB) prompted complete remeasurements both in the nor- * 
mal and superconducting phases on a single crystal with lower tantalum content. The thermal constants 
of the single crystal show only small differences from the polycrystal in the same temperature range, the 
single-crystal values being Te=9.20°K and 0 —241*K between about 10? and 3°K. The normal-state single- 
crystal measurements agree with measurements on the HLB polycrystal made both in this Jaboratory and by a 
van der Hoeven and Keesom at Purdue University. The latter measurements were carried to about 0.4°K 
in a field of 17 kG. As a result of their investigation, they report a Debye © of 275°K in the region below 
3*K. With this value of O, the anomaly in the specific heat disappears. As both the single-crystal results and 
the van der Hoeven and Keesom values agree in the mutually measured ranges, O is taken to be 275°K. The 
corresponding value of y is found to be 7.80 mJ/mole deg?. The usual exponential behavior of the electronic 
specific heat C.s/yT-=ae~*?e!? with a=8.21 and b=1.52, is observed over a restricted temperature range. r 
Below /—5 the data exhibit larger values than predicted by the exponential in agreement with a number of 
other superconductors. The value of the energy gap at 0°K was found to be 3.69 kT. Overall comparisons 
are made with the BCS theory and a modification due to Swihart. 


I. INTRODUCTION 


XTENSIVE measurements of the specific heat of 
niobium in the temperature interval from ap- 
proximately 12 to 1.2°K have been reported previously 
from this laboratory.‘ The latest measurements made 
by Hirshfeld, Leupold, and Boorse? (HLB) on a poly- 
crystalline specimen showed that the normal phase, ob- 
tained by the application of a magnetic field of approxi- 
mately 10000 G, had a specific heat which could be 
represented by the usual sum of an electronic and lattice 
term, viz., C,—T--const(T/0)* with y=7.53 mJ/mole 


the larger values as a consequence of the appearance of 
the mixed phase in the material. Accordingly, the value 
of O appropriate to the normal phase over the whole 
interval to absolute zero was taken as constant and 
equal to 238°K. Previous measurements on annealed 
Nb made by Chou, White, and Johnston? gave values 
varying from about 250°K near the transition tempera- 
ture to approximately 320 near 2°K. The precision of 


* This research was supported in part by the National Science 
Foundation, the U. S. Office of Naval Research, and the Linde 
Air Products Company. 

t Permanent address: Barnard Coll Columbia Uni ity, 
NIU Ua nard College, Columbia University, 
92, 2 oa) M. W. Zemansky, and H. A. Boorse, Phys. Rev. 

2 ^ A. DM Ud H. A. Leupold, 1961 (unpublished). 

3 A. T. Hirshfeld, H. A. Leupold, and H. A. B Phys. Rev. 
127, 1501 (1962). " Sila Been 

4R. M. Bozorth, A. J. Williams, and D. D. Davis, Phys. Rev. 
Letters 5, 148 (1960). eae ae 
d B ES Chou, D. White, and H. L. Johnston, Phys. Rev. 109, 788 


the latter values was apparently much less than those 
at the higher temperatures. Leupold and Boorse? using 
an annealed single crystal found a constant value equal 
to 241?K. Further evidence for a constant value of 0 
approximately equal to that found in the previous 
researches reported from this laboratory has been given 
by Morin and Maita.® Using a field of 18 kG, they 
found © constant and equal to 230°K from above the 
transition to 1.5?K. 

The superconducting-phase specific heat C, in the 
HLB investigation was carefully measured in a number 


on a better specimen. Preliminary measurements? on 
a single-crystal rod, ł in. in diameter, obtained from 
Materials Research Corporation, Orangeburg, New 
York, did not show the anomaly. Accordingly, careful 
measurements were made again on both the polycrystal 
and on another piece of the same single-crystal rod 
mentioned above. The remeasurement of the polycrystal 
yielded identical results within the error of measurement 
with the originally published values. In order to im- 
prove the superconducting properties of the second 
piece of the single-crystal rod, it was subjected to @ 
zone-refining process. This operation was effected 
through the kindness of Dr. J. E. Kunzler and E. 
Buehler of the Bell Telephone Laboratories, Murray 
Hill, New Jersey, where the specimen was subjected to 
four passes in an electron-beam melting apparatus 1n 
high vacuum. The specimen, as subsequently analyzed, 
showed that as compared to the HLB polycrystalline 


* F. K. J. Morin and J. P. Maita, Phys. Rev. 129, 1115 (1963): 
? C. A. Bryant and P. H. Keesom, Phys. Rev. 123, 491 (1961). 
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material,’ the tantalum content was lower, being 750 
parts per million as compared with 2000; No, 54 ppm 
as compared with 248; 0, at 34 ppm, Fe at 25 ppm and 
Si at 75 ppm were all slightly higher. Otherwise, the 
analysis was the same. A series of x-ray photographs 
taken at intervals along the length of the specimen 
showed identical crystal structure. 


II. APPARATUS AND PROCEDURE 


The apparatus used to make the measurements on this 
single crystal was essentially the same as that used in 
the HLB investigation. The principal change was the 
replacement of the inner He-4 bath in the cryostat 
with an He-3 reservoir in order to reach lower tempera- 
tures. In carrying out the investigation, additional 
precautions were taken to insure the reliability of the 
temperature measurements. 

As in previous investigations, temperatures were 
determined by the use of resistance thermometers. To 
measure temperatures in the He-3 range, Allen-Bradley 
0.1-W carbon composition resistors having a room tem- 
perature resistance of approximately 10 2 were used. 
For the He-4 range and up to 20°K, Allen-Bradley 
0.1-W resistors having an approximate room tempera- 
ture resistance of 33 Q were employed. After each run, 
a resistance-vapor pressure calibration of the thermom- 
eter was made so that any errors arising from cycling to 
nitrogen temperatures or higher were avoided. The 
vapor-pressure temperature conversions for He-3 were 
made using the 1962 scale of Roberts, Sydoriak, and 
Sherman,’ for the He-4 region of the 1960 scale? and 
for the hydrogen region the scale of Wooley, Scott, and 
Brickwedde.” 

The resistance-temperature curve for the He-3 region 
joined smoothly to the curve for the He-4 region and in 
the range of overlapping temperatures gave indis- 
tinguishable values. A least-squares fit of the helium 
calibration date was obtained for each run with the aid 
of an IBM 1620 computer using the relation 


B 
Tz, 
InR+K/InR—A 


The temperatures calculated by means of this formula 
agreed with those determined experimentally from 
vapor pressure measurements to within 2 or 3 
millidegrees. 

For each run in which heat capacity data were taken 
above the helium temperature region, the thermometer 
was also calibrated against the vapor pressure of liquid 
hydrogen in the range between its normal boiling nome 
(20.49 K) and triple point (13.957^K). The Hz and He 


8R. H. Sherman, S. G. Sydoriak, and T. R. Roberts, LAMS— 

2 (unpublished). E 
E CUN Uu H. van Dyke, M. Durieux, J. R. Clement, 
and J. K. Logan, J. Res. Natl. Bur. Std. (U. S-) 64A, 1 (1960). 
10 H. W. Wooley, R. B. Scott, and F. G. Brickwedde, J. Res. Natl. 


Bur. Std. (U. S.) 41, 379 (1958). 
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points together were then fitted with the aid of a 1620 
computer to the expression 


InR — A--BT-1?--CT-, 


The agreement between values of T calculated from 
this formula and those obtained by observed vapor 
pressures was excellent, the root-mean-square error 
amounting to about 6 millidegrees over both the helium 
and hydrogen ranges. This formula was used only in 
the specific heat calculation at the temperatures in the 
gap between the He and H; regions where a 6-millidegree 
error is only of the order of 0.1% of the measured 
temperature. 

In the uncalibrated region between liquid helium and 
liquid hydrogen temperatures, it was essential that some 
checks be provided in order that the accuracy of the 
specific-heat determinations as well as that of the transi- 
tion temperature could be estimated. To make such 
checks, gas thermometer measurements of the transi- 
tion temperature of a highly purified lead wire were 
used. The gas thermometer measurements were made 
at Rutgers University through the kindness of Professor 
B. Serin. Two determinations of the transition tempera- 
ture of the lead wire were made, yielding values of 
7.210+0.005° and 7.209--0.005?K. Throughout the 
heat-capacity measurements, the lead wire was therm- 
ally bonded to the niobium sample and readings of 
the resistance thermometer on the sample determined 
when the lead changed from the superconducting to the 
normal phase or vice versa. In three separate com- 
parisons, the reading of the resistance thermometer when 
placed in the InR versus T formula above yielded the 
following temperatures: 7.19, 7.21, and 7.21°K. It was 
therefore concluded that the formula based on calibra- 
tions in the He-4 and hydrogen temperature ranged 
was not in error at 7.21°K by more than a few tenths 
percent. The adoption of 7.21°K as the lead transition 


Taste I. Smoothed values of the molar specific heat of niobium. 


T T? Gm CGT T/T 

CK) CK) (mJ/mole deg?) (mJ/mole deg?) 

0.0 0.00 0.000 7.80 * 
0.5 0.25 0.023 7.82 18.39 
1.0 1.00 0.101 7.89 9.19 
1.5 2.25 0.266 8.00 6.13 
2.0 4.00 0.647 8.17 4.60 
2:5 6.25 1.450 8.38 3.68 
3.0 9.00 2.683 8.64 3.06 
3.5 12.25 4,374 9.09 2.63 
4.0 16.00 6.294 9.61 2.30 
4.5 20.25 8.289 10.20 2.04 
5.0 25.00 10.42 10.87 1.84 
5.5 30.25 12.78 11.60 1.67 
6.0 36.00 15.30 12.40 1.53 
6.5 42.25 17.93 13.27 1.41 
7.0 49.00 20.66 14.22 1.31 
7.5 56.25 23.54 15.23 1.23 
8.0 64.00 26.48 16.31 1.15 
8.5 72.25 29.49 17.46 1.08 
9.0 81.00 32.53 18.68 1.02 
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temperature is 0.02°K higher than the latest determi- 
f nation" but more in line with the average of earlier 
$ measurements.” 


IH. RESULTS 


The superconducting specific heat measured in zero 
field and the normal specific heat measured in a field 
of 10000 G are shown in Fig. 1 and smoothed values 
for the data in the two phases are given in Table I. 
‘The normal data from the transition down to about 
3°K can be calculated from the usual Debye-Sommer- 
feld relation with O—241*K and y=7.38 mJ/mole deg? 
obtained from the extrapolation of the 0— 241?K line 

— to 0°K in substantial agreement with the values previ- 
_ ously reported for the polycrystal. In the present data, 
wever, starting at 7? values between 10 and 9, the 
experimental points at decreasing temperatures rise 
radually above the @=241°K normal line and fall on 

: line whose © value was found to be 275°K with 
corresponding value of equal to 7.80 mJ/mole deg?. 
certainty regarding the proper interpretation of 


P. Franck and D. L. Martin, Can. J. Phys. 39, 1320 
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Fic. 1. C/T versus 
T? plot of the normal 
and superconducting 
specific heats of a 
single crystal of Nb. 
Full lines represent 
the mean values of 
the data. 


Data Runs in 
Zero Field 


Data Runs with 
H=10 000 Gauss 


60 70 80 90 100 


this line suggested that the normal specific heat be 
determined in fields higher than the 10000-G field 
available. Professor P. H. Keesom of Purdue University 
kindly agreed to measure the specific heat of the poly- 
crystalline sample in a field of 17 kG in order to check 
the published data. The results of this investigation 
conducted by van der Hoeven and Keesom (VK) and 
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_ Fis. 2. C/T versus T? plot showing the experimental determina- 
tions of the normal specific heat below T2=20. The data of van 
der Hoeven and Keesom is compared with polycrystal measure- 
ments in this laboratory, and the present data on a single crystal 
(full line) found in this investigation. The change in slope of the 
normal line just below 7?— 10 should be noted. 
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Fic. 3. C/T versus T? plot of the superconducting specific heat 
below T?=5. The lines for Debye © values of 241° and 275° are 
shown for comparison. 


published in this journal show clearly that the 9 — 275?K 
line is characteristic of the normal phase of niobium at 
the lowest temperatures and is not the result of the ap- 
pearance of the superconducting phase: 

In Fig. 2, the VK measurements are compared with 
those made in this laboratory on the same sample as 
well as with the smooth curve of the single-crystal 
measurements. All three sets of measurements agree 
within 1%. The measurements of VK show the same 
rise at 3° and follow the line 09—275 to about 0.7°K 
before undergoing an abrupt upward departure in the 
17 000-G field. As can be seen from Fig. 2, the meas- 
urements made in this laboratory on the sample in a 
10 000-G field rise sharply at about 1.8°K. This suggests 
that the gradual departure between 7?— 10 and 7?—9 
of the normal data from the line 9 —241?K to the line 
@=275°K reflects an inherent property of the crystal 
structure and not a consequence of an insufficient mag- 
netic field as had been originally supposed. On this basis, 
the previously reported specific-heat anomaly disap- 
pears and the slope of the normal line corresponding to 
a Debye O of 275°K agrees with the limiting slope of 
the superconducting data close to the origin (see Fig. 
3). It is interesting also that this value for O now agrees 
with the 277°K value found by Alers and Waldorf 
from ultrasonic measurements at 4.2°K. — 

The transition temperature of the f single-crystal 
sample was taken to be 9.195°K, the midpoint of the 
superconducting specific-heat discontinuity which ex- 
tended from 9.07 to 9.32°K. A further confirmation for 
the view that the phonon spectrum of the metal changes 
in the neighborhood of 3°K is found in the experiment- 


uG, A. Alers and D. L. Waldorf, Phys. Rev. 
(1961). 
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ally determined entropy difference between the normal 
and superconducting phases. A plot of this difference 
versus temperature was found to vanish at 9.19,?K. 
This value is in excellent agreement with the transition 
temperature taken as defined above, since the difference 
between 9.193? and 9.195? produces a disagreement in 
entropy between normal and superconducting phases of 
only 0.1%, less than the experimental error. If it is 
assumed that the change in O is an effect arising from 
imperfectly quenched superconductivity, the entropy 
difference vanishes at 9.13?K with the result that at 
9.195°K the superconducting entropy exceeds the 
normal by about 3%. Thus an increase in Debye © 
seems to be the correct explanation for the change in 
slope of the normal line. 

For a thermodynamically reversible superconductor, 
the critical field Hr at temperature T may be calculated 
from the relation 


Te 
J (S,—S,)dT = — (V Hr?/87). (1) 
T 


The entropy difference (5,—.5,) for the single crystal 
was found by plotting a large scale graph of 
(C,/T—C,/T) against the temperature and then meas- 
uring the appropriate areas with a planimeter. Using 
the values for the entropy difference thus obtained, 
Eq. (1) was integrated graphically and Hr could then 
be calculated after choosing V=10.8 cc/mole. The 
resulting temperature dependence of H7 is shown on 
the graph in Fig. 4 and numerical values of Hr are given 
in Table IT. The critical field Hy at 0°K was found to 


— Coicrimetric volves 
Stromberg & Swenson 


Hy (gauss) 


S—N traesitisn 
N-e S transition 


T (deg K] 


Fic. 4. Critical field versus temperature for the single-crvstal 
specimen of Nb deduced from the specific-heat measurements, 
compared with critical-&eld measurements taken from the data 
of Stromberg and Swenson for transitions from normal to super- 
conducting (open circles) and superconducting to normal (dots). 
The agreement of the latter data with the critical field deduced 
from these specific-heat measurements is obvious. 


15J. W. Edwards, R. Speiser, and H. L. Johnston, J. Appl. 
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Fic. 5. Deviation of the critical field of the single-crystal Nb 
specimen determined from the specific-heat measurements. from 
the parabolic dependence (full line). Dashed curve is the deviation 
according to the BCS theory. Open circles and dots are for com- 
parison with Swihart's calculations. 


be 1994 G. This is in excellent agreement with the field 
deduced from calorimetric measurements by McConville 
and Serin!* who found the value 1990 G. The value 
deduced from direct magnetic measurements by Strom- 
berg and Swenson!” was 1960+40 G. The critical-field 
values of the latter investigation were obtained by 
taking the average of values measured when going from 
the superconducting to the normal phase and when go- 
ing in the reverse direction. It is interesting to note that 


TABLE II. Critical fields and deviation from parabolic law. 


LE St=T/T. Hr h=Hr/H R h— (1—6) 
0.0 0.0000 1994 1.0000 0.0000 0.0000 
0.5 0.0544 1988 0.9972 0.0030 +0.0002 
1.0 0.1088 1971 0.9986 0.0118 +0.0004 
15 0.1632 1942 0.9770 0.0266 +0.0006 
2.0 0.2176 1901 0.9532 0.0473 +-0.0005 
2.5 0.2720 1847 0.9262 0.0740 +0.0001 
3.0 0.3263 1780 0.8926 0.1065 —0.0009 
3.5 0.3807 1700 0.8527 0.1450 —0.0023 
40 0.4351 1608 0.8067 0.1893 — 0.0040 
45 0.4895 1501 0.7546 — 0.2396 — 0.0058 
5.0 — 0.5439 1389 0.6066 0.2958 — 0.0076 
5.5 0.5983 1262 0.6329 — 0.3579 — 0.0092 
6.0 — 0.6527 1124 0.5685 ^ 0.4260 — 0.0105 
65 — 0.7071 974.3 0.4887 — 0.4999 —0.0114 
7.0 — 0.7615 814.8 0.4086 0.5798 —0.0116 
7.5 0,8158 644.6 0.3233 ^ 0.6656 —0.0111 
8.0 0.8702 465.1 0.2333 — 0.7573 — 0.0095 
8.5 0.9426 215.6 0.1382 — 0.8549 — 0.0069 
9.0 — 0.9790 78.19 — 0,0392 0,9585 — 0.0023 

e 


10 T. McConville and B. Serin, International Conference on the 
Science of Superconductivity, Colgate University, August 1963 
(unpublished). 

17T. F. Stromberg and C. A. Swenson, Phys. Rey. Letters 9, 
370 (1962). 
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their superconducting to normal values of the Critica] 
field agree with those determined calorimetrically in 
the present work to within one-half of one percent. 
(See Fig. 4.) 

If the deviations of the reduced critical fields from a 
parabolic relation are plotted against the square of the 
reduced temperature, the curve is unusual because it 
has both positive and negative values. The only other 
superconductor in which comparable behavior has been 
observed is indium. The niobium curve is also atypica] 
in that its maximum negative deviation is only about 
one-third that of most other superconductors and of 
that required by the BCS theory. 

Following Goodman,” the value of the energy gap 
2e at 0?K can be estimated from the relation: 


2e/ ET, (S)m (HV /3ryT 2). 


Using the single-crystal values of y, Zo, Te found in this 
investigation, 2ey/kT,— 3.69, compared with the value 
3.52 required by the BCS theory. 

The value obtained from ultrasonic measurements by 
Dobbs and Perez? depends on the propagation direction 
in the crystal; being 3.77 for the [100] direction, 3.74 for 
the [111], and 3.65 for the [110]. Levy, Kagiwada, and 
Rudnick,” using transverse waves, found 3.7 and 3.5, 
depending on the method of analysis. The value found 
from the tunneling experiments of Giaever? was 3.6, 
from Sherril and Edwards? 3.59, and from Townsend 
and Sutton? 3.84-E0.06. A value of 3.80.2 has been 
found by Mendelssohn? from thermal conductivity 
measurements. The present value lies in the range of 
these determinations. 

The BCS theory predicts an electronic specific heat 
at the transition Ces equal to 2.43 T. The value found 
for C../yT, in these measurements is 2.87. 

Tf the electronic specific heat C,, is calculated in the 
usual way, viz: 


Ca= C Crn FYD; 


using y=7.80 mJ/mole deg? the plot of nCes/YTe 
versus T./T gives a straight line for values 
5.5>T./T>1.7, indicative of the relation 


Cal yT.— ag-tTelT. 


This is shown in Fig. 6. The data obtained from this 
plot yields a—8.21 and 5— 1.52 in fair agreement with 
the predicted BCS values of 8.5 and 1.44. At values of 


18 B. B. Goodman, Compt. Rend. 246, 3031 (1958). 

2 E. R. Dobbs and J. M. Perez, International Conference on the 
Science: of Superconductivity, Colgate University, August 1963 
(unpublished). 

? Moises Levy, Reynold Kagiwada, and Isidore Rudnick, 
Phys. Rev. 132, 2039 (1963). 

^ I. Giaever, Eighth International Conference on Low Tempera- 
Gane (Butterworths Scientific Publications Ltd., London, 


Gan D. Sherril and H. H. Edwards, Phys. Rev. Letters 6, 460 
^5 P. Townsend and J. Sutton, Phys. Rev. 128, 591 (1962). 
“K. Mendelssohn, IBM J. Res. Develop. 6, 27 (1962). 
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Fic. 6. The super- 
conducting electronic 
specific heat Ces of 
a single-crystal speci- 
men of Nb plotted 


on lnio(Ces/y Te) ver- one 
sus T./T axes. Full dls 
line is the graph of E 
o 
af 


Cee/yT = ae Fe!T, 
where a@=8,21 and 
6=1.52. The dots -2 
and crossed circles 

show values com- 

puted from Swihart’s 

model. 


Experimental points 
Swihart Model 2 ¢9/kT¢*3.75 
Swihart Model 2 «g/kTc* 3.84 


.3 


Te/T 


T./T>5.5 the data departs from the exponential as 
will be discussed below. 

Although the BCS theory gives fair agreement with 
the a and 5 values, the critical-field deviation curve, the 
value of the energy gap and the specific-heat jump at 
the transition all show significant disagreement. How- 
ever, the BCS calculations were made under the as- 
sumption of weak coupling, i.e., that T./O is small, of 
the order of 0.003 or less, as for elements such as Zn, Al, 
and Ga. When 7./@ is an order of magnitude greater, 
however, as is the case for Nb (0.033 for 0—275?K 
and 0.039 for @=241°K) and In (0.039), the energy 
gap value increases to about 3.7 and the critical-field 
deviation has a minimum only about one-third to one- 
half as low. (See Fig. 5.) Moreover, the deviation actu- 
ally is positive at the lowest temperatures as was 
mentioned above. For elements with still higher values 
of T./0, e.g., Hg (0.056) and Pb (0.075), the deviation 
becomes positive throughout the entire temperature 
range from 0?K to the transition temperature. 

Calculations of these properties of superconductors 
which give closer agreement with the experimental data 
have been supplied by Swihart in a private communica- 
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tion. He assumed that the energy gap e(7)/«(0) was 
the BCS function of reduced temperature and remained 
constant with energy out to a cut off of 13.7e(0) in- 
stead of to infinity as used in the weak-coupling case 
by BCS. The calculations were made for two cases, one 
assuming the ultrasonic energy gap 2«(0)/kT.— 3.75 
and the other the value 3.84 obtained from tunneling. 
These cases correspond to values of 7,/O of 0.039 and 
0.038, respectively, which are much closer to the values 
0.033 and 0.038 for niobium than the weak-coupling 
values to which the BCS procedure is applicable. 

The electronic specific heats calculated by Swihart 
are compared to the experimental values in Fig. 6. The 
values are seen to be relatively intensitive to the as- 
sumed energy gap values and both sets of calculations 
agree well with the experimental data for T/T «5.5. 
In contrast to the specific-heat curve of BCS which is 
applicable only írom inverse reduced temperatures 
greater than about 2.0, the Swihart model agrees well 
with experiment with T,/T values almost to the transi- 
tion temperature. For values of T./T greater than 5 or 
6, both the BCS theory and Swihart's modification 
predict a downward displacement of C,/^T, from the 
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exponential. As can be seen from Fig. 6, however, the 
experimental points show an upward displacement at 
higher inverse reduced temperatures. A similar upward 
departure has been observed in several other super- 
conductors?*?* and is generally believed to be due to 
anisotropy of the energy gap. Although this rise in 
specific heat is greater than the probable experimental 
error, the electronic component, Ces, is very small at 
these temperatures and approaches the order of magni- 
tude of the scatter in the data. For this reason, no 
accurate quantitative comparison with the ultrasonically 
measured anisotropy is practicable. 

Swihart’s calculated critical field deviations are shown 
together with the experimental curve in Fig. 5. The 
experimental curve lies between the two calculated 
curves except at the low temperatures where the calcu- 
lated positive deviations exceed those observed experi- 
mentally. Qualitative agreement is good as the calcu- 
lations show the required change in sign, and as can be 
seen from the graph, the quantitative results are much 
closer to the experimental values than are those of the 
BCS theory. 


235 H, A. Boorse, Phys. Rev. Letters 2, 391 (1959). 

2 J. Bardeen and J. R. Schrieffer, Progress in Low-Temperature 
AM porte Holland Publishing Company, Amsterdam, 1961), 

ol. 3, p. 170. 
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For (C./yT.)re Swihart’s calculations yield 2.89 
for an energy gap of 3.84 and 2.76 for a gap of 3.75. 
Again, agreement of these values with the experimenta] 
result of 2.87 is better than the predicted BCS value of 
2.43. 
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The migration energies and atomic configurations for mono- and di-interstitials and mono- and di- 
vacancies in a iron have been calculated using a classical model. About 530 atoms surrounding the defect 
were treated as individual particles, each with three degrees of freedom, while the remainder of the crystal 
was treated as an elastic continuum with atoms imbedded in it. A two-body central force was devised which 
matched the elastic moduli, was sharply repulsive at close separation, and which went to zero midway be- 
tween the second and third neighboring atoms. Configurations were found by choosing a starting configura- 
tion roughly approximating the situation under consideration and successively adjusting the value of each 
variable occurring in the energy equation so that the magnitude of the generalized force associated with it 
was zero until equilibrium was reached. The energy calculations include changes in bond energy in the dis- 
crete region, energy in the elastic field, and work done against cohesive forces, but neglect changes due to the 
redistribution of electrons. Calculated activation energies for motion of mono- and di-interstitials and 
mono- and di-vacancies were 0.33, 0.18, 0.68, and 0.66 eV, respectively, and binding energies of di-interstitials 
and di-vacancies were 1.08 and 0.20 eV, respectively. The stable interstitial was a “split” configuration in 
which two atoms were symmetrically split in a (110) direction about a vacant normal lattice site, and the 
stable di-interstitial consisted of two parallel split interstitials at nearest-neighbor lattice sites with their 
axes perpendicular to the line joining their centers. In the vacancy configuration an atom was missing from 
a normal lattice site, and the divacancy consisted of two vacancies at second nearest-neighbor lattice sites. 


INTRODUCTION 


ETAILED and extensive lattice calculations for 

copper have been carried out for the dynamics of 
radiation damage events near the threshold energy for 
damage,' and for the atomic configuration! and the 
energy of motion? of point defects. These studies have 
used a similar approach to lattice calculations: The 
forces between atoms within a crystallite of anywhere 
from 50 to 1000 atoms are treated explicitly, and 
boundary conditions are applied to the crystallite to 
simulate the remainder of the lattice. The development 
of this method of calculation may be traced to 
Huntington! and Tewordt,> and a number of less ex- 
tensive calculations using similar models have been 
performed. Copper was chosen for investigation 
primarily because there were sufficient experimental 
data available, because it was felt that an appropriate 
and more reliable potential was available for copper 
than for other metals, and because the face-centered 
cubic structure is amenable to calculation. 

The dynamic calculations! have recently been ex- 
tended to o iron,’ and the present research was under- 
taken in conjunction with that investigation. Iron was 
selected as the metal of greatest interest with a body- 


* Work performed under the auspices of the U. S. Atomic 
Energy Commission. F 1 
1 J. B. Gibson, A. N. Goland, M. Milgram, and G. H. Vineyard, 
Phys.Rev. 120, 1229 (1960). 
PR. A. Johnson and E. Brown, Phys. Rev. 127, 446 (1962). 
3A. Seeger, E. Mann, and R. v. Jan, Phys. Chem. Solids 23, 
639 (1962). ? 
S £ B. bd Phys. Rev. 91, 1092 (1953). 
5L. Tewordt, Phys. Rev. 109, 61 (1958). E 
6 See, for example: L. A. Girifalco and V. G. Weizer, Phys. 
Chem. Solids 12, 260 (1960). K. H. Bennemann, Phys. Rev. 124, 
669 (1961). P. Hoekstra and D. R. Behrendt, Phys. Rev. 128, 560 
1962). 
: 4 e Erginsoy, G. H. Vineyard, and A. Englert, Phys. Rev. 133, 
A595 (1964). 
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centered cubic structure. Pertinent experimental data 
are lacking for all body-centered cubic metals, but the 
threshold energy for radiation damage for « iron is 
known.® 

Since the present calculation is similar to that 
previously used for copper by Johnson and Brown,? 
the rationale for this method will not be repeated here. 
In the present calculation the atoms near the defect 
are treated as classical particles interacting by means of 
a potential which applies between the first and second 
neighboring atoms in the lattice and which is matched 
to the elastic moduli. 

Atomic configurations are found by choosing a 
starting configuration roughly approximating the situ- 
ation under consideration, and then successively ad- 
justing the value of each variable occurring in the 
equation for energy such that the magnitude of the 
generalized force associated with it is zero, and iterating 
this process many times. The energy in the lattice 
above that for a perfect lattice normally converges in 
the above process. Convergence is not ensured, but no 
difficulties have been encountered in this respect. The 
numerical calculations have been performed by using 
an IBM 7094 computer. 

Iron interstitial atoms, di-interstitial pairs, vacancies, 
and divacancies have been studied, and the activation 
energy for motion as well as the atomic mechanisms of 
migration, as predicted by this model, have been deter- 


mined for these defects. By the application of appro- - x 


priate boundary conditions, the associated activation. 

volumes for motion have also been found. Some pre- 

liminary results have been reported previously.” 
Calculations are also being carried out for interstitial 


* P. G. Lucasson and R. M. Walker, Phys. Rev. 127, 485 (1962). 
? R. A. Johnson and A. C. Damask, Acta Met. 12, 443 ( 
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impurities in æ iron and these results will be reported 
elsewhere. 


THEORY 
Model 


In the present calculation each atom within a 
spherical crystallite containing 531 atoms was treated 
as an independent particle. In the perfect lattice an 
atom was located at the center of the crystallite and 
the surrounding 530 atoms comprised 25 shells of sym- 
metrically equivalent atoms. 

The atoms outside the crystallite were treated as 
though they were imbedded in an elastic continuum. 
The displacement field u used in the present calculation 
for the elastic continuum is given by 


u— —Cv (1/7) - C(r/n), (1) 


where C is the so-called “strength” of the displacement 
field. The displacement for a given atom 7 is given by 


u= C[ro// (r9)? ], 


where the subscript means that the term is to be 
evaluated at the perfect lattice position. Thus, C was 
used as the variable determining the displacement of 
all atoms outside the crystallite. Equation (1) gives 
rise to a spherically symmetric displacement field, for 
which the origin was taken asthe center of thecrystallite. 
The choice of the displacement field given by Eq. (1) 
is discussed in detail by Johnson and Brown. This 
particular u is just one of an infinite number of solutions 
of the static, isotropic, elastic equation, but activation 
energies and atomic configurations near defects were 
found to be insensitive to other solutions. The eflect of 
the displacement field given by Eq. (1) on the energies 
and configurations is relatively minor and this term 
could be eliminated from the calculation (i.e., the atoms 
outside the crystallite frozen at their perfect lattice 
positions) without introducing any serious changes in 
the results, but it was retained because it can be used 
to find activation volumes. 
The energy of the crystallite is given by 


E=3 L D o+ DY e FaC4-bC*, (2) 
G i ok 


where o? is the interaction potential between atoms 
t and j, the 7 summation is over all atoms within the 
crystallite, the 7 summation is over all atoms within 
the crystallite which interact with the ith atom, and 
the k summation is over all atoms outside the crystallite 
which interact with the ith atom. The term aC accounts 
for the work done against the forces required to hold 
the perfect lattice in equilibrium, and the term bC? 
accounts for the energy stored within the elastic field. 
The energy for a particular configuration is given by 
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E— Eo, where Eo is the perfect lattice energy; 
Ey—$ 22 32 (v2 22 vo. 
$2054: SNA 
The derivation of the coefficients a and 5 will be given 


after the discussion of the potential. 
The force on an atom 7 due to an atom is given by 


Fiu= (8 o U/arii) (r*#/r7) D (3) 


where r?—r/—ri (the vector from 7 to j). The total 
force on an atom 7 is then 


Fi= — (0E/0r) 
=D FUE, (4 
i k 
where the summation indices have the same meaning 


as in Eq. (2). The generalized force on the crystallite 
arising for the elastic variable C is given by 


F«— — (8E/C) 
=—5 X F. (ar'/ac)—a—26C. (5) 


The process for finding energy minima and saddle 


points was as follows: Initial vector positions of each 


atom within the crystallite were chosen to approximate 
the configuration of interest. Each coordinate of each 
atom within the crystallite was varied in turn until 
the corresponding force component became zero, and 
then the value of C was adjusted so that F° was zero. 
Usually 10 to 20 such iterations were required for the 
energy and the configuration to converge sufficiently. 
The force on a given atom and the generalized elastic 
force are very nearly linear with displacement for small 
displacements. "Thus, it was possible to find the force 
for a given variable at two values of the variable and 
use linear extrapolation to the value where the force is 
zero. 

The volume expansion associated with a lattice con- 
figuration is easily calculated and is found to be linear 
with C, the elastic variable. Let AV’ be the volume 
expansion of a hypothetical sphere around the defect. 


av'= | fuas 


—42C. (6) 


AV' is seen to be independent of the radius of the 
sphere. For a finite lattice, Eshelby!? has shown that 
the boundary condition of zero stress at the surface 
of the lattice gives rise to an additional term in the 
volume expansion (the so-called *image force" cor- 
rection). This correction may be written as 


AV=AV'{1+[4C19/(3C 9-420.) }, O 
J. D. Eshelby, J. Appl. Phys. 25, 255 (1954). 
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where the superscript indicates that the effective 
isotropic elastic constants, as given by Leibfried,! are 
to be used. 

Cif—$(Cu--4C5— 2C 51) 

C19 = i (Cu— Ci-3C4) . 


Applying these to Eq. (7) leads to 


(8) 


4(C11— C13) +12; 
av-av (1 e ey 


155 
where B, the bulk modulus, is given by 1(C1:i-- 2C12). 


Potential 


Experimental elastic-constant data were used as a 
basis for obtaining an interatomic potential for the 
present calculation. The elastic constants of a metal 
may be thought of as arising primarily from two con- 
tributions: long-range electronic interactions and short 
range ionic interactions. The short-range interactions 
predominate for transition metals, but corrections were 
included for the long-range terms. Analytic expressions 
for the electronic contribution to the elastic constants 
of body-centered cubic metals have been derived by 
Fuchs? using the free-electron model. Since the long- 
range contributions are small and since the free-electron 
model is expected to be a rough approximation, the 
calculated values were used only as a guide. The elastic 
moduli of iron? and the long- and short-range contri- 
butions used in the present calculation, in units of 
eV/À?, are listed in Table I. 

The simplest type of potential that can be used in a 
lattice calculation is one which gives rise to central 
forces between pairs of atoms. The ranges of the 
potential must not be so large as to give rise to an 
interaction between distant pairs or the calculation 
becomes too time consuming. For the body-centered 
cubic lattice, it 1s natural to cut off the potential 
between the second and third nearest neighbors, since 
the ratio of distances to the nearest, second nearest, and 
third nearest neighbors is V3: 2: 2v2, respectively. 
Thus, the first and second nearest neighbor distances 
are comparable, while the third nearest neighbors are 
considerably more distant. The body-centered cubic 


TABLE I. Elastic moduli of iron (units of eV/A%). 


C ls [67 2 Ca 
Electronic contribution 0.057 0.162 0.281 
Tonic contribution 060 oe oson 


Experimental values? 


a See Ref. 13. 


u G. Leibfried, Z. Phys. 135, 23 (1953). __ 5 
D E idi. Proc. Roy. Soc. (London) A153, 622 (1936). 
13 J, A. Rayne and B.S. Chandrasekhar, Phys. Rev. 122, 1714 


(1961). 
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structure is not stable with just a nearest-neighbor 
central force interacting between the lattice atoms, so 
the potential cannot be cut off between the nearest 
and second nearest neighbors. Since the potential is 
meant to describe the short-range ionic interactions, 
the inclusion of only nearest and second nearest- 
neighbor interactions is consistent with the division of 
the elastic-constant data into long-range and short- 
range contributions, 

The conditions which a spherically symmetric 
potential ¢(r) extending through second neighbors 
must fulfill to match the short-range elastic moduli are 


(Cui— Ciz); — 


2 /6 3 
(—o'-+3e'+—e1) ; (9a) 


Ara Ti T2 
TP SUL pene 
(Cm (et mer er) j (9b) 
3ra rı 72 


2 2 2 
(B) = Ha ett er-—e«) , (9c) 


3ra Ti T3 


where the primes indicate differentiation with respect 
to r and the subscripts 1 or 2 mean that the term is to 
be evaluated at the nearest or second nearest-neighbor 
distance, respectively. There are three equations with 
four unknowns. Figure Í indicates a family of possible 
curves of e'(r) which fit the conditions in Eq. (9) and 
which are zero midway between the second and third 
neighbor distances. The value and slope of '(r) are 
determined at rı and rz, and the value is determined at 
the cut off distance re while the lines indicate how 
smooth curves might be fitted to these conditions. 
Curve II, which is roughly parabolic, is the “smoothest” 
of the family of curves, and no reason is seen to prefer 
any other to it. Indeed, it would be difficult to ra- 
tionalize the choice of any of the other curves in 
preference to the parabolic one. Equation (9) contains 


[ Sieh T T T T T T T 
0.8 I 


p’ (r), (eV/A) 
o 
Zaj 


| 


-0.87 


Fic. 1. A family of possible curves of the derivative of the 
potential, e'(r), which fit the short-range elastic constant con- 
ditions and which are zero midway between the second and third. 
nearest neighbor distances. 


> 


Dp 
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3.0-3.44 — 1.115035 (r — 3.066403)3-1- 0.466892 — 1.547967 


three conditions and a parabola has three coefficients, 
so there is one parabola which fits the elastic moduli 
exactly. This parabola does not have a zero value 
midway between the second and third nearest-neighbor 
distances and so, another parabola was joined to it 
just past the second neighbor distance with matching 
value and slope, and zero value midway between the 
second and third neighbors. These parabolas represent 
€'(r), the derivative of the potential, and the actual 
potential was found by integration. The constant of 
integration was determined by the condition that the 
value of the potential be zero at the cutoff distance r.. 

Gibson eż al. and Erginsoy et al.” have shown that 
the threshold energy for radiation damage is determined 
primarily by the form of the potential at separations 
considerably less than the nearest-neighbor distance; 
distances which do not enter into the present calcu- 
lation. It was felt, however, that the potential used 
here should be matched to the radiation damage 
potential for iron,’ and thus, a cubic equation was 
joined with matching value and slope to the potential 
derived above at a distance just less than the nearest- 
neighbor distance, and was joined with matching value 
and slope to the dynamically determined potential at a 
distance less than any equilibrium distances expected 
from the calculation. Thus, the potential was a com- 
posite of the three cubic equations given in Table II 
and is shown in Fig. 2. All distances are given in À, the 
iron lattice constant was taken as 2.86 À, and energies 
are given in eV. 

The coefficients a and b occurring in Eq. (2) may 
now be easily evaluated. From elastic theory, the 
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TABLE IT. Iron potential. 
Range (A) Potential (eV) f 
1.9-2.4  —2.195976(r—3.097910)34-2.704060 — 7.436448 
2.4-3.0 — 0.639230 (r— 3.115829)3--0.477871r — 1.581570 


o.l 


ide 25 n 

dr (he 35 
E Fic. 2. The iron-iron interaction potential, e(r), which was 

J used in the present calculation. 
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pressure required to hold the crystallite in the perfect 
lattice configuration is’ 


P—1(Ci—Cau) 
—3[),,— (C«)«s—$(Cu— Ci) ]. (10) 


For the elastic constants given in Table I for iron 
P=0 and thus, a=0. The value of a may also be found i. 
by an explicit bond calculation in which Eq. (5) jg 
evaluated for the perfect lattice with the requirement | 
that F*—0, i.e., that the perfect lattice is in equilibrium, 


Using the iron potential given in Table II, this calcu- x 
lation yields b 

a=0.0134 eV/A%, 
which value was used in the calculations. The dis- | 


crepancy between this value of a and the value derived 
from elastic theory is negligible when compared with 
the magnitude of the elastic moduli. It arises because 
there is not an exact ratio of 4:3 of nearest to second 
nearest-neighbor bonds crossing the boundary between 
the crystallite and the elastic region. 

The coefficient b was calculated from elastic theory“ 
by setting the energy due to a displacement field u 
stored in the elastic field outside the crystallite equal 
to C°. This leads to | 


b= (64z?/5Nr£)L(Cu)s--3(Cu—Cis)ss], (11) 


where N is the number of atoms in the crystallite. The 
pressure term aC plays a somewhat more important 
role than the elastic term bC?, but for iron both terms 
could have been eliminated from the calculations with | 
no serious effect upon the results. | 


RESULTS 
Interstitials | 


"There are six interstitial configurations which must 
be equilibrium configurations because of symmetry, 
but only one of which is stable. These six configurations 
are denoted by the symbols J;, Zə: --Ie, and are de- 
scribed as follows: J; [ (100) split interstitial, Fig. 3(a)], 
two atoms are symmetrically split in a. (100) direction 
about a vacant normal lattice site; Je [(110) split 
interstitial, Fig. 3(b)], two atoms are symmetrically 
split in a (110) direction about a vacant normal lattice 
site; 7s [ (111) split interstitial, or crowdion, Fig. 3(c)], 
two atoms are symmetrically split in a (111) direction 
about a vacant normal lattice site; I, [activated 
crowdion, Fig. 3(d)], an atom is located at the mid- 
point between two normal nearest-neighbor lattice 
sites; Zs [octahedral interstitial, Fig. 3(e) ], an atom is 
located at the midpoint between two normal second 
nearest-neighbor lattice sites; and J, [tetrahedral 
interstitial, Fig. 3(f)], an atom is located at the mid- 
point between two nearest octahedral interstitial sites. 


“H. B. Huntington, in Solid State Physics, edited by F. Seitz 
and D. Turnbull (Academic Press Inc., New York, 1958), Vol. 7- 
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In the three "split" cases the pairs of atoms are ap- 
proximately 1.5 @ apart (ay=half-lattice constant). 

Configuration Is, the (110) split interstitial, is the 
minimum energy configuration and is, therefore, stable, 
in agreement with the results of Erginsoy el al.” while 
Ts, the (111) split interstitial, is just barely a local 
minimum, i.e., metastable. The energy and volume 
expansion for the various interstitial configurations are 
given in Table III. 7; is the saddle-point configuration 
for motion of the interstitial, and is somewhat awkward 
to describe because it does not have much symmetry. 
If the initial configuration is a [110] split interstitial 
centered at (0,0,0), the next equilibrium configuration 
after one jump might be a [101] split interstitial 
centered at (1,1,1) ao. The saddle point for this step is 
at (0.50, 0.69, 0.31) ao. The migration sequence 7s — 
I;— Iz is shown in Fig. 4., and the activation energy 
for this process was found to be 0.33 eV. The calcu- 
lation also gave an activation volume for motion of 0.1 
atomic volume. 


Fic. 3. Six iron interstitial configurations: (a) J; or (100) split 
interstitial; (b) 7» or (110) split interstitials; (c) Zz, (111) split 
interstitial or crowdion; (d) 74 or activated crowdion; (e) Is or 
octahedral interstitial; and (f) J. or tetrahedral interstitial. 


The crowdion configuration 7; is metastable, but by 
less than 0.01 eV, and the crowdion migration sequence 
I3; I4, I; has an activation energy for motion of 
0.04 eV. The activation energy for rotation of a split 
interstitial, holding the center fixed, was found to be 
0.33 eV, the same as the motion energy. 

In order to make a complete study of interstitials it 
was necessary to check for the possible existence of other 
equilibrium configurations. In a manner similar to that 
used for copper? it was found to be possible to define a 
configuration by three coordinates. The iron calculations 
show that no two atoms ever approach each other 
closer than 1.5 ao, and that there is always one atom 
within a radius of 0.75 ao from each normal lattice site. 
Thus, there is always one atom outside a bcc lattice of 
spheres of radius 0.75 ao, which atom 1s defined as the 
interstitial. If the remaining atoms are at equilibrium 
positions consistent with the interstitial position, the 
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Taere III. Interstitial configurations (Q= atomic volume). 


Configuration Energy above I;(eV) AV (Q) 


I 1.29 1.7 
la ove 1.6 
Ts 0.32 1.7 
Is 0.36 1.7 
Is 1.12 1.4 
Is 0.85 1.5 
I; 0.33 1.7 


crystal energy may be considered as a function of just 
the interstitial coordinates. If two atoms are on opposite 
ends of a sphere diameter, each is considered half in 
and half out of the sphere. These are the "split" con- 
figurations, and if one atom of a split pair enters the 
sphere, the other moves away from the sphere and 
becomes the interstitial. The volume available to the 
interstitial that must be studied is greatly reduced by 
symmetry. The volume in which the interstitial co- 
ordinates have no symmetrically equivalent position is 
1/48 atomic volume, and only about half of this volume 
is outside the spheres, so, in effect, only about 0.01 
atomic volumes must be investigated. 

No further equilibrium configurations of interest 
were found. The general pattern of the interstitial 
energy contours was as follows: low energy pockets at 
Is sites; a plateau region containing 75, Z4 and /7; and 
high energies elsewhere. The plateau region is roughly 
an oblate spheroid with its axis along the line joining 
two nearest neighbors. It is centered at an J; site, has 
two I; sites on its surface where the axis pierces the 
surface and has six J; sites on its outer rim. Not only 
are the configurations within this region at about the 
same energy, but there are no steep contours between 
them. 


Di-Interstitials 


There are many possible di-interstitial configurations 
and the calculations showed that many of them are 
metastable. Di-interstitial configurations consist of two 
split-single interstitials in reasonably close proximity 
to each other: No cases were found which resulted in 
more complex configurations. The most stable di- 
interstitial is shown in Fig. 5(a), and has two split 
interstitials parallel to each other at nearest-neighbor 
lattice sites, with their axes perpendicular to the line 


Fic. 4. The iron interstitial migration process. Configures ies 
(a) and (c) are both / configurations, and (b) is Jz, the saddle 
point configuration. 
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Fic. 5. The iron di-interstitial migration process. Configurations 
(a) and (c) are both stable di-interstitial configurations, and (b) 
is the intermediate step between (a) and (c). 


joining their centers. The binding energy for this con- 
figuration relative to two separated interstitials was 
1.08 eV and the associated volume relaxation was 0.3 
atomic volumes. 

The migration of di-interstitials is by a stepping 
process in which the two split interstitials partially 
dissociate. If the initial di-interstitial configuration is 
an interstitial split in the [110] direction at (0,0,0) ao 
and an interstitial split in the [110] direction at 
(1,1,1) ao, the intermediate configuration might be an 
interstitial split in the [110] direction at (0,0,0) ao 
and an interstitial split in the [011] direction at 
(2,0,2) ao, and the final configuration would then be 
an interstitial split in the [011] direction at (1,1,1) ao 
and an interstitial split in the [011] direction at 
(2,0,2) ao. This two step process is shown in Fig. 5, 
and it should be noted that the interstitial moving in 
each step follows a single interstitial migration path. 
The di-interstitial motion energy was 0.18 eV and the 
activation volume for motion was 0.3 atomic volumes. 

The separation distance at which two split inter- 
stitials are bound as a di-interstitial did not have a 
well defined cutoff, but was calculated to be about 
4.4 ao. This is an average since the value depends upon 
the direction between the interstitials as well as their 
individual orientations. This range corresponds to a 
region of roughly 88 atomic volumes in which split 
interstitials have positive binding energy. 


Vacancies and Di-Vacancies 


The vacancy problem was straightforward compared 
to the interstitial calculations. The stable vacancy was 
the configuration in which an atom was missing from 
a normal lattice site, and the migration process con- 
sisted of a nearest-neighbor atom to the vacancy 
jumping from its normal lattice site to the vacancy 
site, thus filling in the vacancy and leaving a new 
vacancy behind. This process may also be thought of 
as the vacancy migrating by jumping to a nearest- 
neighboring lattice site. The migration energy was 
found to be 0.68 eV and the activation volume for 
motion was negligible. 

The potential barrier for vacancy motion had a slight 
depression at the midpoint, i.e., the symmetric con- 
figuration at the midpoint of the process was not the 
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saddle point but a local minimum. A single migration 
jump, therefore, had two saddle points as is shown jn 
Fig. 6, where the energy of the configuration is plotted 
as a function of the position of the migrating atom, 
The existence of this metastable configuration should 
have little physical significance, however, since it js 
metastable by only about 0.04 eV. 

A di-vacancy consists of two single vacancies in close 
proximity to each other. The most stable di-vacancy 
was that in which two vacancies were at second nearest- 
neighboring lattice sites, and the binding energy was 
0.20 eV. The binding volume, or volume decrease of 
the lattice upon formation of a di-vacancy, was 0.1 
atomic volumes. Vacancies at nearest neighboring sites 
were bound by 0.13 eV and vacancies at fourth nearest- 
neighboring sites were bound by 0.05 eV. No other pairs 
had an appreciable binding energy. 

Di-vacancy migration was by a stepping process, in 
each step of which one of the vacancies of a di-vacancy 
pair moved as a single vacancy. T'wo possible migration 
processes were important; where the configuration 
changes from second nearest neighbor to nearest 
neighbor and back to second nearest neighbor, and from 
second to third and back to second nearest neighbor. 
The energy barrier for half of each of these migration 
paths is shown in Fig. 7, where the solid line is the 
energy barrier for migration via the third nearest- 
neighbor configuration and the dashed line is the energy 
barrier for migration via the nearest-neighbor con- 
figuration. The motion energies are 0.66 and 0.78 eV, 
respectively, and the activation volumes are negligible. 
These curves are seen to bear a strong resemblance to 
the curve in Fig. 6, so that a divacancy step may be 
thought of as a perturbed single vacancy step. 

Motion by the third nearest neighbor mechanism 
always leaves the orientation of the divacancy un- 
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Fic. 6. The vacancy migration energy barrier. The energy of 
the configuration is shown as a function of the position of the 
jumping atom, as the vacancy migrates from (0,0,0) at the left 
to (1,1,1) at the right. The curve does not extend to (1,1,1) and 
(0,0,0) because the atom relaxes towards the vacancy. 
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L INTERSTITIALS AND VACANCIES IN a Fe A1335 
altered, while the orientation may be changed by a Taste IV. Summary of results (energy in eV, 
nearest neighbor process, The nearest-neighbor con- volume in atomic volumes). 
figuration also may act as a trap, since 0.71 eV ate ^ ^ 9 
required for the nearest neighbor to second nearest- Motion Motion Binding ^ Binding 
neighbor divacancy transition. — pli. e EL beu. Nes 

p : A summary of these results is given in Table IV. eo oa 5 
Interstitial motion is not by the same mechanism as Di-interstitial 0,18 0.3 1.08 0.3 
interstitial reorientation, although the energy and Vacancy — 0.68 0.0 
volume are the same. The corresponding di-interstitial RUM Gos vd 0.20 o.t 

9 processes are the same. Vacancies have the symmetry Di-interstitial 0.18 0.3 | Reorientation 
of the lattice and therefore do not have any reorien- Divacancy 0.78 oo | 
tation process associated with them. = M 

DISCUSSION migration, and assuming that the pre-exponential 
ToC Results factor associated with the migration processes of the 
iron and copper interstitials are roughly the same, then 
Very little experimental data are available per- the iron interstitial migration energy is 0.30 eV. The 
taining to the results reported in the present paper, calculated value was 0.33 eV, in very good agreement. 
but some comparisons may be made for interstitial In an internal fraction experiment by Wagenblast 
migration. Lucasson and Walker? have obtained and Damask," which searched for a relaxation asso- 
isochronal resistivity annealing curves for iron and  ciated with the interstitial in irradiated iron, the 
copper after irradiation with electrons at energies irradiation was performed at 140*K and the relaxation 
slightly above threshold. They also found that the was Jooked for at higher temperatures, with negative 
stage I iron and copper annealing curves can be super- results. Using the interpretation that interstitials 
posed if the iron temperature scale is reduced by a anneal in stage I, the work of Lucasson and Walker 
factor of 2.5.5 Corbett, Smith, and Walker!? have just discussed shows that the iron interstitial anneals 
found an energy value of 0.12 eV for annealing of stages out of iron in ten minutes at 120°K. Thus, this inter- 
Ia and I, in copper and have ascribed this annealing to pretation is consistent with the Wagenblast and 

interstitial migration. Assuming that the corresponding Damask findings. 
annealing stages in iron are also due to interstitial Another interpretation of annealing in copper is that 
crowdions migrate in stages 74 and J, and interstitials 
E migrate in stage III. The present calculations of 
crowdion stability and motion energy do not support 
ese J this interpretation for iron. The Wagenblast and 
Damask experiment also does not support this inter- 
pretation since they found no indication of an oriented 
9S defect over a wide temperature range (140° to 385^K) 

E in iron. 

» a No direct experimental evidence is available for the 

| z motion energy of vacancies in iron. The activation 

, á energy for self-diffusion in æ iron is 2.6 eV, which is 

j the sum of formation and motion energies. Theoretical 

> 2:2 estimates by Brooks? and by Mehl, Swanson, and 

Pound?! indicate that the vacancy formation energy is 

| considerably larger than the vacancy motion energy 
Cd eee MeL UE T L4 for bcc crystals, so that the motion energy of 0.68 eV 
CONFIGURATION reported in the present paper is not unreasonable com- 


Fic. 7. The divacancy migration energy barrier. The energy pared to the self-diffusion energy. However, Damask 


of the configuration is plotted as a function of a configuration 
coordinate (similar to the coordinate of the jumping atom) as 


the divacancy transforms from a second nearest neighbor con- "7 H. Wagenblast and A. C. Damask, Acta Met. 10, 333 (1962). 
figuration on the left to a third nearest neighbor configuration 13 C. J. Meechan, A. Sosin, and J. A. Brinkman, Phys. Rev. 


along the solid line and a first nearest-neighbor configuration 120, 411 (1960). A. Seeger, in Proceedings of the Symposium on 
along the dashed line. Both curves have a mirror image repeat Radiation Damage in Solids (International Atomic Energy Agency, 
to the right of the drawn curves by which the migration process Vienna, 1962), Vol. 1, p. 101. 
is completed. E tise dus I. D. Bakalar, and M. Cohen, J. Metals 4, 
52). 

2 H. Brooks, in Impurities and Imperfections (American Society * 
for Metals, Cleveland, 1955), p. 1. 

2 R, F. Mehl, M. Swanson, and G. M. Pound, Acta Met. 9, 
256 (1961). 


15 P, G. Lucasson and R. M. Walker, Phys. Rev. 127, 1130 


Oe Pw. Corbett, R. B. Smith, and R. M. Walker, Phys. Rev. 


114, 1460 (1959). 
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et al? in an extensive study of carbon in a iron did 
not find a vacancy annealing stage at temperatures 
which would correspond to a motion energy of 0.68 eV, 
and the model which they proposed to explain their 
results assumes that vacancies move at a temperature 
corresponding to a motion energy of greater than about 
1.0 eV. 

The present results indicate that none of the defects 
studied are amenable to internal friction studies. 
Vacancies have the symmetry of the lattice and are, 
therefore, eliminated from consideration. The energy 
to reorient di-vacancies is greater than their motion 
energy, and so they will anneal before being able to 
reorient. Both interstitials and di-interstitials require 
the same energy to migrate as to reorient (migration 
and reorientation are the same process for di-inter- 
stitials), and thus, internal friction experiments would 
be very difficult to perform as the concentration of 
defects would be decreasing at a temperature where 
internal friction would be large. This does not take into 
account the possibility that the relaxation strength 
associated with these defects might be sufficiently 
small so that internal friction could not be measured 
at experimentally obtainable concentrations. A detailed 
calculation of this effect was made for copper? for an 
oriented interstitial with the result that even if the 
defect were present in heavy concentrations, it would 
be very difficult to detect by internal friction. 


Model 


"The results of the present calculations are considered 
to accurately represent the model: Increasing the size 
of the region in which the atoms are allowed full 
freedom to relax, using greater precision within the 
calculation, using a different relaxation scheme, etc., 


? H. Wagenblast and A. C. Damask, Phys. Chem. Solids 23, 
22 (1962). F. E. Fujita and A. C. Damask, Acta Met. 12, 331 
ene R. A. Arndt and A. C. Damask, ‘Acta Met. 12, 341 
1964). H. hose F. E. Fujita, and JN C Damask, "Acta. 
Met d D 347 (1964 

D s and R. A. Johnson, Acta Met. 10, 281 
(1962). 


would not sensibly affect the results. "The important 
question is whether or not the model satisfactorily 
represents the real crystal so that the results are 
meaningful. 

This model completely neglects any contribution 
from the so-called electron redistribution energy, ie., 
it does not account for the drastic alteration of the 
electron wave functions near the defect. The author 
knows of no method currently available to obtain any 
sensible estimale of how this term enters into the 
differences in energy between various configurations, 
It is felt, however, that in all probability the calculated 
vacancy migration energy would be increased. It is 
also felt that the configurations are determined pri- 
marily by the close repulsions, so that the electron 
redistribution would have very little affect upon the 
configurations. 

The energy in this model is not a sensitive function 
of the volume, and thus, the volume changes for the 
various configurations should only be considered as a 
rough approximation. 

The choice of a potential is a critical part of any 
calculation, and it is felt that the potential used in the 
present calculation is as good an approximation as may 
be made. It matches the experimental elastic moduli 
and is fitted to a number of other not unreasonable 
criteria. A major objection might be that it is cut off 
after the second nearest neighbor. Aside from the fact 
that increasing the range of the potential would greatly 
increase the complexity of the calculation, it is desirable 
to test the simplest reasonable model first and only 
incorporate more complex features where necessary. 
The assumption in using a short range potential is that 
longer range effects average out to give rise to the 
volume-dependent binding. 


ACKNOWLEDGMENTS 


The author wishes to thank Dr. C. Erginsoy, Dr. 
A. C. Damask, Dr. G. J. Dienes, and Dr. G. H. Vineyard 


for many discussions and suggestions during the course 
of this work. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


- 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


PHYSICAL REVIEW 


D VOLUME 134, NUMBER 5A 1 JUNE 1964 
Electronic Spectra of Crystalline Germanium and Silicon*t 
Davip Bevsrt 
Argonne National Laboratory, Argonne, Ilinois 
(Received 9 December 1963) 
le A detailed calculation of the energy bands of germanium and silicon has been performed by use of the 
pseudopotential method. The first three potential coefficients have been determined empirically, and all 
higher ones set equal to zero. This potential was used to compute the energy eigenvalues at ~50 (00 points 
throughout the Brillouin zone. By use of this sample, we calculated the imaginary part of the dielectric 
R constant in the optical and near ultraviolet where direct transitions between the valence and low-lying 
conduction bands dominate the response. Photoelectric yield curves were obtained for comparison with re- 
cent experiments. In all cases agreement of theory and experiment was reasonable. Energy contours were 
constructed in several of the principal symmetry planes. These were used to explain the structure in the 
optical properties of Ge and Si in terms of transitions near certain important critical points, Effective 
masses and the static dielectric constant were also computed. 
I. INTRODUCTION values (or rather the differences between them) is 
ECENTLY, very precise reflectivity measurements Pecessary to undertake the present computations. 
have been taken on! Ge and? Si in the optical and For our purposes the energy levels can be put into two 
near ultraviolet region of the electromagnetic spectrum. Cêtegories: levels which posed small changes in 
When the reflectance, |7(w)|2, is plotted as a function of the crystal potential and those which are not. The 
w, the resulting curves show detailed structure. The former have, in the past, been computed ab inilio with 
energies involved (1.5 eV</iw<10 eV) are such that 2” uncertainty of order 3 eV, while the latter were ob- 
direct interband transitions between states lying near to tained to within ^j eV. The principal feature eae 
the forbidden gap are expected to dominate the die- and edges) of the optical curves are separated by less 
lectric properties of these materials, than 1 eV. It is necessary, therefore, to know the levels 
Before attempting an explanation of the observed with a considerably greater accuracy than the current 
o er L . H . 
data, it is necessary to know the behavior of the 4 Priori calculations permit. Fortunately, a good deal of 
valence and low-lying conduction bands throughout the YEY fine expeumental Info MUS beers 
Brillouin zone. Prior theoretical work on the energy when combined with the results of band theory, allows 
bands ok Ge and o Si hastheamconnnerd to calul- a sharp definition of both the sensitive and insensitive 
i ‘ ) ; L 2 an NR E 
tions at points and along lines of particularly high levels. Ehrenreich, Philipp, and Loue Have. 
symmetry. This is inadequate for our purposes; we find good EVE of most of the maY EneTey EALA di e 
it necessary to extend the energy-band calculations soas SYTmetry ponts: Table hai dM E 
to sample all of the zone. work 
The previous energy-band work will, however, be m E men i m 
k i 1 n 7, 2 ABLE I. some oi the major energy gaps at the symmetry points 
used as a starting point. First of all, we shall employ a T= (2z/a) (0.0,0), L- Qx/1) QM). and X= (2x/a)(1,0,0). The 
simplified version of the orthogonalized plane-wave labeling of states is after Bouckaert, Smoluchowski, and Wigner.* 
> method (OPW method) utilized in Refs. 3-9. Secondly The experimental reference list is by no means exhaustive. Fur- 
" fee l . <? thermore, energy-band calculations, not included in the table, 
| - we shall make use of the eigenvalues at symmetry points — played an important role in making the identifications. 
d in order to deduce the numerical values of the pseudo- pe T 
| potential matrix elements used as computational pa- Energy Energy 
a rameters in this simplified approach (see Sec. III of this (eV) (eV) Refs. Refs. 
paper). A reasonably precise knowledge of these eigen- Gap es $i oe Be 
Sse Jan 2.1 37 1, 13, 16 17, 18, 19 
* Based on work performed under the auspices of the U. S. 2.3 
Atomic Energy Commission. E 3 EE 
i + Submitted in partial fulfillment of the requirements for the Ly Ls 2 py 1 2 
| degree of Ph.D. in Physics, University of Chicago, Chicago, x 
Illinois. im Lese ~ 7 
| 1 Present address: Department of Physics, Purdue University, memi E cee HUP E 
Lafayette, Indiana. z Ta >T 3.1 3.5 19, 22 2,21 
| VEL R. Philipp and E. A. Taft, Phys. Rev. 113, 1002 (1959). utn , 
| 2H. R. Philipp and E. A. Taft, Phys. Rev. 120, 37 (1960). X> X 45 45 1 2 


3 F. Herman and J. Callaway, Phys. Rev. 89, 518 (1952). 
| ^F, Herman, Physica 20, 801 (1954). 

| 5 F. Herman, Phys. Rev. 93, 1214 (1954). 

| €T. Woodruff, Phys. Rev. 103, 1159 (1956). 

j * F, Bassani, Phys. Rev. 108, 263 (1957). 

] 3 F, Bassani, Nuovo Cimento 13, 244 (1959). 

I 

r 


* L. Kleinman and J. C. Phillips, Phys. Rev. 118, 1152 (1960). 
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aL, P. Bouckaert, R. Smoluchowski, and E. Wigner, Phys. Rev. 50, 58 


(1936). 


10 F, Bassani and M. Yoshimine, Phys. Rev. 130, 20 (1963). 


u H. Ehrenreich, H. R. Philipp, and J. C. Phillips, Phys. Rev. 


Letters 8, 59 (1962). 
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It should be emphasized at this point that Ehrenreich 
el al. have deduced some of the energy gaps of Table I 
with the aid of several natural assumptions. They as- 
sume, for example, that the main features of the optical 
curves can be associated with direct transitions between 
levels at symmetry points. One of the primary purposes 
of the present work is to attempt a justification of such 
assumptions and working hypotheses. In order to illus- 
trate further, Ehrenreich ef al! make use of certain 
characteristic properties of the reflectivity, as discussed 
by Phillips,” in order to make their assignments in a 
definite way. For example, the edge near 2.2 eV in Ge 
has been resolved into a doublet? (see Table I). This 
corresponds to the expected 0.2 eV spin-orbit splitting 
of the Ly level“ provided the optical edge is taken as 
arising from transitions near Ly — Lı. This assignment 
was considered as reasonable since both band theory and 
g-factor informationP/!5 indicate an Ly — Lı splitting 
72 eV. 

With the Ly — Lı gap determined in Ge it was 
possible to deduce it in Si.? This was done by ex- 
trapolating the indirect gap (les — Lı) in Ge-rich 
Ge-Si alloys'!5 to its value in pure Si, and also by 
extrapolating the Ly — L optical edge!’ to give a value 
for the Ly — Lı splitting in Si. 

The T25 — T» splitting was found in Ge by observing 
the threshold for direct optical transitions.!^*? Its value 
could be deduced in Si by the extrapolation of alloy 
data!’ and also (see Ref. 12) by cyclotron resonance 
measurements in strained p-type Si.2 The last-men- 
tioned experiments as well as optical reflectivity were 
used to find a value for T2» > T's in Si. Again cyclotron 
resonance? and extrapolation techniques!® were used 
to get Pæ >T in GeJ^? Finally, X,— X, and 
Ly — L; were determined by comparing band-theory 
results (these two gaps are both insensitive to the 
potential) with reflectivity data. 

To sum up, the energy levels of Table I provide a 
model on the basis of which we can begin our detailed 
calculations. Hopefully we will be able to explain the 
frequency dependence of the optical properties. This 
will enable us to establish the validity of Table I and 
provide us with a much broader understanding of the 
band structure than has heretofore been possible. 
Furthermore, the present work in the course of checking 
the level assignments of Ehrenreich ef al." and Phillips? 
will confirm or redefine their diagnostic techniques. 


12 J. C. Phillips, Phys. Rev. 125, 1931 (1962). 
18 J. Tauc and A. Abraham, Phys. Chem. Solids 20, 190 (1961). 
ML. Liu, Phys. Rev. 126, 1317 (1962). 
a L. Roth and B. Lax, Phys. Rev. Letters 3, 217 (1959). 
2 (095 5 fa z K. Wilson, and E. A. Gere, Phys. Rev. Letters 3, 
eos (o ord A. R. Moore, and F. Herman, Phys. Rev. 109, 
- R. Johnson and S. M. Christian, Phys. Rev. 95, 560 (1954). 
1 J. Tauc and A. Abraham, Phys. Rev. Letters 5, 253 (1900). ) 
#2 S. Zwerdling and B. Lax, Phys. Rev. 106, 51 (1956). 
21 J. C. Hensel and G. Feher, Phys. Rev. Letters 5, 307 (1960). 
(1955 Dresselhaus, A. F. Kip, and C. Kittel, Phys. Rev. 98, 368. 
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Spin-orbit effects will not be accounted for in this 
paper. They are observable in Ge, as mentioned, but are 
<10% of the band separations. These effects, if desireq 
can be treated as a perturbation, and their influence On 
the line shape determined. 


II. OPTICAL PROPERTIES 
A. General Discussion 


If €2(w) is used to denote the imaginary part of the 
complex dielectric constant, we have from Eq. (9) of 
Ref. 23 


— Am 2 
€:(w) = x | 
Imaw? n.s J p,z, (27)? 


Xô (wn, (kK) —w) | M n,a (K) |k. (1) 


The subscripts » and s refer to filled and unfilled bands, 
respectively, and wn,(k)= CE.(k) — E,(k) )/h. | Mn,s(k)|? 
=|(Uxjn] V |Ux,.)|?, where Ux,» and Uy, represent the 
periodic parts of Bloch functions. Expression (1) neg- 
lects lifetime broadening effects such as those resulting 
from phonon and impurity scattering. If we ignore for a 
moment the matrix element in the integral (and the 
factor 41°e7//3m*w*) then the contribution from a given 
pair of bands to ¢2(w) is simply 


2 
Jn s(w)= | T nelle) 7o dk. (2) 
B.Z 


T 


The quantity Jn,s(w) is the joint density of states for the 
two bands indexed by n and s. Jn,s(w)Aw is equal to the 
number of pairs of states in bands n and s with 
h(w—Aw/2)< (E,(k) —E.(k)) «Ah (o4-Ac/2). Later in 
our discussion of momentum matrix elements we shall 
see that |M,,,,(k)|? can, to a good approximation, be 
treated as a constant. This implies that a knowledge of 
the relevant J,,,(v) is all that one requires to under- 
stand the properties of e»(c). 

In the Introduction it was stated that the reflectivity 
data were used as an aid in establishing the model term 
scheme of Table I. Actually the reflectance is a function 
of both &(w) and ex»). However, by examining the 
figures of Refs. 1 and 2 and comparing them with e2(w) 
for Ge and Si presented in Sec. VIII of this paper one 
sees that R(w)~e2(w). All of the principal features in 
|7(w)|? are reproduced at the same energies in €2(w). 


B. Theory of Optical Structure 


At this point it is clear that to understand e2(w) (we 
shall direct our attention to the imaginary part of the 
dielectric constant hereafter), we must carefully ex- 
amine the functions J,,.(w). Expression (2) can be 


5 M. H. Cohen, Phil. Mag. 3, 762 (1958). 
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transformed to 


2 
Uo) Í 
(29) J, 


where the integral extends over the surface defined by 
@n,«(K)=w and ds is an infinitesimal element of area on 
that surface. The points ky where |V pwn, a(k) | y..,— 0 
are significant and are called critical points (c.p.). 
Van Hove” wrote the following expansion, valid about a 
normal c.p., 


ds/|wses(k)|, (3) 


n, (Kj =o 


3 
On,s(k)=we+ 3, ase Ak, (4) 


a=! 


where e,—-E1, Ak=k—kp. Here w.=wn,,(ko), and a 
coordinate transformation which preserves d'k has been 
made so as to get the functional dependence expressed 
by (4). He found that the critical points produce 
analytic singularities in what we call J,,.(w). The 
mathematical behavior of J,,,(w) near the “Van Hove" 
singularities is described by the following list: 


Mo £17 €3— €5—1, 
when a<w, 


(e 
Jf CI | (5) 
lc+4 (w—w,)!?+O(w—w,) when w>we, 


Mı €1— e= — e3— 1, 
C—A(w.—w)'?-+O(w-—w) when w<we 
1.4) | (6) 
C+0(w—w.) when w>we, 
M» €1— e= e=], 


when w<w, ( 
(7 
C—A(@—w,)!?+0(w—w,) when w>we, 


C--O(o—o.) 
Inve -| 


M3 €1— €?— €5— —1, 
CHA (w-—w)!?+0O@—w.) when w<we (8) 


Jn s(w) a | 
C+0lw— we) 


when w>we. 


In each of these expressions C and 4 represent con- 
stants. The symbol Mo designates a minimum in 
@n,s(K), Mz denotes a maximum, and M; and M? refer 
to saddle points. In Figs. 1(a)-1(d) we plot the behavior 
of Jn (œ) near the Van Hove singularities. One can 
already see the way in which a critical point can produce 
edges in the joint density of states and hence in ex(c). In 
Fig. 1(e) we show the possibility of two nearly de- 
generate critical points of the M: and M2 type producing 
a peak-like structure in J n,s(w). Later work by Phillips? 
showed that the periodicity of the energy bands requires 
the existence of a minimal set of critical points. From 
group theoretical arguments he demonstrated that 


51 L, Van Hove, Phys. Rev. 89, 1189 (1953). 
EIE c Phillips, Phys. Rev. 104, 1263 (1956). 
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sity of states near a 
normal critical point 
(a) Mo, (b) Mi, (c) 
Ms, (d) Ma (e) 
a nearly degenerate 
pair M; and Ma. It 
should be noted that 
the slope of the linear 
portion is not deter- 
mined by the nature 
of the critical point. 


critical points occur at symmetry points. In the simplest 
cases (e.g. lattice vibration spectra), almost all the 
critical points may occur at symmetry points. In the 
absence of detailed information about E(k) throughout 
the Brillouin zone, the empirical analysis of the previous 
section was confined to symmetry points. For the com- 
plicated band structures of Ge and Si this simplification 
(which has also been made in order to interpret the 
ultraviolet spectra of the noble metals) is certainly not 
valid. Moreover, the constants in (5)-(8) which indicate 
the strength of a given edge can only be obtained from 
values of E(k) throughout the Brillouin zone. 


C. Band-Structure Approach to Optical Properties 


We now wish to do a band-structure calculation with 
the view of deriving e2(w). A description of the logical 
framework to be employed is the following. First a 
method of computing the bands will be proposed. The 
method should have a small number of disposable 
parameters. In order to determine these parameters, the 
term scheme discussed in Sec. I will be used as a 
starting point. Having set the parameters, we shall then 
proceed to find the bands throughout the Brillouin zone. 
From the band structure e2(w) can be computed. If the 
dielectric function agrees with experiment, we shall 
conclude that both the model of Sec. I and the computed 
band structure are correct. Should there be substantial 
disagreement then a consistent alternate to the starting 


` term scheme will be tried, and a corrected band struc- 


ture found. A schematic diagram of this is shown in 
Fig. 2. 

Since the optical structure is associated with critical 
points, we must find them in the computed bands. Of 
course, we know that they will appear at the symmetry 
points, and they will have energies which correspond to 
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Fic. 2. Logical flow diagram. 


Disagreement 


optical edges. We cannot be sure in advance that they 
will be of the right analytical class or will contribute 
edges of the correct strength. Furthermore, there may 
be critical points away from the principal symmetry 
points, in which case they will have to be located. 


III. PSEUDOPOTENTIAL METHOD 


We mentioned earlier that for certain levels the OPW 
technique is only accurate to ~3 eV.” Furthermore, the 
unwieldy nature of the orthogonalization terms makes 
it unsuitable for extensive calculations. These con- 
siderations imply that the OPW method is unsuited for 
our needs. 

Research workers using the OPW method noticed 
that the effect of orthogonalizing the plane waves to 
core orbitals was to greatly diminish the magnitude of 
the off-diagonal matrix elements in the secular equation. 
This led Phillips and Kleinman?* to write the wave 
functions in the form 


Va, x(r) 9 on, (1) — oe (04, (r) |ósx(r))6,x(r). (9) 


Here 6:,x is the core function constructed from the /th 
core orbital, and ¢n,x is a smooth wave function 
satisfying the wave equation 


(P2/2m+-Ve+V 2)bn,x= En(k)bn,x, (10) 


where V. is the crystal potential. The operator V has 
the character of a repulsive potential. Cohen and 
Heine? show that $4; and Vp are not unique. By a 
variational argument they demonstrated that the latter 
can be chosen so as to minimize 


Ver= Vet Vn, (11) 
in which case Vz has the form 
V ron x= — > 1(81,x] V bn,x)01,%- (12) 


36 Ts Phillips and L. Kleinman, Phys. Rev. 116, 287 (1959). 
ae Hy Cohen and V. Heine, Phys, Rev, 122, 1821 (1961) ; also, 
see Ker. SU. 
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From (11) it is apparent that if the set of core orbitals 
were complete, perfect cancellation of (V.$,,) by 
(Vrobn,x) would result (the bands would then be com. 
pletely free electron). The core orbitals form a sur- 
prisingly good basis set in the core region. This implies 
that the Fourier coefficients of VerrS'= VS! V pS are 
small for large values of G;= (a/27)K; where K; is a 
reciprocal lattice vector.” This suggests introducing a 
pseudopotential with Fourier coefficients V ,6:ey „qS: 
which are zero for |G;| greater than the first two or 
three values. A pseudopotential of this form was first 
introduced by Phillips? He, however, chose V,Gi 
— const for large G;. Bassani and Celli,*! on the other 
hand, took V,°'=0 for G; such that |G;|*>11 which 
materially improved the convergence in agreement with 
the variational arguments of Cohen and Heine. 

In this paper we shall adopt a pseudopotential having 
the form used by Bassani and Celli. For both Ge and Si 
this will be of the form 


V =E V 5% exp Qni/a)G;-r], 
i 


T 
Visi V,(3) cos] (Gja+6) +63) | if |G; 2-3, 


=V,(8) cos GrH GaGa) | if |G;|?=8, 


T 
=V (11) coq (6-6, 6,37] if |G;|*- 11, 


=0 if |G;|/?>11. 

Here G;,1 is the projection of G; on the (1,0,0) direction 
(direction normal to a square face). There are then three 
adjustable parameters V,(3), V4(8), and V,(11). We 
should point out that V. is both / and k dependent. 
That is, it is different depending on whether ¢n,x has s 
or p symmetry, and also varies throughout the Brillouin 
zone. This nonlocal character is a consequence of the 
OPW method. We notice, however, that V, is a local 
(function of r only) potential. This simplifying feature 
cannot be regarded as deleterious. The energy eigen- 
values which result from using V, are generally within 
2 eV of those deriving from the use of Vert. The latter, 
one remembers, gives energies certain to only 3 eV 
furthermore, V, will be adjusted at the outset to agree 
with what we believe to be a large number of symmetry 
term values. We would then expect V, to give eigen- 
values at other points of the zone with an error much 


28 In these units the center of the hexagonal face is located at 
(3,3,3) and the shortest nonzero G; has length — v3. 2 

* We are here distinguishing between Verr which is uniquely 
defined by (7) and (8) and V; which is an approximation to Vett: 
Inserting Verr into (6) gives exactly the same results as the OPW 
method from which it is derived, whereas V, does not. 

9 J. C. Phillips, Phys. Rev. 112, 685 (1958). 

?! F, Bassani and V. Celli, Phys. Chem. Solids 20, 64 (1961). 
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less than the OPW method. The pseudopotential in a 
sense circumvents the task of finding a crystal potential 
replete with a correct exchange interaction, Coulomb 
interaction, etc. It fits, instead, the most important 
terms in the potential to experiment. 

One more point should be made. Bassani and Celli do 
distinguish between s- and p-like states by allowing 
V (0) to differ from V,(0). In this way they can raise 
s-like states with respect to p-like states. We have found 
that this is not necessary to get an adequate fit to the 
experimental interpretation of the levels. 


IV. DIAGONALIZATION PROCEDURE 


We seek to determine the energy eigenvalues of our 
model wave equation 


H ybs x En xbn.x; (13) 


where H,— (—/?/2m)V?4- V p. To do this we must in 
principle solve the infinite secular equation 


|H ,5— Eb; ;| -0. (14) 


Here Hp*=(b.4x;|Hplbxrx,) and $xzx— eK). 
That is, we use a plane-wave representation in which to 
expand the $,,.. The lowest four levels will represent 
valence states and the higher one's conduction states 
(one remembers that the core levels have been elimi- 
nated from the problem by the orthogonalization pro- 
cedure). Only the low-lying conduction levels are of 
interest so that we shall ask for only the first eight 
eigenvalues of H, (four valence and four conduction). 

Assuming that the ¢x+x; have been ordered so that for 
i» j, |k+K:|?> [k+K,|?, we may truncate the secular 
Eq. (14) so as to include contributions from only those 
plane waves for which 7, ; € N (N some integer). We 
expect the low-lying levels to converge to their final 
values as JV is made large. If we take as our convergence 
criterion that |E,,-**— E,,,"-^| «0.1 eV then No 
would have to be ~50. The time required by the IBM 
704 to diagonalize a 50X50 matrix at a large number of 
points was judged as too great to make this approach 
feasible. By augmenting the calculation with perturba- 
tion theory it is possible, however, to reduce greatly the 
computing time. 

A form of perturbation theory, e.g., Lówdin? was 
used instead of the Rayleigh-Schródinger method. We 
seek the eigenvalues of the secular equation 


| U pr "— E0,,5] =0, (15) 


Jet EE oos 


r 
n,m— JT n,m —————— (16) 
U, i tE, E—HgQr* 


with n, m<N and y» N but €T where N and T are 
integers. The indexes » and m refer to plane waves being 
treated exactly; whereas T refers to higher plane waves 
being treated only through perturbation theory. This 


2 p. Löwdin, J. Chem. Phys. 19, 1396 (1951). 
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Fic. 3. The approximate relationship of the quantities entering 
the band calculation. E represents an average energy for the 
valence and conduction bands. Ey is a cutoff energy such that ail 
plane waves with kinetic energy (#°/2m)|k+Ki|?< Ew are 
treated exactly, whereas those having a kinetic energy between Ew 
and Er are accounted for by perturbation theory. 


procedure is especially convenient for handling de- 
generate and quasidegenerate cases as it is not necessary 
to find linear combinations of zero-order degenerate 
states as in the standard method. This is particularly 
advantageous when automatic computing machinery is 
employed. 

Before proceeding it was necessary to eliminate the 
eigenvalue dependence of the matrix elements in (15). 
If this were not done, we should have to solve tor each 
of the eigenvalues separately and to use an iteration 
procedure for each of them. We wished, instead, to 
diagonalize (15), and obtain all eight interesting levels 
at once. 

In handling the matrix elements the following substi- 
tutions were made: off the diagonal )°,(H,*7H,7™)/ 
(E—H 77) — XH "^H, 7)/(E— Hn"), that is, 
E — E. Here É is an average of the eight energy levels 
at each point in the reduced zone. On the diago- 
nal 254 (H pH y" "(E= Hy") am Poo TH 
(H,"^—H,rT, ie, E—H,""-(/2m)|k-- K.]*. 


e, emt n ma )— 


Fic. 4. Shows the convergence properties of the worst cases 
among those tested. G; and G; refer to states at the general point 
of the zone with k= (0.60,0.35,0.10). The energies (in eV) at the 
left are an estimate of the convergence for Ex-7.0 (the value 
actually used). They were gotten by comparing with the results for 
Ex 9.0. 
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We here, in essence, replace E by the kinetic energy of 
the principal plane waves making up the expansion of 
$nx. This allows a closer representation of E than any 
one choice of E permits. It was felt that the more accu- 
rate representation of the diagonal matrix elements 
would improve the convergence. 

The error AU,” in a matrix element resulting from 
these approximations is 


H "YH 17 
AU ime y, — —— ôE 
Y CRUI 


H 


z (17) 
6E=E—E, n¥m, 


6E=H,"™—E, n=m. 


Since E refers to the energy of one of the lowest eight 
eigenvalues, one sees that by increasing V and hence the 
minimum value of Hp”? that AU ,"."» — 0. 

Now suppose that the plane waves $x., x, being treated 
exactly are those obeying the condition (f2/2m)|k+K, |2 
S Ey. Here Ey is a cutoff energy which we shall choose 
below. By this definition N will be somewhat dependent 
on k. In a like manner the plane waves ox+x, entering 
the calculation through perturbation theory satisfy 
7" . Ew« (h?/2m)|k+K,|?< Er. = 
ES. The actual choice of Ey, Er, and E depend on an 
intelligent use of trial and error. The values ultimately 
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(a) 


Fic. 5. Variation of some of the principal 
band gaps with the potential coefficients, In 
eV: A= (X1—X4)—3.0, A2= (L5— Ly) —4,5, 
Asc (Li— Ly)—1.5, A= (Tv —Ta); (a) 
V(3)=—0.21 Ry, V(8)=0.0 Ry; (b) V(3) 
— —0.23 Ry, V (11) 20.06 Ry; (c) V(8)-0.0 
Ry, V (11) 20.06 Ry. 


chosen were 


Ey 1.0(22?12/ma?)--35 eV, 
Er= 19.0(22?/22/ma?)-- 85 eV, 
E-— 2.0 (22282/ma?) 10 eV. 
With this choice IV was ^20 and T—90. In Fig. 3 we 
show these values in comparison with the band energies. 


The question of convergence was examined by ob- 
serving the change in the eigenvalues as Ey was in- 


creased. In Fig. 4 the levels showing the poorest con- : 


vergence among the test cases are plotted as a function 
of Ex. One should note that convergence is good to 
~0.05 eV which is within the limits of the accuracy 
being sought. 


V. PSEUDOPOTENTIAL PARAMETERS 


We now turn to a brief discussion concerning the 
choice of the pseudopotential coefficients. The work of 
Bassani and Celli’! on Ge was found to be a convenient 
starting point. Since the present work makes no dis- 
tinction between s and p symmetry, it was necessary tO 
redefine their potential slightly. In determining the 
Fourier coefficients particular emphasis was placed 00 
getting good agreement with the proposed interpreta- 
tion of the optical experiments. That means we sought 
to make the direct splittings such as Ly — Ls, Ly — Lu 
etc., come as close as possible to the experimen 


we 


€ 
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TABLE II. Energy of principal gaps (in eV). 


: jassani jrust Brust 
s Gap (Ge) (Ge) (Si) 
Pay IY 0.8 0.6 3.8 
Toy > Du 3.4 3.6 3.4 
Ly —> Lı 2.0 1.8 3.1 
Ly — 1 5.4 5.4 5.4 
X. X, 4.4 3.6 4.0 


identification. Since at our first trial we sought to test 
the proposed energy model by attempting to get the 
correct structure in the optical properties, it was natural 
to make the splittings agree as closely as possible with 
the model. 

In order to reduce the labor required to get a good set 
of pseudopotential parameters the graph shown in 
Fig. 5 was prepared. This indicates how the principal 
band gaps vary with changes in the potential coeffi- 
cients. By using this a good fit could be obtained with 
only one or two trials for both Ge and Si. In this way we 
found for Ge: V(3)=—0.23 Ry, V(8)=0.0 Ry, and 
V(11)=0.06 Ry; while for Si: V(3)=—0.21 Ry, V(8) 
=0.04 Ry, and V(11)=0.08 Ry. The term values re- 
sulting from these choices are given in Table II where 
our Ge results are compared with those of Bassani and 
Celli. 

The values of Table II are somewhat different from 
those of Table I. It is not possible to get an exact fit 
with a three-parameter potential. If one required a 
better set of values, additional coefficients would have 
to be introduced into the potential. 

It was of interest to compute the energy bands along 
symmetry directions. These are shown in Fig. 6, and 
should be compared with earlier results obtained by 
Herman on Ge and by Kleinman and Phillips for Si. We 
should point out that the good behavior of our bands 
was used as an indication of both the convergence of our 
computational procedure and of the correctness of the 
pseudopotential approach. 

We are now in a position to go on to a calculation of 
€2(w). Before doing this, however, we shall examine the 
result of computing effective masses according to the 
present scheme. 


VI. EFFECTIVE MASSES 


We wish to determine the effective masses of the 
conduction electrons within the framework of the 
pseudopotential method. Doing this for Ge by the 
formulas of k-p perturbation theory we have for Ge, 


e.g., Liu“ 


2 \(Ly| Py | Ls”)? 
(t). (18) 
m,* m  (Ein-Ery) 


Here P, is a momentum operator in a direction trans- 
verse to the (1,1,1) direction. Using our pseudopotential 


woo doráio ooa 


Ix 


fb) 


Fic. 6. Pseudopotential energy bands along A, A, and X sym- 
metry directions. Some of the principal transitions have been 
marked. 


wave functions the momentum matrix element was 
found to be 0.737? (2z/a)*. The energy denominator is 
1.8 eV. Putting these values into (18) one finds m,* 
=0.12m, compared with the experimental value of 
0.082m,. Considering the approximations, used to get 
m,* the result is reasonably close to experiment. This 
value was checked by directly computing E(k) in a 
transverse direction and using 


me/ (m,*) — [2m.CE(Ak3- k ;) — E(kz) ) (Ak). 


This also gave us 0.12m., which justifies the use of a 
two-band model in the k-p calculation. 

For Si where the minima correspond to A, states, we 
have from Dresselhaus? 


Me GAL Py| As‘)? z 
( aee e. (19) 
m*. 1 (Ea, — Es) 


3G. F. Dresselhaus, Ph.D. thesis, University of California 


(unpublished). 
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(b) 


Fic. 7. Scheme for sampling zone (a) mesh of sampling points in 
two dimensions, (b) section of zone actually sampled. 


In this case we must use a 3-band model taking two As 
states (one valence and one conduction). We now get 
m4*— 0.17m, compared with 0.195, from experiment. If 
only the A; state in the valence band is used, we get 
mi*=0.14m,. By examining E(k)m.*=0.17m. was 
found, showing that a 3-band k-p calculation is ade- 
quate for our purposes. 


VII. SAMPLING PROCEDURES 


With a view toward finding ew) we discuss the 
calculation of the joint density-of-states functions 
Jn.(@) defined by Eq. (2). Since in practice it is possible 
to solve for the eigenvalues of H, at only a finite number 
of points in the reduced zone, we must replace the 
integral in (2) by a finite sum. This was done by defining 
the quantity 


Kns(w;) 2$; 05*(o,, (ki) —o:). 


kı 


(20) 


Here k; is a set of uniformly spaced sampling points 
— lying within the first Brillouin zone. The delta function 
= is defined by 

^ (o,,—c»)- 1 if |wn.—wo| € (Aw/2)=0 otherwise. 
£2 (21) 
— The function Kns (w:) is then defined for a set of values 

- €; such that w441:=0,;+Aw. The meaning of K,,(w;) is 
ear. It measures the number of pairs of states in bands 
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found in the sample. The relationship between Kn, s(w;) a 


and Jns(w) is 

eye @) Aw (2x)? 
Ke (1) ——- 

Ak 2 


Jn(o=a), —— 


where Ak is the elementary volume surrounding the 
sampling points. 

In making a choice for Aw we were guided by the 
consideration that most of the experimental structure jp 
ew) has a half-peak width ~z eV. We consequently 
took Aw=0.1 eV/f as a reasonable value. to 

The reciprocal lattice vectors of the diamond struc- 
ture form a bcc lattice. The most natural choice for the 
sampling vectors k; is a mesh of points forming a bec 
array. This was reduced in size from the host lattice by 
a scaling factor of 36. That is, the k; are determined by 
k,=K,/36 where K, are reciprocal lattice vectors. This — | 
idea is demonstrated for a two-dimensional body | 
centered structure in Fig. 7(a). With this reduction in | 
scale the calculation gives results for 36'=46 656 
~50 000 points in the reduced zone. 

Advantage was taken of the 48-fold symmetry of the 
Brillouin zone of the diamond structure. It is then 
necessary to examine only that 1/48th part of the zone — 
indicated in Fig. 7(b). This region is defined by the | 
conditions 

0€ (k), € (k),& (kdz, (23) 
and k; contained within the first Brillouin zone. The 
contribution to Kn.(w;) from a point k;, in the interior 
of the sampling region is then taken to be 48. The 
number 48 is referred to as the weighting factor W of the 
point k;. 

For points not in the interior of the sampling region 
W is easily found. We simply find the number of regions 
between which the point k; is shared. If we call this 
number C, then W=48/C. For points not on the 
Brillouin zone surface C is found as follows: 


M ooo E 

Nx ? 1 ( 1) 27% ( Dy ( x ) | 
C ~0 
EREMO ' 
p T (24) | 
TE |, a 
i) i 9:5 0,40. |) an 


EE 
= (48) ' 
If k; lies on the first Brillouin zone surface then it i5 
shared with adjoining zones, and the above value of 
is multiplied by 2, 3, or 4 according to whether ki is on 
a face, edge, or corner. For example, for the point^; | 


C=48 and W=1. With this technique H p had only tobe - 
diagonalized at ~1200 points. 


if (k k m 
it k= k= (ede 


(24) 


it is 
X C 
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When the data first came from the computer, it was 
observed that the scatter in the 0.1-eV histograms, 
Knl), was excessive. We found, however, that by 
smoothing Kn, (w;) according to f 


Ko) iLK noin) HK nlo) K ns(wi-1) |; (25) 


» the scatter was materially reduced. This method is dis- 
cussed by Hartree.” When the smoothing procedure is 
combined with the results of critical point analysis as in 

E the next section, excellent results are achieved. In the 

| Appendix we examine the effects of smoothing as well as 
the sources of the statistical noise. 


VUI. DISCUSSION OF LINE SHAPES 


- In discussing the results we shall begin by examining 
the histogram for bands 4 and 5 (the bands are ordered 


Mort 
For 
3x 
` 
' 
2000— i 
{ . 
/ Y 
( . 4 
I / 
= uf 
jg 1000— / ^ 
4 i 
2] 
| Maced E WT S 
| fs 
Li Li » - : 
| nd we 
Olerrttho d f ! 1 
2.0 3.0 4.0 5.0 
E (ev) 
(a) 
| 
| l ef M. 


i ji 
| A TA 
] Bx "d 
|] 2 
l zI 007 / 
Ix f 
/ QS 
Y I 45 35 [37 
i Elev} —— 
j (b) 
f Tic. 8. 4-5 histograms. Volume effects refer to structure which 
| could not be traced to critical points. (a) Ge, (b) Si. 
31 D, R. Hartree, Numerical Analysis (Oxford University Press, 
i London, 1952), pp. 249-251. 
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Fic. 9. (a) Sampling planes (dashed lines) are parallel to energy 
contours (solid lines); (b) resulting density of states showing the 
spurious peaks. 


in the conventional way with n=4 the highest valence 
band, and s=5 the lowest conduction band). From 
energy considerations it can be guessed that this will 
make the most important contribution to elw). Figure 8 
shows A ,5(w) for both Ge and Si. One notices that the 
graphs show a group of periodic peaks (dashed lines) 
beginning at 4.5 eV in Ge and at 5.6 eV in Si. A close 
study of the data disclosed that these arise in an arti- 
ficial way as boundary sampling errors. Near the point 
W one finds that the planes of sampling points lie 
parallel to the surfaces on which £4 a(k) is constant. 
This condition is drawn for a two-dimensional example 
in Fig. 9, where one easily sees how these artificial 
sampling peaks can be formed. Since they were judged 
to be spurious, the peaks have been eliminated from the 
joint density of states (solid lines in Fig. 8). 

The most striking result is the large central peak. We 
remember that in our starting model the large optical 
peak was associated with transitions of the type 
Xı—> X,. To understand how it arises in A4, we inspect 
Fig. 10 where /;.5(k) is plotted in the basal plane 
(TKX plane). By looking at the energy contours near 
the point X in Fig. 10(b), one discovers that there 
exists a critical point of the M, class with a direct 
energy gap of 3.6 eV. Furthermore, on the X axis near 
the point k= (0.6,0.6,0.0) there occurs another critical 
point. This belongs to the M^» class, and has an energy 
gap of 3.8 eV. The large peak in &,,5 of Ge, results then 
from these two nearly degenerate critical points corre- 
sponding to X,— X; and X;— X; transitions. This idea 
has been demonstrated previously in Fig. 1(e). The 
occurrence of a critical point on = is a new result. : 

The analysis of Fig. 10(b) for Si leads to the same 
general conclusions. At the point X there again occurs 
an M, critical point with energy of 4.0 eV. On the Z axis - 
near k= (0.4,0.4,0.0) we find an M» critical point 
time with an energy gap of 4.4 eV. $ 

In Fig. 8 the reader may notice that sharp comers 
have been drawn into the data. The analysis discussed 
earlier indicates that this is the correct behavior near a 
critical point. 

Returning to Fig. 8(a), we see a sharp corner of | 
M, type in K,,; of Ge at 2.0 eV. It is clear that t 
identifiable with the spin-orbit split optical edge 
eV. Figure 11(a) shows the contours of E,5(k) i 
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Fic. 10. E, s(k) cont 
in TKX plane. The elegi 
indicated on graph are al] in 
eV. A few of the principa] 
contours are drawn show. 
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plane defined by PKL. At L there is an Mo critical point 
with a gap of 1.8 eV. Looking along A, we detect another 
critical point near k=(0.17,0.17,0.17). This one is 
classified as an M, with an energy of 2.0 eV. The edge 
results from As — A, transitions, while the threshold is 
produced by Ly — L, transitions. This interpretation is 
again a new one. 

Both A;— A; and Ly— Lı are spin-orbit split by 
approximately the same amount. Although the pre- 
dicted breadth of the 2-eV edge given by the separation 
of the Mo and M, singularities (0.2 eV) is in good 
agreement with experiment, more decisive confirmation 
of the predicted Mo threshold can be obtained in ma- 
terials where the spin-orbit splitting and the M; peak 
are somewhat larger. This is the case for GaAs* and 
still more clearly ZnTe. The Mo thresholds are well 
resolved approximately 0.2 eV below the M; edges. It 
is striking that such fine details appear to carry over not 
only to slightly ionic crystals such as GaAs but also to 
strongly ionic ones such as ZnTe. 

In the 4—5 histogram of Si there is an edge at 3.5 eV. 
This is identified with the peak at 3.5 eV in e(w). 
Figure 11(b) shows that the threshold for this edge is 
once more an Mp critical point at L. The energy of the 
Ly — Ly gap is here 3.1 eV (in Si this appears to be the 


35 D. L. Greenaway, Phys. Rev. Letters 9, 97 (1962). 
15 M. Cardona and D. L. Greenaway, Phys. Rev. 131, 98 (1963). 
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minimum vertical separation between the valence and 
conduction bands). The edge itself is a bit more difficult 
to classify. The energy contours indicate that it comes 
from points near the zone center. There appear to be 
several critical points clustered about T degenerate to 
within 0.04 eV. Our convergence, however, is not re- 
liable for such small energy differences. We prefer to say 
simply that the edge is due to a critical point at 
(Tes — Tis) with an energy gap of 3.5 eV. 

Finally returning to the Ge histogram of Fig. 8(a), we 
notice a shoulder near 3.2 eV. This is presumably caused 
by a point of inflection in E;;(k) near the point 
k= (0.4,0,0). This has the same effect as a weak pair of 
critical points of the Mo and M, category. We mention 
this as a possible explanation of a similar shoulder in 
€2(w) of Ge near the same energy. In our data it is less 
pronounced than in experiment owing to the nearness of 
the large peak which masks the shoulder. 

That exhausts all of the interesting structure in the 
4-5 histograms. We wish to check that our critical-point 
analysis is complete. To do this we use a rule discussed 
by Phillips,?® which says that 


N(M)—-N(M)TN(Mj-N(Msj-0. Q9 
Here N(Mo)=number of Mo critical points; N (M) 


Fic. 11. 2£4,5(k) in TLK 
plane. (a) Ge, (b) Si. 
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Fic. 12. 4-6 histogram. (a) Ge, (b) Si. 


—number of M, critical points; etc. In Ge, V (M) has 
contributions from three sources: T (the absolute mini- 
mum I; — I»), L, and A (the Mo contribution of the 
kink). Putting in the weighting factors (IV) for each of 
the points we have N(Mo)=1+4+6=11. For N(M) 
the contributions are from A, X, and A (the M, saddle 
point arising from the kink). This gives V(M1)— 84-3 
-F6- 17. The contributions to JV (M?) are all from X and 
N (M3) 212. The contributions to M all come from the 
maximum in E;,;(k) which occurs at W and N (M3) — 6. 
Forming the sum implied by (26) we get 11—174-12—6 
=0 as required. We have not applied the rule to Si 
because of the complexities discussed earlier. We can 
nevertheless, be confident that all the important critical 
points have been found since the energy contours in Ge 
and Si are very similar. 

Next we examine the contribution of the 4-6 histo- 
gram. In Fig. 12, K4,6(w) is drawn. The main feature in 
the Ge data [Fig. 12(a)] is the large peak at 5.3 eV. 
From the energy contours of Fig. 13(a), one can see that 
the structure is associated with L. In this case the 
transition being Ly — L, which agrees with the inter- 
pretation of the corresponding optical peak. 


Fic. 13. Exe(k) in TLK 
plane. (a) Ge, (b) Si. 


y ta) 
31 L, Kleinman and J. C. Phillips, Phys. Rev. 118, 1153 (1960). 
38 The effective masses are determined by mom: 
well with experiment, we naturally 


The Si data are understood similarly. The Ly — Ls 
transition is responsible for the peak at 5.3 eV. The 
somewhat large peak at 5.8 eV is apparently a boundary 
sampling error of the type discussed in connection with 
the 4—5 histogram. After matrix elements are accounted 
for, the remaining structure in K4, is too weak to make 
an observable contribution to the optical properties. 
For the same reason the remaining histograms con- 
tribute nothing to e2(w). 

We are now in a position to use these results to con- 
struct ez(w). First we shall give a very brief discussion 
of how the matrix elements were determined. 

In order to develop a reasonable way of handling 
|4,,s(k)|? we computed matrix elements for n=4, 
s=5, and for 1—4, s=6, at a number of points in the 
reduced zone. These were gotten by using the pseudo- 
potential wave functions $4,«. Such a procedure corre- 
sponds to using the smooth part of the OPW wave 
functions which according to Ref. 37 gives accuracy of 
better than 20%.*8 The results are shown in Table III. 
The Si wave functions were used; however, we would 
get almost identical results with the Ge wave functions. 
In obtaining the matrix elements for the symmetry 


K r K 


entum matrix elements. Since our computed values for the effective masses agree 
expect the computed matrix elements also to be correct. 
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points, we did the calculations at neighboring points. 
"This was done so that degeneracies would be lifted, and 
the resulting matrix elements would be those of the 
bands being studied. If this were not done then, for 
example, the wave functions of bands 5 and 6 would mix 
at X.» 

Looking at the values for | M4,;(k) |? in Table III we 
see that it is roughly constant throughout the zone 


TABLE III. Matrix elements in units of (27/a)?. 


k | M, s (K) |? | Ma, «(E) |? 
(0.04,0.04,0.02) 0.90 0.01 
(1.00,0.04,0.04) 1.51 0.00 
(0.72,0.39,0.33) 1.33 0.09 
(0.6,0.6,0.0) 1.41 0.00 
(0.58,0.25,0.14) 0.94 0.22 
(0.50,0.48,0.52) 0.85 0.63 


varying by less than a factor of 2. We could then 
replace the matrix element in Eq. (1) by a k-inde- 
pendent quantity |M4,s|?. To determine this value we 
took the weighted average of the values in Table III. 
Then |M;;|?—1.2(27/a). As Table III points out, 
| M4, «(k)|? is small except near the point L, precisely 
where the 4-6 peak originates. It is a bit difficult to 
settle on a way to handle the matrix element. We have 


zone on an equal footing. 


CC-0. In 


Public Domain. Gurukul 
-S D 


The program did not construct sýmmetrized combinations of plane waves as it was a 


rust, J. C. Phillips, and F. Bassani, Phys. Rev. Letters 9, 94 (1962). 
it, M. L. Cohen, and J. C. Phillips, Phys. Rey. Letters 9, 389 (1962), 


taken |.M.,;|*—0.6(27/a)*. That is we take a constant 
value equal to | M4,;(k— (3,3,3))|*. In Figs. 14 and 15 
we show the contributions to e2(w) of the 4-5 and 4-6 
histograms. In Fig. 16 we take €2(w) = e25 (w) + ex4:8(w), 
that is, as the sum of the contributions from the 4-5 and 
4—6 histograms, and compare with experiment. In previ- 
ous work (see Refs. 40 and 41) e2(w) was also taken as 
= e5 (w)+ es^ 9(w). Figure 6 shows our energy bands 
with the principal transitions marked. 

To check our hypothesis regarding matrix elements it 
was decided to do Si with matrix elements included. 
|M,.,.(k)|? was computed along with the energy eigen- 
values at each point of the zone, and put in Eq. (1). 
This added about 3 h of machine time to a total of about 
15 h. The result for e2(w) shown in Fig. 16(b) includes all 
of the separate histograms. 

As Fig. 16 indicates, our line shapes appear satis- 
factory. The two exceptions are the Ly — Ls peaks in 
Ge and Si, as well as the T25 — Ts peak in Si. The main 
difficulty in the first case centers around the strength. 
We have seen that our treatment probably overesti- 
mates the matrix element so that it is not surprising that 
the computed strength is too large. In Fig. 17 we ex- 
amine the experimental line shape derived by taking the 
shaded portion of ez(w) as shown by the inset. With 
suitable magnification the theoretical and experimental 
line shapes are in good agreement. The T25 — I1; peak 
is not well understood. Since it is so narrow, there is à 
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Fic. 16. Experimental result for e2(w) (solid linc) compared with our computed value (a) Ge €2(w) = ext (5) -- e3*5 (o) (dashed line, 
with edges emphasized to account for critical points), (b) Si €2(w) = ext5(w)+es44(w) (dashed line), ¢2{w) taken as sum of contribution 


from all bands with dipole matrix elements included (dotted line). 


chance that it is a density-of-states effect which we 
did not detect. The reader should seea further discussion 
concerning this point by Phillips.?? 

'The results of this section are summarized in 
Table IV. 


IX. STATIC DIELECTRIC CONSTANT 


From a knowledge of e2(w) it is possible to calculate 
the real part of the dielectric constant €;(w) in the limit 
«€ — 0. From the Kramers-Kronig relation the following 
expression can be obtained? 


2 f? ew) 
a= | do. 
TJO 


(5) 


TABLE IV. Important critical points below 6 eV. In some cases 
they have been given to 0.01 eV with a reliability ~0.03 eV. 


Theo- Experi- 
retical mental 
Type Location Bands Symbol energy energy 
Germanium 
Meo (0,0,0) 4>5 Iw-—Ie 06 08 
Mo  (0.5,0.5,0.5) 455 L»y—lL 1.78 21 
Mi (0.17,0.17,0.17) 45 Aic A; 201 23 
Mi — (0.3,0.0,0.0) 45 A;cAL 3.17 3.0 (?) 
Mi —(1.0,0.0,0.0) 4—55 X >Xı 36 43 
Ms» (0.50.0,0.0) 455 Apo A 321 34 (9) 
M. (0.61,0.61,0.0) | 45 24x20 0 59:844 
Moy  (0.0,0.0,0.0) 46 Ts-Is 3.6 3.2 (?) 


M,  (0.560.560030 46 sec 5.33 5.6 


Ms —(0.50.50.5) AJ 6 Jesi C EA EMT 

Silicon x 

mete 0.0 0.0,0.0) 4—5 T25 > Tis $5 35 

Mo (oes 4-s5-. Decl 315 30 

M,  (1.00.0,0.0) 425. XX i 43 
2 .4,0.4,0.0 4-5 2> $. 3 

Ae Od AG Xy y 5D 


Ma — (0.5,0.5,0.5) 


e ‘ips, Phys. Rev. Letters 10, 329 (1963). —— 
a E H Wc Phys. Soc. (London) B66, 141 (1953). 
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Using our results for e2(w), we can now derive e,(0). It is 
realized, however, that the value calculated in this way 
is directly proportional to the choice of matrix elements. 
In the case of the 4-5 bands we took this as 1.2(27/a)? 
with an assigned error ~25%. For the other pairs of 
bands where the matrix element is considerably smaller - 
we do not have a sufficiently reliable value for this 


purpose. Therefore, only the contribution from es'*(u) 


Fic. 17. Comparison of the theoretical and experimental line 
sbapes for the Ly — L, transition in Ge. The solid line is the 
shaded portion of the experimental curve indicated by the inset. 
The shaded region has been moved 0.5 eV to the left and magnified 
seven times. The dashed line is a similar region under the theo- 
retical curve. 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


> 
pa 
ow 
wn 
D 


PHOTOEMISSIVE YIELD(ELECTRON/PHOTON ABSORBED) 


$0 34 38 42 46 50 54 58 
PHOTON ENERGY (eV) 


Fic. 18. Photoemissive yield: electrons/absorbed photons, as a 
function of photon energy, with vacuum W, eV, above the top of 
the valence band, as computed in present work. 


was evaluated. For Ge this gave e4:5(0)=12.4 and for 
Si e°(0)=7.6, which are to be compared with the 
experimental values of 16.0 and 12.0, respectively. As 
expected &*5(0) « &(0) due to the neglect of the higher 
bands. The 4-5 bands of Ge appear from these values to 
give a better result than those of Si. This, however, 
arises from the fact that the large peak in Ge has been 
erroneously shifted to a lower energy, and thus gives a 
bigger contribution to the integral. These values show 
that the 4-5 transitions, which account for most of the 
structure in €2(w), also contribute more than 50% of the 
value of €,(0), the macroscopic static dielectric constant. 


X. g FACTOR AND SPIN-ORBIT EFFECTS 


According to Roth and Lax,!5 the longitudinal com- 
ponent of the g tensor is 


e Aso mo 
£u M 1) . 

(Er,— Ey) \m: 
Here A; is the spin-orbit splitting of the Ly states. We 
use Lukës and Schmidt’s value for Ge, A,o=0.195 
£0.02 eV, and take the Mo threshold in the optical 
response as the L;— Lı transition, which gives 
(Er,—Ery)—2.1 eV. gu is then found to be =0.96 
(m, 0.082m;). This is compared with the experimental 
value of 0.87. With the old interpretation that the edge 
itself is to be identified with Ly — Lı we would use 
2.35 eV for (Er,—Er,) and get gu=1.07. Hence the 
new interpretation is seen to give better agreement with 

the g-factor data. 

The identification of the edge with A; — A, transitions 
allows us to resolve a problem in the spin-orbit splitting 


* F. Lukés and E. Schmidt, Proceedings of the International 
$ Conference on ihe Physics of Semiconductors, Exeter (Institute of 
__ Physics and the Physical Society, London, 1962), p. 389. 
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of GaAs. In this material Aso is observed to be =0.26 
eV, whereas from theory one would expect it to be 0.29 
eV. The latter value is based on the assumption that the 
Ly — Lı transition is responsible for the edge. In that 
case A.o(L)=3A,0(I’). The paradoxical result is resolveq 
by the conclusion that the transition is associated with 
a point on the A axis where A,o=0.26 eV, that is, falls 
between the extremes at I' and L. 


XI. PHOTOEMISSION RESULTS 


The present work provides an interpretation not only 
of the dielectric properties of Ge and Si, but also of 
recent studies on the photoemissive properties of Si, 

This work may be divided into two parts. Very precise 
studies! of weak photoelectric emission from Si near 
threshold on atomically clean surfaces reveal direct and 
indirect thresholds similar to the infrared absorption 
edges discussed by Hall, Bardeen, and Blatt.‘® The in- 
direct threshold is at 5.15 eV, the direct threshold at 
5.45 eV. The crystal surface is a (111) plane, and the 
thresholds, according to Kane,“ are associated with the 
symmetry line A, where V is directed normal to the (111) 
plane. The direct transition responsible for the direct 
threshold therefore starts from As at an energy 0.3 eV 
below T25. The transition marked D in Fig. 6(b) starts 
from k= (0.12,0.12,0.12) and requires H=5.5 eV. The 
agreement with Kane's theory and Allen and Gobelli's 
experiment is satisfying. It should be emphasized that 
this agreement is a by-product of our calculations for 
the ultraviolet absorption. 

The second part of the photoelectric data concerns the 
broad yield curves obtained when the effective work 
function is reduced by covering the surface with up to 
one monolayer of Cs. In doing the analysis we assume 
that electrons which are excited into the conduction 
bands have an escape probability P,” (k). Then P," (k) 
=0 if E,(k) «W and P,” =P if E,(k)>W. Here E,(k) 
is the energy of the electron state with respect to the 
top of the valence band and W is the vacuum energy 
also measured relative to I'35. With these assumptions 
the photoemissive yield function (number of electrons 
emitted/number of photons absorbed) is given by 


» Í PRP (k)8 (on a (k)—0) | Maa? 


Yw(»)— D 
T ERB (cn, s (k)—w)| Mn, (Kk) |? 


mesS BZ. 


(27) 


in which it is assumed that the electron escapes with 4 
negligible loss of energy. 


*5 G. W. Gobelli and F. G. Allen, Phys. Rev. 127, 141 (1962)... 
(1955 H. Hall, J. Bardeen, and F. J. Blatt, Phys. Rev. 95, 5 
* E. O. Kane, Phys. Rev. 127, 131 (1962). 
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i for tw =3.6+0.1 eV and 5.30.1 eV with W — 1.5 eV, as computed; 
is (b) Spicer's results for P(E), W=1.5 eV. 
12 
dio with the experimental results obtained by Spicer“ and 
k by Gobeli and Allen? as shown in Fig. 19. The value of 
? É Š w is experimentally varied, as we said, by covering the 
p6 Si surface with a fraction of a monolayer of Cs. The 
à over-all agreement appears satisfactory. The principal 
7) features in the W=2.7-eV plot are the peaks at 3.4 and 
2 5.3 eV due to T's — T'15 and Ly — La transitions. The 
1a 6 = = — X,— Xi dip at 4.3 eV arises since E(X1)— E(Tzy) 
| GOES ea thency.e XI =1.1 eV «W. The secondary peaks at 3.8 and 4.8 eV : 
| s (b) 4 W., E. Spicer and R. E. Simon, Phys. Rev. Letters 9, 385 
- R xperimental results for the photoemissive yield (1962). — 5 2 
559 Fic. 19. Pees (b) Gobeli and Allen. 9$ G. W. Gobeli and F. G. Allen (to be published). 
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have been identified as due to transitions near As > Ai 
and points in the volume of the Brillouin zone, re- 
spectively. As W is increased the behavior of the curves 
is in quantitative agreement with the experiments. — 

In Fig. 20(a) we have drawn the energy distribution 
P(E) for the emitted electrons with W= 1.5 eV for the 
cases iw=3.5 and 5.3 eV. Since our sample includes 
considerably fewer points, we take an energy interval of 
0.3 eV. The experimental results of Spicer are drawn for 
comparison in Fig. 20(b). Again our results provide an 
interpretation of the experiment. The peak at 1.2 eV in 
the 7:9—3.5-eV distribution is centered around the 
Tas — T's transition. The peak at 2.4 eV in the 7io— 5.3- 
eV curve arises from Ly — Ls transitions. 


XII. DISCUSSION OF RESULTS 


By starting with a model term scheme for the levels at 
T, X, and L, we have constructed a pseudopotential 
which has given the bands throughout the zone. The re- 
sults we have found for e(w) explain the line shapes 
which are seen in experiment. As an additional feature 
we can explain the results of recent studies on photo- 
emission from Si.9 Although our starting model had 
errors in it, they were subsequently straightened out by 

the band calculations. We also found several new 
critical points. This demonstrates the value of the 
optical data as a guide to theory. It gives us a way to 
start our calculation and a way to check it upon 
completion. 

We have seen that the calculation was limited by 
convergence and sampling noise to errors ~0.05 eV. 
Further improvements of these quantities can be 
achieved by increasing the number of plane waves 
admitted to the expansion, and by taking more points 
into the zone sample. The next step would require the 
use of a more accurate potential. 

By analyzing the optical properties of Ge and Si 
using a very simple model we have succeeded in ex- 
ploring the electronic structure of these materials over 
an energy range of order 10 eV. This range is an order of 
magnitude larger than that studied by infrared experi- 
ments. The precision of our study enables us not only 
to explain all the observed structure but also to predict 
new features, such as Mo thresholds. 

Comparing the results for Ge and Si we find that 
alterations in the crystal potential are small and lead to 
much smaller changes in the over-all band structure 
than might have been supposed from infrared data. 
"This agrees with the ultraviolet experimental results. 
One may say that in Ge and Si the energy surfaces in 

k space are determined primarily by the shape of the 
Brillouin zone, i.e., by symmetry considerations, and by 
V(3), which measures the strength of the covalent bond. 


wA further application has allowed us to explain some of the 
properties of Ge-Si alloys as well as high-pressure effects on the 
oo [F. Bassani and D. Brust, Phys. Rev. 131, 1524 
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APPENDIX 


In Fig. 21 we plot K4,s(w:). This should be compared 
with K4,s(w:) shown in Fig. 8(a) of the text, which 
shows the effect of smoothing. 

'To do a rigorous statistical analysis of the smoothing 
process would be exceedingly complex and not particu- 
larly rewarding. By a few heuristic arguments we can, 
nevertheless, estimate the random errors entering the 
problem. Consider first a one-dimensional situation. 
Suppose we have in this model a monotonically in- 
creasing E,,(k) curve. In Fig. 22(a) imagine that the 
solid lines divide the energy range into intervals of 0.1 
eV, that is, E;,(ki1) 2 E, (k;)--0.1 eV. Next suppose 
that a uniform net of sampling points is laid over the 
line. The result of this sample is used to approximate the 
true joint density of states. Suppose also that in 
computing E,,(k) we had convergence noise ~0.05 eV. 
There are then two sources of error in the computed 
joint density of states. The first results from the fact 
that the last sampling point in an interval is not 
significant. For example, in Fig. 22(b) shifting the 
boundary at 0.2 eV very slightly to the left will, put the 
last point in the 0.1-0.2 eV interval into the 0.2-0.3 eV 
interval. The second source of error arises because our 
calculation does not locate the boundary energies ex- 
actly. The idea is sketched in Fig. 22(c). One should 
note that the error in the energy eigenvalues is a dis- 


‘Fic. 21. K4,5(0;), compare with K4 5(w). 
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continuous function of k. This occurs because the num- 
ber of plane waves in our expansion depends on k. 

If we take a sampling mesh of 1000 points and suppose 
that Ens(k)max=5 eV, then our one-dimensional sample 
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contains ~20 points per interval. These numbers 
roughly correspond to those of the actual problem. Then 
the error from boundary corrections is about 1 point per 
interval or 5%. The error arising from convergence 
noise can be ~ 10 points per interval or 50%. The effect 
of the smoothing is clear. We take three times as many 
points while leaving the absolute error constant. Hence 
the dominant error is reduced from about 50% to 
about 20%. 

The situation in three dimensions is not so readily 
analyzed. The same sort of arguments ought to apply, 
however, we now have a two-dimensional surface for the 
boundaries of the energy intervals. The scatter resulting 
from boundary corrections and from convergence noise 
before smoothing is now estimated to be ~20%, and 
after, ~10%. 
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Redetermination of the Temperature Dependence of X Rays 
Anomalously Transmitted through Germanium 


Bonis W. BATTERMAN 
Bell Telephone Laboratories, Murray Hill, New Jersey 
(Received 2 December 1963) 


We have redetermined the temperature dependence of the anomalously transmitted x rays through single 
crystals of germanium of effective thicknesses, uo = 20 and 30. We have established, as suggested by Okkerse, 
that the earlier measurements were subject to error due to a thermal gradient perpendicular to the diffracting 
planes. By suitable rearrangement of the heating geometry this type of gradient was substantially removed, 
The current results are in agreement with those of Okkerse and indicate that the data can be explained by 
allowing the imaginary part of the scattering factor to depend on temperature as exp( —M) where M is 
based on a Debye temperature of 291°K. It is felt that there is insufficient evidence at present to conclude in 
general that the temperature dependence in dynamical diffraction is exp( — M). 


[S a previous publication! we reported measurements 
on the temperature dependence of the anomalous 
transmission of x rays in germanium single crystals. 
We obtained a linear dependence of the logarithm of the 
integrated intensity with temperature ranging over an 
order of magnitude in intensity. [ Reference to this work 
will be abbreviated as (I).],The data in (I) were fitted 
to theory assuming that e(— Fz"/Fo"), the normalized 
angular dependence of the imaginary part of the scat- 
tering factor, could be expressed as e= e» exp[ — (aM) ] 
where eo is the static value, M is the exponent in the 
usual Debye-Waller factor e~™ and a is an adjustable 
constant. It was found that the data for the (220) and 
(400) reflections were consistent with a single a= 1.32. 

In a symposium associated with the 50th anniversary 
celebration of the discovery of x-ray diffraction, 
Okkerse? presented experimental data indicating a 
similar logarithmic dependence but with an o value of 
1.0 based upon a Debye temperature of 291°K. 

To resolve the experimental discrepancy, we repeated 
our past measurements with several variations. In the 
first series of experiments the crystals were mounted in 
the same manner as before (crystal in the shape of the 
letter T, held by the vertical arm) as in (I), Fig. 4, but 
with the crystal extended in the furnace so that both 
the forward diffracted (anomalously transmitted) and 
diffracted beams could be measured. (Previously only 
the diffracted beam was measured.) The experiment was 
repeated with the x-ray beam striking different portions 
of the surface of the crystal. The results of these meas- 


QUARTZ CEMENT 


DIFFRACTEO 
BEAM 


QUARTZ x INCIDENT BEAM 


Fic. 1. Details of crystal mount. The crystal is 
attached on only one arm of the yoke. 


1B. W. Batterman, Phys. Rev. 126, 1461 (1962). 
2B. Okkerse, Philips Res. Rept. 17, 464 (1962). 
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urements were: (1) the slopes of log intensity versus 
temperature of the diffracted beams were constant for 
different points on the crystal surface to within 5%, but 
did not agree in value with the previous measurements 
for a crystal of the same thickness, (2) the curves of the 
forward diffracted beam were not linear but curved 
toward lower intensities at higher temperatures such 
that at sufficiently high temperatures, the forward beam 
had less intensity than the diffracted beam. The amount 
of curvature depended upon where the beam struck the 
crystal surface. 

These results strongly indicated the presence of 
thermal gradients even though the slopes of the dif- 
fracted beam curves (which were measured in our 
previous paper) did not vary with beam position on the 
crystal. 

We subsequently repeated our previously reported 
measurements on the (400) reflection on the same crys- 
tal and experimental setup as in (I). The results re- 
produced those in Fig. 4, Ref. 1, i.e., they showed the 
curvature when the beam was near the supported part 
of the crystal, and a linear plot of the same slope as 
before, when the beam was at the center of the tee- 
shaped crystal. However, when the beam was moved 
still further from the support, the plot remained linear, 
but had a progressively smaller slope the further the 
beam was from the quartz-supporting rod. This defi- 
nitely proved that the results in (I) were in error be- 
cause of a thermal gradient due to heat flow down the 
supporting rod. This substantiates Okkerse's? hypoth- 
esis concerning the results in (I) that a linear de- 
pendence of the log of intensity versus temperature is 
not the correct criterion for a strain gradient-free 
crystal. In this arrangement heat flow down the quartz- 
support rod produced a gradient perpendicular to the 
diffracting planes which strongly affected the inte- 
grated intensities. 

To avoid this difficulty we now mounted the crystal 
as in Fig. 1. Any gradient caused by heat flow down the 
supporting rod would now be parallel to the diffracting 
planes and should have a minimal effect on the inte- 
grated intensity. As before, the germanium (220) and 
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lif- as described in (I). The (400) used in (I) was still 
T. available and made the fourth sample. The crystal o 
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The quantitative measurements were performed in 
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Z L : y g 
y new manner in which the crystals were held (Fig. 1). 
ge Í The data for the two reflections are presented graphi- T aussi 
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"TABLE I. Experimental slopes from Figs. 2 and 3 
and corresponding values of a. 


4 I MER s MEIN 
3 Reflection nul cw Slope : "i 
f Eon 7 —248X 105 0.99 
E 220 20,08 —3.39, X 1073 1.02; 
= 400 20.92 —4.654X 107? 1.02; 
$ 400 30.95 —6.85X 10-8 1.01; 
H 


It can be seen from the table that a single value of 
a= 1.010.015 will fit the measurements for the four 
crystals. This is in contradiction to the results in (1) 
which were subject to error due to thermal gradients in 
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on the tungsten was also recorded. 


SE of cesium as propellant in ion propulsion has 
increased the interest in the interaction of me- 
dium-energy cesium ions with solids. One of the 
interactions in which there has been a basic as well as a 
practical interest is that of the kinetic ejection of elec- 
trons from metals due to the bombardment of cesium 
ions. In the energy range of interest, 1-20 keV, there is 
little published data. Waters,! measured secondary elec- 
tron yields for low-energy (150-1500 eV) cesium and 
lithium ions on tungsten. Arifov e al., and Petrov’ have 
published data for alkali ions up to 10 keV impinging on 
various metals. In the energy range of 1-4 keV Brunnee* 
has measured y; for the alkali ions on molybdenum. 
— No close agreement exists among published values. 
— This paper presents the measured yields of secondary 
- electron emission from clean polycrystalline tungsten by 
normally incident cesium ions of energies of 1-21 keV. 
* This work supported by the U. S. Air Force, Aero Propulsion 
aboratory, WPAFB, Dayton, Ohio. 
3 P, M. Waters, Phys. Rev. 111, 1053 (1958). 

A. Arif Rakhimoy, Transactions of the Ninth 
"Union Conference on Cathode Electronics, Moscow, 1959 
(un) blished), P. 666. 

J. N. Petrov, Fiz. Tverd. Tela 2, 949 (1960) [English transl. : 
et Phys.—Solid State 2, 865 Ste 
C. Brunnee, Z. Physik 147, 161 (1957). 
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the specimens perpendicular to the diffracting planes 
These results now substantially agree with Okkerse' 
and indicate that Debye-Waller factor for germanium 
diffracting in the symmetric Laue case is very close to 
e-™, This, coupled with the same experimental tem. 
perature dependence for symmetric Bragg reflection, 
suggests that in general the Debye-Waller factor for 
perfect crystals is €^". This should not, however, be 
taken as proof of its general validity without further 
experiments and the development of a sound theoretica] 
treatment of thermal vibrations in an absorbing perfect 
crystal. 


5 B. W. Batterman, Phys. Rev. 127, 686 (1962). 
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Kinetic Secondary Electron Ejection from Tungsten by Cesium Ions* 


SaxuxL H. Boscu AND GUNTIS KUSKEVICS 
Electro-Optical Systems, Inc., Pasadena, California 
(Received 23 December 1963) 


Initial measurements of the number of electrons ejected per incident ion, y;, have been made for medium- 
energy (1-21 keV) cesium ions at-normal incidence on a clean polycrystalline tungsten surface. The residual 
gas pressure was 5X 10^? Torr. The tungsten surface was cleaned by prolonged bakeouts and by flashing 
before each measurement. A dc ion-beam pulse method of measurement was used to prevent cesium coverage 
of the tungsten surface. In the energy range of 3-21 keV ys is a linear function of the ion kinetic energy with 
Qy,/0£0.04 electron per ion per keV. At 3 and 21 keV vy; is 0.02 and 0.74, respectively. If the data are 
extrapolated to y;—0, a threshold energy of 2.5 keV is obtained. Below this energy the measured values of 
yi were equal to zero within the accuracy of the measurement. The change of y; with cesium adsorption 


Since the ionization energy of cesium, 3.87 eV, is less 
than the work function of clean tungsten, about 4.5 eV, 
the electron yields presented are due to the kinetic 
ejection process only and not to the Auger or potential- 
ejection® process. The apparatus used is shown in Fig. 1. 
The vacuum chamber is a stainless-steel cylinder, 3 ft 
in length with a diameter of 1 ft. It is pumped by a 
liquid-nitrogen trapped, 6-in. silicon oil diffusion pump. 
The chamber can be baked to 300°C with either an 
oven or strap heaters. Copper gaskets are used for all 
flanges. Within the tank a liquid-nitrogen liner is used 
to freeze out all condensibles. It can be baked out by 
passing hot gas through it. Pressure is measured with à 
Bayard-Alpert type ionization gauge. The system i$ 
capable of pressures lower than 1X 10-? Torr if proper 
care and technique are used. 
_ The cesium ions are produced by a surface-ionization 
lon source using a porous tungsten ionizer operating at 
1000-1200°C.* This source can generate up to 5 mA of 


cesium ion current with a maximum current density at 


the target of approximately 1-2 mA/cm?. The fraction 
of neutral cesium atoms for the currents used in this 


iH. Hagstrum, Phys. Rev. 96, 336 (1954). 
G. Kuskevics and B. Thompson, AIAA J. 2, 284 (1964). 
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ELECTRON EJECTION 
experiment was less than 0.01. This represents neutral 
cesium incidence rates corresponding to monolayer 
adsorption times of greater than five hours. All meas- 
urements were taken with low ion-current densities of 
approximately 10 uA/cm?. The singly charged cesium 
ions, accelerated by the three electrode structures are 
monoenergetic except for their thermal velocity dis- 
tribution. 

The secondary electron emission detector consists of 
the target made of 0.003-in.-thick tungsten filament, 
and a cylinder collector with slits in front and rear. The 
latter allows higher pumping speed around the filament. 
The target can be flashed by direct resistance heating. 
Biased slits are situated in front of the collector. They 
collimate the beam, prevent the escape of secondary 
electrons that are ejected toward the front slit, and 
extract electrons that may be in the ion beam. The 
edges of the slits are chamfered and the width of the 
slits becomes progressingly larger from the first slit to 
the collector. The values of the incident ion and 
secondary electron currents are obtained by measuring 
the voltage drop across precision resistors. These are 
read out on a Westronic two-channel chart recorder. 
The collector potential is provided by a dc voltage 
supply in conjunction with a polarity switch. 

During the bakeout and pump down of the vacuum 
system to the 10? Torr range (approximately 24 h) the 
tungsten filament is heated continuously at about 
1600°K. The procedure used in measuring is a modifica- 
tion of Petrov's dc pulse method.’ With the ion beam 
deflected off of the detector the target is flashed for 15 
sec at approximately 2300?K with the collector at a 
negative potential so that it collects any positive ions 
evaporated from the surface. Three to four seconds 
after flashing when the target has cooled below thermi- 
onic emission temperatures, the collector potential is 
changed to a +45 V and the chart recorder is started. 


To 6" 
Vacuum 


Cs Reservoir 
lonizer 


Beam Deflection 


Fic. 1. Schematic diagram of apparatus. 
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Fic. 2. Typical primary and secondary currents as functions of 
time. 6 keV Cs* on W. 


About 2 sec later the cesium-ion beam is pulsed onto 
the target. The primary ion and secondary electron 
currents are recorded continuously. If desired, the ion 
beam can periodically be deflected off of the target to 
insure the zeros of the ion and electron currents have not 
shifted. 

The instant the beam strikes the target the values of 

I; and I. rise from their zero values to their initial 
values. These incident ion and secondary electron zero- 
value currents are due to insulator leakage caused by a 
cesium film which builds up on them as measurements 
are taken. The ratio of these initial value increments 
was taken to be +; for clean tungsten. There is little 
observed variation of 7; with time but J. immediately 
begins to increase steadily until a saturation value is 
obtained (see Fig. 2). Not only is the initial y; for clean 
tungsten reproducible but so is the final +; for cesiated 
tungsten if enough time is allowed. Since all measure- 
ments were taken with background pressures <5 10-? 
Torr the authors believe that the 5-10 sec delay from 
the end of the flash to the time of the initial measure- 
ment does not allow tíme for ambient gases to ap- 
preciably adsorb on the target. Exact monolayer ad- 
sorption times have been measured for these pressures? 
and they are of the order of 10 min. A quick check 
showed the adsorption time to be greater than 5 min for 
conditions under which these data were taken. This 
gives a value of surface coverage by ambient gases less 
than 397. 

The results of these measurements are shown in Fig. 
3. In agreement with previous measurements a linear 
increase in y; was obtained for an increase in incident 
ion energy, dy;/0E~0.04 electron/ion per keV. But 
contrarily, a higher kinetic ejection threshold energy for 
cesium on tungsten was obtained. This threshold, 2.5 


7 P. Mahadevan, J. Layton, and D. Medved, Phys. Rev. 129, 79 
(1963). 
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Fic. 3. Secondary electron yields as a function of cesium-ion 
energy for clean and cesiated tungsten. 


keV is arrived at by extrapolating the data down to a 
y:=0. Also, y; was zero within the accuracy of the 
measurement below this energy for what the authors 
believe to be clean tungsten. The equipment and 
method of detection used would be capable of detecting 
a y; down to about 0.005. For cesiated tungsten, 
nonzero values of y; were observed for energies down to 
1 keV. 

In Fig. 4 the data presented in this paper are shown in 
comparison with other measurements. Because of the 
measurement of a threshold energy for y; for cesium on 
tungsten and because of the lower values of y; obtained, 
the authors believe the previous measurements of 
Waters and Petrov are in error. This error is probably 
due to cesium or adsorbed gas coverage on the target. In 
the energy range common to both measurements, the 
values of Brunnee agree very well with those presented 
here. There is a difference in the dy;/0Z’s and also in the 

threshold energy. This could be due in part to the differ- 
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Fic. 4. Comparison of measured values of y; as functions of 
cesium-ion energy. 1—Waters, Cs on W; 2—Petrov, Cs on Mo; 
3—Brunnee, Cs on Mo; 4— present data. 


ence in properties of tungsten and molybdenum. The 
threshold energy of 2.5 keV corresponds to a cesium-ion 
velocity of approximately 0.610’ cm/sec. This is in 
excellent agreement with the theoretical value of the 
velocity threshold for kinetic ejection of electrons 
calculated by Parilis and Kishinevskii.? 
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Defects in Irradiated Silicon: Electron Paramagnetic Resonance and 
Electron-Nuclear Double Resonance of the Si-E Center 


G. D. Watkins AND J. W. CORBETT 
General Electric Research Laboratory, Schenectady, New York 
(Received 7 January 1964) 


The Si-E center is one of the dominant defects produced by electron irradiation in phosphorus-doped 
vacuum floating zone silicon. It introduces an acceptor level at ^(/2,—0.4)eV and gives rise to an electron 
paramagnetic resonance when this level does not contain an electron. As a result of electron paramagnetic 

e resonance (EPR) and electron-nuclear double resonance (ENDOR) studies described in this paper, we 
conclude that the defect is a lattice vacancy trapped next to a substitutional phosphorus atom, with EPR 
arising from the neutral charge state. The observed hyperfine interactions with P® and neighboring Si? 
nuclei, as well as the observed g-tensor anisotropy, are discussed in terms of a simple linear combination of 


6 
atomic orbitals (LCAO) molecular orbital treatment. In addition to the anisotropy associated with the 
phosphorus-vacancy direction in the lattice, an additional distortion of the defect occurs which is identified 
MS D^ in the LCAO treatment as a manifestation of the Jahn-Teller effect. Thermally activated reorientation from 
) one Jahn-Teller distortion to another causes motional broadening and narrowing effects upon the EPR 


spectrum in the temperature region 60-150°K. The motion is also studied by stress-induced alignment at 
lower temperatures and the activation energy for this process is determined to be ~0.06 eV. Alignment 
The of the phosphorus-vacancy direction in the lattice is also achieved by stressing at elevated temperatures, 
The activation energy for this motion is 0.93--0.05 eV. The magnitude and sense of the alignment in both 


ion . 1 : : : 
ag kinds of stress experiments are consistent with the microscopic model of the defect. The role of the 
| In phosphorus-vacancy interaction in the diffusion of phosphorus in unirradiated silicon is discussed. Using 
the the published value for the diffusion activation energy for phosphorus in silicon, we estimate the appropriate 
ons value for silicon self-diffusion to be 3.94-£0.33 eV and the formation energy for the lattice vacancy in silicon 
to be 3.6+0.5 eV. These are quantities for which no direct experimental values are available. Also included 
is an appendix which gives estimates of |V3,(0) |? and (rsp~*) for the 3 atoms aluminum through chlorine. 
nee I. INTRODUCTION Il. EXPERIMENTAL PROCEDURE 
ren HIS is the third! in a series giving detailed descrip- The silicon samples? studied were » type (phosphorus 
ta, tions of defects produced by irradiation in silicon. — 2«10/5— 10!5/cm-?) which had been grown in vacuum 
tta Í In this paper, we describe electron paramagnetic reso- by the floating zone technique. The samples were 
jon nance (EPR) studies of the Si-E center? This centerhas approximately 0.1 in.X0.1 in.X0.5 in. and were ir- 


radiated equally on opposite sides at room temperature 
by 1.5 MeV electrons from a resonant transformer 
accelerator. The bombardment current was 22.5 
pA/cm?, the maximum temperature rise in the sample 
being about 25?C. 

The samples were mounted with their long dimension 
along the axis of a TEn microwave cavity and EPR 
was studied using a balanced bolometer spectrometer 
at 20 kMc/sec. Magnetic field modulation and lock-in 


been discussed briefly in previous publications^* where 

it was tentatively identified as a phosphorus-vacancy 
DI | pair. In this paper the arguments leading to this identi- 

fication are presented, and a model of the microscopic 
configuration of the defect as well as its electronic 
structure is presented. Such physical properties as its 
activation energy for diffusion, its binding energy, etc., 
are estimated. 


LOS. afl 


E 


Also discussed is the role that this defect plays in 
the high-temperature diffusion of phosphorus in un- 
irradiated material. From some of the properties of the 
E center deduced here, we can estimate the activation 
energy for self-diffusion in silicon as well as the forma- 
tion energy for the lattice vacancy, quantities for which 
no direct experimental values are available. 


1 The first two are (I) G. D. Watkins and J. W. Corbett, Phys. 
Rev. 121, 1001 (1961) and (II) J. W. Corbett, G. D. Watkins, 
R. S. MacDonald, and R. M. Chrenko, ibid. 121, 1015 (1961). 
Tn the remainder of this article they will be referred to as (I) and 


, respectively. Ye 
E ae remainder of the paper, the prefix “Si-” will be dropped 


for convenience. à 
3G. D. Watkins, J. W. Corbett, and R. M. Walker, J. Appl. 


. 30, 1198 (1959). : 
AG D. dne and J. W. Corbett, Discussions Faraday Soc. 


31, 86 (1961). 


detection at 94 cps were employed. Most measurements 
were made in dispersion. At <20°K, relaxation times 
are long and rapid passage effects? gave absorption-like 
spectra, 180 deg out of phase with respect to the modula- 
tion. Studies at higher temperatures gave the usual 
derivative of dispersion curves. 

For electron-nuclear double resonance (ENDOR)? 
studies, a single turn coil on the end of a coaxial line 
was introduced into the resonance cavity, as shown in 
Fig. 1, the open loop at the bottom of the coil allowing 
it to slip over the previously mounted sample. The coil 
was driven by a General Radio 1211B Unit Oscillator 


5 Procured from Merck Chemical Company. 

* A. M. Portis, Phys. Rev. 100, 1219 (1955), Technical Note 
No. 1, Sarah Mellon Scientific Radiation Laboratory, University 
of Pittsburgh, November, 1955 (unpublished); M. Weger, Bell 
System Tech. J. 39, 1015 (1960). 

7G. Feher, Phys. Rev. 103, 834 (1956). 
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Fyc. 1. Single-turn coil assembly 
introduced into the microwave cavity 
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and supplied the radio-frequency magnetic field neces- 
sary for the nuclear transitions. ENDOR studies were 
made at 4.2°K by observing a strongly saturated elec- 
tronic transition in dispersion under normal conditions 
of magnetic field modulation. The nuclear transition 
frequencies were detected as a transient change in the 
electron resonance signal as the radio-frequency oscilla- 
tor was swept through the nuclear resonance frequency. 

The method for applying uniaxial stress to the crystal 
has been previously described in (I). Briefly, a simple 
lever and weights outside the cryostat supplies a force 
to a stainless-steel rod (inserted into the entry tube in 
place of the ENDOR coaxial line) which in turn trans- 
mits this force directly to the end of the crystal, thus 
compressing it along its long dimension. 

Studies versus temperature were made below 77°K by 
first cooling to 20°K, expelling the liquid Hs coolant, and 
observing as the cavity plus crystal warmed up to the 
77°K of the cryostat heat shield. For higher tempera- 
tures, a heater connected to the wave guide approxi- 
mately 6 in. above the cavity was used. Temperature 
was monitored by a copper-constantan thermocouple 
on the cavity. 


HI. GENERAL RESULTS AND DISCUSSION 
A. Experimental Results 
1. Summary of Previous Work 
Previous studies? have shown the following: 


(a) The E center is the dominant spin-resonance 
center produced in phosphorus-doped vacuum. floating 
zone silicon by 1.5 MeV electron irradiation at room 
- temperature. It does not appear in the initial stages of 
the irradiation but only after the Fermi level has 
receded to = (E,—0.4)eV as a result of the irradiation. 
The Æ center is thus associated with a net acceptor level 
at ~ (E.—0.4)eV* and is observed only when this level 
; not contain an electron. 


We found this level to be at 0,43 eV. Other investigators have 


WATKINS AND J. W. 


.... CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


CORBETT 


(b) Hyperfine interaction with 4.7% abundant Si» 
has shown that the /-center resonance arises from an 
unpaired electron that is highly localized on one silicon 
atom. An additional hyperfine interaction with an 
impurity atom with a 10095 abundant nuclear isotope 
of I —1 is also observed. This was suggested to be due 
to a neighboring phosphorus atom. 

(c) The E center has not been observed in pulled n- 
type silicon (containing oxygen 7 10/5 cm~™). Instead 
in this material the dominant defect observed is the 
Si-A center,! which we have identified as a substitutional 
oxygen atom, presumably formed when a mobile lattice 
vacancy is trapped by an interstitial oxygen impurity. 
This reciprocal behavior between Si-4 and Si-E centers, 
depending upon the concentration of oxygen, led us to 
suggest that the Æ center also results from a trapped 
vacancy, in this case trapped by a phosphorus atom. 


2. EPR Spectrum 


Figure 2 shows a recording of the E-center spectrum. 
There is a strong central group made up of doublets 
and, at higher gain, satellites are observed showing the 
same doublet structure. Figure 3(a) shows the angular 
variation of the central group. The satellites are found 
to be composed of pairs symmetrically located around 
each of the central lines and with intensity (per pair) 
~5% of the corresponding central component. 

The complete spectrum can be described as arising 
from several identical defects, but differently oriented 
in the lattice, each with the spin Hamiltonian 


3- 8H. g- S--»; L-[A;j- S— (u/17H ], (1) 


with S=%. 
The second term describes magnetic hyperfine inter- 
actions with nuclei neighboring the defect. In these 
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Fic. 2. E-center spectrum at two gain settings with H]|(111)- 
The arrows indicate weak shoulders on the satellite lines, whi 
are referred to in the text. 


reported the dominant acceptor level in phosphorus-doped floatin£ 
zone silicon to be at (i) 0.38 eV "S. K. Wertheim and D. E- 
Buchanan, J. Appl. Phys. 30, 1232 (1959) ], (ii) 0.4 eV [H. Saito, 
M. Hirata, and T. Horiuchi, J. Phys. Soc. Japan 10, Suppl. m, 
246 (1963)], and (iii) 0.47 eV [E. Sonder and L. C. Templeton, 
J. Appl. Phys. 34, 3295 (1963) ]. These discrepancies may reflect 
the presence of other defects with energies near to that of the 
E center (see Ref. 4). 
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interactions, there are two distinct nuclear species in- 
volved. One is a nucleus with a 100% abundant isotope 
of J=4. (We will demonstrate in the next section that 
this nucleus is that of a phosphorus atom near the 
defect, hence we will refer to it here as phosphorus.) 
There is always one and only one such phosphorus 
nucleus near the defect, giving rise to the doublet 
structure on all lines shown in Figs. 2 and 3. The other 
nuclear species is Si? (4.7% abundant, 7 — 3) associated 
with neighboring silicon atoms. The satellites of Fig. 2 
arise from strong interaction with the Si? nucleus of 
one nearby silicon atom, their amplitudes relative to 
the central lines reflecting the isotopic abundance of 
Si?. Hyperfine interactions with other silicon atoms in 
the vicinty of the defect are an order of magnitude 
smaller and are only partially resolved, contributing 
mainly to the breadth of the central lines. 

Equation (1) gives for the energy levels 


E— gBHM 4-Y, Eni (Mm), (2) 
i 
with 
g- gne ernst gins? , (3) 


where 7, 115, 71; are the direction cosines of the principal 
axes of the g tensor with respect to H. Ignoring the small 
anisotropy in g, the hyperfine energy terms in (2) are 
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esp Thes 
Fic. 3. (a) Angular dependence of the central lines in 

centen ue with H in the (011) plane (see Fig. 4). The 

microwave frequency was 20.029 kMe/sec. (b) Corresponding 

g values. The labeling scheme is described in the text. 
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Fic. 4. Spin Hamiltonian constants for the E center with axes 
indicated for one of the twelve equivalent defect orientations in 
the lattice. With the magnetic field H in the (011) plane as shown, 
this set of axes gives rise to the g values given by cb in Fig. 3(h). 


given to first order in 4/g8H, u/gBI by 
baM H 
Fur tnr IUS [MA (us/ T) HP ng}. (4) 


Here ja are the direction cosines of the a=1, 2, 3 
principal axes of the A; tensor with respect to H. 
Equation (4) has been written in a form convenient for 
use when |MA;4| > |uj/1;j| H, the case for the hyper- 
fine interactions studied here. In so doing, the selection 
rules for the allowed EPR transitions are the usual 
AM — 4-1, Am;=0,° giving 


88H = hy,—3. [Er Gm) — Eriam]. 5) 
; 


The analysis is summarized in Fig. 4, where the 
principal values of the $ and A tensors are given along 
with their principal axes for one of the possible defect 
orientations.” From Fig. 4, it is seen that there must be 
24 equivalent orientations available for the defect in 
the cubic silicon lattice. That is, for each of the twelve 
possible (110) axes for gz, there are two equivalent 


? If | MA ad < |nj/ I| H, omission of the term M/|M | in Eq. (3) 
will allow the use of the same selection rules. 

10 These values for Ap were determined also by ENDOR meas- 
urements to be described in the next section. They are therefore 
more accurate than the values determined by analysis of the 
doublet splitting in the EPR alone. These results are within the 
stated error of our pene published results (Refs. 3, 4) but 
reveal in addition the tilting of the (4p): axis by ¢=16.5 deg 
away from [011], a fact not previously noted. 
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Fic. 5. Angular dependence of the P ENDOR frequencies with 
Hin the (011) plane. The labels denote the corresponding branches 
of the spectrum in Fig. 3(b) for which these transitions are 
observed. 


choices for gi, one being generated from the other by a 
180-deg rotation around go. In the EPR spectrum, such 
a rotation, or inversion, of coordinates cannot be dis- 
tinguished and the number of distinguishable spectra is 
reduced to twelve. For most of the studies described 
here, the spectra were studied with H in a {110} plane 
(i.e, crystal rotated around the corresponding (110) 
axis). In this case ten of the spectra become equivalent 
by pairs, giving only seven distinct spectra as shown in 
Fig. 3, five of double intensity, and two of unit intensity. 
We label (for later use) the transitions of Fig. 3 as 
referring to a specific defect orientation. This has been 
done as follows: In Fig. 4, the four (111) axes are labeled 
“a,” “b,” “c,” and “d.” (We do not distinguish here 
the sign of the axis direction.) We label a defect by two 
letters. The first letter specifies the (111) axis closest 
to (Ap); and the second specifies the direction of 
(AsQn. 


3. ENDOR Spectrum 


ENDOR studies were initiated primarily in order to 
identify the impurity giving rise to the 100% abundant 
I=% hyperfine splitting.” From Eq. (4), the nuclear 
resonance frequencies (AM —0, Am;— 1) are 


1 
v, (M =e [MA jo—(u;/I;)HPnj2}'?. (6) 


2 


‘The study was performed only on the higher frequency 


11 The sensitivity, although adequate for the study of this 100% 
E abundant nucleus, was not sufficient to study the 4.7% abundant 
i Si? hf interactions. As a result this interesting part of the problem, 

F i.e., probing out the wave function on more distant silicon neigh- 


bors, has yet to be done. 
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transition, the lower frequency transition usually being 
too weak to detect. 

The frequencies of the ENDOR transitions versus 
magnetic field orientation in the (110) plane are shown 
in Fig. 5 for each of the branches of the spectrum shown 
in Fig. 3, and are labeled accordingly. Also indicated 
are the positions of the principal hyperfine axes as 
determined from these measurements. (The splitting 
of the ba-ca, bd-cd, ab-ac, db-dc, and bc-cb pairs arises 
from «1.5 deg misalignment of the (110) axis of the 
crystal around which the orientation studies were made 
and would not exist if the crystal were perfectly 
aligned.) 

The Aja can be determined from the doublet splitting 
in the EPR spectrum, as described in the previous 
section. With these, we may solve (6) for uj 
using the ENDOR frequencies of Fig. 5. The result 
is |u;|=1.13-40.06. Phosphorus (up=+1.1305¥) is the 
only atom in the periodic table with a 100% abundant 
I=% nucleus in this range and the atom involved has 
therefore been proven to be phosphorus. 

Having identified the atom as phosphorus, we may 
now use the more accurate value," u= -- 1.1305, to deter- 
mine the values of Ap along the principal axes and these 
are given in Fig. 4. Also, for greater accuracy, we modify 
Eq. (6) to include terms second order in 4/g8H. This 
gives for the ENDOR transition with H along a 
principal axis (a) of A;, 


(Avi). | Aj M — (u5/ 15) H — (A jg jy 2hv) 


Ag Aj. 
XESG-4- at] 2 7 way — 1). (7) 


12V 


Here v is the EPR frequency, and 4;s and Ajy are 
the other two principal values of A. These second-order 


2H. Kopfermann, Nuclear Moments (Academic Press, Inc., 
New York, 1958), p. 451. 6 

18 In assuming this value for u, we are ignoring possible contribu- 
tions to the magnetic field at the nucleus arising from the orbit 
angular momentum induced into the wave function by the extern 
magnetic field—the equivalent of the “chemical shift” in NM! 
studies. Using arguments similar to those given by G. W. Ludwig 
and H. H. Woodbury [Phys. Rev. 117, 1286 (1960) ], we conclude, 
from the small g shifts encountered here, that the effect is neg- 
ligible with respect to the experimental errors indicated in Fig. + 


: Fic. 6. Simple model of th 
BD center as the neutral charge die 
MOO) poA of a vacancy next to a substitu- 
f \ tional phosphorus atom. 
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Fic. 7. Simple LCAO molecular 
orbital model of the electronic struc- 
ture of the E center. (i) Four de- 
generate (ignoring overlap) orbitals 
of the isolated vacancy. (ii) The 
lowering of the c orbital due to the 
extra nuclear charge of the phosphorus 
atom at C. (iii) Lifting of the de- 
. generacy of the a,b,d orbitals by a 

Jahn-Teller distortion. (iv) Modifica- 
tions to the wave functions when small 
overlap between the atomic orbitals 
is considered. (These wave functions 
are orthogonal and normalized only 
through first order in the Aş.) 


a,b,c,d ———— 


(i) 


corrections are small, and were therefore used only in 
determining the principal values given in Fig. 4. Equa- 
tion (6) was otherwise quite satisfactory for fitting the 
observed spectra. The absolute signs of Aja could not 
be determined, but fitting the data of Fig. 5 with Eq. 
(6) requires that the signs of all principal values be 
the same. 


B. Discussion 
1. Model 


In order to interpret the experimental results, it will 
be convenient to present an approximate description 
of the model at the outset. This is shown in Fig. 6 for 
the defect giving rise to the specific sets of axes shown 
in Fig. 4. It is the neutral charge state of a lattice 
vacancy trapped next to a substitutional phosphorus 
atom impurity. We can visualize the construction of 
the defect as follows: Initially, there are four broken 
bonds around the vacancy, one each for the four atoms 
surrounding it. Because of the extra nuclear charge of 
the phosphorus atom, two electrons are accommodated 
in its broken orbital with their spins paired off. Two of 
the remaining silicon atoms pull together to form an 
electron pair bond leaving an unpaired electron in the 
orbital of the remaining silicon atom. EPR arises from 
this unpaired electron explaining the large hyperfine 
interaction with one silicon atom. The As; and g axes 
reflect primarily the (111) axis of the broken bond 
orbital on this silicon atom. 

A somewhat more detailed consideration of the 
electronic structure of the defect is shown in Fig. 7. 
In the absence of overlap, the dangling orbitals of an 
isolated lattice vacancy (a,b,c,d) associated with the 
four neighboring silicon atoms (4,B,C,D) give four 
degenerate one-electron orbitals. Replacing atom C 
by a phosphorus atom is equivalent to adding an extra 
positive charge at this site. This partially removes the 
degeneracy and orbital c is lowered in energy. In the 
neutral charge state, five electrons are accommodated in 
these orbitals, two paired in the c orbital and the re- 
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(iii) 


maining three in the a,5,d set. Because of the degeneracy 
associated with the partially filled a,b,d orbitals, a 
spontaneous distortion takes place to remove this degen- 
eracy, as shown. Viewed this way, the “pair bonding” 
of silicon atoms 2 and 3 in Fig. 6 can be considered sim- 
ply as a manifestation of the Jahn-Teller" effect. 

If we now consider the effect of small additional over- 
lap between the remaining orbitals, the wave functions 
will be modified slightly as shown in panel (iv) of Fig. 7. 
In this scheme, the unpaired electron is primarily lo- 
cated in orbital b of silicon atom B but has a small per- 
centage (X;2/2) on silicon atoms A and D and (A£) on 
phosphorus atom C. Because we expect the Jahn- 
Teller splitting (E") to be smaller than the Coulomb 
energy associated with the lowering of the c orbital 
(E’), we expect X32 Mf. 

This is still a considerable oversimplification, of 
course, because the electronic wave functions will also 
spill over somewhat onto the more distant neighbors. 
However, analysis of the hyperfine interactions in the 
following section shows that ~ 70% of the wave function 
can be accounted for on these four atoms alone and this 
simple localized "molecule" should therefore be a 
reasonable first approximation. 

In the next two sections the hyperfine interactions 
and the g tensor will be considered quantitatively in 
terms of this model. In the remaining sections, other 
consequences of the model will be explored as quantita- 
tive tests for the model. 


2. Hyperfine Interactions 


Following the arguments of the previous section, let 
us construct the wave function for the unpaired electron 
as a linear combination of atomic orbitals centered on 
the atom sites near the defect 


=E ws. (8) 


“H. A. Jahn and E. Teller, Proc. Roy. Soc. (London) A161, 
220 (1937). 
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"TABLE I. Hyperfine parameters (2; and bi) and the correspond- 
ing molecular wave function coefficients (a°, 8;*, and »;?) calcu- 
lated from the observed hyperfine interaction constants for the 
atom sites in the vicinity of the Æ center. 


Atom aj b; 1 n A 
site (107* cm7?) — (107* cm7?) Cis Bi ae 
wu 7 CORSUNNSRCOTUD 014 086 059 
So =) 124 DE (0.30) (070) 003 
Sis (—) 12:4 iem (0.30) — (0.70) — 0.03 
P (4) 932 Caoa 0.29 O71 0.01 


At each atom site j, we approximate y; as a hybrid 
353p orbital given by 


y; e Uas);-B;(V2p);- (9) 


For the atoms adjacent to the vacancy, we take the 
p function as directed approximately along the (111) 
direction from the site to the center of the vacancy. To 
a good first approximation, the hyperfine interaction at 
the jth nuclear site is determined solely by y;;, i.e., that 
part of the wave function close to the nucleus. In this 
approximation, the hyperfine interaction is axially 
symmetric along the p-orbital axis and can be written 


Ay=a+2b, 


4A,— a— b. 
(10) 


Here the isotropic term a arises from the Fermi contact 
interaction 


aj (167/3) (uj/15)8a?7 | Va. (0) | 7, (11) 


where u; is the magnetic moment and J; the spin of the 
jth nucleus. The anisotropic term b results from the 
dipole-dipole interaction averaged over the electronic 
wave function and is given by 


b;— (4/5) (uj/ 15)BB?n? ras); (12) 


In Appendix A, we have estimated |V5,(0) |? and (35-3) 
for neutral silicon and phosphorus, using the recent 
Hartree-Fock calculations of Freeman and Watson.!® 
These values are given in Table III in the Appendix. 

Table I gives the results of analysis for a;7, 87, n; for 
each of the atom sites of Fig. 6, using Eqs. (8)-(12), 
the hyperfine constants given in Fig. 4 and the estimates 
of Table III. Equations (11) and (12) predict that a; 
and 5; should have the same sign as uj. With this assump- 
tion, there is no ambiguity in determining the a; and 
b; from (10) even though the absolute values of Ay, 
and 4; were not determined. The signs of the aj's 
and bs in Table I are in parenthesis to indicate that 
they were not determined experimentally. In treat- 
ing the phosphorus hyperfine interaction we have 
ignored the small departure from axial symmetry 
[(Ap)2¥#(Ar)s] and have ignored the additional con- 


= _ tribution to 5; that arises from dipole-dipole interaction 


T ET R. E. Watson and A. J. Freeman, Phys. Rev. 123, 521 (1961). 
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with the unpaired electron centered mainly on the 
neighboring Si; site. A crude calculation suggests that 
this interaction contributes ~0.1—0.2 (10-4 cm?) to the 
anisotropic term and it should be included in a more 
sophisticated treatment. It appears small enough, how- 
ever, that it cannot account by itself for the tilting of 
the hf axis away from the (111) “normal bond” direc. 
tion and we must conclude that the orbital on the 
phosphorus is itself significantly tilted in this direction, 

The estimates of a; for the Si» and Si; sites comes 
indirectly from an experiment to be described in a later 
section (Sec. IVA), and is included here at this time 
for completeness. Only a; can be estimated in this ex- 
periment and we have had to assume o and fj, i.e., 
percentage s and $, in order to compute nj. The choice 
of a?~0.30, 6?~0.70 was made to agree with that 
observed in the A center,!® where the unpaired electron 
is incorporated in a similar two atom bond between 
next neighbors across a vacancy. 

We are thus able to account for ~60% of the wave 
function as localized in a dangling orbital on a single 
silicon atom (Si, of Fig. 6, atom B of Fig. 7) neighboring 
the vacancy. The orbital is approximated by ~86% 
3p character and is directed toward the vacancy. The 
enhanced $-like character over a 25%s—75%p, (sp), 
tetrahedral orbital!" could be caused by the relaxing 
of the Si; atom away from the vacancy. Since the orbital 
on this atom is not involved in bonding to the other 
atoms across from the vacancy (see Fig. 7), we would 
expect the Si; atom to be pulled away from the vacancy 
by the strong bonds to its three remaining nearest 
neighbors (not shown in Figs. 6 and 7). Si; and its 
three nearest neighbors would then tend to approach 
a planar configuration for which the bonding orbitals 
would be s? and the dangling orbital pure 7.18 

An additional ~7% of the wave function is accounted 
for on the three other atoms surrounding the vacancy, 
~3% each on the two silicon atoms (Siz and Sis) and 
a still smaller amount (~1%), as anticipated in the 
previous section, on the phosphorus. The remaining 
~30% of the wave function is presumably spread over 
more distant silicon neighbors. [The hyperfine inter- 
action with some of these sites is still large enough to 
give partially resolved structure on the shoulders of the 
lines in the spectra (see Fig. 2). The Si; and Sis inter- 
actions contribute to these shoulders, but the total 
intensity of the structure indicates that there must also 


16 Our previous analysis of the A center (Refs. 1,4) with some- 
what different estimates of |/3,(0)|? and (755-3), indicated that, 
for it, the wave function was 37% s and 60% p on the main two 
silicon atoms next to the vacancy and with 70% of the total wave 
function accounted for by them. A reanalysis, using the new 
estimates of these quantities given in Table III, gives ~3070 $ 
and ~70% p, but with essentially the same fraction of the wave 
function accounted for. Similar changes, in the fraction s an VÀ 
canaries only, are required for the other centers analyzed !n 


Ei Ib, Pauling, The Nature of the Chemical Bond (Cornell Univer- 
sity Press, Ithaca, New York, 1948), 2nd ed., Chap. 3. 4 
18 The arguments are developed somewhat more fully in Ref. 4. 
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be three or four additional sites for which the hyperfine 
interaction is of comparable magnitude. These then 
could account for an additional 10-15%, of the wave 
function leaving only 15-20% to be further spread out. 
The weaker hyperfine interaction on these more distant 
sites 1s unresolved and contributes only to the breadth 
(~2.5 G between half-maximum points) of the central 
part of the line. ] 


3. g Tensor 


We have determined that the wave function consists 
primarily of a dangling bond on a single silicon atom. 
Let us therefore consider the g tensor to be expected for 
such a single broken-bond orbital. We start with the 
Hamiltonian for a bound electron in a magnetic field. 


eh € 
c 238-84 S Ex( p+ A) | 
2mc C 


2 


e 


€ 
+—(p-A+A-p)+ 


2mc 2mc 


A2. (13) 


"The second term gives rise to the spin-orbit interaction 
where E is the electric field seen by the electron, p, its 
momentum and A, the vector potential. The third term 
gives the interaction between the motion of the electron 
and the external magnetic field, and the last term is the 
usual diamagnetic interaction. 

We choose a gauge for A with the atom involved 
as center, i.e., 


A-1Hoxr. (14) 


With this choice, the third term becomes Ho: L. 
Equation (13) then leads to the following expression for 
the g shift, accurate to first order in the spin-orbit 


interaction, TRETTIAIS 
(0| (Veo) «| 0) (n| L;] 0 
Agu- —2X; ceo 
n En Eo 


Vo= (8/mc)E xp. 


The sum is over all excited states. 

The form of the g tensor to be expected is apparent 
from Eq. (15). It will be axially symmetric around the 
direction of the broken bond orbital axis and the g 
shift along the axis will be zero.” There can be a non- 
zero g shift perpendicular to this axis, however. 

We note that the observed g tensor (Fig. 4) is approxi- 
mately axially symmetric around the 1 axis (which is 
approximately the bond axis, see Fig. 6), as expected. 
Also, the g shift along this axis, although not zero, is 


where 


19 C. P. Slichter, Principles of Magnetic Resonance (Harper and 
Row, Publishers, Inc., New York, 1963), Chap. T 

2 The orbital may be made up of a linear combination of several 
atomic orbitals (s,p,¢, etc.), but by symmetry, all will have m=0 
along the bond axis. As a result, there are no matrix elements for 
the component of angular momentum along this axis giving 


Agu=0. 
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Fic. 8. Localized “molecule” considered in the g-shift calculation 
(see Appendix B). (a) Atom A with its broken bond Y (A) next 
to a vacancy. (b) Corresponding one-electron molecular states. 
The matrix elements involved in the g shift are indicated along 
with the sign of their contribution. 


significantly smaller than that along the 2 and 3 axes. 
If we consider the small negative shift along the bond 
axis as arising from the 40% of the wave function mot 
accounted for in this oversimplified treatment, we are 
left with Agu=0 and Ag.=+0.01 to be contributed by 
the dangling bond. Since ~60% of the wave function is 
in this bond, this requires Agu=0, Ags=+0.017 fora 
fully occupied dangling bond. 

Experimentally, we can take the view that these 
values are reasonable in that other defect structures in 
silicon which we have studied and interpreted in terms 
of dangling bonds also indicate Agi =0 and with positive 
values for Ag: of this magnitude.'* Theoretically, how- 
ever, the calculation of these quantities represents a 
very complex problem. Treated properly, the excited 
states of Eq. (15) are the continuum of all valence and 
conduction band states of the solid. 

In previous publications, ^* we have carried out a 
simplified treatment of this problem in which we con- 
sidered only the localized “molecule” made up of the 
atom in question and its three nearest neighbors as 
shown in Fig. 8. In this treatment, we used simple 
one-electron LCAO bonding [¥+(4.X)] and antibond- 
ing [¥-(AX)] molecular orbitals between the atom 
and its neighbors as approximations to the excited 
localized valence and conduction band states of 
(15). The matrix elements to the bonding orbitals, 
being “‘hole”-like, were shown to give rise to a positive 

g shift, while those to the antibonding ones, being 
“electron’-like, gave rise to a negative shift. The net 
shift was shown to be the difference between these two 
competing effects. However, it has been pointed out to 
us that in the derivation, the spin-orbit matrix elements 
were not handled correctly, leading to an error in the 
final result. We take this opportunity to correct this 
error. The result (rederived in Appendix B) should 


2 We are grateful to R. H. Silsbee for pointing out this error to 
us. Our error involved replacing V. by AL, with A the spin-orbit 
constant for the free atom. For the molecular excited states con- 
sidered here, the more general form of (15) should have been used. 
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have been 

Ag mL E»— (1/E))84, (16) 
where À is the spin-orbit interaction constant for the 3p 
electron of atomic silicon, Ba? is the percentage 3p 
character for the dangling bond, and E» and Æa are the 
energies to the bonding and antibonding orbitals, 
respectively, as shown in the figure. The overlap 
integral between the atomic orbitals on the atom and 
its neighbors no longer appears in the result. 

In order to estimate Ey and Ea, we recognize that 
making up a localized state in the band picture re- 
quires summation over all k. E; and E» must therefore 
represent suitable averages over the available states 
in the bands. Present estimates of the band structure 
of silicon? would suggest that these average excited 
state energies would be ~1-2 V into the respective 
bands from the band edges. We do not know the energy- 
level position associated with the dangling bond of the 
E center but we do know that an additional electron 
can be accommodated at (/2,— 0.4)eV. This means that 
the single electron level is lower in energy and, as a 
guess, we take it as ~Z,, the valence band edge. This 
gives Eyz1.5 eV, Ea™2.5 eV. With \=0.02 eV,” 
Ba?=0.86 (Table I), this gives 


Agi~-+0.005. 


This is the correct sign but is a factor of ~3 too low. 
There was some arbitrariness in the estimates of E; 
and Ea, to be sure, but it is not felt that a realistic 
adjustment of the values could make up this large a 
factor. (This is particularly evident in the Si-A center,! 
for instance, where the energy level is known to be close 
to the conduction band edge and the g shifts are still 
positive and of comparable magnitude.) 

We thus conclude that the large g shifts must result, 
in significant part, as a result of stronger matrix ele- 
ments of (15) to the bonding orbitals (valence band) 
than to the antibonding ones (conduction band). 
Within the framework of our localized molecule approxi- 
mation, there are two effects that we did not previously 
treat that tend in this direction. One results from the 
fact that the molecular obritals used in (I) were not 
orthogonal to the 1s, 2s, 2p atom cores. We have 
modified the treatment to take this properly into ac- 
count (see Appendix B) and we find that (16) is re- 

placed by 


Age M (1--)/ Ev— (1—y)/ E)84*. (17) 


In Appendix B, we estimate y~+0.17, giving, with 
E,~1.5 eV, Ea™2.5 eV, 


Agi~+0.008. 


SEG, Phillips, Phys. Rey. 125, 1931 (1962). 

233 The spin orbit Anteraction constant À is two-thirds the free 
atom spin-orbit splitting as given in C. E. Moore, Atomic F 
Levels, Natl. Bur. Std. (U. $.) Circular No. 467 (U. Send 
ment Printing Office, Washington, D. C., 1949). 
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This is an increase in the right direction but it js still 
not enough. 

Another effect is a nuclear shielding effect. The effect 
of the positive charge of the neighboring atom core 
should be to enhance the charge density near the core 
of the atom to which it is bound for the bonding orbitals 
but to deplete it for the antibonding ones. This effect 
is well established in the H molecule, for instance 
where the contraction of the 1s atomic orbitals in the 
bonding state is equivalent to an increase of nuclear 
charge by 20%.” Such a contraction in our case would 
enhance the spin-orbit matrix element of (15) ang 
further increase the “contact” to the valence band. 

It would be difficult to include this effect properly in 
our simple LCAO treatment. However, a measure of 
its importance is seen in the fact that the observed 
spin-orbit splitting in the valence band of silicon 
(0.044 eV)?* is actually ~50% greater than that for the 
free atom, (0.03 eV).?* On the other hand, if we were to 
calculate the spin-orbit splitting using the LCAO 
molecular bonding orbitals in (I), we would get a 
reduction. For instance, if we ignore the 2p core terms, 
the effective spin-orbit constant would be reduced 
directly by the normalization factor (1+S)=1.7 [see 
(I) ]. By a calculation similar to that for the g shift, 
we find that inclusion of the 2 core terms recovers 
some of this loss by the factor 1+2y=1.34, but we 
would still predict a reduction of ~1.25. This would 
indicate that the “contraction” of the valence band 
wave functions is therefore of considerable importance. 

It is interesting to note that in our simple treatment, 
it would require an effective y~0.8, to account for the 
observed 50% increase in valence band spin-orbit inter- 
action. Such a value in (17) would give Ag.~+0.017! 
This agreement is clearly not entirely coincidental in 
that both the g shift and the enhanced valence band 
spin-orbit interaction reflect the same phenomenon— 
the contraction of the wave function around the cores 
in bonding states.?® 

We conclude that the indicated values Agu=0, 
Agi~-+0.017 are reasonable ones for a single broken 
bond orbital in silicon. In the case of the E center, with 
~60% of the wave function in such an orbital, this 
accounts for the gross features of the g tensor, L6» 
Agi~0, Ago~Age+0.01. The remainder of the wave 


*4See C. A. Coulson, Trans. Faraday Soc. 33, 1479 (1937) for 
a discussion of these effects. 

35 S. Zwerdling, K. J. Button, B. Lax, and L. M. Roth, Phys: 
Rev. Letters 4, 173 (1960). : 

?*'The correct characterization of molecular wave functions 10 
the vicinity of the atomic cores is a subject of considerable current 
controversy [W. Marshall and R. Stuart, Phys. Rev. 123, 2 z 
(1961); R. G. Shulman and S. Sugano, ibid. 130, 506 (1963); 
J. C. Phillips and L. Liu, Phys. Rev. Letters, 8, 94 (1962) ]. In the 
case of substantial ionic character, the situation is not so cleat 
In the case here of strong covalent bonds, however, it appears tha! 
substantial contraction and expansion of the bonding and anti- 
bonding orbitals, respectively, must occur around the atoms. The 
arie by Phillips and Liu, and Ref. 54 treat this point in some 

etail, 
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function (74097) presumably can account for the 
remaining g shifts. 


IV. MOTIONAL EFFECTS 


In the previous sections, we have presented a model 
for the E center, and we have shown that it is compati- 
ble with the observed hyperfine interaction and the g 
tensor. There is always the question, however, as to 
whether this model is unique. Are there other possible 
microscopic configurations that might also be compati- 
ble with these quantities? We desire to test the validity 
of the model in as many ways as possible. 

The model of Fig. 6 predicts a number of motional 
effects which we will test in the following sections. For 
instance, for the particular phosphorus-vacancy direc- 
tion shown in the figure, there are two other entirely 
equivalent configurations, corresponding to bonding 
Si; and Si; together and leaving the unpaired electron 
on Si», or Si; and Sis together, with the unpaired elec- 
tron on Siz. These together with the configuration 
depicted in Fig. 6, represent the three equivalent Jahn- 
Teller distortions of Fig. 7. The barrier for reorientation 
from one distortion direction to another should not be 
large (being comparable to the Jahn-Teller energy) and 
we expect thermally activated reorientation to manifest 
itself at relatively low temperatures. This motion can 
be studied in the resonance in two ways: (1) If the cor- 
relation time for the reorientation process is short 
enough (rS 10-7 sec), the individual lines in the EPR 
spectrum become lifetime broadened and give a direct 
means of studying the nature and kinetics of the motion. 
Such a study is described in Sec. IVA; (2) in IVB, the 
motion is revealed by studying the preferential align- 
ment of the defects induced by the application of 
uniaxial stress to the crystal. In addition to revealing 
the nature and kinetics of the reorientation process, 
the magnitude and sense of the stress induced alignment 
also confirm other important features of the model. 

The vacancy-phosphorus direction is another "degree 
of freedom" that can be probed by stress. In Sec. IVC, 
preferential alignment of this axis of the defect is 
achieved by stress at elevated temperatures. The study 
of the magnitude and sign of the alignment plus the 
kinetics of its recovery gives a further test of the model. 


A. Low-Temperature Linewidth Studies 


In the temperature region ~60-150°K, the E center 
spectrum is observed to change drastically as shown in 
Fig. 927 Initially, some of the lines appear to broaden 


27 Also noticed in Fig. 9. are weaker lines, not associated with 
the E center, which do not change throughout this temperature 
region. Some of these arise from another center which appears to 
have roughly the same g variations as the E center and which m 
displays a phosphorous hyperfine splitting, but of approximately 
half the magnitude. The interference with the E-center spectrum, 
and its weaker intensity, has prevented a detailed analysis. We 
have no model for this defect but presumably it could be an E 
center paired off with another defect such as oxygen or another 


vacancy, etc. 


IRRADIATED Si 
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and disappear as the temperature is raised and at the 
higher temperature end a simplified spectrum emerges. 

This behavior suggests thermally activated reorienta- 
tion of the defects. In general, if a defect with an 
anisotropic g tensor is jumping from one orientation in 
the lattice to another, its g value will change abruptly 
with each orientation. In the resonance experiment, this 
means that, if we are tuned to a particular multiplet in 
the spectrum (which corresponds to the g value of a 
particular orientation for the defect), a given defect is 
alternately brought into resonance, then out, as it 
executes this random reorientation. This will give rise 
to a lifetime broadening of the multiplets. As the tem- 
perature is raised and the jumping rate increases, the 
multiplets will tend to broaden out and disappear. At 
higher temperatures and higher jump rates, motional 
narrowing will eventually begin and new sets of lines 
will emerge at the average position of the original 
multiplets. 

Now let us consider the E center in detail and, in 
particular, determine what we expect if the reorientation 
motion is from one Jahn-Teller distortion to another. 
Figure 3(b) shows the variation of the g value versus 
crystalline orientation for the differently oriented de- 
fects. By reference to Figs. 4 and 6, we recognize that 
the labels on the various curves of Fig. 3(b) have a 
simple interpretation in terms of the model for the 
defect. Labeling the four different (111) axes a through 
d as in Fig. 4, the first letter of the label specifies the 
vacancy-phosphorus direction and the second letter 
specifies the direction from the vacancy to the silicon 
atom with the unpaired electron. Stated more simply, 
for the nearest neighbor sites of the vacancy depicted in 
Fig. 7, the first letter of the designation gives the phosphorus 
site and the second letter gives the silicon site for the un- 
paired electron. (The defect orientation of Fig. 6 is “cb.”) 

Now consider the thermally activated reorientation 
from one Jahn-Teller distortion direction to another. 
In terms of the defect labeling scheme, each time a 
defect executes a reorientation, its second letter changes, 
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but its first does not. The corresponding g-value change 
can be read directly from Fig. 3(b). 

We now treat the particular case when the magnetic 
field is parallel to a (111) axis (the conditions of Fig. 9). 
Referring to Fig. 3(b), the number of distinguishable 
g values reduces to three, labeled 4, B, and C. With 
the phosphorus hyperfine interaction, this gives rise 
to three pairs of lines for the central group spectrum as 
shown in Fig. 3(a) and as designated in Fig. 9 as A, 
A', B, B', and C, C’. Consider now that the magnetic 
field is tuned to resonance for multiplet C (or C"). For 
the defects giving rise to this multiplet, the particular 
reorientation we are considering has no effect because 
each of the three Jahn-Teller distortions gives rise to 
the identical g value. (This multiplet arises from the de- 
fects for which the magnetic field is pointing along the 
phosphorus-vacancy direction. By symmetry, it is 
apparent that each of the three distortions is equivalent 
with respect to this axis.) Therefore the motion predicts 
no effect on the CC’ multiplets. (There will be a slight 
narrowing due to the partial motional averaging of Si? 
hf broadening by neighbors, but we ignore this in this 
discussion.) In Fig. 9, it is noted that these multiplets 

(C’ is easiest to observe) do indeed stay sharp through- 
out this temperature range in agreement with the 
predictions. 

The defects with the other three phosphorus-vacancy 
directions give rise to resonance at A (4^) or B(B’) and, 
for them, reorientation causes a jumping between the 
two resonance values. Gutowsky and Saika? have 
treated a problem which is formally similar to this and 
we can apply their results directly to this problem. 
Their results are shown schematically in Fig. 10, in 
which we consider hopping between two resonance 
fields A and B separated by dw= (g8/#)AH, and with 
relative probabilities pa and pz of being at each of these 
two fields. (For our case $4— 3, ?5—3.) At low tempera- 


nu 


(T2), TPg T 
(T2), = T/P4 
8 


rie n 


B A 


Fic. 10. Effect of thermally activated “hopping” between two 
resonance positions A and B, with relative probabilities pa and 
pp of being at each site, according to Gutowsky and Saika (Ref. 
28). As the temperature increases and the lifetime 7 for the 
‘hopping’ process decreases, the spectrum changes from (I) low- 
temperature multiplets at A and B, to (II) broadening of multi- 
plets, to (III) disappearance of multiplets and emergence of a line 


= the average position AB, to (IV) narrowing of the averaged 
ne. 


28 H, S. Gutowsky and A. Saika, J. Chem. Phys. 21, 1688 (195. 
The formulas in Fig. 10 are derived from Eq. (18) of this UE 
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tures, we have two lines (I) of relative amplitudes Pa 
and pr and with widths determined by static inter. 
actions (in our case, Si? hf interactions with distant 
neighbors). As the characteristic reorientation time 7 
becomes shorter (II), the motion contributes a Lor. 
entzian broadening 

(To/m) 


OT worl? E. 


to each of the multiplets, where 7» for each multiplet 
is given in the figure. At higher temperatures where 
rowK1, (III), motional narrowing is manifested by the 
emergence of a line at the weighted average between A 
and B, which is again Lorentzian broadened with the 
(T2) az given in the figure. As r becomes shorter the 
central line narrows further ultimately going over into 
a temperature-independent width now determined by 
static interactions that have not been averaged out. 

Inspection of Fig. 9 shows that the AA’ and BB! 
multiplets broaden as predicted (4^ and B show clearly 
the broadening and final disappearance in the first 
panel with A’ twice the width of B as expected through- 
out), along with the ultimate emergence and narrowing 
of the AB—A’B’ pair at the weighted average position 
in the second panel. Similar studies with H||(100) 
and H||((110), also confirm the expectations of the 
model. The motionally averaged spectrum at elevated 
temperature is axially symmetric reflecting the phos- 
phorus-vacancy direction around which the averaging 
has taken place. The g values within experimental 
accuracy agree with those predicted from the weighted 
averages of the values in Fig. 3(b). 

Using the formulas for T% in Fig. 10, we can estimate 
7 from a study of the linewidths versus temperature 
in both the early broadening and final narrowing 
regions. The results are given in Fig. 11. {In measure- 
ment, the linewidths Aw= (g8/4)AH were taken as the 
width between half-maximum points observed in the 
derivative of dispersion recordings as in Fig. 9. For a 
line of Lorentzian shape, T,=1/Aw. The points plotted 
at each temperature represent the average of two 
different methods for subtracting out the contribution 
due to the static width, (Aw)o. One method assumed 
T2=[Aw—(Aw)o}“, the result expected for the con- 
volution of two Lorentzian shapes. The other took 
Ta [ Ac?— (Aw) ] "2, somewhat characteristic of two 
Gaussians. Since we anticipate the correct behavior to 
be somewhere in between these two simple approaches, 
we have simply averaged the two estimates. The 
accuracy of the data (limited by overlapping of lines, 
presence of unrelated spectra, temperature uncer- 
tainties, etc.) does not warrant a more detailed treat- 
ment. We recognize also that a detailed treatment i$ not 
trivial. For instance, the static contributions are not 
strictly constant, being themselves presumably partially 
motionally narrowed throughout the same region.} 

In spite of the uncertainties, we could make 4 


& 


& 
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reasonably good estimate of 7 because the broadening 
and marrowing regions are separated by about two 
decades. However, we will defer this until the next 
section, where additional experimental data allow a 
more accurate estimate. We note only that the activa- 
tion energy is 70.06 eV. 

Study of motional effects on the Si?’ satellites is also 
informative. This is shown in Fig. 12 for H||(100}). At 
20*K, (observed under fast passage conditions in dis- 
persion), we see the Si?’ satellites with a total splitting 
of 126 G. In the motionally averaged state (displayed 
in the figure as derivative of absorption, explaining the 
apparent difference in the character of the resonance), 
the Si? satellites are found to be ~ three times as intense 
relative to the central group but with a splitting re- 
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Fic. 11. Lifetime 7 of the Jahn-Teller reorientation versus 
temperature. The two curves represent fits to two different 
theories (see text). 


duced to 51 G. This is just what we expect because in 
the motionally averaged state, the defect becomes a 
three silicon center. That is, in the model of Fig. 6, the 
unpaired electron is hopping between the Sit, Siz, and 
Si, sites so fast as to be indistinguishable from a defect 
in which the electron is in a LCAO molecular orbital 
spread over all three. The hyperfine satellites now arise 
when any one of the three equivalent silicon sites is 
Si?, which is three times as probable as one, and there- 
fore three times as intense. The hyperfine splitting is 


determined now by 
A' ((100)) = 44 1((100)) 
+44 2((100))+343({100)), (19) 


where 43((100)) is the hf term for the defect configura- 
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tion of Fig. 6 when Si, is Si”, and As and A, are the 
correspondingly smaller terms when Si» and Si; are 
Si”. The meaning of Eq. (19) is that if one of the three 
silicon sites is a Si?? there is an interaction of 4, for the 
third of the time the electron is located mostly on that 
atom but there is also 42 and A; for each of the other 
one-thirds when the electron is primarily located on the 
other two silicon atoms. 

With | 4'(/1007)| =51 G, | 41,((100))| = 126 G, we can 
solve Eq. (19) for Aa and Ay, which must be equal by 
symmetry. This gives 


Ag 3((100)) = 413.2 G=£12.4 (10-5)em7! , 


where the sign must be the same as A;((100)). Assuming 
the interaction to be axially symmetric along a (111) 
direction, this gives the isotropic constants a» and a; 
which are entered in Table I. 

Summary: Forgetting the model for a moment, we 
have determined the following about the center as a 
result of the linewidth studies: The center undergoes 
thermally activated reorientation around a particular 
(111) axis associated with the defect. The activation 
energy is low (~0.06 eV), and from the hyperfine studies 
we have determined that the motion is an electronic one. 
That is, this is not the reorientation of a molecular unit 
but is rather an electronic bond switching in which the 
unpaired electron hops between three equivalent atoms 
which are for all purposes fixed in the lattice. 

In terms of the model, this is precisely the behavior 
expected for thermally activated reorientation of the 
defect from one Jahn-Teller distortion direction to the 
other. The 0.06 eV we feel is a reasonably value for the 
activation energy of the process. This must be con- 
sidered an important confirmation of the model. 


B. Low-Temperature Stress Studies 


All of the possible orientations of the E center are 
equally probable in the cubic silicon lattice and, at 


? In a simple model, the three silicon atoms go through the 
symmetrical undistorted position in reorienting. In this case, the 
0.06 eV would be equal to the Jabn-Teller stabilization energy. 
Remembering that this energy is the difference between the total 
electronic stabilization energy and the increased energy stored in 
the lattice, which in turn is half the electronic energy, we get for 
the electronic stabilization energy 0.12 eV. In Fig. $, this ener 
is associated with two electrons in the (a--d) orbital so that the 
energy per electron E” «0.06 eV. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1370 Gs. (Ds 


0 Ibs/in* 


12500 Ibs/in" 


Fic. 13. Change in the E-center spectrum (H||[0117), under 
[011] compressional stress. The insets show a typical defect 
orientation for each multiplet. 


thermodynamical equilibrium, we have equal numbers 
of defects in each of these orientations. If we now strain 
the crystal, and distort it from cubic symmetry, the 
various orientations are no longer equivalent and, if the 
defects are free to reorient, they will do so, tending to 
seek out the lowest energy orientation. This stress in- 
duced alignment is the static analog of the internal 
friction experiment. 

EPR gives a unique way of observing the alignment 
in that the amplitudes of the multiplets in the spectrum 
give a direct measure of the relative number of defects 
in each of the allowed orientations, allowing a micro- 
scopic study of the process.? We here apply this to 
study the same “bond-switching” motion that was 
described in the previous section. 

In Fig. 13, the effects on the spectrum of applying a 
uniaxial compressional stress along a (110) axis of the 
crystal at 20.4°K is observed. (The spectrum is observed 
with the magnetic field along another (110) axis, per- 
pendicular to the stress direction.) The insets identify 
a representative set of defects—and their corresponding 
letter designation—to be associated with each multiplet. 
It is seen that there is a large effect, the dc and db 
orientations being strongly suppressed and the da being 
proportionally enhanced. This is again consistent with 
the “bond-switching” motion, the phosphorus-vacancy 
direction remaining unchanged. (The intensity simply 
"sloshes" between the indicated multiplets with no 
change in total area.) The sense of the alignment is an 
additional confirmation of the model in that the defect 

prefers to align in such a way that the applied com- 
Pression serves to push the two bonding silicon atoms 
closer together as expected. In terms of Fig. 7, the 


% A detailed study of another EPR center (the Si-A center) 
using this technique has been previously described by us in (I). 
See also W. Kanzig, Phys. Chem. Solids 23, 479 (1962). 
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applied stress adds to the Jahn-Teller distortion anq 
lowers the electronic energy of the defect. We will 
return to the magnitude of the alignment but first let 
us consider the kinetics. 

Upon removing the stress at 20.4°K, the recovery 
to the original unstressed spectrum is observed to 
follow a single exponential law with a time constant 
7—1.32:0.2(10?) sec. This 7 is the same characteristic 
reorientation time that appears in the formulas of 
Fig. 10 for the linewidth studies and is therefore also 
plotted in Fig. 11. This value together with the line- 
width study estimates span 13 decades in 7, allowing a 
very accurate determination of r. The points fall well 
on a straight line giving 


771— 1.6 (102)e-£r?T , (20) 


with E=0.0622+0.0015 eV. [There is some disagree- 
ment in the literature as to whether (20) is the correct 
form for such a process or whether T should appear in 
the pre-exponential factor.*! For this reason, we have 
also shown by the dashed line the best fit with the 
alternate form. We do not feel our data are accurate 
enough in their present state to distinguish between 
the two although the fit using Eq. (20) appears some- 
what better. We point this out because with these large 
ranges of 7 and 1/T, our experimental errors in the 
estimated activation energies become less than that 
imposed by the uncertainty in the theory. It is possible 
that a careful study in this or similar systems? could 
serve as a critical test of the two theories. ] 
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„Fic. 14. Boltzmann factors governing the populations of the 
different defect orientations for the (a) (110) and (b) (100) stress 
experiments. The subscripts 4j refer to the defect orientation 
according to the convention established in the text. 


31 See R. H. Doremus, J. Chem. Phys. 34, 2186 (1961) for a dis- 
cussion of this point and for further references to the literature, 3 
2 The silicon divacancy [G. D. Watkins and J. W. Corbett ( 

be published) also has two spectra which are suitable for SU 
a study. 
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A quantitative study of the degree oí alignment has 
also been performed versus stress and temperature. For 
this, the effects of stress along both (100) and (110) 
axes were investigated. By symmetry, the (100) stress 
creates two nonequivalent sets of orientations while 
the (110) stress creates three (sce Fig. 14). Assuming a 
Boltzmann distribution between the sets, this defines 
a single energy difference T. for the (100) stress and 
two, kT, and ETs, for the (110) stress as indicated in 
the figure. 

Fit to a Boltzmann distribution given by the for- 
mulas in Fig. 14, the alignment in all cases shows Ta, 
Ts, and T, linearly proportional to stress within experi- 
mental accuracy. The exponential dependence upon 
1/T has also been verified. For this study, estimates 
in the range 20-50°K were made from the relative 
intensities of the multiplets as described above. Àn 
estimate was also made at 160°K from the shift in the 
resonance field of the motionally averaged spectrum 
(reflecting the relative change of pa and pz in Fig. 10). 
The fact that shifts in the positions of the resonance 
multiplets at high temperatures occur, and that they 
can be accurately predicted by measurements on the 
relative amplitudes of the multiplets in the low-tempera- 
ture spectrum confirms again that the high-temperature 
phase is properly characterized as a motional averaging 
of the low-temperature configurations. 

In Table II, the measured values for Ta, Tg, and T, 
determined at a stress of 12 360 Ib/in? are given. To 
interpret these, let us take a very simple view of the 
defect. We assume that the change in energy of a 
particular defect orientation is determined solely by 
the component of strain along the line joining the two 
bonding silicon atoms (Sis and Si; of Fig. 6). 

We define a change in the defect energy per unit strain 
along this direction as 


M = (dE/de)si-si- (21) 


Defining eaa as the strain along the line joining the ad 
sites, etc., we obtain 


kT a= (€aa— €ac)M , 
kT p= (€be— €ca)M , (22) 
kT y= (€aa—€ac)M. 

These follow directly from the formulas of Fig. 14 
when the change in energy of the dc site is set equal to 
Me, etc. In (I) these strains have been calculated in 
terms of the elastic moduli and applied stress and, using 


these formulas with Eq. (22), we can solve for M using 
the experimental values of Table IT. Selection of M 


for the best fit gives 
M — 4-16.8 eV/unit strain.” (23) 


i i bilization 

33 Tn footnote 29, we concluded that the electronic sta! T 

A associated with the Jahn-Teller distortion was 70.12 Y 
This, with Eq. (23) serves to define an "effective" strain associate: 

with the Jahn-Teller distortion of ~0.7%. Whether this in turn 
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"TABLE IT. Comparison of the observed and calculated (see text) 
energy differences (in ^K) for the differently oriented defects 
under stress (see Fig. 14). The applied stress is 12 360 psi. 


r Observed Calculated 

Stress T; (°K) (°K) 

(110) T. "EEUU 
Ts —90 -94 


(100) T -+85 7 481 


The values computed from Eq. (22) using this constant 
are also given in Table II. 

The agreement in Table II is quite good considering 
the oversimplified model that we have taken in the 
analysis. The positive value of M means that the energy 
is lowered when the atoms are pushed together as we 
would expect for two normally next nearest neighbors 
with rather weaker overlap. The magnitude of M is 
close to the value +16.0 eV/unit strain deduced in (I) 
for a similar Si-Si pair bond in the 4 center. As in the 
A center, we argue that the magnitude is not un- 
reasonable being of the same order of magnitude as 
deformation potentials in silicon. We thus conclude 
that the stress experiments give substantial additional 
confirmation to the model. 


C. High-Temperature Stress Studies 


There is another “degree of freedom" for the defect, 
the phosphorus-vacancy axis, which has not been 
probed as yet. In the low-temperature studies this 
axis remained fixed. In terms of the model, it should be 
more difficult for this axis to change because atoms 
must rearrange rather than the easier electronic re- 
arrangement involved in the low-temperature motion. 

This phosphorus-vacancy axis reorientation can be 
revealed by stress alignment at elevated temperatures. 
For this a crystal was compressed along a (110) axis 
to ~26000 psi at room temperature in an external 
squeezing apparatus for ~1 h, then removed and 
quickly inserted into the microwave cavity and cooled 
to 20.4°K for study. A significant alignment was 
achieved as can be seen in Fig. 15. By reference to 
Figs. 3(b) and 14, we see that the preferred direction 
for the phosphorus-vacancy axis is in the plane per- 
pendicular to the stress direction. 

From Fig. 15, we estimate the degree of alignment? 


ni Had Has ab i 
E E N s- (24) 


nu Nye Nat Hood 


Analvsis of Eq. (24), similar to that in the previous 
section where we consider only the M=16.8 eV/unit 


bears any simple relation to the actual motions of the atoms 
neighboring the vacancy is not clear, however. 

31 C, Herring and E. Vogt, Phys. Rev. 101, 944 (1956). 

35 We estimate the alignment in Fig. 15 to be 1.5. We increase 
this value by ~15% to take account of partial recovery during 
the ~75 sec at room temperature between removal of stress and 
eventual cooling of the sample (see Fig. 16). 
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Fic. 15. E-center spectrum (H||[0117) (a) before and (b) after 
26 000 psi [011] compressional stress at room temperature. The 
defect orientations corresponding to the lines in the spectrum are 
indicated. 


strain associated with the Si-Si bond, predicts 
(ni/ny) = 1.34. The alignment is therefore in the same 
sense as that dictated by the preference of the defect 
to align its Si-Si bond along the compressed direc- 
tion. If we include another parameter for the change in 
energy per umit strain at right angles to this, along 
the phosphorus-vacancy direction, 


N= (dE/de)p-v«£—9 eV/unitstram, (25) 
the value of 1.75 is obtained. The sign is consistent with 
an outward relaxation of the phosphorus atom from 
the vacancy in its normal configuration which is to 
be expected since there is no bonding to hold it in (see 
Fig. 7). It is not possible to make a value judgment on 
the magnitude, but it does not seem out of line with 
the value for M, for instance. 


105 


104 


T (SECONDS) 


l =1.6 (103) e7 E/T 
103 E=0.93 7 .05ev 


33 34 35 36 37 3.8 3.9 


1/T(x103) 


Fic. 16. Characteristic recovery time versus temperature 
for the phosphorous-vacancy axis reorientation. 


acm 


WATKINS AND J. W. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


CORBETT 


The kinetics of the reorientation process was studied 
by monitoring the recovery from the stress induceq 
alignment versus annealing. During each anneal, the 
sample remained in the cavity which in turn was held 
at the annealing temperature for a specified time inter- 
val. At the end of each annealing interval, the Cavity 
was plunged back into the cryostat and the remaining 
alignment monitored from the spectrum at 20.4°K. The 
results are shown in Fig. 16. We obtain for the char. 
acteristic relaxation time of the reorietnation process, 


t= 1.6(10") exp(— E/kT), 
with 
E=0.93+0.05 eV. 


Let us now inquire as to the microscopic process by 
which the defect reorients. That is, of what process are 
we measuring the kinetics? Interchange of the phos- 
phorus and vacancy serves only to “invert” the defect 
and does not achieve the reorientation we are observing, 
The most likely remaining process is the one shown in 
Fig. 17 in which the vacancy must make four consecu- 
tive jumps, two away from the phosphorus and then 
two back. The fact that the defect must partially dis- 
associate in order to reorient has some interesting 
consequences. 

First let us note that the combination of reorientation 
plus the phosphorus-vacancy interchange allows the 
defect to diffuse through the lattice as an entity. We 
anticipate the activation energy for the vacancy- 
phosphorus interchange to be small, being comparable 
to the value for the isolated vacancy motion (c5 eV). 
The limiting process for diffusion should therefore be 
the reorientation. We thus conclude that in measuring 
the activation energy for reorientation, we are also 
measuring the activation energy for the diffusion of the 
phosphorus-vacancy pair in the lattice. 

The fact that the vacancy must separate partially 
from the phosphorus in order for the defect to reorient 
gives a simple explanation as to why the activation 
energy is larger than that associated with the isolated 
vacancy motion. The difference (~0.6 eV) can be 
associated with binding energy that must be overcome 
in order for the vacancy to separate to the next-next- 
nearest neighbor position (position 3 in Fig. 17). This 
is not the fola] binding energy, however, because we do 
not detect any loss in E centers in the process. That 1s, 
there is still sufficient binding at this remote position 
to assure that most of the vacancies return. 2 

There are several bits of experimental information 
that give us guides in estimating the remaining binding 
energy at this next-next-nearest site. For instance, Wê 
note that the E centers decay upon standing at room 
temperature at a rate suggestive of a time constant 
S6 weeks." On the other hand, from Fig. 16, we deter- 


3G. D. Watkins, J. Phys. Soc. Japan 18, Suppl. II, 22 (1963): 
_ *' The actual rate may be slower than this. "The loss of Æ centers 
in the spectrum can also be caused by decay of other centers W 5 
in turn alters the Fermi level so that E centers still present are ™ 


bm 
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mine that the defect is reorienting once every ~400 
sec at this temperature (~25°C). We may therefore 
conclude that the vacancy gets away only once in 
$10 opportunities at the next-next-nearest site. This 
in turn, can be related to a Boltzmann factor weighting 
the separation and recapture processes at this site, x 


a exp( — AE/kT) 210-5, (26) 


where a is a geometric factor (7-1). Taking a=1, we 
obtain AES0.25 eV. This quantity is not strictly 
equal to the binding energy at this site but rather gives 
a difference in barrier heights as shown in Fig. 18. How- 
ever, it is clearly related (being perhaps somewhat 
larger) and serves as an indication of its magnitude. 

It is interesting to note that a calculation of the 
binding energy between a single positive and a single 
negative charge separated by the same distance and 
immersed in a dielectric constant equal to that of 
silicon (11.6) gives 0.28 eV. This is certainly suggestive 


Fic. 17. The mechanism for the phosphorus-vacancy axis reorien- 
tation. The vacancy in the normal nearest position (No. 1) must 
make two jumps away from the phosphorus to site No. 3 and 
then return to a new (No. 1’) nearest site. 


and perhaps not an unreasonable way to view the defect 
at this separation. The over-all defect is neutral and, 
in separating into components, the phosphorus will 
tend to be positively charged, its normal ionized state 
when isolated. : 
The indication that $10 reorientations are occurring 
before disassociation means that the defect is diffusing 
through the lattice as an entity for considerable dis- 
tances. Considering the approximate nature of these 
estimates it is not clear then whether the annealing of 
the E center actually is a result of its dissociation or 
whether it is the result of long-range migration as an 
entity with subsequent trapping by other defects. ce 
experiment that may bear upon this is that reported by 
Saito et al. They have studied by electrical measure- 


state for resonance. This complication 


longer in the proper charge aking detailed annealing studies for 


has discouraged us from m 
w Ht Saito, M. Hirata, and T. Horiuchi, J. Phys. Soc. Japan 18, 
Suppl. III, 246 (1963). 
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Fic. 18. The potential energy (in eV) of a vacancy in the vicinity 
of a substitutional phosphorus atom, as determined from the 
experiments on the Æ center. The abscissa denotes the various 
neighbor positions in order of increasing separation, and are not 
necessarily in a straight line. The quantity AE was estimated from 
annealing studies by (a) EPR and (b) electrical measurements. 


ments the annealing kinetics of a deep acceptor at 
(E.—0.4)eV in floating zone silicon produced by 
irradiation. Considering the conditions of their experi- 
ment, it is possible that they are monitoring the Æ 
center anneal, as they have suggested. They observe 
first-order kinetics with a characteristic recovery time 
7 given by? 
71-1£10* exp(—0.94/RT). 


The close agreement of this activation energy to that 
which we estimate for diffusion suggests that the anneal- 
ing they are studying is the diffusion to traps. This is 
also consistent with the pre-exponential factor of 10* 
indicating, with our pre-exponential factor of ^ 109 
for the single reorientation process, ~10° diffusional 
jumps before trapping. Analysis similar to that leading 
to (26), but using $10* jumps without separation and 
at the higher temperatures of anneal used by Saito 
et al.(~175°C) gives AEZ 0.45 eV. This value is also 
indicated in Fig. 18. 

On the other hand, there is also evidence for dis- 
sociation in that some Si-4 centers (oxygen-vacancy 
pairs) are observed to form as the Si-E centers dis- 
appear. This has been observed in electrical measure- 
ments? and has also been observed by us in resonance. 
These vacancies could be liberated of course, after 
trapping. In any event, we recognize from our estimates 
here that it is touch and go as to whether the vacancy 
and phosphorus stay together long enough to reach a 
trap as an entity or whether they dissociate first, and 
that probably both processes are occurring. To in- 
vestigate this in detail one would require studies over 
a wide range of temperatures and defect concentrations. 


V. SUMMARY AND CONCLUSIONS 


We conclude that the Si-Æ center resonance arises 
from the neutral charge state of a lattice vacancy ad- 


3? We have estimated the pre-exponential factor from the curves 
of Fig. 6 of their paper. (See Ref. 38.) 

€ G. K. Wertheim and D. N. E. Buchanan, J. Appl. Phys. 30, 
1232 (1959). 
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jacent to a substitutional phosphorus atom. It is pre- 
sumably formed when a lattice vacancy, formed at 
random in the lattice by irradiation, diffuses through the 
lattice, and is trapped by the phosphorus impurity. It 
appears to be the dominant defect formed in phos- 
phorus-doped floating zone silicon by electron irradia- 
tion at room temperature. In pulled crystals, containing 
oxygen ~10!7—10'8 cm™, this center is not observed, 
the oxygen now serving as the dominant vacancy trap 
to form the Si-4 center.! The Si-E center introduces an 
acceptor level at ^ (E.—0.4)eV. With the Fermi level 
position above this level, the defect is negatively 
charged and is not observed in the resonance. 

'The model of the center deduced from the studies in 
this paper is one in which two of the three silicon atoms 
adjacent to the vacancy pull together to form a pair 
bond with the unpaired electron observed in the res- 
onance left primarily to the third silicon atom. This 
distortion can be viewed as a manifestation of the 
Jahn-Teller effect. The primary confirming features in 
the spin resonance are summarized as follows: 


(1) Analysis of the resolved Si? hyperfine interactions 
indicates that ~60% of the wave function is localized 
in a dangling bond on a single silicon atom. An addi- 
tional 20-25% of the wave function can be accounted 
for by partially resolved hyperfine interactions on a 
few (5-6) near-neighbor sites, with the remaining 
15-20% presumably spread out over more distant sites. 

(2) The spectrum also reveals resolved hyperfine 
interaction with a 100% abundant I=} impurity. 
ENDOR measurements confirm that this impurity is 
phosphorus. Only ~1% of the wave function is localized 
on this atom. 

(3) The g tensor is nearly axially symmetric reflecting 
the axis of the dangling bond which makes up most of 
the wave function. Its anisotropy is similar to other 
defects which we have studied and have identified as 
made up of dangling bonds associated with vacancies. 
Also, a simple “localized molecule" treatment of the 
defect has been given which indicates that the magni- 
tude and sign of the g shifts are reasonable. 

(4) Motional broadening and narrowing effects are 
observed in the spectrum in the temperature region 
60-150°K and are attributed to the thermally activated 
reorientation from one Jahn-Teller distortion direction 
to the other. Analysis confirms that the motion is that of 
an electronic “bond switching" (not atom motion) and 
that the motion is around the phosphorus-vacancy axis 
as predicted by the model. 

(5) This motion is also studied by stress induced 
alignment at 20.4°K. Combined with the linewidth 
studies, the kinetics of the reorientation process can 
be obtained with very high precision. The observed 
activation energy ~0.06 eV appears reasonable for this 

_ process. The sense of the alignment is consistent with 
the natural preference of the defect to align so that the 
applied stress aids the Jahn-Teller distortion. The 
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degree of alignment can be accounted for by a simple 
model which only considers the change in energy per 
unit strain along the axis joining the bonding Si-Si pair. 
The magnitude deduced for this parameter is reasonable 
being of the order of observed deformation potentials, 

(6) Stress-induced alignment of the phosphorus. 
vacancy axis is also studied. The sense and magnitude 
can also be accounted for in terms of the model. The 
kinetics of this process, with an activation energy of 
0.932-0.05 eV, appear reasonable for the atom re. 
arrangements required by the model. 


The E center, in addition to being an important 
defect in radiation damage studies, may play an 
important role in unirradiated silicon. High-temperature 
diffusion of substitutional impurity atoms, as well as 
self-diffusion, is generally believed to be via lattice 
vacancies. As a result, such impurity-vacancy inter- 
actions are important in understanding the mechanics 
of impurity diffusion. The Æ center represents a unique 
opportunity to probe this interaction in detail. Being 
the neutral charge state appropriate for the Fermi level 
in the middle of the gap, it is in the correct charge 
state relevant to the high-temperature experiments. In 
Fig. 18, we summarize what we have been able to learn 
about this interaction. The abscissa denotes schemati- 
cally the different neighboring atom positions as one 
moves out from the phosphorus. They are not neces- 
sarily in a straight line. We don’t know the exact shape 
to be expected but it seems reasonable to assume it is 
something as shown. From the kinetics of the stress- 
orientation of the phosphorus-vacancy axis we have 
determined the height of the second maximum to be 
0.93+0.05 eV, as indicated. We have also made two 
estimates for the indicated difference in barrier heights 
AE from annealing studies. The smaller estimate comes 
from EPR measurements and the larger one comes from 
electrical measurements. From previous studies? we 
estimate that the activation energy for the isolated 
vacancy motion ist! Ej4Y—0.332-0.20 eV. The in- 
dicated estimate of the binding energy Eco~0.28 eV at 
the third-neighbor site comes from a simple calculation 
of the electrostatic interaction between a megative 
vacancy at this site and a positive phosphorus atom. 
This value is seen to be generally consistent with the 
other experimentally determined estimates. 

We have therefore been able to deduce quite a bit 
about the details of the phosphorus-vacancy interaction. 
It is hoped that future experiments can narrow further 


“ The value for p-type silicon has been determined to be (Ref. 
36) 0.332-0.03 eV. However, in annealing studies in n-type silicon 
(Refs. 3, 36), it is found that both A and E centers emerge in two 
discrete temperature regions. One of these is presumably to be 
associated with long-range motion of the vacancy, and, depending 
upon which we select, we either deduce ~0.1—0.2 eV or ~0.4—0.5 
eV for the activation energy of the vacancy motion. This difference 
from the p-type value presumably reflects the role of charge state 
in the mobility of the vacancy. Since we do not know the chargé 
state appropriate to Fig. 18, we indicate an error +0.2 eV to 
bracket these extremes. 
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the errors in £y" and AE with the result that this 
interaction will eventually be rather precisely mapped 
out. However, even with the present uncertainties we 
can draw certain conclusions: 

From Fig. 18, we see that there is a sizeable binding 
energy (~1.0 eV) between the phosphorus atom and a 
vacancy in their closest bound state.” Because of this 
attractive interaction there is an enhanced probability 
of finding a vacancy near the phosphorus at elevated 
temperatures and we therefore expect the diffusion of 
the phosphorus to be enhanced over that for silicon 
self-diffusion. However, we recognize that it is only the 
binding at the third-atom site that is operative in enhancing 
the diffusion because the vacancy must go through this 
site in order for the phosphorus atom to diffuse. 
For this binding energy, we take the Coulomb estimate 
Ec=0.28 eV. We estimate the error in this quantity 
somewhat arbitrarily as +0.15 eV, which we feel, in 
view of the general consistency of the values in Fig. 18, 
is overly cautious. 

Phosphorus diffusion in silicon has been measured 
and the activation energy has been determined to be# 
3.66+0.18 eV. However, there has been no correspond- 
ing measurement of self-diffusion in silicon. Using the 
value 0.282-0.15 eV for the lowering of the phosphorus 
activation energy, we therefore estimate the activation 
energy for silicon self-diffusion 


Eescit_aitt, = 3.94 40.33 eV. (27) 


Since this quantity has not been measured directly, we 
feel that this probably represents the best present 
estimate that can be made for this important quantity. 

Combining this with the activation energy for 
vacancy motion of 0.33-£0.20 eV, we obtain for the 
formation energy of the lattice vacancy“ 


W=3.6+0.5 eV. (28) 


ACKNOWLEDGMENTS 


It is a pleasure to acknowledge the assistance of 
W. Colliton in all phases of the measurements. Helpful 
conversations with Dr. F. Ham, Dr. B. Segall, Dr. R. 
W. Redington, and Dr. R. M. Walker are also gratefully 


acknowledged. 


4 We remind the reader that these arguments (and the values 
in Fig. 18) apply only to the neutral charge state of the defect and 
therefore when the Fermi level is near the middle of the gap. This 
should apply to the high temperatures used in diffusion studies 
for anything but the most heavily doped materials. “ 

43 This important fact (see Sec. IE and Fig. 17) has been 

enerally overlooked by previous authors. - 
3 ae Š. Fuller and J. A. Ditzenberger, J. Appl. Phys. 27, 5: 
1956). diens " 
f £9 ut have, of course, assumed throughout this discussion that 
both phosphorus and self-diffusion proceed by the vacancy 
mechanism. If other mechanisms are also involved, the formation 


energy will be even larger. 
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APPENDIX A: ESTIMATE OF |15,/(0)|? AND (72,74) 
FOR THE NEUTRAL 3p ATOMS 


In previous publications,-* we have used |2.0) |*si 
= 24(10) cm and (r35-)3i:17(10) cm” in the 
analysis of resonance centers. These estimates were 
originally made using various Hartree functions, 
available only for higher ionization states of silicon, as 
guides. Since then, Hartree-Fock calculations for neutral 
silicon, as well as all of the other neutral 3p atoms, have 
been reported by Watson and Freeman" and a revised 
estimate of these quantities is in order. 

In Table III, the first two columns show |;,(0)|? 
and (r35 *) determined from the Watson and Freeman 
wave functions for the atoms aluminum through 
chlorine. Consider first |V5,(0)|*. As was pointed out 
by Kohn and Luttinger,” direct comparison with experi- 
ment in the case of the alkalies shows that Hartree- 
Fock wave functions generally underestimate |y,(0)|*, 
and the true values will probably be somewhat larger 
than these values. In the case of silicon, Kohn and 
Luttinger estimated an appropriate scaling factor by 
comparing the Hartree-Fock estimate of |;.(0)|* for 
the alkali-like Si** ion* with that predicted by the 
Goudsmit formula? for such ions and assuming the 
same scaling factor to apply for Si. Their result was 
that the Hartree-Fock value should be increased by 
~19%. Applying the same correction to the Watson 
and Freeman value for Si’, we obtain the final estimate 
of 31.5(10?*) cm-? shown in the table. 

Now consider (757?) Barnes and Smith? have 
estimated this quantity from an analysis of optical fine 
structure splittings and their values are given in column 
three of Table III for each of the 3p atoms. In the case 
of aluminum?! and chlorine,” atomic beam hyperfine 
measurements give accurate determinations of (ripis 
also shown in the table, to which we can compare the 
estimates of Barnes and Smith. We see that the agree- 
ment is very good, the atomic beam values lying 7367 
higher. We thus increase the Barnes and Smith estimates 
by 3% for the remaining atoms to give the final estimate 
of (r5?) shown in the last column. 

We note that the (rap °) determined from the Watson 
and Freeman wave functions are uniformly too low. 
It is interesting to note that for Si’, the value must be 
increased by 19% to agree with our final estimate. 
Although this agreement with the scaling factor for 


45 We are indebted to B. Segall for supplying us with tabulated 
values of the 3p Watson and Freeman wave functions from which 
(rap) were calculated. 

4 W. Kohn and J. M. Luttinger, Phys. Rev. 97, 883 (1961). 

€ W. Hartree, D. R. Hartree, and M. F. Manning, Phys. Rev. 
60, 857 (1941). 

49S. Goudsmit, Phys. Rev. 43, 636 (1933). 

aR, G. Barnes and W. V. Smith, Phys. Rev. 93, 95 (1954). 

51 H. Lew and G. Wessel, Phys. Rev. 90, 1 (1953). 

5$? V, Jaccarino and J. G. King, m Rev. $3, 471 (1951); 
J. G. King and V. Jaccarino, ibid. 84, 852 (1951). (rsp) was 
determined from their results by analysis similar to that in the. 
preceding Ref. 51. 
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TABLE IIT. Various estimates of [Ya (0)|? and (rar?) for the neutral 3p atoms. The indicated values were determined from (HF) 
Hartree-Fock wave functions, (FS) experimental fine structure constants, and (AB) atomic-beam hyperfine measurements. See text 


for the method of determining the final estimates. 


[Ya (0) |? 


Or Estimates 
3. (0) |nr CROSS 3e: rap 3) Scaling [Yas (0) |? (rap ?) 
Atom de ium dos red (o a5 dos em) factor (10% cm?) — (10% cm») 
Al 1595 — 694^ 8.645 8.95¢ 1.29 20.4 8.95 
Si 26.42 13.59 15.6> eee 1.19 31.5 16.1 
P 37.88 22.09 23.5b 1.10 41.6 24.2 
S 53.38 31.98 33.99 o38 1.09 58.4 34.9 
Cl 71.8" 44.28 48.3b 49.51 1.12 80.4 49.5 


c Calculated from wavefunctions in R. E, Watson and A. J. Freeman, Phys. Rev. 123, 521 (1961). 


R. G. Barnes and W. V. Smith, Phys. Rev. 93, 95 (1954). 
* H. Lew and G. Wessel, Phys. Rev. 90, 1 (1953) 


a V. Jaccarino and J. G. King, Phys. Rev. 83, 471 (1951); J. G. King and V. Jaccarino, ibid. 84, 852 (1951). 


|Va,(0)|? must be considered somewhat fortuitous, it 
does suggest that both (rsp °) and |v3,(0) |? are under- 
estimated by a comparable amount in these Hartree- 
Fock wave functions. This then suggests a consistent 
way to determine the proper |¥3,(0) |? scaling factor for 
the other atoms: For the atoms other than Si, the 
scaling factor was determined as being equal to the 
ratio of our final estimate of (rp °) to that determined 
from the Freeman-Watson wave functions. The scaling 
factor thus determined is shown in Table III along with 
the corresponding final estimate of |y5,(0)|? for all 
the 35 neutral atoms.5 

We will not attempt to set limits of accuracy on the 
values of Table III. There are still sufficient uncer- 
tainties in the applicability of these free atom values 
to those of a bonded atom surrounding a defect in a 
solid to make such speculations somewhat academic 
for our use, anyway. However, Table III does give a 
set of reasonable values, consistently arrived at, for 
the complete 35 series of neutral atoms. 

In this paper we have used only the values for 
Silicon and phosphorus. In subsequent papers in this 
series, we will have occasion to use the estimates for 
others of the atoms in the table. 


APPENDIX B: APPROXIMATE LCAO 
TREATMENT OF THE g SHIFT 


We approximate the one-electron LCAO molecular 
orbitals of Fig. 8 as: 


V/(A)— es! (4) c ana (A)2- Bap: (4) , 
V*(A Xe) = N#[o3!4 (A)2:o3-1! (X5) 


FL e()25/;0€02)], (B1) 


with similar expressions for the other orbitals. Here 


5 The scaling factor is not constant throughout the series but 
is largest at the beginning. A further check that this variation is 
not unreasonable is given by noting that an even larger scaling 
factor of 1.42 is required in the case of neutral sodium (which 
starts this row in the periodic table), in order to raise the value 
determined from the Hartree-Fock calculations [D. R. Hartree 
and W. Hartree, Proc. Roy Soc. (London) A193, 299 (1948)] 
(3.56X 10% cm™) to that observed experimentally [P. Kusch and 
H. Taub, Phys. Rev. 75, 1477 (1949)] (5.05 10% cm™). 
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a3}, o1, o5, and c3!" represent four orthogonal cylin- 
drical orbitals made up of 3s-3 atomic wave functions, 
where the superscript represents the axis of the directed 
orbital as shown in Fig. 8. (By ey ™!, we denote an 
orbital pointed in“ the opposite direction from s.) 
We have included small admixtures of the core wave 
functions y;, (1s, 2s, and 25), in order to orthogonalize 
the total wave function to the core. The small admixture 
coefficients are given by 


e— (c; (Xe) |Yi(A)), ete. 
The normalization coefficients are 


N+=[2(14S—D e2)]?^, 


with 
S= (e; (A) | os (X2). 


A typical orbital angular momentum matrix element 
of (15) is 


GAX AILI (4) 
-X (HAX) |o: (0) rv GL Lilas (A): 


Substituting the expression for ¥+(AX») given by Eq. 
(B1), and using the fact that only c3 (A) has significant 
overlap with any of the terms for J*(AX;), we obtain 


EAX) Ly] V (4)833 (W*) (e (A) | Lj] Y (4))- 
(B2) 


Now consider the corresponding matrix element in 
(15) involving Vao 
(A) | (:9:195 (4X3) 
= (8/mo) (s (4)| (E x p)s\¥*(AX2)). (B3) 
The electric field E is large only near the nuclei of 
atoms A and X; and therefore, in these matrix elements, 
tends to weight strongly only the parts of the Wave 
functions near the atomic cores. To a good approxima- 


tion we may therefore retain in Eq. (B1) only the 
atomic terms centered on atom A giving 


&(8/mc)N*(ss (A) | (E x p);| es (4) — essor (4))- 


E ^— 
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(The terms centered on atom X», though not zero near 
atom A, are slowly varying there, and therefore do not 
contribute significantly to the spin-orbit interaction.) 
Having reduced the matrix elements to those on atom A 
only, we may now make the substitution E- [r/r]E(r) 
for the central force field near the nucleus of atom 4 


: ul H 7 
leading to the familiar free atom expressions 


; (nian | (V4) wm!) = durntwi(lm| Lim), 
with 
Natri (8/me)(nl| LE(r)/r]| ul). 

This gives, upon straightforward manipulation, 
Q/C4)] (V). P5 (A X3) 
ENS, (€opAop,3p)/Br Ko? (A) | L,|o%(A ))- (B3’) 


Substituting (B2) and (B3’) into (15), and summing 
over-all excited orbitals in (B1), 


Agu=0, 
Une? loi 
Ag, ar |; (B4) 
Ey Ba 
with 
Aap, 3p€2p D 
cer d (B5) 
AspbI 


IRRADIATED Si A1377 
Here, y is a positive number because Azp,2p and ézp will 
always be of opposite sign, a consequence of the fact 
that Yap has one more radial node than Pap [In our 
previous treatment (I), the core overlap terms were 
not included in the wave functions. For these wave 
functions, y=0, and (B+) leads to Eq. (16) in the text 
of the present paper. ] 

We may estimate y, as follows: We approximate 
A2p.3p 48 — (A445)? using the values A2p=0.40 eV 
estimated by Liu* from x-ray emission data" and 
^:5—0.02 eV. We calculate e from Eq. (19) using 
Slater atomic orbitals’ and standard overlap integral 
formulas? in conjunction with certain required integrals 
tabulated by Rosen.5 For the normal Si-Si lattice 
distance and with the c‘(A) taken as the tetrahedral 
5f? hybrids (25% 3s, 75% 3p), we obtain &55 0.033. 
With r= (3)/2/2 for the tetrahedral hybrid, this gives 
y=0.17. 


*5 L. Liu, Phys. Rev. 126, 1317 (1962). 

55 D. H. Tombulian and W. M. Cadis, Phys. Rev. 59, 422 (1944). 

55 J. C. Slater, Phys. Rev. 36, 57 (1930). 

5 R. S. Mulliken, C. A. Rieke, D. Orloff, and H. Orloff, J. 
Chem. Phys. 17, 1248 (1949). 

95 N, Rosen, Phys. Rev. 38, 255 (1931). 
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Experimental investigations of the phenomenon of single carrier, space-charge limited injection in silicon 


at liquid-helium temperatures have been performed in 


limited currents (SCLC) in an insulator with traps. 


characteristics were performed on p,pps. devices fabricated by 


an attempt to validate the model for space-charge 
Comparisons between predicted and experimental 
diffusion techniques. Pulse and dc measure- 


ments were made to obtain steady-state current-voltage characteristics for the above devices. Bie ciications 
were observed in the SCLC characteristics due to field-dependent carrier mobilities, the exact nature of 
which could be determined from the experimental steady-state characteristics. The density of deep trapping 


centers in the above material at liquid-helium tempe 
tained by measurements of the traps-filled voltage. An 


rature, corresponding to the compensation, was ob- 
independent value for the compensation in the same 


material was obtained by Hall measurements in the temperature range 20°K «T <77°K. Transient meas- 


urements of the single-carrier current were performed 


, from which values for the capture probability and 


capture cross section for the deep trapping center could be obtained. A comparison was made between the 


capture cross section obtained in this manner and the 


INTRODUCTION 


HE theory of space-charge limited currents 
(SCLC) in insulators with traps as proposed by 
Lampert! has been employed frequently in recent years 
to explain the current-voltage characteristics of insul- 
ator structures. In some instances, however, uncertainty 
arose in the interpretation of the insulator character- 
istics because of apparent disagreement between 
material parameters obtained from the single injection 
interpretation and independent material measurements. 
For example, in CdS, single-carrier SCLC was investi- 
gated by Bube in an attempt to correlate the trap 
density with the traps filled voltage (Verr) as defined 
by Lampert. However, his determinations of trap 
density obtained from measurements of the apparent 
traps filled voltage differed by as much as three orders 
of magnitude from independently obtained trap-density 
values. 

This experimental uncertainty in the model for 
SCLC in insulators with traps prompted the work in 
this paper. It was observed, originally by Lampert, 
that the purest insulator in which to test the model 
would be a high-purity semiconductor at low tempera- 
tures (sufficiently low to cause the equilibrium carrier 
density to be completely negligible). Since the dopant 
densities in the semiconductor would be known (or 
could be determined by Hall effect measurements), one 
should be able to quantitatively compare the predictions 
of the theoretical model with the experimental charac- 
teristics. Other important results expected from 
investigation of single injection at low temperatures are: 


(1) the capture probability, hence capture cross 


*This work was supported in part by the National Science 
Foundation. 

f Present address: Sandia 
Albuquerque, New Mexico. 

1M. A. Lampert, Phys. Rev. 103, 1648 (1956); J. Appl. Phys. 
29, 1082 (1958). 
? R. H. Bube, J. Appl. Phys. 33, 1733 (1962). 
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theoretical prediction of the "giant trap" model. 


section, of the deep trapping levels present due to 
compensation could be measured directly by pulse 
techniques; 

(2) the field dependence of the drift mobility could 
be determined from the SCLC characteristic in the 
Child's law region ; 

(3) the injecting nature of contacts at low tempera- 
tures could be explored; and 

(4) the nature of scattering mechanisms at low tem- 
peratures could be better understood by interpreting 
the field-dependent mobility data. 


D.bb, SILICON DEVICES AT LOW TEMPERATURES 


As mentioned above, the ideal material in which to 
examine the model for single-carrier SCLC should be 
a high-purity semiconductor at low temperatures. In 
actuality, however, one is limited to germanium or 
silicon since no other sufficiently pure semiconductors 
are available. Furthermore, high-purity germanium is 
unsuitable for these investigations since at low tempera- 
tures impact ionization occurs at quite low field 
strengths,-4 destroying its insulating nature. Thus, 
presently only silicon is suitable for investigating SCLC 
at low temperature; furthermore, since highest purity 
silicon is invariably p type due to the presence of boron, 
p-type material was chosen for the experiment in this 
work. 

As the temperature of a sample is reduced to below 
the extrinsic range (T <50°K in lightly doped material) 
the free holes, produced by thermal ionization of non- 
compensated acceptor centers, recombine with ionized 
acceptors producing neutral centers. At 4.2°K (liquid- 
helium temperature) virtually all free holes will have 
disappeared and V 4— V p neutral acceptor centers vill 
have appeared. The electrical effect of the compensated 


3 N. Sclar and E. Burstein, Phys. Rev. 98, 1757 (1955). 
1S. H. Koenig, R. D. Brown, and W. Schillinger, Phys. Rev. 
128, 1668 (1962). 
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SINGLE INJECTION 
centers at low temperature is to produce trapping levels 
for injected carriers. The positively charged donor 
centers would be trapping levels for injected electrons. 
Similarly, the negatively charged acceptor centers 
would be traps for injected holes. In this simplified 
model the impurities consist of V4 shallow acceptors 
and N p shallow donors (shallow at room temperature) 
where X47 N p. The trapping kinetics for this model 
depends in large part on the "capture cross section" of 
the ionized centers. The theoretical considerations 
which allow this cross section to be determined will be 
discussed in a later section dealing with the cascade 
capture process.® 

To fabricate a single-carrier SCLC device, one must 
determine which actual contacts exhibit the properties 
assumed in the model. To reiterate these properties, one 
contact must be “ohmic” and the other “blocking.” At 
the ohmic contact a large reservoir of holes which can 
be readily injected into the bulk under forward bias 
must exist. Such contact is not easily achieved for 
temperatures in the liquid-helium range. For example, 
it is known that most metal contacts to silicon create 
Schottky-type barriers at the metal-semiconductor 
interface which, though small at room temperature in 
some cases, are extremely large compared to kT at 
4.2?K. Such a barrier would, of course, prevent hole 
injection at this temperature. A nondegenerate diffused 
or alloyed p contact would also be unsuitable for use as 
an injecting contact at these low temperatures since the 
contact region would merely be an extension of the 
insulating bulk. However, a degenerate f, contact 
appears to be suitable for the injecting contact at low 
temperatures since in the degenerate p+} region there 
exists a large number of mobile holes which could easily 
be injected into the insulating region because only a 
slight barrier exists. 

The exact nature of the blocking contact at low tem- 
peratures is not so important. Any contact which 
prevents neutralizing electrons from entering when 
holes are being injected at the ohmic contact is satis- 
factory. For this investigation of SCLC the most 
suitable contact would again be a degenerate p, contact 
since no rectification is desired. No electrons can be 
injected at this contact at low temperatures regardless 
of the bias polarity. 

The structures in which SCLC was explored at low 
temperatures were, therefore, p,pp+ devices with de- 
generate p, regions. The schematic potential solution 
for the entire structure in equilibrium is shown in Fig. 1. 
The metallurgical junctions occur at X=0, d in this 


diagram. 


FIELD-DEPENDENT MOBILITIES AT 
LOW TEMPERATURES 


At temperatures in the liquid-helium range carrier 
mobilities become field-dependent for values of electric 


5 M. Lax, Phys. Rev. 119, 1502 (1960). 
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ficld orders of magnitude smaller than at room tempera- 
ture. To obtain an explanation for this phenomenon, 
one must consider the temperature dependence of the 
mobility. Since at 4.2°K the low-field mobility in lightly 
doped silicon is more than two orders of magnitude 
greater than its room-temperature value,’ the carrier 
drift velocity at this temperature for a given field 
strength is likely to greatly exceed the room tempera- 
ture drift velocity at the same field strength. Therefore 
the breakdown of the constant mobility assumption, 
which occurs when the energy given to mobile carriers 
by the field between collisions is comparable to the 
equilibrium carrier energy, should occur at greatly 
reduced electric fields at low temperatures due both to 
the increased mobility and to the decreased equilibrium 
carrier energy at these temperatures. 

The two principal mechanisms which determine the 
form of the mobility field dependence are acoustical 
and optical scattering. Only at very low fields must 
ionized impurity scattering be considered.” 

The equation governing the dependence of the 
mobility y and electric field Æ when only acoustical 
scattering is of importance is 


u= (8/3) (proc) PETE, (1) 


where uro is the acoustical scattering limited zero-field 
mobility and c is the velocity of sound in the semi- 
conductor. 

The field strength at which optical scattering should 
become important in determining the mobility can be 
approximated by employing the above equation. If one 
equates the carrier drift velocity as calculated from this 
equation to the saturated drift velocity as obtained by 
equating the carrier kinetic energy to the Raman energy 
Er the following results, 


(8/3) (uro) 1E: 2221/2(2 &g/ mF”, (2) 


where m* is the density-of-states effective mass. 

When the appropriate constants are inserted, it is 
obtained that in p-type silicon for E>50 V/cm optical 
scattering should determine the mobility at 4.2°K. This 
result is surprising when one considers that electric fields 
near 10° V/cm were required at room temperature? to 
cause the carrier drift velocity to approach saturation. 

As shall be seen in the following section, the minimum 
electric field value for which SCLC is observed in the 


5 E. H. Putley and W. H. Mitchell, Proc. Phys. Soc. (London) 
72, 193 (1958). ipic : 

1J. B. Gunn, Progress in Semiconductors (John Wiley & Sons, 
Inc. New York, 1957), Vol. 2, pp. 213-247. 5 

s À. C. Prior, Phys. Chem. Solids 12, 175 (1959). 
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present compensated silicon devices at low tempera- 
tures exceeds the value of 50 V/cm by more than an 
order of magnitude. One would expect, therefore, that 
for all the SCLC devices investigated at low-tempera- 
ture optical scattering is the process determining the 
carrier mobility. 

The simple model for optical scattering, which 
assumes all carriers are scattered optically upon reach- 
ing the threshold energy for optical-phonon emission, 
allows no carrier energies higher than the threshold 
energy and predicts drift velocity saturation. When 
higher carrier energies are allowed, as would be in any 
realistic model, the drift velocity would not actually 
saturate. If, for example, the mobility varied as E ^, 
where a<1, as some workers have reported, the drift 
velocity would be proportional to Z'~«. If one assumes 
that due to optical scattering the hole mobility up is 
given by the expression 


up- KoE-*, (3) 


the constants Ko and a can be evaluated empirically by 
comparing the observed Child's law region character- 
istics to the theoretical Child's law characteristic. When 
the mobility variation of Eq. (3) is assumed, one can 
calculate the Child's law characteristic. The resulting 
equation relating the current density j to the terminal 
voltage V is 


j-E(8—2)/Q—a) -*(eK9)/Q—o)Ve-?/de-9, (4) 


where e is the permittivity of the material and d is the 
device thickness. This equation can be seen to be correct 
by allowing a to equal zero. In this case the equation 
reduces to the familiar square-law characteristic for 
constant mobility. 


TRAPPING AT LOW TEMPERATURES 


At low temperatures in p;)), single injection devices, 
the hole trapping process should involve ''giant" traps? 
since Coulombic attraction exists between injected 
holes and the negatively charged compensated boron 
atoms. An injected hole would therefore be trapped 
first in a large orbit excited state of a negative boron 
center via the emission of a single acoustical or optical 
phonon. After being trapped in this excited state, the 
hole would decay to the ground state of the center via 
a cascade of single-phonon processes. 

Whenever carriers are injected into insulators where 
deep trapping centers exist, the rate at which the 
carriers are trapped is naturally related to the capture 
cross section of the centers. In the case of holes injected 
into p type silicon, the rate at which the carriers are 
trapped is given by the equation 


dp/di— — BpN 47, (5) 


where JV 4- is the density of ionized acceptors, p is the 
density of injected holes, and B is the mean-capture 
probability. B is obtained by averaging B(6) over the 
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energy distribution of the injected carriers. The energy- 
dependent capture probability can be written as 


B(&)-«(8)Q6/m*)h, (6) 


where c (6) is the energy-dependent cross section. 
'The mean capture probability is therefore seen to be 


B-(s(8)(26/m*)). (7) 


The experimental values of B, from which estimates 
for the capture cross section can be made, can be ob- 
tained by comparing the experimental current tran- 
sients with the analytically derived characteristic. For 
the condition of slow trapping (the drift transit time 
laXr1, the trapping time) the following analysis may 
be employed to obtain the analytical characteristic. 


TRANSIENT SCLC IN THE PRESENCE 
OF SLOW TRAPPING 


Carrier flow in a plane parallel insulator with deep 
traps can be described with four equations: the trans- 
port equation, the continuity equation, Poisson’s 
equation, and a rate equation describing the trapping 
process. If diffusion currents are again neglected, these 
four equations are as written below: 


ERE set d 
ox ot ðt 


OE(X,) q 
S DTN a Nani 
P € 


and 


Op(X,)  _ 
PA _ 5 6c) Nr p(X, (11) 


where q is the electronic charge, f, the density of 
trapped holes, z the density of mobile electrons, 1; the 
density of trapped electrons, and Np the density of 
trapping centers. 

In writing Eq. (11), the generation of carriers from 
the deep traps has been assumed to be negligible. Addi- 
tional assumptions which appear to be appropriate to 
the low-temperature situation are: 


(1) due to the quasistationary? behavior of the 
carrier flow in the slow trapping situation, both (a) the 
current divergence 07(X,/)/9X and (b) displacement 
currents may be neglected ; : 

(2) at 4.2°K, both 2 and m are negligibly small m 
p-type material ; 

(3) since the injected holes are captured on acceptor 
centers, Nr is equal to V4 in Eq. (11); and i 

(4) since tar, the trap free SCLC solution 1$ 
assumed to apply at /— 0. 


e 
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Fhe simplified equations can be rewritten 


IN COMPENSATED Si A1381 


i as To solve this system of equations for j(/), one proceeds 
I) = quyp(X DEX À, (12) 9$ follows: 
Op(X,I) Op. X.) ` 
ELERA PA, d (13) (1) relate p(X) and p(X,t) via Eqs. (13) and 
at gu VU. Wh 
dE(X) q (2) relate j() to p,(X,) using these results and 
"ay TAI eX O+No—Na], (4) Pa (2;and 
4 € (3) substitute SCLC initial conditions into the 
p(X d _ problem obtaining the solution for j(1). 
— = BROGDENA— p(X]. (15) 
ðt The resulting expression for j (f) is 
i=] 1—[2(2—a)*(d/X) 12 a Vrrr/(3—a) v] Doo S ee ] 
1—[20—e d/X) «016-9 Vzs,/ (3—0) V] expBIN p(1— (3—a) (X/d) «98-9 V/2(2 ap Vei] S 


Equation (16) does not agree completely with the 
assumptions made in its derivation since it predicts a 
nondivergenceless current. The divergence is not great, 
however, since the exponent of the X terms, (a—1)/ 
(2—a), is much less than one for values of a appropriate 
to the devices of the following section. In fact, for a=1, 
corresponding to drift-velocity saturation, the X 
dependence disappears altogether yielding a completely 
divergenceless current. Since the assumptions upon 
which the present analysis is based are strictly valid 
only for the case of drift-velocity saturation, the 
theoretical current transients shall be calculated under 
this assumption. The difference in form between the 
characteristics obtained in this manner and the results 
of an exact solution should not be great since, for values 
of o reasonably close to one, the X dependence of the 
current in Eq. (16) is greatly depressed. Hence, by 
matching the theoretical current transients obtained 
assuming drift-velocity saturation to the experimental 
characteristics, one should be able to obtain approxi- 
mate values for B even though the drift velocity has 
not reached saturation. The theoretical transient 
characteristic to be employed in these comparisons is 
obtained by substituting a= 1 into Eq. (16). This yields 


1—Vor1/V | 
1— (Vas V) exp{BiNc(i—Varw/V)}4 


j0—4 
(17) 


Several limit checks may be performed on this solution 
to compare it to reality. For example, an obvious 
limit check is an examination of the behavior of the 
equation as ¿—> «o for the two cases of V« VrrL and 
V Varr. For V < Verr, Eq. (17) predicts that j(/) — 0 
as [— œ. This corresponds to the result expected 
from physical considerations. For the second situation 
where V >Vrru the equation predicts that j()— 
Jo(1— Varz/ V), which also corresponds to the expected 
behavior. An additional check on the equation may be 
obtained by allowing the density of compensation Vp 


(16) 


to become very large. In this case, Vrpr/V would be 
much greater than 1 and. j(/) — Jo exp(— BIN 4). This 
simple exponential is the transient characteristic one 
would expect from physical reasoning for this case, 
since the density of ionized trapping levels is so large 
as to remain essentially constant even though carriers 
are being trapped.’ 


DEVICE CONSTRUCTION AND EVALUATION 


The degenerate p, contacts were formed by com- 
paratively low-temperature boron diffusions. For the 
temperatures employed (900-950°K) the surface con- 
centration of boron exceeded 10° cm-? as was observed 
from sheet resistivity and junction depth measurements 
on identically diffused n-type wafers.” These measure- 
ments indicated junction depths of approximately 1g 
in addition to the above value of surface concentration. 

Two techniques were employed to fabricate single 
injection devices from the diffused wafers. Circular 
devices with areas of approximately 10-? cm? were 
obtained by masking and etching. Rectangular devices 
of comparable area were made by scribing and breaking 
the diffused samples. Leads were attached to the above 
devices by thermocompression bonding aluminum wire. 

To obtain measurements at 4.2°K on these devices, 
a shielded, light-tight cryostat was employed. To obtain 
the necessary low temperature, the cryostat was in- 
serted into the liquid-helium chamber of a double-walled 
transport Dewar. 

To measure the steady-state current-voltage charac- 
teristics over the relatively large range of currents 
observed in SCLC devices, several techniques were 
employed. Below 1 mA, dc measurements could be 
taken without appreciable heating. Consequently 
currents between 10-? and 10-? A were observed using 
dc schemes. Point by point measurements were made 
in some cases using a well-shielded circuit employing a 


? A, Many and G. Rakavy, Phys. Rev. 126, 1980 (1962). 
w G. Backenstoss, Bell System Tech. J. 37, 699 (1958). 
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ES SCOPE (V) 
PULSER m 


DEVICE ATTENUATORS 


Zo SCOPE (I) 


Fic. 2. Pulse measurement circuit. 


precision decade resistance box to sense the current. 
In many instances, however, the dc data were obtained 
using an oscilloscope as an X-Y plotter. This procedure 
speeded up the data taking process considerably while 
still allowing accuracy within the range required. In 
some cases the device characteristics were observed on 
a curve tracer when high accuracy was not needed. 

For currents greater than 107? A where heating would 
prevent dc measurements, the steady-state current- 
voltage characteristics were obtained using repetitive 
submicrosecond pulses. The pulse measurement setup, 
as shown in Fig. 2, differed slightly from the dc setup 
since to prevent reflections the transmission lines were 
matched with their characteristic impedance Zo at all 
terminations. The pulse duration in these measurements 
varied between 100 and 300 nsec. The repetition rate 
was approximately 150-350 pulses per second. A fast 
rise-time oscilloscope (Tektronix 555) was used to 
observe the pulses in all cases. 

The single-shot pulse measurements, used to approxi- 
mate the trap-free SCLC, were taken with the same 
circuit as shown in Fig. 2. A single-shot measurement 
was made by pulsing the device once after it had been 
thermally cycled from 300 to 4.2°K. The oscilloscope 
traces corresponding to the single-shot voltage and 
current pulses were recorded on high speed film, 
providing records from which the data could later be 
taken and analyzed. 


EXPERIMENTAL CURRENT-VOLTAGE 
CHARACTERISTICS 


The experimental steady-state current-voltage char- 
acteristics as obtained at 4.2°K for p, pp, silicon devices 
exhibited the structure one would expect for SCLC in 
an insulator with deep trapping levels. In Fig. 3 the 
steady-state characteristic has been plotted for device 
II. This characteristic, which is typical of all the devices 
tested, clearly shows the presence of a traps-filled 
voltage, at which point the current increases steeply 
with voltage. When the traps-filled voltage is measured 
for this device and for the others constructed from the 
same material, the values tabulated in Table I are 
obtained. If the trap density were uniform, as was 
previously assumed above, a d? dependence would be 
observed in Varr. By plotting logVrrr versus logd, 
where d is the minimum thickness, a straight line of 
slope two would connect the measurements for the 
various devices for this dependence. In Fig. 4 this plot 


: ; is shown including the possible errors in device thickness 
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TABLE I. Traps-filled voltage values and the corresponding 
minimum thicknesses for several devices. 


Device Minimum thickness (cm) Ver (V) 
(0) 7.7X1075 31 
II 6.8X 10-3 25 
III 8.2X 1073 36 
IV 9.0x 1075 44 
VII 1.1107 57 
VIII 7.2X1075 25 
IX 8.2x107* 32.5 


measurements (+5 y). The experimental points lie very 
close to the predicted slope-two line. 'The traps-filled 
voltage value obtained for each device allows a deter- 
mination of the trap density in the device. When this 
is done for all the devices in Table I, a mean trap 
density of 6.1X 10? cm™ is obtained. The maximum 
deviation from this value in the devices measured was 
approximately 10%. 

From the steady-state curve in Fig. 3 the form of the 
current-voltage characteristic in the Child's law region 
can be determined. However, the limited data in this 
region make the determination, with any degree of 
accuracy, rather difficult. To decrease the possible error 
associated with fitting the Child's law region, single-shot 
pulse measurements can be made, to approximate the 
trap-free characteristic. These measurements, as taken 
for the device in Fig. 3 are plotted as the boxed data 
points on the same curve. The agreement between the 
steady-state and single-shot curves in this figure 
indicates the usefulness of the single-shot measurements 
in obtaining an extension of the Child's law region. 
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By measuring the slope m of the characteristic in the 
extended Child’s law region one can obtain from Eq. (4) 
m= (2—a). For the devices analyzed a mean value of 
0.64 was obtained for a, 

From the value of Ky and the slope of the character- 
istic in the Child's law region, the field-dependent 
carrier mobility at low temperatures can be determined. 
In the range of applied fields 3x 10? V/cm « E « 3X 104 
V/cm the relationship between mobility and electric 
field was found to be 


up CE) = (2.622 1.4) X 105 7 (0.04220.04) cm?/Vsec. (18) 


The steeply rising region of the steady-state charac- 
teristic following Vzrr, can be obtained analytically for 
the two cases of constant mobility or constant drift 
velocity. However, for the more general situation where 
the mobility is field-dependent but the drift velocity 
not saturated, no analytical solution can be obtained. 
To obtain the characteristic for this case a numerical 
solution would be necessary. The effort required to 
obtain this solution would not be justifiable since little 
additional information would result. However, the 
basic fact about the steeply rising region is that any 
model, which assumes the onset of SCLC after a traps- 
filled voltage, predicts an extremely steep characteristic 
regardless of the applicable u(E) function. This predic- 
tion is most definitely verified by the previous experi- 
mental characteristics. 


CAPTURE PROBABILITY AND CAPTURE 
CROSS-SECTION MEASUREMENTS 


The mean capture probability;* B, can be obtained 
from the same single-shot pulse measurements that 
were utilized to obtain the extended Child's law region 
of Fig. 3. Two pulse recordings, typical of those em- 
ployed to obtain B, are shown in Fig. 5 as photographed 
from the oscilloscope display. a 

There are two ways in which one can obtain B from 
the single-shot current response. One can (a) match the 
theoretical transient characteristic [ Eq. a7] to the 
experimental pulse, or (b) estimate the initial time 
constant of the decaying current pulse and compare it 
to the theoretical initial time constant. 


102. 


© EXPERIMENTAL VALUES 


Fic. 4. Thickness 
dependence of Vrrt- 
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_ Fic. 5. Typical voltage and current transients obtained by 
single-shot pulse measurements. Scales: 4-403107? sec/major 
division: V —25 V/major division; / =20 mA/major division. 


Equation (17) is rewritten for clarity as 


1— Vrzgi/V 


j(t)- Je E = =, 
1— (Vrrt/ V) expB.V4(1 = V/ Verve 


It is seen to contain the parameters J, and B in addition 
to the previously determined constants Na and Vrrt. 
Some uncertainty exists as to where {=0 should be 
situated on the experimental pulse in order to cause it 
to correspond to the analytical characteristic of Eq. (17). 
Due to the accumulated rise time of the generator, 
oscilloscope, and measuring circuit there exists an 
uncertainty of approximately 20 nsec. Two alternative 
methods of matching the theoretical and experimental 
characteristics present themselves. One can assume 


(a) that the beginning of the current pulse, i(/), 
through the device corresponds to approximately 
20X 10^? sec before the peak of the i(¢) pulse, or 

(b) that negligible trapping occurs prior to the peak 
of the z(t) pulse in which case this point would corre- 
spond to /—O0 in the theoretical characteristic. 


Case (a) involved matching the theoretical and 
experimental characteristics at two points to allow 
determination of both Jy and B for each pulse. This 
technique was therefore called a “two-point fit.” Case 
(b) involved matching the characteristics at only one 
point since Jọ was known. This case was called a 
“one-point fit." In Fig. 6 the experimental pulse 
responses of two typical devices are plotted as circled 
points. Both one- and two-point fits were performed for 
these units and the results are also shown. It was 
observed that for these and other pulses the one-point 
fit tended to more closely approximate the experimental 
results. This would indicate that no appreciable trap 
filling occurred for most pulses before the peak of the 
experimental characteristic. 

The second general technique for calculating the 
mean capture probability from the experimental data 
was to relate the initial time constant rro for decay of 
the current pulse to B. Since at /—0 the density of 
trapping centers is approximately equal to Vp, one 
obtains that 

B=1/rr0N p, (19) 


where rro may be obtained by estimating the initial 


: 
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"TABLE II. Capture probability and capture cross section values 
for negatively ionized boron centers im compensated silicon 
at 4.20K. 


Device Method B (cm/sec) o( 84) (cm?) E (V/cm) 
Il Initial slope 1.0 X10°° 2.0 X1078 3.5 X10? 
II Initial slope 2.2 X10^* 4.0 X1078 4.7 X103 
Ill Initial slope 3.6 X105 6.9 X107! 4.1 X103 
Il Tnitial slope 2.0 X1076 4,9 X10783 2.1 X103 
Il Initial slope 2.8 X107* 6.1 X107 2.9 X10? 
IH 2-Point fit 3.2 X10^* 5.0 X10% 7. X10 
1H 1-Point fit 2.05 X10^* 3.4 X107" 7.1 X103 
II 2-Point fit 1.6 X1075 2.9 X107? 4.8 X10 
IT 1-Point fit 3.0 X10^5 5.5 X107" 1.8 X101 
VII Initial slope 4.0 X107* 7.3 X107 4.9 X101 
VII 2-Point fit 6.0 X1078 1.1 X107? 4.9 X10? 
IX Initial slope (1.75-3.5) X1076 (3.6-7.3) X107? 3.3 X105 
IX Initial slope 1.5 X107 3.3 X10713 2.7 X10 
IX 2-Point fit 3.8 X107* 7.2 X10733 4,4 X10? 
IX 1-Point fit 2.2 X1076 4.2 X1075 4.4 X103 
IX 1-Point fit 1.4 X107* 2.1 X10783 8.3 X103 
IX 1-Point fit 1.4 X107* 2.3 X1078 6.6 X103 


slope of i(/) on a semilogarithmic plot. For this reason 
this technique was called the “initial slope” method. 

In Table II the values of B obtained by these three 
methods for several devices have been tabulated. The 
average electric field (V/d) is shown for each measure- 
ment. Because of the large scatter of the experimental 
results, no field dependence could be detected in the B 
measurements. By averaging the values for each of the 
techniques one obtains: (a) B=2.4X10-° from initial 
slope measurements, (b) B=2.0X10~° from one-point 
fitting of the theoretical characteristic, and (c) 
B=7.2X10 from two-point fitting of the theoretical 
characteristic. Š 

Since the value of B given in (c) does not agree well 
with (a) or (b) the value of B—2.2X 10-* cm?/sec is 
chosen as the most probable. 

In order to calculate a meaningful capture cross 
section for the trapping levels from the B value given 
above, one must know not only the energy distribution 


—-- TWO POINT FIT 
—— ONE POINT FIT 
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J (A/cm*) 
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Fic. 6. Theoretical and experimental transient characteristics 
for devices ITI and IX. 
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of the carriers but also the energy dependence of the 
capture cross section since B is given by Eq. (7). How- 
ever, in the high-field case certain assumptions about 
the energy distribution of the carriers can be made 
which simplify the relationship between B and c. For 
this case, because of the heating effect of the electric 
field on the carriers, optical mode scattering is the 
dominant energy-loss mechanism. If one assumes that 
the component of the carrier energy contributed by the 
field between optical collisions is much greater than the 
carrier energy remaining after a collision, the average 
carrier energy will be given by 


Ga= (1/2)m*og , (20) 


where v4 is the drift velocity. If one also assumes that 
few carriers have energy is excess of Sp, Eq. (7) simplifies 
to 


B=«(6&a) (264/m*)!*.. (21) 
Replacing the quantity 
(26a/m*)? 


by its equivalent 
Ky , 


one obtains from (21) the simple expression 
c(64)- B/KyEt-2. (22) 


The values of ø (64) calculated by applying Eq. (22) 
to the values of B in Table II have been included in this 
table. The mean value of o as taken from the initial 
slope and one point fit calculations is 


87: 4.4X 10? cm?, 


This value applies for a mean electric field of 4.6% 10? 
V/cm. 

The above experimental capture cross section can be 
compared to the prediction of the giant trap model by 
calculating the theoretical value of the energy- 
dependent capture cross section via relationships 
derived by Lax.5 For the case of trapping via optical- 
phonon-emission Lax’s theory predicts values for the 
capture cross section in the range 


1.4 10-8 cm?<o<5.6X10-* cm?, 


when constants appropriate to silicon at 4.2°K are 
inserted. In Appendix A this calculation is performed 
in detail. One observes from this range of values of c 
that the previously obtained experimental value of 
4.4X 1073 cm? is indeed reasonable. 


HALL-EFFECT DETERMINATION OF 
COMPENSATION 


Although the low-temperature single injection exper i- 
ments above provided values for the compensation v! 
traps-filled voltage measurements, it was felt that 4 
separate determination of the compensation by a more 
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for corrobora- 
tion. For this purpose Hall-effect measurements were 
performed over the temperature range 25°K <T ^ 77^K 
whereby one could obtain both the majority dopant 
depth and degree of compensation. 

The temperature-dependent hole concentration in 
p-type silicon at low temperatures is known to be given 
by the expression 


conventional technique was necessary 


2am*T 8P(N A—N p) 
o=( ) exp(—64/kT), (23) 


I? 2Np 


where 64 is the acceptor activation energy. 

To obtain the compensation Np from the Hall data 
one observes that it is necessary to plot the quantity 
In(pT-*/2) versus 1/T. Over the region of applicability 
of Eq. (23) this plot yields a straight line of slope 
— &4/k from which one obtains the acceptor activation 
energy. Furthermore, if the linear region of the plot is 
extrapolated to the 1/T —0 axis, the intercept allows a 
determination of the quantity (W4—N p)/N p. Since 
(N 4— N p) can be obtained from Hall measurements in 
the exhaustion region, a measurement of the intercept 
readily yields the compensation N p. 

For three Hall samples this plot is shown in Fig. 7. 
Device I-II was constructed of material diffused 
simultaneously with that of the single injection devices 
previously considered. Device II-I and II-III were 
fabricated from the results of a separate diffusion per- 
formed under near identical circumstances on the same 
starting material. The results for these three samples 
are tabulated in Table IIT. One observes that the results 
for devices III-II and III-III are quite similar. How- 
ever, the degree of compensation in these two devices 
differs by a factor of 2 from that measured for device 
I-II. This result supports the hypothesis that most of 


1944 Y © DEVICE I-11 
^ 
AM a DEVICE n-i 


DEVICE Il— tf 
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TABLE IIT. Impurity concentrations and acceptor depth 
as determined by Hall-effect measurements, 


Ep 


(Na-N n) Np Na 


47x10  5.6x107 


: Device Type (eV) (cm) (cm) (cm?) 

I-II p  47X10?  L7X10? 66x10? 83x102 
HI-II b ATXIO? 54X10? 39X10? 93x102 
II-III p 38X10? 94x10% 


the compensation present in the devices was created 
during the diffusion step. The compensation observed 
in device I-II (6.6X10? cm) was quite close to the 
average value of 6.1410? cm-? obtained from the 
previous l/zer, measurements. This degree of agreement 
between the two values is quite good, especially since 
finding N p in the Hall experiment involves extrapolat- 
ing the p7~*” curve over a region as extensive as the 
straight line region itself. 

The activation energy 6&4 of the acceptor center, 
corresponding to the trapping depth at 4.2°K, was 
approximately 0.047 eV for all devices. This value is 
known to correspond to boron in silicon." 


DISCUSSION AND CONCLUSIONS 


In considering the model for ppp, devices at liquid- 
helium temperature a threshold voltage for injection is 
found to exist. This threshold is due to the barrier 
created at the p4p junction by holes trapped near the 
junction in the equilibrium situation. The exact form 
of the Child's law region at low temperatures is de- 
pendent on the mobility-electric field relationship 
appropriate to the range of fields employed. For a 
mobility dependence as given by expression (3) the 
general Child's law relationship has the form of Eq. (4). 

In order to relate the observed single-shot current 
pulses to the trapping capture probability, an approxi- 
mate solution has been obtained for the current as a 
function of time under the conditions of slow trapping. 
This solution is as indicated by Eq. (17) if the current 
flow is assumed divergenceless. This assumption is 
strictly valid only for the case of drift-velocity satura- 
tion; therefore for a<1 in expression (3) a slight 
disparity between the analytical expression for the 
current density and the actual characteristics is ex- 
pected to exist. 

The experimental results indicate that the SCLC 
model, modified to include the appropriate field- 
dependent mobility, applies for p-type silicon at 4.2°K. 
The steady-state current-voltage characteristics ob- 
served at this temperature correspond in form and 
magnitude to the predictions of the SCLS model for an 
insulator with a deep trap density of 6.1X 10" cm, By 
single-shot pulse techniques the experimental Child’s 


law regions could be extended to lower applied voltages. _ 


1 N, B. Hannay, Semiconductors (Reinhold Publishing Corpora- 
tion, New York, 1960), p. 472. 
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Based on these extended power law regions the appro- 
priate form of Child's law was found to be j~ V! 35. The 
mobility implied by the characteristic in this region was 


pp (2.62: 1.4) 105 2- 0-440. cm?/Vsec. 


A trap density of 6.1X 10? cm™ was obtained by 
trap-filled voltage measurements and it should corre- 
spond to the compensation in the material. This com- 
pensation was measured separately by a Hall-effect 
experiment over the temperature range W°K<T<77°K 
and found to be approximately 6.610” cm™, which 
agrees reasonably well with the value above. Also from 
the Hall experiment a value for the acceptor level of 
0.047 eV was obtained, which agrees reasonably well 
with the expected value for boron in silicon. 

Experimental values for the capture probability and 

capture cross section were obtained by fitting the 
analytical characteristic for the current density to the 
observed pulses. The average experimental value for the 
mean capture probability was B=2.2X10-® cm? sect. 
Tf all the carriers are assumed to have the same velocity, 
this value could be converted to a corresponding one 
for capture cross section. The latter, as obtained by 
averaging the values obtained from the many experi- 
mental measurements of B, was ¢=4.4 107? cm’. The 
theoretical value of c, as predicted by the giant trap 
model, was calculated for the case of trapping via 
optical-phonon emission. A value of c in the range 
1.4xX 10-3 <a « 5.6X 10-1? cm? resulted. The agreement 
between experimental and theoretical values is again 
quite reasonable. 


APPENDIX A 


For the case of trapping via optical-phonon emission, 
Eq. (A1) relates c and 6, the carrier energy. This 


> ur 
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equation is 


Twp yy 
c(6)-—-— 45) 
8 le € Neto 


A P(hw— 6) 
x 
1—exp(—A) (1— &/ħw)*!? 


(1-Fag)?, (A1) 


where the various quantities are defined as: w= 6, the 
Raman energy (0.063 eV for silicon), A\=fw/kT, 
P(lho— &)-the “sticking probability," a=((hw—6)/ 
hw), g=0.85, l,—the mean free path for acoustical 
scattering, and w=a constant which relates the strength 
of coupling of the carriers to the optical modes to the 
strength of coupling to acoustical modes." After sub- 
stituting the appropriate values of the above terms for 
silicon at 4.2°K into Eq. (A1) one obtains that 


c—8.5X 10719/1,, (A2) 


when E=4.6X 10? V/cm. 

Choosing the pure acoustical scattering mobility at 
4.2°K to be approximately 10° cm?/Vsec for p-type 
silicon, as the results of Putley and Mitchell? would 
indicate, /, would be approximately 6X10~* cm. 
Therefore at 4.2°K 


o~1.4X 1073 (w) cm?. (A3) 


The value of w can be estimated based on the existing 
data for mobility versus temperature in the range 
100°K. <7-<300°K.* It would appear that 1<w<4 is 
applicable for p-type silicon. 

For this range of w one finds that 


1.4X 10-8 cm? o <5.6X 10715 cm?. 


12 C. Herring, Bell System Tech. J. 34, 237 (1955). 
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The effe 5T x : ith n-type ia ili 1 ial i 

Md a ee germanium with n-ty pe material and silicon with p-type material is to increase the 

ds AM a est decrease the elastic moduli. The decrease in the c;, elastic modulus for n-type germanium 

ee a y well with theoretical predictions. However, the modulus decrease in p-type silicon is much 

arger and varies wi mne e ste licte va 7 * Tei 

cie x B pam E DUE much faster than predicted by any present theory. It is suggested that 
€ 1$ a temperature-induced change in hole population along the energy surfaces. The intervalley relaxa- 


tion time, which determines the added attenuation, 


becomes independent of the doping for high dopings 


with the antimony time being about 100 times that for arsenic. This result indicates that the relative values 
are determined by the square of the triplet singlet separation which occurs near the impurity atoms. l'or 
p-type silicon the relaxation time at low temperatures increases very markedly indicating an activation- 
energy effect. The energies agree well with the energies measured by infrared techniques for the largest 
excited state orbits around the impurity atoms. The relaxation times measured at high temperatures indi- 
cate that the hole is transported 4 cycle from one (111) position to the next. 


I. INTRODUCTION 


'T was first pointed out by Keyes! that the introduc- 
tion of arsenic, phosphorus, or antimony into germa- 
nium——Ji-type doping—produces a decrease in the shear 
elastic modulus c44. The result was shown to be due to 
the effect of the electronic part on the strain energy 
function and to involve only those electrons which 
change from one valley to another. The experimental 
results of Keyes and Bruner showed that the measured 
decrease was about 75% of the calculated decrease and 
varied with temperature as predicted by theory. 
Pomerantz, Keyes, and Seiden? showed that the 
finite time required for electrons to go from one valley 
to another was a relaxation effect and should result in 
an attenuation of sound waves at a high frequency. 
Measurements at 9 kMc/sec did show? an absorption so 
high that it could not be quantitatively determined for 
any wave involving the cs elastic modulus, while 
normal phonon-phonon attenuation resulted for any 
mode not involving this elastic constant. 


IL. EFFECT OF DOPING GERMANIUM WITH 
n-TYPE MATERIAL 


A. Experimental Results 


By going to 500 Mc/sec, or less, measurable results 
were obtained? for arsenic doping. The present paper 
extends these results to antimony and phosphorus 
These data are of considerable interest in de- 
e mechanisms causing changes of electrons 
and the relaxation times for 
ave been rather exten- 


doping. 
lineating th 
from one valley to another 
these processes. Hence, they h 
ively i igated. 
nm the increased attenuation for longi- 
tudinal waves along the (110) direction caused by 


1 R. Keyes, I.B.M. J. Res- Develop. » A eme ; L. J. Bruner 
: oves. Phys. Rev. Letters 7, 59 1). 
ME puc W. Keyes, and P. E. Seiden, Phys. Rev. 
p 9 2 E 
OP d T. B. Bateman, Phys. Rev. Letters 10, 151 
(1963). 


doping germanium with 10'3 and 310” arsenic atoms 
per cc. The measurements were made at a frequency of 
475 Mc/sec. The difference between the attenuation of 
the doped and pure crystal is practically independent of 
the temperature. The elastic modulus is determined by 
the sum 

C= (cntcnt 2cs)/2. (1) 


Since it has been shown’? that the only elastic constant 
that is affected by n-type doping of germanium Is css 
the attenuation can be ascribed to the results of a 
resolved shearing stress associated with the longitudinal 
wave. In fact the attenuation is caused by the finite 
relaxation time associated with the motion of electrons 
between the four valleys of the energy surface of n-type 
germanium. It satisfies the relaxation equation 


A (xp.fem) = (Acss/2pV*) (7) , (2) 


valid when er«&1 and (w/V)A<1, where à is the mean 
free electron path. In this equation Ac4 is the change in 
modulus caused by the doping, « is 2xf, p the density, 
and V the velocity for the longitudinal wave. Changes in 
the c4, modulus for a doping of 3X 10” boron atoms per 
cc were measured by Bruner and Keyes! with the results 
shown by Fig. 2. This change was determined by the 
difference in velocities for pure and doped samples 
measured at 10 Mc/sec. The dashed curve shows the 
calculated value—from Eq. (13)—for the sample doped 
with 10'% arsenic atoms per cc. Measurements for 
antimony-doped germanium, shown by Fig. 4, give a 
very similar change. 

The attenuation measurements of Fig. 1 and the 
velocity measurements of Fig. 2 can be used to deter- 
mine the relaxation times, which are 


7=4X10-8 sec; 7—2.3X10-? sec, (3) 


respectively, for dopings of 1055 and 3X10" arsenic 
atoms per cc. These times are shown plotted on Fig. 3. 
On account of the small change in attenuation, this 
method cannot beused toinvestigate dopingsless than 105 
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Fic. 1. Attentuation measurements 
of pure and arsenic-doped germanium, 
Longitudinal waves along (110), fre. 
quency 475 Mc/sec. 
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; arsenic atoms per cc. However, another method appli- 
j cable for lower dopings is the acousto-electric voltage 
1 method of Weinreich et al.4 Figure 3 shows the relaxation 
times for a number of doping levels from 10^ to 10! 


arsenic atoms per cc. At low temperatures the relaxation 
time is determined by impurity scattering and the solid 
line of Fig. 3 shows the values obtained as a function of 
doping using the 40°K values! As the temperature 
increases, scattering occurs also by collisions with 
phonons and the dashed lines of Fig. 3 show the modi- 
fication caused by this type of scattering for the tem- 
peratures shown. 

The relaxation time for germanium doped lightly with 
arsenic impurities has been discussed by Weinreich 


A C44 IN DYNES/CM? 


100 150 : 
TEMPERATURE IN DEGREES KELVIN ded NEP 

Fic. 2. Change in the elastic modulus cq for arsenic-doped 
germanium. Measurements made at 10 Mc/sec. Dashed curve 
f shows estimated value for a doping of 1«10!* arsenic atoms per cc 
- ^ — (after Bruner and Keyes). 


4G. Weinreich, T. M. Sanders, and H. G. White, Phys. Rev. 114, 
33 (1959). 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


80 90 100 


et al.4 as being due to a combination of scattering by 
neutral and ionized atoms. At low temperatures neutral 
scattering predominates and this gives a time inversely 
proportional to the impurity density. If we extrapolate 
this down to an impurity density of 10'8, the indicated 
relaxation time of 2X107! is less than the measured 
value of about 4X 107? sec. For a density of 3X10" the 
relaxation time approaches the value of 2.3 10? sec, 
which appears to be an intrinsic time for going from one 
valley to another. 

An even stronger indication of this effect is furnished 
by antimony-doped germanium. Figure 4 shows the 
modulus and attenuation difference between a pure 
germanium sample and a sample doped with 1.510" 


\<—— ANTIMONY 
=X. ARSENIC 


INTERVALLEY RELAXATION TIME, T, 
IN SECONDS 


LETS 10'¢ 1017 1015 10191901029 
DOPING LEVELS FOR IMPURITY ATOMS 
IN GERMANIUM 


Ẹ Fic. 3. Relaxation times for arsenic- and antimony-doped 
germanium as a function of doping. Dashed curves show effect 0: 
phonon scattering while solid curves are for impurity scattering: 


> 


f 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


ULTRASONIC WAVE 


antimony atoms per cc. The attenuation is so high for 
this sample that a measuring frequency of 99 Me/sec 
was used. As before the measurements were made along 
a (110) direction for longitudinal waves. The relaxation 
time calculated from these measurements is shown 
plotted by the dashed line of Fig. 4. At low temperatures 
impurity scattering is dominant and a value of 
3.8X 107! sec results. As the temperature is increased, 
phonon scattering occurs and the value approaches the 
same value found for arsenic doping at 300?K. The 
values for impurity doping by antimony are shown by 
Fig. 3. Another measurement was made for a doping of 
10" giving the value of the relaxation time, as shown. 
Measurements for lower doping have been made by Tell 
and Weinreich.’ The results quoted indicate a relaxation 
time five times as large as that for antimony doping over 
a range from 10" to 2.5X 105 atoms per cc. The resulting 
relaxation time for the entire range is shown by Fig. 3. 
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Fic. 4. Attenuation and modulus changes for germanium doped 
with 1.5X10!5 antimony atoms per cc. Measurements made at 
99 Mc/sec. Dashed line shows corresponding relaxation time. 


Here the constancy of the intervalley relaxation time is 
confirmed over a wide doping range. 


B. Theoretical Interpretation 


Many experimental results for n-type germanium 
and silicon have been interpreted in terms of the many- 
valley model of the energy surfaces. Figure 5 shows the 
energy surfaces for silicon and it is seen that for the 
conduction band, one of the surfaces has a very low 
energy value in & space along the (100) axis. Electrons 
tend to congregate about these momentum values. Since 
there are six (100) directions, there are six energy 
minima or valleys. The six surfaces are shown by Fig. 6. 
The ovals indicate surfaces of constant energy. Similar 
considerations apply to the conduction band for germa- 
nium except that the minimum energy directions lie 


5 B. Tell and G. Weinreich, Bull. Am. Phys. Soc. 7, 546 (1962). 
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Fic. 5. Energy surfaces in valence and conduction band for silicon. 


along (111) axis and occur at the edges of the Brillouin 
zone. 

In the unstressed condition all the valleys have 
equal energies and electron populations. The effect of a 
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Fic. 6. Top figure: 
multivalley energy 
surfaces for #-silicon. 
Bottom figure: cross 
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surfaces a constant 
distance below the 
origin. 
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stress T, along a crystal axis in silicon is to raise the 
energy of the parallel valleys and lower those of the 
perpendicular valleys by amounts 3?S1 and m 
respectively, where 5; is the longitudinal strain induced 
by the applied stress and 4 is the deformation potential. 
"This causes electrons in the parallel valleys to flow into 
the perpendicular valleys. In the nondegenerate range 
the equilibrium ratio of population is determined by 
Boltzmann’s principle 


N 100/ N o10= exp(—05,/kT) 5 (4) 
This, together with the conservation of electrons 


given by 

No= 2N aoo 4Noio E (5) 
determined the equilibrium populations to be 
Noexp(—®Si/kT) | =| 3 


2pexp(—9S:/kT) 3 


2No 2N o $5, 
T — 14 | . 
2kexp(—eS,kT) 3 3kT 


2N10= 


The change in energy AW caused by the applied stress 
is given by 


No 26S, 2 2N9 QS; DS; 
awe su 
3 3 kT 43 3 3kTJ 3 


2No PS? 


QR 


There is no change to first powers of the strain but a 
negative contribution occurs proportional to the square 
of the strain. If we add this to the elastic energy 301151, 
it is seen that a lowering of the elastic modulus occurs 
of a value 


4 Noe 
Acy=—— > 
9 kT 
also (8) 
Aca 2 Not? 
Aci ——— =- A Acas=0. 
2 9 kT 


The last two relations follow since a shearing strain 
affects all the wells in the same fashion while a stress 
along (111) direction also affects all the valleys 
equally. Since the elastic constant for this direction is 
(6314+ 2¢12-+-4c4s)/3, the second relation of (8) must hold. 

— For n germanium, a similar calculation! shows that 


Aci Ac—0; Acas= — (N 0/9) (€?/RT) (9) 


in the nondegenerate range. For the degenerate range 


the limiting value of Aca, is obtained by replacing kT by 


- two-thirds of the Fermi energy or 


2W r= (4/3m*) (3152 Ny^^* , (10) 


AND T. B. 
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where 7; is Plank's constant / divided by 2r, N is the 
number of free electrons, which is equal to the number 
of impurity atoms in the degenerate range, and m* the 
density of states mass 


m*= (mime)? (ny , (11) 


where mı is the longitudinal mass of a single valley, m, 
the transverse mass, and n the number of valleys, 
Inserting the value in (9), the limiting value of Aci, at 
low temperatures is given by 
4 /Ag N22 N'BE (mme) 
AC4497— ( ) . 
3 I 


(12) 


Since m= 1.57903 m, 0.082000, where mo is the electron 
mass 9.1 10-8 g, this equation takes the form 


Acs — 1.58X 1092, (13) 


From piezoresistance measurements and from other 
effects @ is usually taken as 16 eV at room temperature. 
For very low temperatures 19 eV is more probable 
which corresponds to 3.04 107! ergs. Hence, 


Acus 7 1.44% 1041, (14) 


This equation checks quite well with the measured 

values of Figs. 2 and 4 where the theoretical values are 

4.7 X 10'^ and 1.65 10!? dyn per square cm, respectively. 
At the degeneracy temperature, given by 


4? (3rN)28 
2m* k 
=7.61X10-4N2%, (15) 


kTp=Wr or Tp= 


the change in modulus should be one-half of Aci! The 
curves of Fig. 4 indicate a value of 75 — 80°K compared 
to the calculated value of 99°K. Over a temperature 
range the ratio of Acs, to Aca, follows the relation’ 
shown by Fig. 7. Using Tp=80°K for Fig. 4, the agree- 
ment with the measured value is excellent. 

The attenuation A is determined by the product of 
Acss and the relaxation time 7 which is the time required 
to go from one valley to another. Measured values are 
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Fic. 7. Calculated ratio of Acu/Aca as a function of the ratio 
of the temperature to the degeneracy temperature (after R. W- 
Keyes). 
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shown by Fig. 3. A number of different mechanisms 
determine this time. For dopings from 10" to 10^ and 
temperatures above 150°K, the scattering mechanism is 
the collision of electrons with phonons. The total 
relaxation time, as determined by the conductivity 
equation, is given by 


r— m"0/Ne=pm*/e=0.37K 1.1% 10-#!/1.6 10-19 
=2.5X10-" sec, (16) 


where c is the conductivity in mho/meter, V the number 
of electrons per cubic meter, e the electronic charge of 
1.6X 107* C, u the mobility in M?/V sec, and m* the 
effective drift mass in kilograms. Using the values for 
lightly doped samples, the indicated value is about 
2.5X 10^? sec at 300°K. Hence, about one in every 40 
collisions results in a change in valleys. 

As the temperature becomes lower, or the doping 
higher, the intervalley relaxation time is controlled by 
scattering from impurities. For low dopings this process 
has been discussed by Weinreich ef al.* as being due to 
a combination of scattering by neutral and ionized 
arsenic atoms. At low temperatures neutral scattering 
predominates and this gives a time inversely propor- 
tional to the impurity density. Scattering is probably 
due to elastic scattering by the impurity atoms which 
follows? a law 


1/r-xNRY, (17) 


where JV is the density of impurity atoms per cc, R the 
effective radius, and V the electron velocity. At 40°K 
the indicated cross section R? is about 5X 10-9 cm?. This 
is attributed*? to an exchange interaction between 
valleys although no values are given. 

As the temperature increases some of the arsenic 
atoms become ionized and a component is found which 
varies inversely as the electron density. This component, 
which has a strong temperature variation, is at- 
tributed‘? to a capture of electrons in bound donor 
states followed by re-emission in another valley. As will 
be shown in the next section, this process has an activa- 
tion energy and hence the time between capture and re- 
emission decreases as the temperature increases. 

For densities above 10'8, the data of Figs. 1 and 4 
show that there is no activation energy for the process 
and the data of Fig. 3 show that the relaxation time 
tends to reach a constant value independent of the 
|. All of the impurity atoms will be ionized 
8 occurs in germanium at an 
107 and 103 impurity 
degeneracy? are in the 
d 5X10" for arsenic 


doping leve 
since Fermi degeneracy 
impurity concentration between 
atoms per cc. The impurities for 
order of 10!7 for antimony an 


ory of Metals (Cambridge University 


ublished) have recently 
he scattering of ionize 


* A. H. Wilson, The The 
Press, New York, 1958), p. 10. = 
? P, J. Price and R. L. Hartman (to 
given a derivation which accounts for t 

arsenic atoms in germanium. | 
8 F, J. Morin and J. P. Maita, 
? H. Fritzche, Phys. Rev. 


Phys. Rev. 96, 28 (1954). 
125, 1552 (1962). 
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doping. Hence, bound donor states do not exist for these 
dopings. 

In the neighborhood of an impurity atom the four 
degenerate energy minima are split into a singlet and 
triplet states by the valley-orbit splitting. The separa- 
tions caused by antimony, phosphorus, and arsenic are 
0.57X 107, 2.0X 10-*, and 4.15X 10% eV, respectively. 
One mechanism proposed for the interchange of elec- 
trons between valleys is a “giant trapping’ of electrons 
which involves a change in momentum between valleys 
by means of an interaction between the triplet and 
singlet states. If the electron is placed in the 1s state 
and simultaneously in a known valley, it will not be in a 
stationary state, but in a linear combination. At a later 
time, depending on the relative phases of the singlet and 
triplet state, the electron will be in a different valley. 
Weinreich ef al^ have estimated that the relaxation 
time for such a valley change should be 


T> (16/3) (h/8E) — 8.4X 10-9? sec(arsenic) 
—6.14X 10-2 (Sb). (18) 


These values are too large for arsenic-doped germanium 
and too small for antimony-doped germanium. 

Another derivation" indicates that the relaxation time 
should be in the ratio of the squares of the singlet- 
triplet separation and this is in better agreement with 
the experimental values. However, since this is essen- 
tially an elastic scattering process, the independence of 
the scattering time versus impurity density is still 
anomalous. The same is true for the total scattering 
time of Eq. (17). This time varies from 6 to 2 10- sec 
for the doping ranges from 10'* to 3X10" impurity 
atoms per cc. The ratios of the intervalley to total 
scattering times are about 10 for arsenic and about 1000 
for antimony. 


II. EFFECT OF DOPING SILICON WITH 
p-TYPE MATERIALS 


A. Experimental Results 


Measurements have also been made of the effect of 
doping silicon with the p-type materials boron and 
gallium. The first measurements’ were made for 
2.5 10!? boron atoms per cc. Measurements were made 
for longitudinal waves along the (110) and (100) direc- 
tions and shear waves along (110) with polarizations 
along (001)—which involves the shear modulus ¢4<— 
and with polarization along (110) which involves the 
(c11—¢12)/2 elastic modulus. As shown by Fig. 8a), 
curve A, the attenuation for shear waves along (100) 
increases very markedly at low temperatures and be- 
comes so high below 6°K and for a 475 Mc/sec frequency, 


10M. Lax, Phys. Rev. 119, 1502 (1960). 

u P, J. Price, J. Appl. Phys. 31, 849 (1960) ; measurements have 
recently been made for one value (1.5X 10/5) of doping by phos- 
phorus atoms in germanium. The value of 8X10~ sec is inter- 
mediate between As and Sb and agrees well with the square of the 


singlet-triplet separation. 
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Fic. 8. (a) Measured attenu- 
ation as a function of tempera- 
ture for A (2.5X10!5 boron 


atoms per cc), B (5X 10! boron 
atoms per cc), C (1.51018 


ATTENUATION (DB PER CM) 


galium atoms per cc). (b) 
Measured changes in the cq 
elastic constants for the same 
conditions. 


1 2 4 6 810 20 
TEMPERATURE (^K) 


thatit could not be measured. The frequency was shifted 
to 98 Mc/sec and the curve was measured from 1.5 to 
15°K. In the overlap region, it is seen that the attenua- 
tion is square law as indicated by the relaxation Eq. (2). 

Velocity measurements for pure and doped silicon 
were made by the pulse overlap method,? which is 
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Fic. 9. Measured shear wave velocities along (100) direction for 
pure silicon and silicon doped with 2.510! boron atoms per cc. 
Measurements made at 20 Mc/sec. 
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capable of good precision. Figure 9 shows the measured 
velocities for pure and doped samples. From these 
measurements, the change in modulus Acı, can be 
determined and it is shown plotted by Fig. 8(b), curve 
A. The relaxation time from the measured attenuation 
and modulus differences can be calculated by using 
Eq. (2). The result is shown plotted on Fig. 10, curve A. 
For temperatures above 8°K the relaxation time satisfies 
the equation 

T= 1.9 1052e4"7 5 (19) 
indicating an activation energy àE of 0.00113 eV. At 
lower temperatures the curve bends off and approaches 


10710 


RELAXATION TIME IN SECONDS 
o 


Fic. 10. Relaxation times for p-doped silicon plotted against 
1/T. A (2.5X10!8 boron atoms per cc); B (5X10! boron atoms 
per cc); C (1.5X10!8 gallium atoms per cc). 
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another straight line having an equation 


T—143X109 9/7; 55.-256*10-eV, (20) 

Detailed measurements were also made for longi- 
tudinal waves along the (100) direction with the results 
shown by Fig. 11. Here an increase in both attenuation 
and modulus change occurred as the temperature de- 
creased, but their ratio, which determines the relaxa- 
tion time, is approximately constant with a value 
T2=4.2X 107 sec, 

The shear wave measurements along (110) with 
polarization along (110) and the longitudinal measure- 
ments along (110) confirmed the expected relation that 


Acye= — Acy;/2. (21) 
The attenuations for these modes satisfy, respectively, 


the equations 


3 2. 
FACIO To 


[(Acis/4) 72+ Acri ]o? 


me (22) 
29V $ 2pV f 


7. 


The relaxation time 7; for shear waves along (100) 
has an activation energy term. In order to determine the 
origin of this term, measurements were made for a range 
of boron doping from 5X10" to 3X10" atoms per cc. 
Figures 8(a) and (b), curves B, show the results for 
shear waves along (100) measured at 500 Mc/sec for the 
doping of 5X 10". The relaxation time is shown plotted 
on Fig. 10, with an indicated activation energy of 
0.00117 eV or nearly the same as for the higher doping. 
The curves of Figs. 12 and 13 show the attenuation and 
velocities for the dopings of 10? and 3X10” boron 
atoms per cc. The lower value still has an increase in 
attenuation as the temperature is lowered, but the 
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Fic. 11. Attenuation and modulus change Aci for longitudinal 
waves along the (100) axis. Dashed line shows an average relaxa- 
tion time. 


higher doping value has an added attenuation which is 
independent of the temperature with no activation 
energy. The doping of 10? satisfies the relaxation 
equation 

7,7 2.1X 107 9gtT, (23) 


while the relaxation time for the doping of 310" is 
107" seconds independent of the temperature. 

In order to complete the interpretation, a measure- 
ment was made for a doping of 1.5 10^ gallium atoms 
per cc in a (100) silicon crystal. Measurements were 
made for shear waves in this direction at a frequency of 
500 Mc/sec. The results for the attenuation and 
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Fic. 12. Attenuation meas- 
ured at 500 Mc/sec for boron- 
doped silicon as a function of 


the temperature. The curve 
labeled 0.01 Q-cm is for a 
doping of 10 boron atoms per 
cc Ship the one labelled 0.0042 


is for 3X10 boron atoms 
per cc. 
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Fic. 13. Measured velocities 
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for pure and doped silicon 
samples as described in Fig. 12. 
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modulus changes are shown by Figs. 8(a) and (b) by the 
curve labeled C. In the lower temperature range, 
measurements of the attenuation were made down to 
1.5°K for a frequency of 99 Mc/sec. A plot of the 
relaxation time is shown by Fig. 10, curve C. The 
initial part of the relaxation time satisfies the equation 


71=5X10-%e!? sec, (24) 


giving an activation energy ôE of 0.0035 eV. At lower 
temperatures the relation becomes curved but ends up 
with another straight line 7;^ which has a very low or 
zero activation energy. For intermediate temperatures 
the points agree fairly well with the equation 


1/r— 1/r14- 1/71 . (25) 
B. Theoretical Interpretation 


The interpretation for the relaxation time for the 
p region appears straightforward. Since the degeneracy 
‘temperature is 50°K or higher for all the doping levels 
used, the valence bands are filled up to 0.004 eV or 
higher above the band minima. Hence, it is believed 


that the relaxations measured are between different 


points on the same energy surfaces rather than an inter- 


- change between surfaces. The effect of a strain is to 


arp the surfaces, lowering one part with respect to 
other. This causes a flow from the higher energy- 
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change momentum directions near the impurity atoms 
by being captured in one of the excited states. This can 
happen with only a small number of collisions with 
phonons since the energies of the holes and the excited 
states are not very different. To leave the orbit, how- 
ever, requires thermal energy to overcome the binding 
energy of the state, and this energy represent the activa- 
tion energy measured. As the doping increases the 
activation energy decreases, and for a doping of 3X 10? 
boron atoms per cc it disappears entirely. 

The activation energies for the relaxation time 71 of 
the shearing mode are in good agreement with the 
infrared measurements for the largest orbits in boron 
and gallium. The energy levels of the ground state ^ 
innermost orbit—and the excited states in relation to 
the band edge have been discussed by Kohn." These 
energy differences were measured by infrared absorp- 
tion. The values of 0.0012 and 0.0035 eV agree well with 
the largest orbits, lowest energies, for the measure 
excited states. At lower temperatures, the relaxation 
times for the primary modes for both boron an 
gallium become so large that other processes take ae 
For boron doping, the next process is another excite 
state having the constants of Eq. (20), while for gallium 


15H. J. Hrostowski and R. H. Kaiser, Bull. Am. Phys. Soc, [2] 
2, 66 (1957); see also W. Kohn in Solid State Physics, edit). 
F. Seitz and D. Turnbull (Academic Press, Inc., New York, ! 
Vol. 5, p. 257. 
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doping it appears to be a process without activation 
energy such as elastic scattering by the impurity atoms. 

Another verification that the processes observed are 
the excited states comes from the high-temperature 
values of 7; and 7’ for both boron and gallium doping. 
The frequency with which an orbit is traversed is 
given by 


[= 6E/h=2.8X 10" cycles/sec (boron) 


=8.4X 10" cycles/sec (gallium) (26) 


for the first—r,—process. The time of a cycle is the 
inverse of these values and the measured constants of 
1.3%X10- sec (boron) and 5X10” sec (gallium) 
indicate that the transfer of momentum at high tem- 
peratures takes place between points that are about one- 
third of a complete orbit apart. Since the shearing stress 
raises or lowers the energy surfaces along (111) direc- 
tions, the closest equivalent positions are 109? apart or 
about one-third of a complete cycle. The second process 
71 of Eq. (20) indicates that the hole traverses two- 
thirds of à much larger orbit before it transfers its 
momentum. 

As the doping increases, the impurity atoms come so 
close together that bound orbits are no longer possible. 
In this region the only process left is theelasticscattering 
by the ionized impurities which follows the law of Eq. 
(17). If we use the measured relaxation time of 10? sec 
for the impurity doping of 3X 10? boron atoms per cc, 
and calculate the velocity from the equation 


im*V?—0.6W p= $5 (h2/m*) (312N)5, (27) 
one finds for the scattering radius R the value of 
8.5X 10-8 cm. Comparing the relaxation time 10-7? with 
the total relaxation time from the conductivity Eq. (16) 
it appears that about 1 out of 10 collisions results in a 
change of position on the surface. 

The theoretical situation for the change in moduli due 
to doping for p-type silicon is far from satisfactory. 
Three theoretical derivations!“ have been given for 
the change in shearing modulus. The first solution! 
assumes spherical energy surfaces, as shown by the 
dashed lines of Fig. 6, and limits consideration to the 
heavy hole surface. The second solution’ takes account 
of the curvature of the two surfaces. In the degenerate 
range these solutions reduce to the forms 


(81/3) 5m* 2N1 


A — 
a 5h? 
—0.825p*d: : 
nid MUS 
k 
Acs = (—2.38m*] 7sq2/f2) (Bir and Tursunov). (29) 


u G., L. Bir and A. Tursunov, Fiz. Tverd. Tela 4, e foh 
[English transl.: Soviet Phys.—Solid State 4, m ( TA 
15 P. Csavinszky and N. G. Einspruch, Phys. Rev. 192, 


(1963). 
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Parse I. Values of Acu at 1.5°K for various dopings. 


Gallium 

ER. Joron doping doping 

Doping — — 3X109 10" 25x10 5x10"  LS5x109 
124 13.5 10.5 


AcaXi0? — 179 


17.2 


Table I shows measurements for Acs at 1.5°K for 
different dopings for boron and one value for galliurn 
doping. 

In the degenerate region from 10'? to 310" boron 
atoms per cc, the variation is proportional to the one- 
third power of the doping level in agreement with Eqs. 
(29) and (30). Taking m* as 0.44my as an average value 
for the two surfaces, the deformation potential values 
are 

d=17.2 eV (Keyes); 


d=10.1 eV (Bir and Tursunov). 


(30) 


A third calculation has recently been made!* which 
yields values close to the value given by Keyes. 
Csavinszky and Einspruch" have considered the spin- 
orbit coupled band in addition to the two degenerate 
bands but have assumed spherical symmetry of the 
bands. For doping up to 10° boron atoms per cc, the two 
upper bands are sufficient for the calculation and the 
result obtained is 


1/874? d ; ] 
Åc = -G) —[maN3--m4N43], (09 


5N3 I? 


where m. =0.49m is the heavy hole mass and m,, 
—0.16mo is the light hole mass. Since 


Ni/N2= (m,/ m, ^ = 5.35, (34) 
N,20.85N, and N3-0.15N,. Hence, for a doping of 
10! boron atoms per cc, the value of d= 15.4 eV. 
However, the larger values measured at the lower 
dopings at 1.5°K—for which asymptotic values have not 
been reached—and the large variation with temperature 
shown by Fig. 8(b) cast doubt on the validity of these 
calculations. It is conjectured that there is a tempera- 
ture variation of the position of the holes on the energy 
surfaces and at low temperatures it seems likely that the 
heavy holes collect near the (111) positions while light 
holes collect near the (100) positions. As seen from Fig. 
6, lower part, which shows the cross sections of the 
energy surfaces a constant distance below the origin, 
the heavy-hole (111) positions and the light-hole (100) 
positions are farthest from the origin and require elec- 
trons of higher momentum to reach the surface. Hence, 
if momentum is equalized in all directions, the surfaces 
near the origin should be filled up to a level higher than 
those far away and hence the (111) positions for the 
heavy-hole surface and the (100) positions for the light- 
hole surface should have the lower energies. At very 
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low temperatures, holes tend to congregate in these 
directions. 

If this conjecture is correct, the change in modulus for 
the heavy-hole surface should follow that for germa- 
nium, as given by Eq. (12), while that for the light-hole 
surface should follow Eq. (8) with kT replaced by (10), 
for the low temperatures, giving 


16 /7N?5 m*N' 8a? 
E cm 
3 N3 he 


(31) 


If we take an average value of 15 eV for the shear defor- 
mation potential d, the indicated change in modulus 
from Eq. (12)—using m*=0.49m and N=0.85No, i.e., 
2.12 10'8 heavy holes—is 26.5X 10° dyn/cm’, which is 
not out of line with the values of Fig. 8(b) extrapolated 
to lower temperatures. For higher doping levels this 
effect disappears since the momentum differences for 
the various positions become smaller. 

Another piece of evidence that the light holes congre- 
gate along (100) positions at low temperatures is 
furnished by the nearly constant relaxation time of 
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Fig. 11 for longitudinal waves propagated along the 
(100) direction. The data of Fig. 12 show that p-silicon 
with boron becomes degenerate for a doping of 3X 1Q19 
atoms per cc. This is taken to mean that the impurities 
are near enough together to prevent any excited state 
orbits around the boron atoms. The smallest orbits wil] 
be executed by the heavy holes and hence they will 
reach degeneracy at a higher doping level than the light 
hole surface. With a mass ratio of 0.49 to 0.16 or 3.06 
the radius will be this factor larger for the light-hole 
surface. Hence, this surface should become degenerate 
for a doping of 


3x(1012/(3.06)*— 105 boron atoms per cc. (32) 


Therefore, the constant relaxation time of 4.2 10-!? sec 
for a sample doped with 2.5X 103 boron atoms per cc is 
a confirmation that, at low temperatures, a longitudinal 
stress along the (100) axis actuates mostly light holes. 

If the relaxation time is due to scattering of light 
holes by impurities, calculations indicate that the 
scattering radius is about 5X 10-5cm,in good agreement 
with the value obtained for the degenerate heavy-hole 
surface. 
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Distribution Functions for the Number of Distinct Sites Visited in a 
Random Walk on Cubic Lattices: Relation to Defect Annealing* 


J. R. BEELER, Jr. 
Nuclear Materials and Propulsion Operation, General Electric Company, Cincinnati, Ohio 
(Received 17 January 1964) 


Distribution functions for the number of distinct sites S (7) visited by a point defect executing a symmetric 
random walk of n jumps on two- and three-dimensional lattices were computed using the Monte Carlo 
method. The square planar, simple cubic, bcc, and fcc lattices were treated. In three dimensions, the normal 
distribution appears to describe S (») for n>10‘ jumps and at 104 jumps the derivative d&(n)/dn of the 
average number, S (n), of distinct sites is within 0.5% of the value given by Vineyard's exact asymptotic 
solution. The defect annealing rate was computed using the S (7) distribution in a simple example and this 
result compared with an analog Monte Carlo solution. The comparison indicated that fluctuations in the 
initial defect concentration must be considered in computing the initial annealing rate and the mobile defect 
concentration as a function of time. After 500 jumps the annealing rate, but not the concentration, can be 
closely approximated without accounting for fluctuations in the initial concentration. i 


1. INTRODUCTION 


i a monatomic crystal, the migration of a defect, 
such as a vacancy or an interstitial atom, proceeds 
as a symmetric random walk! on the crystal lattice. The 
rate at which mobile defects are annihilated or trapped 
at point sinks is proportional to the rate at which they 


* Work supported by the U. S. Atomic Energy Commission. 

lIn this discussion a symmetric random walk is one wherein 
the jump probabilities for each possible jump direction are equal 
and constant. The vacancy random walk in an alloy is in general 
asymmetric because of its ordering energy, i.e., the jump proba- 
bilities for each possible direction are not equal and also depend 
upon the position of the vacancy. 
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encounter fresh sites which have not been visited pre- 
viously. Damask and Dienes? will treat the physical 
side of this process in a forthcoming book. On the basis 
of a Monte Carlo study, Beeler and Delaney? concluded 
that the average number of distinct sites, S (n), visited 
by a point defect in either a symmetric or an asym 
metric random walk of m jumps was of the form, 


S (1) An* (t) 


? A. C. Damask and G. J. Di “Poi is in Metals” 
(to b pubes) J. Dienes, “Point Defects in À a 
3J. R. Beeler, Jr., and J. A. Delaney, Phys. Rev. 130, 9 
(1963); J. R. Beeler, Jr., U.S.A.F. Report ASD-TDR 63-215 

(unpublished). 
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for a two-dimensional lattice, and of the form 
S(n)=B+Cn (2) 


for a three-dimensional lattice, where A, B, C, and E are 
constants for a given lattice structure. Their computa- 
tions pertained to vacancy migration in a binary alloy 
and describe the relation between S(11) and the rate of 
ordering (disordering) for »« 10* jumps. Migration on 
the square planar simple cubic, bcc, and fcc lattices was 
investigated. Vineyard* subsequently proved that the 
asymptotic form of S(w) for a three-dimensional 
symmetric random walk is 


lim $(2) - Cn. (3) 
The Monte Carlo results of Beeler and Delaney for 
dS (n)/dn-— C on the interval 10°<n< 104, in the case of 
zero ordering energy, agree closely with the exact 
asymptotic results given by Vineyard's analysis, the 
Monte Carlo results being about 1% larger than the 
exact asymptotic values. Vineyard also showed that 


S (n) = (81) */z (4) 
in one dimension. 

In this communication we describe the distribution 
function F(S;z) for S(z), as given by Monte Carlo 
calculations, for the square planar (sp), simple cubic 
(sc), bec and fcc lattices. F (S; n) is the probability that 
a defect visits S or less distinct lattice sites in n jumps. 
Direct application of these distributions to defect 
annealing calculations is illustrated in a simple, yet 
physically significant numerical example wherein atten- 
tion is called to the effect of fluctuations in the initial 
defect concentration upon the initial defect annealing 
rate. This effect first came to our attention in a com- 
parison of the annealing rate predicted by F(S;n), on 
the basis of the macroscopic (average) defect concentra- 
tion, with that obtained directly in a separate analog 
Monte Carlo annealing calculation wherein fluctuation 
effects were automatically accounted for. The presence 
of fluctuations significantly increased the initial 
annealing rate relative to that obtained on the basis of 
the macroscopic concentration. 

On the interval 1000<n<5000, F(S; n) appears to 
start an approach to the normal distribution. A x 
analysis indicated that F (S; n) is very probably normal 
for n» 104 in three-dimensional lattices, but gives no 
information on the convergence to à normal distribution 
in two dimensions. Up to n= 10* the density function 
f(S;n)-dF(S; n)/dS is definitely skewed to the right. 
This feature enhances the increased initial annealing 
rate contribution arising from concentration fluctua- 
tions. The indicated increase is possibly large enough 
to be detectable in high-temperature quenching experi- 
ments and should exert a dominant influence during 
the initial stage of defect annihilation and clustering 1n 
irradiated specimens. 


4G. H. Vineyard, J. Math. Phys. 4, 1191 (1963). 
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2. COMPUTATIONAL METHOD 


Distribution functions F(S; n) and density functions 
JS; n) for each lattice were estimated by running 1000 
independent random walk histories of 10* jumps each 
using an IBM-7090 computer. The mean (S), mode 
(S*), median (Sm) and standard deviation (s) were 
computed at 10-jump intervals up to 100 jumps, 
100-jump intervals up to 10? jumps, and 1000-jump 
intervals up to 10* jumps. In each instance an expected 
tendency toward the normal distribution was monitored 
by performing an x? test for 21 deg of freedom® and 
computing the coefficients of skewness (y1) and excess 
(52). S(n) and o were computed using the first 500 
histories, the second 500 histories and all 1000 histories. 
The differences between the results given by each 
sample set were found to be negligible and on this basis 
we decided that the 1000-history sample size was 
sufficiently large to form a basis for annealing calcula- 
tions. In this regard, it is interesting to note that the 
difference between the present S results, based on a least- 
squares fit to data from 1000 histories, and the original 
calculations of Beeler and Delaney, based on a least- 
squares fit to the data from only 10 histories, is less 
than 0.567. 


3. DISTRIBUTION FUNCTION RESULTS 


Selected plots of F(S; n) for n= 10, 20, 50, 100, 1000, 
and 10 000 jumps are given in Figs. 1-4 in terms of the 
ratio S(z)/n. This mode of presentation makes it 
easier to see how F(S; n) changes as n increases, than if 
the curves were plotted versus S(). Let S1(») be the 
largest S value such that F(S;5)-—0 and Ss(u) the 
smallest S wave such that F(S;)=1. The range of 
the distribution is defined as AS(n)=S2(n)—Si(n). In 
all instances AS(n)/n, S(n)/n, Sn(n)/n, and S*(u)/n 
decrease monotonically as n increases. 51(11)/n remains 
more or less constant in the two-dimensional case, the 
diminution in AS/n arising essentially from a mono- 
tonic decrease in S;(4)/m. In the three-dimensional 


LATTICE 


CUMULATIVE DISTRIBUTION, F (S; n) 


Sín)/n 


Fic. 1. Distribution function for square planar lattice, 


5D. R. Evans, The Atomic Nucleus (McGraw-Hill Book _ 
e Inc., New York, 1955); see pp. 774-777. 2 E 

sH. Cramer, Mathematical Methods of Statistics (Princeton 
University Press, Princeton, New Jersey, 1954); see p. 183. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


| 
| 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1398 J. R. 


SIMPLE CUBIC 
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CUMULATIVE DISTRIBUTION, F (S;n) 


Fic. 2. Distribution function for simple cubic lattice. Vertical 
tick in Figs. 2-4 indicates Vineyard’s asymptotic solution for 
S (n)/n. 


lattices, however, both S,(m)/m and S2(1)/n change 
as n increases, Sı(n)/n tending toward S(»)/n from 
below and S2(n)/n approaching it from above. The 
vertical tick defines the lower limit of S(s)/n given by 
Vineyard’s calculations for n— «co. For all practical 
purposes, one can consider this asymptotic behavior to 
be attained at n=10* jumps. Table I compares the 
1000-history Monte Carlo results for dS/dn at n= 10t 
with Vineyard’s evaluations for the asymptotic case. 

All distributions were skewed negatively for 7<10! 
as shown by the coefficient of skewness listed in 
Table II. This skewness is evident from inspection of 
the curves in Figs. 1-4 for 7<50. By definition? 


Y1= Hs/0*, (5) 


where un is the nth moment of the distribution about 
the mean. A negative yı means that the tail of the 
distribution for S< 5 is longer than the forward tail 
for $78. If yı>0 the opposite is true and y1—0 for 
the normal distribution. A measure of how the shape of 
the density function f(S ; n), in the vicinity of the mode, 
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S(n)/n 


Fic. 4. Distribution function for fcc lattice. 


differs from that for the normal density function is 
given by the coefficient of excess? 


*ya7 a/o* (6) 


listed in Table III. If y3«0, f(S; n) is more flat in the 
vicinity of the mode than the normal density function 
and more peaked if y2>0. In this regard y2<0 for the 
sp and sc lattices but y2>0 for bcc and fcc lattices. This 
suggests that yı is independent of the lattice coordina- 
tion number z but that y2<0 for z< zo where 6<2<8. 

A x? test was used to judge the over-all fit of a normal 
distribution to the Monte Carlo data. The averages of 
the x? values obtained using 21 deg of freedom (23 
intervals) are given in Table IV over 500-900 jumps, 
1000-5000 jumps, and 5000-10000 jumps for each 
lattice. A composite average for the bcc and fcc distribu- 
tions appears in the last column. According to Evans,’ 
for example, given 21 deg of freedom, x? in the range 
13<x?<30 indicates that the data at hand is very 
probably described by the assumed distribution, i.e., 
the normal distribution in this case. On the other hand, 
either x? « 10 or x?7 36 indicates that it is very unlikely 
that the data represents a randomly drawn set of values 
from the assumed distribution. Excepting the sp 
lattice, x? decreased with increasing n for n> 500. All 
x? values were in the plausible fit range 10<x?< 36 for 
121000. Both the distribution for the sc lattice and 
that for the bcc lattice exhibited x? values within the 


"TABLE I. Values of C in S (n) 2 B--Cn given by (a) 1000-history 
Monte Carlo (MC) calculations at 2—10* and (b) Vineyard's 
exact solution for n — co. 


————————————————————————————— 


(a) (b) 
Monte Carlo Vineyard 
(n.— 107) (n — c) (3)/ (b) 
sc 0.6641 0.6595 1.005 
bcc 0.7216 0.7178 1.005 
fcc 0.7472 0.7437 1.005 


eee 
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TABLE II. Coefficient of skewness (y) 
. Coefficie skewness (^i) for 
Monte Carlo distributions E i 


Skewness 

n SPL SCL BCC FCC 
10 —0.054 —0.386 —0.558  —0.663 
50 +0.07 —0.286 | —0.320 —0.550 
100 +0.003 — —0.333 —0364 0.444 
500 —0.130 -0.270 -0540  —0.300 
1000 —0.150 — —0238  —0418  —0.293 
2000 —0230 | —0330  —0.40 — 0.330 
3000 —0440 — —0.330 — —028 —0.320 
4000 —0400 | —0310  —026 —0300 
5000 —0.300  —04320 —0.19 —0.340 
6000 —0.3220 | —0:50 | —020 —0340 
7000 —0.300 | —0280  —019 —0300 
8000 0.310 0.230 0:21 —0.260 
9000 —0.870 | —0200 | —020 —0.220 
10 000 —0.382 —0.169  —0.19 —0205 


very probable range for n> 6000 jumps. Because yı and 
y2 for the bcc and fcc distribution behave similarly 
above n= 1000 we feel that a composite average of x: 
for these two lattices may be meaningful. This com- 
posite average is listed in the right-hand column. The 
X test indicates: (1) F(S; n) is not well approximated 
by the normal distribution for n < 1000; (2) the normal 
distribution is a good approximation to F(S;1) for 
105«51«10*; (3) F(S;n) is very probably normal for 
n> 10* in the three-dimensional case. 

A least-squares analysis gives the following results 
for S(z) and e(n) on the interval 10? n « 10: 


S(n)=B+Cn, (7) 
o(n)=An*, (8) 


Values for the constants A, B, C, and k appear in 
Table V. 


4, EFFECT OF FLUCTUATIONS ON THE 
INITIAL ANNEALING RATE 


The distributions described in the previous section 
were computed to supplement a separate analog Monte 


TABLE III. Coefficient of excess (y2) for 
Monte Carlo distributions F(S; n). 


Excess 
n SPL SCL BCC FCC 
—0.455 —0.355 —0.301 —0.055 
2 —0.270 +0.120 —0.14 +0.12 
100 —0.158 +0.103 +0.339 +0.216 
[500 —0.170 +0.210 +0.56 +0.150 
1000 —0.048 —0.094 +0.388 +0.043 
2000 — 0.030 —0.01 +0.76 +0.29 
3000 +0.028 —0.10 +0.40 ae 
4000 +0.22 —0.08 +0.16 +0.21 
5000 -+0.35 +0.006 +0.13 Od 
6000 +0.17 —0.10 +0.04 -+0.27 
7000 —0.05 —0.13 +0.07 ieee 
8000 —0.001 —0.20 +0.11 40.3 
9000 —0.05 —0.20 +0.13 TON 
10 000 —0.059 —0.12 +0.13 +0.12 
Less slim More slim 
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Faste IV. Average x? values over An ranges indicated. 
8 the average of columns three and four. 


mr An sp sc hee fee pe 

500-900 76 398 ns B8 sop 
1000-5000 348 34- 268 — 3222 X238 
310 221 . 247. 312 99204 


Carlo study on defect annealing in cubic crystallites 
containing up to 2XX 10* lattice sites. In order to obtain 
even approximate agreement between the initial 
annealing rate given by the analog Monte Carlo? 
calculations and that computed using F(S;m), it was 
necessary to consider the effect of fluctuations in the 
initial defect concentration within the crystallite. In 
this section the number of mobile defects remaining 
after an average of n jumps per defect is computed 
analytically in two ways. In the first instance the defect 
encounter probability is computed using F(S;) and 
the average concentration over the crystallite; in the 
second, it is computed using F(S; n) and the average 
concentrations in each of .V, noninteracting crystallite 
subvolumes populated according to the random distri- 
bution law for Mo defects in Ve boxes of equal volume. 
The two expressions are then applied in a numerical 
example for Mo= 100 initially mobile defects in a fcc 
crystallite containing 5X10 sites and the results 
compared with those obtained in the Monte Carlo 
solution for this system. 


A. Average Concentration Approach 


Let there exist Mo mobile defects randomly distri- 
buted on X lattice sites and M, defects contained in 
randomly distributed stable, immobile defect clusters 
at time /—0. Assume that the sole mobile defect 
removal process is association with other defects at 
contact. A pair of defects will be taken as stable and 
immobile in this example. Further, assume that the 
association probability e at contact is the same when 
one member of the contact pair is a cluster member as 
when both defects were mobile just before contact. Ma 
will denote the number of mobile defects remaining 
after n jumps per defect. Let a 1/X. The probability 


S(u) - B--Cn o (n) s An* 
B c A k 
sp Ue ee 0.194 — 0.799 
sc 1627 0.6642 0414 — 0.608 
bcc 1690 — 07217 0.385 — 0.604 
fcc 15.84 0.7473 0.308 0.635 


a ———— M —Ó—— — 
s Sin) = 0.764870, 
1 N. R. Baumgardt and J. R. Beeler, Jr., Bull. Am. Phys. Soc. 
9, 294 (1964). 
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that a mobile defect will encounter a cluster member on 
the zth jump is 


a[M t Mo— M a ]p* (5; n— )/(;n—1), (9) 


where p*(S;n— 1) is the conditional probability for a 
defect visiting (S4-1) distinct sites on the (n-4-1)th 
jump provided it had visited exactly S distinct sites in 
n jumps. The probability that a mobile defect will 
contact another mobile defect is 


aM, 1—1]p* (5; n— 1) f(5; n—1). (10) 


Contact with a cluster leads to the removal of one 
mobile defect but contact between two mobile defect 
leads to a depletion of two mobile defects; hence, the 
difference equation for M n is 
M,— Ms-a—acLM dr Mo M a] 
S=So(n—1) k 
x X IMf(S;n-1]P*G;n—l) 
SeS1(n—1) 
—2a« M n1] 
S=S2(n—1) 
pO» MET (Son-1)9*(S;n—1)]- (11) 


S-81(n—1) 


By the definition of f(S; n) the summation reduces to 
M. a (p* (n—1)). Equation (11) leads to the differential 
approximation 


dM /dn=—aeM (p*(n)) [M+ (M+ M ,—2) ]. 
Integration gives 
InLM/ (M4-M .+-Mo—2) jv? i 
= -acM +2) | (p*(n)dn. (13) 
0 


(12) 


Collecting terms, one obtains 


M (n) MdEM 4-M—2Y (2(Ms— 1)2-M.] 
Xexploe(Me+Mo—2)PnJ—Mo}, (14) 


where P= Jf (p*(n))dn. Because (p*(w)) is closely 
approximated by dS (1)/dn, we will use 


P. [| tas] 80). (15) 


M (n) was evaluated for the particular case a= 2X 10-5, 
Mo=100, M.—0, and e—1. The results are discussed 
in Sec. 4C. 


B. Consideration of Fluctuations 


In the estimation of M (n) outlined above one con- 

sidered (M.4-M) defects in a cell of Z sites and 

, computed the contact probability from the average 
concentration 


C- (M-t-Mg/Z (16) 
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TanrE VI. Probability g (4) to find & defects in a subvolume of 
5000 sites when the macroscopic defect concentration is 
G=2X10-4 defects per site. Cx is the associated cell average 
concentration. 


g(k) Cex 104 
0.366032 
0.369730 
0.184865 
0.0609992 
0.0149417 
0.00289778 
0.0046345 
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defects per site. The effect of fluctuations in the initial 
defect concentration upon the annealing rate can be 
estimated by introducing the density function g(k) for 
finding exactly k mobile defects (&—1,2,:::,M») in a 
given subvolume of a collection of Ne equal subvolumes 
with atomic-scale linear dimensions. Let M (n) be the 
number of mobile defects remaining after n jumps, 
computed on the basis of (g(k)) rather than C, and 
set M.=0. M (n) then is 
Mo 


M(n)=>X Mi(n). 


kel 


(17) 


The quantity M,(k) is obtained by substituting 
Mo=kg(k)N. for Mo, and o,—1/g(k)E for o, in 
Eq. (14). The result is 


Mi(n) 9 Mox(M o.— 2)/ (2(M o.— 1) 


Xexp[oxe(M ox— 2)P4.]— Mor} 5 (18) 

M(n) was evaluated for the particular case 
a=2X10-§, Mo=N.= 100 and e— 1, using? 

g(k)=Mo!(N-—1)¥o-*/k!(Mo—k) NM. (19) 


Values of g(k) for k<6 are given in Table VI along with 
the associated subvolume concentrations. The average 
value (k)=1 corresponds to an average concentration 
of 210-4 defects per site over the entire crystallite, 
the concentration used in computing M (n). 


C. Comparison with Monte Carlo Solution 


In Fig. 5, M (n), M(n) and Monte Carlo solution, 
Mrme(n), are compared for a—2X10-5, Mo—N.— 100, 
and e=1 for n 1000. We will assume that Mimo 1$ à 
physically more realistic solution than either M or 
because one would expect that it better describes the 
association of initially closely spaced defects. It will be 
used as the reference in all discussion. The curves 
become nearly parallel at 2— 500, and although it 1 
not shown in the figure, they remained nearly parallel 
up to n= 2000, the point at which the Monte Carlo runs 
were terminated. It appears, therefore, that dM/dn 15 


8 W. Feller An Introduction to Probability Theory and lis 
A Pelications (John Wiley & Sons, Inc., New York, 1957), 2nd ed., 
p- 34. 
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a good approximation to the true annealing rate for 
n> 500 even though M itself is too large. Rapid associa- 
tion of defects, initially closely spaced, accounts for the 
more Tapid falloff in Mine for n<500 relative to that 
exhibited by M and M. The Monte Carlo method 
enables one to easily describe the removal of these 
initially close defects during the early annealing stage, 
a process only crudely described by JZ. Tt would seem 
that a rigorous analytical description of fluctuation 
effects in the association of randomly walking defects 
on a lattice would be somewhat complicated. 

If one accepts Mme as being more realistic than either 
M or M, our results indicate that the use of M in 
interpreting defect removal data would cause one to 
overestimate the number of jumps per defect required 
to attain a given removal fraction? Table VII lists the 
number of jumps 2(M) and n(M me) given by M and 
M mo, respectively, required to remove a given fraction 
of mobile defects. Extrapolation of the straight line 
log-log plot of 2 (M)/n (M) versus the removal frac- 
tion, gave a limiting value of ~1.1 for this ratio. In a 
strict sense, the ratios listed in Table VII should be 
regarded as lower bounds, especially when they are 
associated with 221000. Although each defect was 
initially contained within a cube of 5X 105 sites, in the 
Monte Carlo calculations, some defects migrated out 
of the cube into a defect-free environment during the 
annealing process. These escaping defects served to 
increase E a maximum of 2.4% at n=1000, i.e., a 
maximum of 1.2 10! sites outside the 5X 105 site cube 
were visited. This caused dM m-/dn to be slightly smaller 
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80 


M GIVEN BY EQ. (14) 
M GIVEN BY EQ. (17) 
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Fic. 5. Comparison of extant 1 
jumps as given by M, M, an 
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* Aft letion of this article, experimenta 1 : 
DOE hie conclusion was found in a paper P Daa 
K. Herschbach, and J. S. Koehler, Phys. Rev. 133, A2 i 
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Tarte VIL. Comparison of the number of jumps nM) and 
nCM s) per extant mobile defect, given by M and M, respec- 
tively, required to remove a given fraction of mobile defects when 
M»=100 and «— 2x 10-5, 


Fraction he 
, Temoved n(M) nOM we) nM )/n( M 
0.1 340 0. "SPEM 
02 770 200 3.8 
0.3 1350 520 7.6 
04 2300 1070 2.2 
0.5 3300 2000 1.6 
1.0 e tee ~1.1 


than it would have been had all defects remained inside 
the 5X10* site cell. At 5220002 was increased a 
maximum of 3.5% by escaping defects. 

Twenty-five Monte Carlo runs of 2000 jumps per 
extant mobile defect were made to obtain Mme. Five 
independent runs were made for each of five inde- 
pendent, randomly sampled initial defect distributions 
in a cube of 5X 10° sites. A x? analysis of the five initial 
distributions showed that they were consistent with the 
density function g(&) of Eq. (19). 


5. SUMMARY 
The calculations described above indicate: 


(1) F(S;2) is not well approximated by the normal 
distribution for »<5000 jumps. However, (5S; 2) 
appears to approach the normal distribution on the 
interval 5000 <n <10 000 and to be normal for n> 10 
in three-dimensional lattices. 

(2) Our results give no conclusive evidence that 
F(S;n) either does or does not approach the normal 
distribution in two-dimensional lattices. S 

(3) The use of Vineyard's asymptotic result for (n), 
and the assumption that F(S;z) is normal [using 
Table V for e (1) ] should give a very good approxima- 
tion to the distribution of S() for n> 10*. 

(4) A computation of the mobile defect concentration 
and the initial annealing rate must account for the 
effect of fluctuations in the initial defect concentration. 
Apparently, however, the annealing rate after the 
initial annealing stage can be well approximated by 
considering only the average defect concentration. 
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The microscopic dielectric susceptibility e(g) is calculated for germanium using the shell model. It is 
found that except at g=0 the longitudinal and transverse susceptibilities differ and are anisotropic. The re- 
sults agree qualitatively with Penn's isotropic quantum model based on the random-phase approximation. 
It is suggested that the quantitative differences may in part represent shortcomings of the RPA in treating 


the stiffness of covalent bonds. 


1. CLASSICAL LATTICE VIBRATION MODELS 


ANY classical models have been used to discuss 

the lattice vibrations of insulators. The interest 

in such models has been greatly enhanced by the 

availability of experimentally determined phonon 

dispersion relations for wavelengths comparable to 

interatomic separations. Through these models, we may 

hope to determine certain properties of the valence 
electron polarizability at short wavelengths. 

We may distinguish between two kinds of classical 
atoms: rigid or polarizable. In each case the valence 
charge may be regarded as either distributed or localized 
at a point. For the moment we restrict ourselves to the 
latter case. The local field corrections in the former case 
are more involved (Sec. 5). 

The rigid point atom yields the simplest models, but 
is inconsistent with the dielectric properties of ionic 
crystals. For homopolar Ge, on the other hand, it was 
natural to suppose that this model would work fairly 
well. Its failure to fit the dispersion curves determined 
by inelastic neutron scattering! except with forces out 
to fifth neighbors was shown by Herman.” 

Polarizable valence electrons can be treated within 
the adiabatic approximation as charged massless shells. 

In a free atom the shell and core are connected by an 
isotropic force constant; in a crystal each shell is also 
linked to near neighbor cores and shells. This shell model 
takes account of the fact that an atom can be polarized 
either through Coulomb or short-range interactions. In 
germanium displacement of an atom produces a dipole 
moment on it, but the lowest moment on the atom and 
its near neighbors is a quadrupole moment. In Lax's 
theory? of the lattice dynamics of germanium, this 
quadrupole moment is taken to be on the displaced 
atom; when the shell model is used* the interaction 
involves the component dipoles. This method finds 


.1B. N. Brockhouse and P. K. Iyengar, Phys. Rev. 111, 747 
(1958); A. Ghose, H. Palevsky, D. J. Hughes, I. Pelah, and 


- C. M. Eisenhauer, ibid. 113, 49 (1959). 


? F, Herman, Phys. Chem. Solids 8, 405 (1959). 
3 M. Lax, Phys. Rev. Letters 1, 133 (1958); Phys. Chem. Solids 


8, 422 (1959). 


£W. Cochran, Proc. Roy. Soc. (London) A253, 260 (1959). 
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some quantum-mechanical justification in the work of 
Mashkevich and Tolpygo.5 It is equivalent to using the 
atomic displacements and shell displacements (or 
atomic dipole moments) as generalized coordinates, and 
writing the energy as a general quadratic form in these 
coordinates. The shell-shell forces appear formally as 
the (non-Coulomb) interaction energy of induced 
dipoles; quantum mechanically they represent the 
energy required to distort the covalent bonds (Sec. 4). 
The microscopic dielectric susceptibility tensor e(g) 

is defined by 
[«(g) — 1/4 P(Q/E(g). (1.1) 


Here E(q) is a macroscopic field of wave number q: 


E(q)= Ee * (1.2) 
and P is the dipole moment per unit cell volume v 
P=}; p;, (1.3) 


the sum extending over both atoms in the unit cell. 
For q along [100] or [111] axes in cubic crystals, the 
tensor e(q) has only longitudinal and transverse com- 
ponents, e;(q) and e;(q), respectively. 

The shell model gives a good fit* to the experimental 
lattice vibration spectrum. From this classical model we 
calculate e,(g) in Sec. 2 and ei(g) in Sec. 3. Within the 
Hartree (or random-phase) approximation, one can 
also calculate e(g) quantum mechanically (Sec. 4). It 
follows that e(g) represents a natural bridge between 
the experimental dispersion curves and a fundamental 
quantum-mechanical treatment. The limitations of this 
approach due to local field and umklapp corrections are 


discussed in Sec. 5. 
2. TRANSVERSE DIELECTRIC SUSCEPTIBILITY 


Within the framework of the shell model, this is 4 
purely classical calculation of the microscopic polariza 
tion (1.3) produced by the applied field (1.2). We have 


VE=(S,+ (Y?/5) C)W, (2.1) 


.5 V. S. Mashkevich and K. B. Tolpygo, Zh. Eksperim. i Teor: 
s 520 (1957) [English transl.: Soviet Phys.—JETP 5, 495 


( 
* W. Cochran, Advan. Phys. 10, 401 (1961). 
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where Y is a constant equal to the charge on a shell. 
The applied electric field and dipole morents at the 
two atoms in the unit cell are vector elements of the 
column matrices E and YW. (The elements of E are 
therefore equal.) The short-range and Coulornb forces 
are represented by the 2X2 matrices $, and C, respec- 
tively. Because we are taking q along a [111] or [100] 
symmetry direction, and ELq, the matrix elements of 
Sı and C, are scalar functions of q. For example, for 
q— 0 all four elements of C have the value =fr. 
Different scalar functions enter for E| |q, so that e(q) 
calculated here differs from e:(q) calculated in the next 
section. 

According to these definitions we can write the 
polarization (1.3) as 


vP=YW (2.2) 
and from (2.1) and (2.2) it follows that 
P=((oS,/Y?)+ CE. (2.3) 


Since for the Ge lattice the off-diagonal matrix elements 
of (v8,/Y?+ C,) are complex conjugates, the polarization 
waves through the two types of atom are not in phase 
with the applied field E. Because the matrix is Hermi- 
tian, one lags as much as the other leads. We may take 
the sum of the two to define a real microscopic dielectric 
susceptibility given by 


ei(q) — 14r Ds (S, Y?)4- C) D 


where the sum is over all four elements of the matrix. 

We can check the correctness of this procedure by 
evaluating (2.4) at q=0. In this limit the shell-shell 
forces do not affect the polarizability, which depends 
only on core-shell forces. We find that in terms of the 
free atom polarizability per unit cell volume 8 (2.4) 
reduces to 


[es(0)—1]/[eee(0)+2]= 3728, 


(2.4) 


(2.5) 


which is the Clausius-Mossotti formula." Thus (2.4) 
includes local field corrections properly when the 
valence electrons are idealized as nonoverlapping 
spherical shells. E r ; 

Using the values of C, and S, given previously in 
Table IIT and Appendix 4, respectively,! one can now 
evaluate (2.4) along the [100] and [111] axes. The 
results are shown in Figs. 1 and 2. t NS. 

A notable feature of these curves is their periodicity, 
which is not that of the reduced Brillouin zone. The 
periodicity is a result of the localization of the two 
valence shells on the atoms rather than merely in the 
unit cell. We also note that e:(g) along the [111] 
direction has a maximum near q-K —2za-! (111). This 
may be an umklapp effect (Sec. 3» 


i hich gives (0) —16.0. It 
7The value of 8 used here is 0.099, whic} 
differs slightly from the value of 0.105 used in Ref. 4. 
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Fic. 1. The longitudinal and transverse shell-model suscepti- 
bilities es: and es, respectively, as a function of g/Er along [100] 
axes, where kp is the Fermi wave number of a free-electron gas of 
density equal to the valence electron density in Ge. The curve 
marked band theory is based on Penn's isotropic band model. 


3. LONGITUDINAL SUSCEPTIBILITY 
In this case we have 
YE=(S (¥7/2) C)W, (3.1) 


where as before the elements of the matrix E are the 
amplitude of the applied field. The effective field is the 
sum of the applied, local and macroscopic fields, the 
two latter being given by —(Y/c) CiW. That is, the 
local and macroscopic fields are not calculated sepa- 
rately by Kellermann's method. At g — 0, for example, 
the elements of C; are all 8/3. The amplitude of the 
macroscopic field is —4z times the amplitude of the 
polarization wave, and this is true for general values 
of g.? 

Let P be the amplitude of the polarization wave, £ 
that of the applied field and Ej/— —4zP that of the 
macroscopic field. We define «:(g) by 

Cal) -1]/4« 5 P/ (E-- Ew) , 
so that 


Le(q) = 1] "Am eig) = P/E. (3.2) 


—-—BAND THEORY 


ZONE 
EDGE 


Fic. 2. The same as Fig. 1, but with q parallel to [111]. 
3 E. W. Kellerman, Phil. Trans. Roy. Soc. (London) 4238, 513 


con and K. Huang, Dynamical Theory of Crystal Lattices 
(Oxford University Press, New York, 1954). 
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By the same argument as was given in the previous 
section, we then have 


PE v -1 
es: (s ze) (3.3) 
4rei(q) y? 
Making use of the fact that if 
(e- 1)/4r- X, X^, 


then 
(e—1)/4re— »; (X—4a1)7, 
where 
1 1 
E 
1 
We finally have 


e(g)= 1--4- x (Si(v/ Y2)4- C;— 4x1)! . (3.4) 
The elements of C; and of C, are related by? 
C;+2C,=0. (3.5) 


It is readily verified from (2.4), (3.4), and (3.5), 
together with the fact that $,(0)— $,(0), that e:(0) 
= (0) (as required by cubic symmetry). 

Using the expressions for S appropriate to longi- 
tudinal modes, we have evaluated (3.5) along the 
[100] and [111] symmetry directions. The results are 
shown in Figs. 1 and 2, where they are compared with 
&:(q). Except at q=0 (where shell-shell forces play no 
role), e;(q) is not equal to e,;(q) ; the difference is small 
along the [100] direction where q makes equal angles 
with all shell-shell force constants, and large along [111 ] 
directions where q is parallel to one of the shell-shell 
force constants. 

The idea that shell-shell forces are chiefly responsible 
for the difference between e;(q) and ¢,:(q) is borne out 
by the explicit expressions! for —S/So. These are 


[100] trans. cosga/4—7y, singa/4, (3.6) 
[100] long.  cosqa/4, (3.7) 
[111] trans. 

cos*9-I- y, sin’8 cos6-I-i(sin'6-I-»y, cos? sing), (3.8) 
[111] long. 


cos!0— 2y, sin'0 cos0-]-2(sin*— 27, cos0 sin6) , (3.9) 


where 0—ga/4v3. With «,—0 the longitudinal and 
transverse short-range forces become equal. The 
parameter y, measures! the ratio of transverse to 
longitudinal shell-shell forces along cube axes: 


t7 [$2 (s—5) l/ [02s (s— 5) ] , (3.10) 


where ¢-, is proportional to the force in the x direction 
on shell 1 when shell 2 is displaced in the y direction. 
The value of y, used previously was 0.7. 

One can express y, in terms of the ratio of bond 


- bending to bond stretching forces. One obtains 


a= (E, — kg)/ (k,T-2kg) . (3.11) 
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Thus, y.=0.7 corresponds to kg/k,=0.13. This is in 
quantitative agreement with the ratio of core-core and 
core-shell bond bending and bond stretching forces 
required to fit the elastic constants! and other theo. 
retical! and experimental data? in the g=0 limit, 


4. QUANTUM-MECHANICAL SUSCEPTIBILITY 


In treating the real crystal quantum mechanically, 
we are concerned with fields throughout the unit cell 
and not merely at the atomic sites. If the external 
potential is v(x) and the self-consistent total potential 
is V(x’), we have the general relation 


1 


v(x) =z [2062 V (x^)d«' . (4.1) 


Here the integral extends over the volume of the crystal, 

The microscopic dielectric susceptibility is D(a,2’), 
When (4.1) is Fourier analyzed, taking advantage of 
periodicity, one obtains9-15 


v(a, K») => Dq, Kp, Kp) V (a, K,) . (4.2) 
p’ 


We are interested in the case K,=0. Then (4.2) may be 
rewritten 


(g0) — e(g) V (g)--27 9 (a,0,K p) V (a, Kp), (43) 


where the prine on the summation excludes K, — 0. 

We have chosen to regard e(g), the K,= Kj =0 com- 
ponent of the operator $, as the quantum-mechanical 
analog of e,(g). Because of the point nature of the shell 
model, this choice is not unique. We shall see in Sec. 5 
that with this choice the local field corrections to elg) 
associated with a distributed shell charge are small, at 
least within the first Brillouin zone. We also postpone 
to Sec. 5 the discussion of the second set of terms on 
the right of (4.3), which may be called umklapp 
corrections. 

If one knows one-electron wave functions and energies 
throughout the Brillouin zone, ¢(g) may be calculated 
using the random-phase approximation!® (RPA). For 
the static dielectric function we are considering, 


«uunc CAEN (44) 
nu 9 


n wno 


The oscillator strengths fon(q) are, in general, different 
for longitudinal and transverse polarizations; they arè 


1 A. D. B. Woods, W. Cochran, and B. N. Brockhouse, Phys- 
Rev. 119, 980 (1960). 

n L. Kleinman, Phys. Rev. 128, 2614 (1962). 

12 A. Segmuller, Phys. Letters 4, 277 (1963). 

13S. L. Adler, Phys. Rev. 126, 413 (1962). 

14 N. Wiser, Phys. Rev. 129, 62 (1963). 

!* M. Azuma, J. Phys. Soc. Japan 18, 194 (1963). 

1* P. Nozieres and D. Pines, Phys. Rev. 109, 762 (1958). 
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the same only in the case of an isotropic solid such as 
a free-electron gas. 3 

An isotropic model of a semiconductor has been 
studied by Penn." The model contains an energy gap 
between the valence and conduction bands. Near this 
gap the Bloch functions are made out of two plane 
waves, k and k— K. The "reciprocal lattice vectors" K 
involved are always parallel to k. The umklapp terms 
are negligible, being of order N~, where N is the num- 
ber of atoms in the crystal. The model also makes the 
longitudinal and transverse dielectric susceptibilities 
equal. This result may apply more generally than 
merely to Penn's model. That is, the anisotropy of e(q) 
may be a measure of the magnitude of umklapp 
corrections. 

Penn shows that a dielectric susceptibility consistent 
with his band model is given by the formula 


Tig? 1 
«o-i«( = ) me (45) 
E, / [1-- (Eg/ E) (g/ kg T 


Here Zp and kp are the Fermi energy and wave number 
of the valence electrons regarded as a free-electron gas. 
With Ep—12 eV, E,—4 eV, (4.5) reduces to 


15 
1+ : 
[1--3.0 (g/ Eg) T 


We have called (4.6) the dielectric function obtained 
from band theory. It is also plotted in Figs. 1 and 2. 

We see that along the [100] axis, where the difference 
between e; and e,; is small, e(g) from (4.6) agrees well 
with e, This confirms our identification of e(q) with 
the K,=0 component of D(q,K,). That the agreement 
of a classical model with point charges with a quantum- 
mechanical band model should be so good is at first 
unexpected. We are less surprised, however, when we 
remember that the shell model gives a good fit to the 
neutron data in a rather simple way. 

Along the [111] axis, e; differs substantially from 
€ even within the zone. (At the zone edge, & 73e). 
Within the zone, e falls between €s: and ést. We now wish 
to consider whether the large difference between €i and 
és: is real, and if so, what shortcoming of the band model 
makes the difference zero there. : 

To answer this question we must discuss the 
quantum-mechanical equivalent of the parameter Ys- 
According to (3.11), Soy, increases with kss, the force 
constant involved in stretching the covalent bond. The 
tendency of shell-core forces is to pull the valence 
charge distributions apart; it is necessary to balance 
this with a bond stretching force. Quantum mechani- 
cally, bond stretching causes 4 substantial eae z 
valence charge density near the covalent plane, ees 
as the plane midway between the two atoms in une = 
cell. This requires an increase m kinetic energy, Whic 


^ 


e(q) 


(4.6) 


11 D, Penn, Phys. Rev. 128, 2093 (1962). 
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1s represented by the shell-shell force constant. The 
charge density at the covalent plane is little altered by 
bond bending so that kyk. 

We can now see that the quantum-mechanical 
generalization of the bond stretching force constant k, 
is a force distributed over the covalent face of the 
atomic cell and acting on a distributed and deformable 
valence shell. The effect of these refinements should be 
to reduce the differences between e, and e,; along the 
[111] axes, perhaps by 50%, but not eliminate them 
altogether. 

It would be interesting to know whether a realistic 
band model would produce a substantial difference 
between e; and er From previous experience’? with 
e(q) at g=0, it appears that much ingenuity would be 
required to carry out calculations for q0. It also 
appears that with many bands, the tendency towards 
isotropy within the random-phase approximation (4.3) 
is strong. We suspect that the anisotropy manifested 
in Fig. 2 indicates that covalent bonding involves 
short-range correlations which are not treated accu- 
rately by the RPA. 


5. LOCAL FIELD AND,UMKLAPP CORRECTIONS 


It was remarked in Sec. 2 that e,(g) is periodic along 
[100] and [111] axes. If we replace a point shell at R 
by a specified charge distribution o(r—R), we find that 
the force on a shell is 


Bf sie nemo. (5.1) 


This is just the previous value, multiplied by the form 
factor of o(r), which we write as 


Y(q)= J o(r)e tdr. (5.2) 


On Fourier analyzing the polarization one finds, e.g., 


for €st, 
4(g)—1  Y*(g) :S =A 
calg) —1 ex( t c) 


= — (5.3) 
dr Y*(0) 


y? 


When an extended charge distribution is used the 
elements of C, will not be the same as those we have 
met previously, which apply to charges localized ata 
point (or to nonoverlapping spherical distributions). 
For example, as the charge is spread out, C,(0) changes 
from —4z/3 to zero, and C,(0) from T8z/3 to +4r. 
However these changes can be taken up in S, in other 
words it will always be possible to keep GS/Y3)2-C 
unchanged. Thus with this quantity unchanged, so that 
the phonon dispersion curves are unchanged, the right- 
hand side of (5.3) is multiplied by the damping factor 
¥2(q)/¥2() compared to (2.4). 


^5 D. Brust, J. C. Phillips, and F. Bassani, Phys. Rev. Letters 
9, 89 (1962). 
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By comparing e(g) with elg), we can see that the 
damping factor must be nearly 1 within the Brillouin 
zone and start to decrease rapidly for qZ kr. Outside 
the zone e (g) represents an analytic continuation of the 
susceptibility which gives a good fit to that deduced 
from the phonon dispersion curves inside the zone. The 
continuation is not unique, as we see from the presence 
of the damping factor in (5.3). Nevertheless, the 
damping is probably not so rapid as to destroy the 
maxima in e; and er for q near L=[111] reciprocal 
lattice vector. 

Such maxima can be called umklapp effects. Azuma!® 
has shown that the umklapp terms in (4.2) lead to an 
effect of this sort. He finds 


- — €(K,)= D(K,,0)/[1—C,/e(0)], (5.4) 


where C, is a number of order unity. The enhancement 
predicted by (5.4) may be related to the subsidiary 
maxima found in e, for q near L. 


6. CONCLUSIONS 


In spite of the differences between a classical model 
of point valence shells and a quantum-mechanical 
model with valence charge spread over the atomic cell, 
the microscopic dielectric susceptibility obtained from 
the shell model is in qualitative agreement with the 
RPA result, especially within the zone. This provides 
some microscopic justification of the shell model. 

Lax? has criticized the shell model on the grounds 
that its point nature grossly overemphasizes the local 
field corrections. He cites Stark shifts of nuclear reso- 
nances,? which place an upper limit on the corrections 


19M. Lax, 1963 Copenhagen Conference on Lattice Dynamics 
(to be published). 

20 N. Bloembergen, Proceedings of the Eindhoven Conference on 
Magnetic and Electric Resonance and Relaxation (North-Holland 
Publishing Company, Amsterdam, 1963), p. 39. 
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of order 50%, whereas the point charge model 
corrections by a factor (e+2)/3~5, i.e., ten time 
than the experimental shifts. 

Our calculations here demonstrate how the 
model overcomes these difficulties. The microscopic 
susceptibility calculated from band theory, « 
implies small (or, in the case of the isotropic model, no) 
local field corrections. Nevertheless e(g) agrees well 
with e(g) near the zone edge. This is because the short. 
range Íorces dominate both the lattice vibration 
frequencies and e(g) in this region. As q— 0 phase 
differences between nearest neighbors are reduced, the 
short-range forces are reduced, and the long-range 
Coulomb forces become important. In this limit the 
nature of the charge distribution in the unit cell is 
irrelevant, so long as it is treated consislently to give 
the right limit, 


implies 
S larger 


shel] 


lime;(g) = lime, (q) = ey— 16, 
En 


q0 q 


for the dielectric properties. This the shell model does; 
adjustment of 8 to give eo— 16 automatically renor- 
malizes the theory. 

We can now reverse this argument and use the fact 
that short-range forces dominate near the zone edge to 
argue that the differences between &,,(g) and ej;(q) along 
[111] axes are real. These differences may represent the 
first observable evidence for departures from the RPA 
in metals or semiconductors. 3 
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- 1 he superconductivity, the low-temperature specific heat, and the paramagnetic susceptibility have been 
investigated in a series of dilute hexagonal titanium alloys. In the solid solutions with V or Nb, both the 
transition temperature and the density of states are strongly increased and the entire volume exhibits 
superconductivity. A similar study of Ti-Fe alloys suggests either an incomplete volume transition or a 
different entropy variation in the superconducting state. 


INTRODUCTION 


HE significance of the enhancement of the super- 
conducting critical temperature, which is ob- 
served in dilute alloys of titanium with other transi- 
tion elements, has been subject to considerable con- 
troversy.-* Matthias et al. interpreted the behavior of 
these alloys as evidence of the existence of a mechanism 
of a different type from that provided by the electron- 
phonon interaction,^? while recent work by Cape’ on 
Ti-Mn alloys suggested . at the high transition tem- 
peratures might simply be due to filamentary cubic 
precipitations in the hexagonal matrix. The experiments 
to be described indicate (a) that in dilute solid solutions 
of Ti with V or Nb, in which cases the hexagonal phase 
extends to the highest solute concentrations, the en- 
hancement of superconductivity is a real bulk effect; 
and (b) that the substantial increase of the transition 
temperature is accompanied by a marked increase of 
both the electronic specific heat and the magnetic sus- 


E Present address: Institut de Physique Expérimentale, 
Université de Genève, Geneva, Switzerland. 2 ; 

1 B. T. Matthias, V. B. Compton, H. Suhl, and E. Corenzwit, 
Phys. Rev. 115, 1597 (1959). 

* E. Bucher and J. Muller, Helv. Phys. Acta 34, 410 (1961). 

3 W. Buckel, G. Dummer, and W. Gey, Z. Angew. Phys. 14, 
703 (1962). : EE = 

HC. qb Ae and C. A. Andersen, Z. Physik 175, 105 (1963). 

5J. A. Cape and R. R. Hake, Bull. Am. Phys. Soc. 8, 192 
(1963); R. R. Hake and J. A. Cape, mia m 419 (1963). 

6 J. A. Cape, Phys. Rev. 132, 1486 (1963). 

7R. L. Falge, Phys. Rev. Letters 11, 248 (1963). ee 

3 See, for example, B. T. Matthias, in Proceedings of the Bisse 
International Conference on Low Temperature Piae: Ten i 
1962 (Butterworths Scientific Publications Ltd., London, i 


p. 135. 


ceptibility. On the other hand, dilute alloys of Ti with 
Fe show a different calorimetric behavior at the tem- 
perature of the diamagnetic transition, and the normal 
electronic specific heat remains essentially unchanged. 


HEXAGONAL ALLOYS OF Ti WITH V AND Nb 


The structure of Ti-rich alloys is goverened by the 
a— phase transition and the Martensite-type diffusion- 
less transformation. The hexagonal alloys containing V 
or Nb were obtained by quenching after repeated melt- 
ing in the argon arc. According to x-ray examination, the 
quenched specimens were single phase up to more than 
10 at.% in both cases. The initial slope of the super- 
conducting transition temperature T, versus solute 
concentration is practically identical in the V and Nb 
series, the increase near pure Ti being approximately 
1°K per at.%. The observation that very rapid quench- 
ing yields somewhat higher transition temperatures has 
to be interpreted in terms of the state of internal strains 
or a high concentration of defects, since we have found 
that Ti-Zr alloys show a similar effect. 

Bearing in mind that even a complete diamagnetic 
transition may not unambiguously indicate a volume 
effect, we have to rely on the calorimetric transition 
which is insensitive to possible partial screening. The 
specific heat of representative samples weighing 3-10 g 
(small specimens are essential for efficient quenching?) 


9 Qur specimens are quenched from slightly below the solidus 
pu in an intense stream of cold argon. In another case 
where the quenching rate is critical [E. Bucher, F. Laves, J. 
Muller, and H. von Philipsborn, Phys. Letters 8, 27 (1961) 
this method has proved to yield better results than water 


quenching. 
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A1408 P. HEINIGER AND J. MULLER 
a S T - TABLE I. Variation of electronic specific heat vy, Debye tem- 
i | perature 0, and magnetic susceptibility x, of hexagonal alloys of 
| Ti with V and Nb. 
| ¢ | pe Ti Tivos Vo.ot Tione NO i AAN 
10 L E Ti 96 Vos - , TE P d Diete Dose Nbo.o4 | 
Dru | y (1075 J mole™ °K~) 3.3 4.6 43 k 
| l | 0, (°K) 415 365 340 * 
fea. | | x«/X« (Ti) (1) 1.09 1.05 
| c | Xat (10°K)/Xat (300°K) 0.92 0.92 0.92 ) 
i zin | eN 
a 7 | = = = —— 
D L | | 
To Ee | portance of the eflective Coulomb interaction is not | 
J MEE BÉ known and no information on the isotope effect in | 
9t \ ccr M | group IV transition elements is available. | 
EU PE 3 In fact, a knowledge of the isotope effect from experi- | 
> med 2E ment would permit separation of the interaction into a T 
^o wu DNE c ic i Coulomb and a phonon part.? Unfortunately, attempts | 
Daan a of determining the isotopic shift of the transition tem- | 
1 1 Sax l perature in? Ti and" Zr have been unsuccessful due to f 
| Si the difficulties of the sample preparation. | 
a DILUTE Ti-Fe ALLOYS | 
, . . | 
| For comparison, several dilute Ti-Fe alloys with iron | 
à UT etd d G content up to 1.5 at.% have been prepared in the same | 
o 10 20 30 40 | 
T’ (eK?) aan z =} | 
Fic. 1. Specific heat (C/T versus T?) and magnetic permeability | | 
p of hexagonal Tio.95 Vo.oa. Lower broken line indicates C/T versus | 
T? of pure Ti. For Te versus concentration, see Ref. 2. ] 
Tie Nb o4 = | 


have been determined with an absolute accuracy of 
+2%.0 Figures 1 and 2 show the results for two alloys 
(single-phase hcp) containing 4 at.% V or Nb, respect- 
ively. Considering the discontinuity at the superconduct- 
ing transition [(C;—Cn)/yT ]r-r,, we conclude that 
the bulk of the material is superconducting below the 
temperature of complete diamagnetic shielding. There- 
fore, the marked increase of the magnetically observed 
transition temperature? compared to T, of pure Ti, 
is characteristic of the solid solution and is not due to 
precipitations. Furthermore, as it is seen from Table I, 
the V and Nb solutes significantly affect the normal 
electronic specific heat. We also note that the change of 
y is much more pronounced than the change of x. 
It is reasonable to assume the presence of a large para- 
magnetic contribution in addition to the spin suscepti- 
bility, although in regard to different interaction cor- 
rections, y and X,pin alone need not vary similarly upon 
alloying. A plot of In(T./@) versus 1/7 reveals that the 
data for dilute Ti-V and Ti-Nb essentially agree with 
those obtained with the isoelectronic Ti-Zr system." 


E Such a relationship is expected qualitatively on the basis 


of the BCS theory. For the time being, a more detailed 
interpretation is of little use since the relative im- 


10 Jean Muller, Helv. Phys. Acta 33, 514 (1960). 
u E, Bucher, F. Heiniger, J. Muheim, and J. Muller, Rev. Mod. 


eS 
. 


Phys, 36, 146 (1964). 


^ 
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C/T (IO J mole 


I i IR 
20 30 40 
T? (ek?) 


Fic. 2. Specific heat and permeability of hexagonal Tio.so Nbo.ot- 


? E. Bucher, F. Heiniger, a . Muller, Phys. Kondens- 
Materie 2, 210 (1964). aT: Maa 

?: R. G. Netzel and J. R. Dillinger, Proceedings of the 71h Inter- 
national Conference on Low Temperature Physics and Chemistry 
Toronto, 1960 (University of Toronto Press, Toronto, 1961), p. 382- 

1 E, Bucher, J. Muller, and J. L. Olsen (unpublished). 
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BULK SUPERCONDUCTIVITY IN 
way as the Ti-V and Ti-Nb specimens. X-ray diffraction 
showed only hexagonal lines in the quenched state. 
Optical micrographs closely resembled the one obtained 
by Cape? for as-cast Tios4Mnoa. No evidence of a pro- 
nounced precipitation of the 8 phase could be detected. 
From 0.6 to 1.5 at.% Fe, the midpoint temperature of 
the change in permeability, observed ballistically in a 
field of 1 Oe, was found to increase linearly with con- 
centration, in agreement with the earlier data! However, 
in contrast to the complete diamagnetic transition, the 
specific heat shows only a slight anomaly (Fig. 3). 
Similar discrepancies between the amount of flux exclu- 
sion and the calorimetric transition have been observed 
in a number of cases as, e.g., in the « phase of the Nb-Ir 
system.” A smaller than expected calorimetric anomaly 
has also been noted by Morin! at the apparent T, of 
Tio.g9F€o.01. According to the arguments given below, 
the hypothesis that the observed effect is caused by 
minor body-centered cubic inclusions seems very im- 
probable, although we cannot rule out completely 
that traces of the latter may be present. Rather, we are 
left with two possible interpretations assuming that the 
transitions do occur in the hexagonal phases (a and a’). 
Unlike the behavior of Mn in a-Ti^ the present specific 
heat data and susceptibility measurements? indicate no 
localized moments on the iron solute. 

If we assume a normalized distribution function 
f(T.) of the actual volume fraction undergoing a 
transition at T., the observed electronic specific heat 
will be given by 


T 


Tcmax i 
C(T)= / CATT) f(L)dT + | Y(C)T f(T )dT, 
T 0 


where 
Te max 
l f(T)dT.=1. 
0 


For simplicity, we define 


T2 
=f f(Tr3dT., 
Ti 


with T, representing the upper onset of the transition, 
and T,— T5/2. Further, we insert the value for C,(T,T.) 
provided by the two-fluid model 


CT) RICE 


15 F. J. Morin (unpublished), quoted in Ref. 6. 
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T qeu — — 
| 
r [e Ti-Fe 4 
T | 
DE 2 at% Fe iA 
2 o 710 ath Fe ee E 
Ty |z ee emot. UNUS eeu 4 
É B =i” Crean E 2 
nz. 
mi | 
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= H5 y | 
a j 
o f | Jus at% Fe | 
FEES p 
4 


Tek 


Fic. 3. Specific heat of dilute Ti-Fe alloys. Note 
complete magnetic transition (u). 


since in the region between T, and T'/2 this is, for many 
superconductors, a better approximation than the cor- 
responding BCS function. By the use of a numerical 
trial and error analysis, we obtain reliably 0.95 «x«1 
for the V and Nb alloys, but 0.3« «0.35 for Ti+1.5 
at.% Fe. If, in fact, only 30% of the volume were super- 
conducting in the latter, a core structure with appreci- 
able microscopic concentration gradients would be re- 
quired to explain the magnetic transition. On the other 
hand, this figure almost excludes the possibility of a 
transition just due to cubic precipitations which are not 
detected in the x-ray patterns, unless induced super- 
conductivity in the environment of such precipitations 
were important. 

An alternative interpretation of the reduced calori- 
metric anomaly would have to be based on the assump- 
tion that the entropy of the superconducting state is 
lowered much less than usual, with a corresponding 
different temperature variation of the energy gap. But 
if so, it would be difficult to understand why isostruc- 
tural and isoelectronic dilute Ti-V and Ti-Fe solid 
solutions, both without localized magnetic states, should 
exhibit a different kind of superconducting transition. 
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A set of reversible equations for Fi, the first dist 


derived for the weak force case. The “forward motion, 
conditions, and the corresponding reverse motion are ex 
F, evolves into the Fokker-Planck equation while in the c 


anti-Fokker-Planck equation. 


I. INTRODUCTION 


HE problem of irreversibility has existed since the 
time of Boltzmann. The problem may be stated 
as: How can one derive an irreversible equation (e.g., 
the Boltzmann equation, the Fokker-Planck equation) 
on the basis of reversible mechanics? In recent years a 
number of derivations of irreversible equations have 
been accomplished based on a variety of assumptions. 
However, these methods have not completely illumi- 
nated the transition from the reversible to the irre- 
versible equations. Most of these methods start from 
the Bogoliubov-Born-Green-Kirkwood-Y von (BBKGY) 
equations,! which for an infinite system are 


F(x Val) 
ðt 


-F36,F (x1, t Tal) 


1 
SL Sy bi spP aas cr egal) (1) 
U ics 

I 1, 2 ety 
where x; (qi pi); qi p; being the position and mo- 


mentum of the ith particle, 


and e;(|q;— q;]) is the intermolecular potential. 3C, is 
given by 


s Pi à E 
Balms) o ——— 2 Bus 
iim OG; i<i 


and v is the volume per particle. F, is defined through 
Dy (21,: --xw,)) the probability distribution in T space 
for the entire system by 


F, (x3, GO Xel) 


= mE s [ose E Xwul)dasgi : -dxy. (2) 


The sth equation of (1) is obtained by integrating the 


* Supported by the U.S. Office of Naval Research. 

1G. E. Uhlenbeck and G. W. Ford, Lectures in Statistical Me- 
chanics (American Mathematical Society, Providence, Rhode 
Island, 1963). 
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ribution function and g, the correlation function, are 
" i.e., development in time from uncorrelated initial 


amined. In the “forward motion" the equation for 
orresponding reverse motion F: is described by an 


Liouville equation 
(8Dy/910) 3-96 Dx —0 (3) 


over the coordinates X54, * *Ya- 


In particular, the method of Bogoliubov? is to assume * 


that all F,(s> 2) are functionals of Fı, and the form of 
the functionals are determined by an assumed boundary 
condition on the functionals. On this basis, the first 
BBKGY equation [Eq. (1), s 1] becomes the Boltz- 
mann equation (or the Fokker-Planck equation depend- 
ing on whether the expansion parameter for the dis- 
tribution function is v-* or the strength of the inter- 
action). The assumptions of Bogoliubov obscure the 
transition from the set of reversible equations (1) to 
the irreversible equations. In fact Cohen and Berlin? 
have shown that, on the basis of an equally plausible 
boundary condition on the functionals, the collision 
term will be the negative of the Boltzmann collision 
term (an anti-Boltzmann equation). On the other hand, 
Green and Piccirelli* have shown that, on the basis of a 
product type condition on F,(s2 2) at the initial lime, 
the higher distribution functions do, in the course of 
time, become the functionals of F, predicted by 
Bogoliubov. From this point of view it is no longer im- 
portant that the Boltzmann equation is time irreversible 
since the Boltzmann equation evolves from the first 
BBKGY equation only after some period of time pro- 
viding the initial F,(s2 2) fall within the assumed class 
of initial conditions. The reverse motion is presumably 
accomplished, since the BBKGY equations are Ie- 
versible, by another special class of initial F.(s2 2); 
this motion is probably describable by a single equation 
for F, (such as the anti-Boltzmann equation).9* It 
is to these points that this paper is devoted. 


>N. N. Bogoliubov, Problems of a Dynamical Theory in Sla- 
listical Physics, translation by E. K. Gora, Studies in Statistical 
M SUR (North-Holland Publishing Company, Amsterdam, 
`E. G. D. Cohen and T. H. Berlin, Physica 26, 717 (1960). 
M. S. Green and R. A. Piccirelli, Phys. Rev. 132, 1388 (1963). 
5 There remains the question of how one can justify the use 0 
the Boltzmann equation when other types of developments are 
possible with other types of initial conditions. The answer must 
be in the following: If we consider a system in which all that 15 
known initially is F,, then there exists a large class of initial Dy’s 
that give the same / but different F,(s 2.2). In the spirit of sta- 
tistical mechanics we can assign a probability to these Dy’s anc 
obtain a probability distribution for F, and ask for the most 


mam E 


n 


E 


\3 
f 


Lu NN 
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We consider a special case, namely a weak potential 
in a spatially uniform system. This case is general 
enough to produce an “irreversible” equation, the 
Fokker-Planck equation, yet simple enough to examine 
both the forward motion (i.e., the development in time at 
from uncorrelated initial conditions, a special case of 
the Green and Piccirelli initial condition) and the corre- OFa(x1,x2,1) 
sponding reverse motion. The evolution of P, for the IE een) 
forward motion is governed by the Fokker-Planck equa- dt 
tion while the corresponding reverse motion is governed 1 


by an anti-Fokker-Planck equation. SL (4b) 
A 


uniform case, are 


OF (pit) lp 3 
x J dx y (31,32) F(x), (4a) 


t 


II. BASIC EQUATIONS 


We start our development with the BBKGY equa- We introduce the two particle, three particle—correla- 
tions, Eq. (1), the first two of which, in the spatially tion functions g and h—by 


Fa(xyxs,t) = F (pit) Fi(po,t)+-g (1,22,0) D (Sa) 


F3 (x1,22,35,0) = Fi(pyf)Fi(psI)F, (ps) + Fi (Ba,t)g(22,23,0) +F 1( Boyt) g Gn t) + Fi (post) g cient) Grana) 7 (5b) 
etc. With these definitions, Eqs. (4a) and (4b) are 


OF (pit) 1 
M- f itat (6a) 
ðt T 


9g (%1,%2,!) . 
— — t6 (71) 4-91 (22) ]g (ert) 


1 
= PCP (psh qu)-Eg Cr H+ desiit 


1 1 
+f deat oadelsursd-+ tct (6b) 


Define the operator S_;(s1,42) in terms of the single-particle Liouville operator 3€, by 
Sí (1,2) S exp{ — 56161) +51 (22) ]) , (7) 


i.e., S (x13) when operating on the point x; (i— 1, 2) propagates the point back in time along a free-particle orbit 
over a time interval /. Equation (6b) can be formally integrated along free-particle orbits to give 


t—te 


E(,xs,0) 9S ua) (sssde ennt | dt! S_ 1 (21,2) [estrstos L—Ü)Fi(ps, (—1')+-g (x1, xo, t— 1) ] 
0 


1 T À A 
+= debuti(os, 1—(Ü)g (x2, x3, t9 | debatos 1—UÜ)gQn, xs, t7 U) 
V v. 


1 
LU 32, 33, [—U) p. (8) 3 


e 
At this point we introduce the weak short-range potential and assume we can find a solution in the form 


gruss) m Gris) H- eg Grut) 

T (23,3255, = 1 yy xoc t) H- eh (raso) He, 
etc., where eis an expansion parameter that measures the strength of the porous (actually a FE of the 
of the average potential to the average relative kinetic energy). For our purpose it will be sufficient to 
ucc seas 


Probable form for F,. This has been considered by Grad (Ref. 6) who has shown that most likely the F, are uncorrelated. 
* H. Grad, J. Chem. Phys. 33, 1342 (1960). 
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g° (a1,%2,l0) = Ao (ary,0'2,3,l0) = °° * — 0. The zeroth- and first-order equation 


2 (¥1,%2,!) EIU 5 


1—to 
gO (33) = Saca) Qn) g (Enzl) Jl di S. X1,x2)0 1o (a2) Fi (Bv; (—U)F i(p», t—U) , 
Jo 


etc. 


To this order, the equation for F1 [ Eq. (6a) ] is 


aFi(pit) 1 
es [ iste 
at U 


(11) 


In Appendix A we prove that the set of Eqs. (10) and 

(11) are still reversible; the e expansion has not de- 
1 stroyed the reversibility. This point is taken up again 
| in Sec. IV. 


III. RELAXATION OF THE CORRELATION FUNCTION 


We now consider a special case, namely (taking /o— 0), 
g (21,230) — 0. 

(a) Consider first that |qi—q2| <R, where R is the 
range of the force. If we assume there exists a time 7, 
the “time of a collision,” such that for essentially all 
relevant momenta 


«= a. 


E 


8eis(|q— | ) 


oq: 
(pi— P2) 


"i aon 
i m 


S_, (21,22) 


m= 


) au 


then Eq. (10) reduces, for /2 7 to 


ge (31,35,1) 


3 / di S_y (1,%2)012F (pi, I)E Cp, tt), 
0 


for (27; 


(12) 


and, from (11) we have 
Fi (pi, (—U) =F (pit) HOC T) for 0€ <7, (13) 


where T is the mean time between collisions. 
To lowest order, 


grr? (21,%2,1) 


=| dt S..v (xy,x2)61F (py Fs (ps!) , 
0 


for izr. (14) 


‘Therefore, to this order, the functional form proposed 
by Bogoliubov develops, for |qi—q2| <R, in a time 7 
- from the initial time. The subscript F-P on g^? indicates 

that this is the form of g% that together with Eq. (11) 
s the Fokker-Planck equation (or, as Bogoliubov calls 
the Landau equation). The Fokker-Planck equation 
reversible in the sense that it obeys an H theorem.” 


“Lenard, Ann Phys. (N. Y.) 10, 390 (1960). 


LEWIS 


s are then (dropping the e notation) 


(10) 


(b) Consider | qi—42| > &. 

Suppose p: and p: are such that at no earlier time 
have the two points interacted, ie., |S (£12) (d:1— qe} | 
>R for 0€/& c ; then g D (1,35) =0 for all time. 

If pı and p» are such that at an earlier time ro(xs,xs) 
the points are just starting to interact, i.e., 


[S-4(q—a93| =R, 
then Eq. (10) is 
gO (xol) 
=0, t$€To; 


rt 


= | di S. v (xyxs)8o' (pi, 1— 0) FY(ps, i—i’) 5) 
ue ToK IS ro 7; 
rot T 
= f di' S i015 Y(py, L—U)Fi(po, i—i’) s 
D IZ Tor. 


(15) 


The asymptotic form for g™ is reached only after 
> 794-7 which can be (for given pı and p») arbitrarily 
large for points qı and q» sufficiently far apart. It is 
interesting to notice that the previous history of the 
system, in terms of F;, is stored in the correlation func- 
tion of distant points; for, at any time /, there are 
pairs of points (q1,q») such that the corresponding g” 
depends on the value of F, at any given time between 
zero and £. 
The asymptotic form of (15) can be written as 


gO (xxt) = J(t— ro) In, !2Totr, 


PEE ; 
(y (-— JeitmorisD, (16) 


Opi Ope 


TOT T 0 qi» 
t= | dl Sv (x1,2)— } 
TO ð 


qı 


where we have again used Eq. (13) and, from the 
spherical symmetry of the force law and the fact that 
the collision is completed, L, is in the direction of the 
distance of closest approach ro. Therefore, g^? is non- 
zero if J has a component in the ro direction. In equi- 
librium, J is in the direction of the relative momentum 
P= p:— pı which is normal to ro so that g(? =0. We See 
then, in the nonequilibrium state, the range of g can 
be arbitrarily large, but the range of (py,D2) which 
gives a nonzero value to g'? goes down with increasing 


|qd:— q»|. In Appendix B we show that the long range -— 
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of g does not cause the next approximation g to 


become unbounded with time. 


IV. THE REVERSE MOTION 


te In Sec. IIIa we considered the evolution of a system 
that at zero time was uncorrelated. In this section we 
shall describe the corresponding reverse motion. For 
le this purpose we consider the evolution of a second sys- 
tem constructed at an arbitrary time /g270 that is 
identical to the first system but with all momenta 
reversed. This second system is initially correlated in a 
special way and it is this initial correlation which 
causes the reverse motion to be accomplished. 

Consider again the initial condition g% (x;,x; 
We have, from Eq. (10) with 450 


2,0) =0. 


g (0125,51) 


XF(0| ps, 1g ity (17) 


where the zero preceding the vertical bar in g^? and Fy 
indicate the initial time was zero. In particular (sup- 


to 


| 
r 
| 
t 
| -f dl^S e (x1,%2)01F1(0| Pi, i—) 
| 0 
j 


g™ (lo| py, Pot) = S. cto) (one) | 


0 


the first integral is 


t—t6 


)- a Changing time variables, Eq. (21) becomes 


t—to 


g (fol Di, pat) -| dt'S_v (Pu p2)012(— pi, — ps) F1 (0| 
0 


2tg—t 
db | di'S-e(—py, — p3tis(— pi, 
0 
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dS v (— pi, — po)&is(— py, — p3)Fi(0] — pi, t5— t )Fv(0| — po, t5— t) t 


| £t , 
| "i di e (py,pa)6is (py po) Fillol py, t—)Fi(to| pa, 
|] E 0 


In the first term in Eq. (21) we can combine the S operators 


S-a- (By B2)S_—v (— pj, — D2) = S_ (eto) (Pi, P2) = See Pi — D2). 
Splitting the range of integration in the first integral in (21) into 


| 
| 
S (0,71—14) and (t—to lo) for tXtX2h, 


t—to to 
f if dtS_ (tev) (Pip) || d'S wrinol pr, =p) ati. 
; 0 


— pu2ts-t -0)F«(0|— p», 2t 


aE a dt'S-e (py, P2)012 (py, po) 


KINETIC THEOR' 


pressing qi, q2) we have 


£^ (01 — pi, —ps, f) 
t - 
= " dS. (— pi, — p2)612(— py, — 
a 


<Fi(0| — pi, (-0)F\(0| — pi, 
At time la start another system off E 


conditions 

Fiol Prb) |en= FiO! — pr, fo), — 

g™ (lo! Popat) | s g” (0] — pi, — pa, 1 

For £2 la we have from Eq. (10) 
g” (I| py pay) 

=S ato (Pr P2)g (to| py, pato). 


t—to 


+ dt’'S_v(p1,P2)912( Pi, pa) 
KF i(lo| py, t—)Py(fo| ps, à 
Using (19b) and (18) in (20) we have 


— p2)F 1(0| — pi, 2fo 


A1414 


which satisfy the initial conditions (19a) a1 
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xd (19b). To show that this is the case, suppose (24a) is the solution of 


(23) and (11), then the first term in (23) cancels the last term and the remaining term is, by (18), g® (0| —p;, 


— ps, 2fo— 1) thus establishing (24b); and (24b) in (11) le 
9PY(to| p, 


at v 
The right-hand side of (25) is 


v 


1 
3 | O p, —p» 25-)- — 


ads to (24a), i.e, Eq. (11) is 


1 
=- i dass (py, p2)g® (to| p, P2, - (25) 


lf ‘ 
A dx28 (— Pi, — p2)g) (0| — P, — P» 219— 1) (25a) 


v 


and the right side of (25a) is, by Eq. (11), 0F1(0|—p1, 2to—1)/ðt; we have for (o / S 2to, oF (to py)/0t 
=0F\(0|— pı, 215—1)/8t, or Fi(to| Pat) — Fi(fo| pfo) =F1(0| — Pi 219—1) — F4(0| — pi, to) and, with initial condition 


(19a), this is (24a). 


Over the time interval 45 1X 2f, the equation for F;(fo| 


can be written, using (24a) and (13), as 


pif) can be obtained from (25) and (23). Equation (23) 


219— 
e (Imus | d'S_y(— pi, — pais (— py, — p3)Fi(0| — pi, 2t0— l’ — 1) F'1(0| — p», 2l0— l — 1) 
0 


2t0— 
- -| di'S v (Pu P2)012 (py P2) 'Y(fo| pil) Fi(to| pf) - (26) 


0 


Equation (26), together with (25), is the anti-Fokker-Planck equation, i.e., the collision term is the negative of 
the Fokker-Planck collision term. If we call q1— q»— r we have 


1 ð ð 2to— 
OF \(to| pul/ata=—~ [tpe er Inf 
? ar lapi ape! Wo 


Changing r —r 


dF; (to| pat) 1 9e(|r]) (| a ð 2to-t 
aea aL ir tmu 
p: x or Opi Op) Wo 


which is the anti-Fokker-Planck equation for f<t 
<2to—r. 
At time ¢=2t, from (24a) and (24b) we obtain 


g” (to| pi,p2,2l0) =0, (28a) 
F (to| p1,2to) = F1(0| — pi, 0). (28b) 


The equation describing the motion for t> 2 is then, 
from (11) and (10) 


OF ,(2to| pif) 1 
a [tton pu D2,/) , (29a) 
ot v 
g” (2to| pu P2,!) 
t—2t0 


= dt'S—v (P1, P2)012(P1, P2) 
xFiQto| ps,t)F1(2to| ps), (29b) 
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: (pi— P?) 
aras r+ Y /^ 
m 
ð ð 
{——— viol PaCol po. 
Op: Ope 


(pi— p») ) 
r— t [2 
m 


à à 
i [—— rt pul)Fi(t| ps), (27) 
OP: Ope 


where 
g™ (2to| py pz,2/o) =0, . 
FiQtol pif) | i-o — F1(0| — pı, 0) , 
and, therefore, after #>2¢+7 the equation for Fi is 


again the Fokker-Planck equation and the system is 
again approaching equilibrium. 


(29c) 


V. SUMMARY 


From the BBKGY set of equations, we have derived 
an approximate set of equations for the weak force 
case, Eqs. (10) and (11); these have been shown to be 
reversible in Appendix A. In the special case that the 
initial g?-—0, the equation for F, evolves into the 
“irreversible” Fokker-Planck equation after a time 7 
(on the order of a time of a collision) and the system 
approaches equilibrium. 


| 


uc o 0o 0o Lr 


r ipse. 
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TIME-REVERSED 


Suppose, on the other hand, the system is allowed to 
run for an arbitrary length of time to; then in order to 
describe the reverse motion, we consider a second sys- 
tem identical to the first at time £j, but with all mo- 
menta reversed. The initial conditions on F and g'^ 
for this second system are constructed from the F; 
and g) of the first system at f». The second system 
during the time interval (¢,,2¢) now performs exactly 
the reverse motion that the original system performed 
during the time (0,5). The equation for F, for the 
second system during the time interval (Io, 24,—7) is the 
anti-Fokker-Planck equation and after the time 2454-7 
is the Fokker-Planck equation. 

For the second system the dependence of g® on its 
initial value causes the reverse motion to be accom- 
plished: “built into” the initial value of g® at pairs 
of points outside the range of the force is the earlier 
behavior of F, for the first system. The dependence of 
g® on its initial value would be missing if the mo- 
mentum reversal were applied to the Fokker-Planck 
equation. 

The final question of what to use in describing a 
given system is answered in the spirit of statistical 
mechanics by the argument that if all that is known 
initially about a system is F;, then the most likely g 
is zero and F; will most likely evolve according to the 
Fokker-Planck equation.® 


APPENDIX A 


Equations (10) and (11), with fo=0, suppressing 
spatial coordinates in g and S, are 


OF 1(pi,f) 1 Ó qi» ð ð E 
es [ tate (Jets, (A1) 
ðt v ôqı Opi Ops 


g (Pi, P2,t) 


=S—:(P1,P2)g™ (py P2,0) 


: Qe; / ô ð 
| dt'S_+(p1,P2) (—-=) 
0 ðqı \Opi Ops 


XFi(pi, t—U)F (pe, tb) . 


(A2) 


Consider ¥(pi,é) and G(pu ps!) to be the solutions of 
(A1) and (A2) with initial conditions $(p,,0) and 
G(p1,p2,0). In Eqs. (A1) and (A2) change pi— — pi 
and t—>—#; then using the fact that SS ,(pips) 
=S,(—pi, — p2) we have 


àFi(—p,-—1! 1 óc: /0 9 
at ? ðqı \Opi Op2 


Xe(—pi, — Pa —0, (A3) 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


MOTION IN 


KINETIC THEORY A1415 
g(— pi, — ps, —1) 
= S-í(py,p)g P (— pi, — pz, 0) 
t 4 ^ ^ 
ðe / ô ð 
+f E) 

o dq: Nópi Ope 

KF (— py, —tE)FiC— ps —1+0). (A4) 


Comparing Eqs. (A3) and (A4) with (A1) and (A2) we 
see that if we take as initial conditions in (43) and (A4) 
FiC— pi, —2)| eo F(pi,0), 
g"(—py —ps —1£)l o 9(pyp»0), 
then the solutions of (A3) and (A4) are 

Fi(— p, —2)—$(pyl), 


g"(—piy —ps —)= g(pups/), 
or 


F,(pi,t)= F(— Pi, —i) , 
g” (pups) = g(— pi — P2, —t) : 


Therefore, the approximation used to obtain Eqs. (A1) 
and (A2) have not destroyed the reversibility. 


(A5) 


APPENDIX B 


We show here that the long range of gU (see Sec. 
IIIb) does not cause the next approximation g” to 
become unbounded with increasing time. g% will con- 
tain a term 


1 rt 
-f dS. (xi) | dru 1— tjg (29, x, i=) 
vJ0 

OF (py, t—1) 


tere Ó cis = 
= -| dt | tuse; (1 d: — qz! ) 
vo Oqi 9pi 


Xg D Gef, 2, A) 


(B1) 
where 


x/-—S.e(xx)x; i=1, 2. 


Figure 1 shows a configuration at time ¢ where the 
integrand in (B1) is nonzero. 


81* 


Fic. 1. An inter- 
acting configuration. 
(a) qi*=ai—pil/m; 
i=1, 2. (b) q is 
within range of force 
of qi’ and (c) p is 
such that at an 
earlier time (but be- 
fore particle 2 is at 
q;*) particles 2 and 3 
interact since g! 740 
at that time. 
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In terms of the relative momentum P32; P5» must be 
within a solid angle zR?/ | r2s(¢’) |? in the direction ra (o), 
which at large distances can be replaced by c R?/ 
| ms (P) |*, where ris() 9 12— Pis/m and r= q1— 05 


Pio= pi— Pe. r ; 
In (B1) changing the momentum integration to P22 


we have 
t ^ 
fu | dQ py , 
EA 4 


Ó q13 oly 


Bec 


ôqı Opi 


(B2) 


1 
"n — [tnbus 


Ve 
To obtain the magnitude of (B2) we neglect the varia- 
tion of a with time and treat it as constant with respect 
to the angles when it is nonzero. We then have 


PHYSICAL REVIEW VOLUME 


significance. 


I. INTRODUCTION 


TN the forward to Vol. 1 of his monographs on 

superfluids, F. London! questions whether a 
superfluid-like state might occur in certain macro- 
molecules which play an important role in biochemical 
reactions. If this should be the case, an entirely new and 
important consideration would be added to the problem 
of understanding living systems. In view of the signifi- 
cance of such an effect, it appears appropriate at this 
time, when a theory of superconductivity, the Bardeen- 
Cooper-Schrieffer (BCS) theory? has been so remark- 
ibly successful in explaining much of the behavior of 
uperconductors, to examine in the light of this whether 
or not a superconducting state might occur in certain 
macromolecules. In view of the extreme complexity of 
biological systems, it would be folly for a physicist to 


* Supported in part by the National Science Foundation and 
the "S. Navy Office of Naval Research. 
1 F, London, Superfluids (John Wiley & Sons, Inc., New York, 


950), Vol. 1. : 
2 J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Phys. Rev. 108, 


134, 


W. A. 
Department of Physics, Stanford University, Stanford, California 
(Received 13 November 1963; revised manuscript received 27 January 1964) 
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t ah? 
«f iia em 
o |n)! 


act Pysl/mry3— cos 
ES = tan“ mE 
ry (P3;)/m) (1 — cos6)'* (1— cos)" 


— cosh 
— 14 7 | me » 
tan Es ze (B3) 


where 0 is the angle between ri» and P,» Equation (B3) 
approaches a limit as /—~, according to 


ark? T — cosh 

— = SS ilche >| — — | 

r1a(Pro/'m) (1—cos’e)!”2| 2 (1—cos?0)!/? 
(1 — cosa) 


P yol/mr12 | 


(B4) 
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Possibility of Synthesizing an Organic Superconductor* 


LITTLE 


London’s idea that superconductivity might occur in organic macromolecules is examined in the light 
of the BCS theory of superconductivity. It is shown that the criterion for the occurrance of such a state 
can be met in certain organic polymers. A particular example is considered in detail. From a realistic estima- 
tion of the matrix elements and density of states in this polymer it is concluded that superconductivity should 
occur even at temperatures well above room temperature. The physical reason for this remarkable high 
transition temperature is discussed. It is shown further that the superconducting state of these polymers 
should be distinguished by certain unique chemical properties which could have considerable biological 


attempt to experiment in such an environment. Instead 
of attempting this, we shall tackle the problem on our 
own grounds. The BCS theory, while by no means 
complete and exact, has succeeded in providing a model 
with most of the essential features of a superconductor. 
In particular, it prescribes certain criteria for a system 
which, if satisfied, should lead to the superconducting 
state. Our approach is to consider how these criteria 
might be applied to the design of a particular organic 
molecule which, if its synthesis is possible, should show 
some of the essential features of a superconductor and, 
as we shall show, some remarkable chemical properties 
as well. One of the interesting features about the 
particular class of molecules we investigate in detail is 
that the molecules should be superconducting at room 
temperature and, indeed, to temperatures well above 
room temperatures. We can show on simple physical 
grounds why this is so and perhaps, with hindsight, why 
this was to be expected. 

The idea of superconductivity in organic systems is 


not a new idea, however, there is a considerable amount — | 


» 


— 


™ 


CY* 
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of confusion as to the exact meaning of this. The 
diamagnetic ring currents of aromatic molecules such 
as benzene, naphthalene, etc., are nondissipative cur- 
rents similar in many respects to the persistent currents 
of superconducting rings and, have often been referred 
to as a form of superconductivity. However, the *Super- 
conductivity" of these molecules is not the same as the 
superconductivity of bulk materials. The reason, I 
believe, is the following. In macroscopically large 
superconductors, if superconductivity exists, then a 
finite fraction of the charge carriers, in general, the BCS 
pairs are in identically the same center-of-mass momen- 
tum state. This state then has a macroscopic occupation. 
In a magnetic field the canonical momentum of this 
state remains unchanged, but due to the vector potential 
term contained in it a current is induced and the energy 
of the state changes. l'or a macroscopically large super- 
conductor the kinetic energy of the different center-of- 
mass momentum states of the pairs lie extremely close 
to one another, however, because the coherence energy 
of each state depends upon the square of the number of 
pairs in that state, the state which is macroscopically 
occupied is appreciably lower in energy than any of the 
neighboring states even in a moderate magnetic field. 
It is only by transitions in which practically all the 
pairs in the macroscopically occupied state simul- 
taneously move to another state that a lower energy 
final state can be reached. This is obviously highly 
forbidden and, consequently, the system of pairs 
remains in the momentum state into which condensa- 
tion originally occurred. Thus, it is the coherence energy 
which prevents the system from freely adjusting itself 
to take the lowest possible energy. In the aromatic 
ring compounds practically all the molecules are in 
their ground states. In a magnetic field the canonical 
momentum of the electrons in this state remain un- 
changed and diamagnetic currents flow in the molecule 
similar to those of a bulk superconductor. The energy 
of the different momentum states of the electrons in 
each molecule in this case are well separated though, 
because the molecules are of microscopic size. Thus, the 
momenta of the electrons do not change because for 
fields as large as those available in the laboratory, the 
state which evolves out of the original ground state still 
is lower in energy than any other in the presence of the 
field. Tf, however, the aromatic system is made arbi- 
trarily large such as in graphite, bulk superconductivity 
does not result because as the system gets bigger, the 
different momentum states of the electrons approach 
each other in energy. Transitions can then occur be- 
tween states and the induced currents are dissipated. 
So that in order to get superconductivity in a macro- 
molecule or in a bulk material, something of the nature 
of a coherence energy is required. In conventional 
superconductors this is provided by the phonon- 
induced, electron-electron interaction; in attempting to 
devise a macromolecule which is to be superconducting 
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one must provide, therefore, some mechanism similar 
to this. In our model we do this in the following manner. 


Il. MODEL SYSTEM 


We shall consider a molecule consisting of two parts, 
a long chain called the “spine” in which electrons fill 
the various states and may or may not form a con- 
ducting system; and secondly, a series of arms or side 
chains attached to the spine as indicated in Fig. 1. We 
will show that by appropriate choice of the molecules 
which constitute the side chains, the virtual oscillation 
of charge in these side chains can provide an interaction 
between the electrons moving in the spine. This can be 
made a sufficiently attractive interaction so that the 
superconducting state results. We can show further that 
even if the spine by itself is initially an insulator due 
to the valence band being full and the conduction band 
empty, the addition of side chains can increase the 
clectron-electron attraction to the point where it 
becomes energetically favorable to enter the super- 
conducting state by mixing in states of the conduction 
band. The spine thus transforms from the insulating or 
semiconducting state directly to the superconducting 
metallic state upon the addition of the side chains. 

Consider a long chain molecule as shown in the left 
half of Fig. 1. We will assume this is a conjugated chain 
of double and single bonds resonating between the two 
at each link. This corresponds in the band theory of 
metals to a band which is half filled and ideally is a 
metallic conductor. (See, however, Sec. III.) At the 
points P, P’, +-+, a regular array of side chain mole- 
cules B are attached. The individual side-chain 
molecules are chosen to have a low-lying excited state 
such that transitions from the ground state to the 
excited state correspond classically to an oscillation of 
charge from end to end of the molecule. 

The electrons moving in the spine may be described 


Fic. 1. Proposed 
model of a super- 
conducting organic 
molecule. The mole- 
cule A is a long un- 
saturated — polyene 
chain called the 
“spine.” The mole- 
cules B are side 
chains attached to 
the spine at points 


as 


or 
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in the tight-binding approximation? by eigenfunctions 
of the form 


1 
Pa 77 x e* RU, (r— Rj) , 


(2.1) 


where U,,(r— R;) is the wave function of an electron in 
a single atom located at Ri. G is the number of links in 
the chain and & is the energy of this electron and we 
will assume cyclic boundary conditions. To avoid un- 
necessary complication we shall ignore the electron 
spin throughout the discussion except where necessary. 

The wave function of the electrons in a single side- 
chain molecule in an eigenstate » we designate by 
Va (rura, :-). Due to the interaction of one side chain 
upon its neighbors, the degeneracy of the levels of the 
group of side chains will be removed when the side 
chains are brought together as in Fig. 1. The band of 
levels of the system of side chains as a whole can be 
described then by a new wave function similar to (2.1) 
above. 


1 
Xon=— X, ePi (ri Rjro- Ry). (2.2) 
qu 


The Coulomb interaction between the electrons in the 
spine and those on the side chain will provide an inter- 
action between the electrons in the spine and the 
side-chain modes. Let this Coulomb interaction be 
V=} V(r—R;n-Rjr-—R;:-:-) which will give 
rise to a typical interaction matrix element 


i Í z4 Í ine ODE (rra: o) 


X ViX qu (rro: e )óxm (r) dird?r; - PS (2.3) 


In the model we shall consider, screening reduces the 
range of the Coulomb interaction so that one can safely 
disregard the contribution to the interaction from any 
but the neighboring side chains. Secondly, in our model 
there is negligible overlap between side chains, and also 
overlap between sites on the spine is relatively small. 
These features together with the properties of (2.1) and 
(2.2) and the assumption of regular substitution along 
the spine allow us to approximate the interaction (2.3) 
to give the following 


V km* ,q' n' ankm = 2. 


1 
= cosQn"a | --- 
G »''—21,0:41 0 | 

x [v (Nba (n—n"a,- E -) 


xVws(n-n'a)Us(r)xdirdiri--., (2.4) 


where £' — k-4- Q, q' —q— Q, and a is the spacing between 
-side chains. 


3 See for example N. F. Mott and H. Jones, The Theory of the 


: "Properties of Metals and Alloys (Clarendon Press, Oxford, 1936). 


- 
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It is convenient to write this as (1/G)V (Q) for the 
moment. Then the complete spine side-chain interaction 
can be described in the representation of second 
quantization as 


1 
V side chain — — 2 V(Q)( » €q—Q,n'16q,5n) G4. Q mr T, J 


(2.5) 


G Qnn’ a 
k,m,m’ 


where the at and a are the creation and destruction 
operators for the electron in the spine, and ct and c the 
corresponding operators for the side-chain modes. The 
particle-hole operator 3664-0, 64. Which occurs in 
(2.5) appears in a similar manner to the phonon creation 
operator bgt in the phonon-electron interaction in a 
metal. The terms linear in this particle-hole operator 
which appear through the interaction (2.5) in the total 
Hamiltonian can be eliminated by the same type of 
transformation! which eliminates the terms linear in 
the phonon creation operator ba! in the conventional 
theory. This leads to a side-chain induced electron- 
electron interaction V2 between electrons in the spine of 
the form 


2V*(Q) hwg 
G (erro — €x)? — (hwg)? 


X dry qtay—etaya,. (2.6) 


We have written fiw for the difference (En —ÆEn) be- 
tween the energy of the states of the side-chain modes 
and have considered for simplicity only one excited 
state n’. This is identical in form to the usual phonon- 
induced electron-electron coupling. We note that if 
[w| | (erpo—e)| then the term in brackets reduces 
to an attractive interaction, V~—2(|V(Q)|*)ay/Ghw, 
where (|V(Q)|?)ay is the average of the square of the 
interaction V (Q). 

In addition to this attractive term, the screened 
Coulomb interaction gives a repulsive term. The sum 
of these two is the net electron-electron interaction. Let 


this sum be (V(Q)).y; then the total Hamiltonian for 
the electrons in the spine is 


N=} (er—u)artar 
k 


um» UO Jay. ola Qa d , (2.7) 


where (VQ) day is given above and u is the Fermi energy. 

The BCS theory then shows that a superconducting 
State is possible if the following equation can be 
satished for a nonzero gap, A: 


V tanhi8E; 
= nhi i: (2.8) 
k 2E; 


‘J. Bardeen, Encyclopedia of Physi i , Flügge 
(Springer-Verlag, Berlin, 1956) VoL dS e pupa 
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where Er e [Ce.— u)y-- Ae}? and the sum is limited by 
| e—u| «Ite. V is the average over the region of Q 
where (V (Q)).v is attractive. The critical temperature 
is given by 


kT .=1.14hw exp[ — 1/N (0)V ], (2.9) 


where JV(0) is the density of states of one spin at the 
Fermi surface. 

It is appropriate to dispose of a difficulty here which 
will arise shortly in considering a particular example of 
a molecule. In long-chain conjugated molecules the 
double and single bonds do not resonate freely between 
each position in the chain, but tend to localize so that a 
stationary periodic charge distribution and periodic 
bond length is established. This introduces a periodic 
potential of twice the atomic spacing in the chain which 
in turn produces a gap in the density of states halfway 
up the band. As there is one electron per atom in the 
band, this band is filled up to the new gap. The conju- 
gated system is then no longer a metallic conductor but 
a semiconductor or insulator, i.e., the density of states 
at the Fermi surface N (0) would be zero.* However, it 
is incorrect to interpret Eq. (2.9) as indicating that 
T.=0 in this case, for Eq. (2.9) is only an approxima- 
tion. Instead, one must go back to (2.8) and examine 
this to see whether in this case a solution is possible. 
In the next section we consider this in detail. 


III. SUPERCONDUCTIVITY IN A SEMICONDUCTOR 


Let us consider a somewhat idealized case of a semi- 
conductor with a band structure as shown in Fig. 2. 
Before the introduction of the periodic potential which 
generates the gap, the energy of the states would have 
been given by that shown by the dashed line in the 
figure. Let us take the band gap as ô and for convenience 
we will assume the density of states near the band edge 
is the same in the two bands. If the lower band is 
completely filled and the upper empty, then the Fermi 
level will be halfway between the two bands. We will 
assume that fw is greater than 6. 

Let N (0) be the density of states at the Fermi surface 
for the system prior to the introduction of the periodic 
potential and let e be the energy of these states measured 
with respect to the Fermi surface. The semiconductor 
band gap can then be conveniently introduced by 
changing the energy of each state e to e = [e+ (8/2) ] ^. 
We can now use Eq. (2.8) to see if the superconducting 
state can occur at any temperature. At T=0°K a 


5 Where one does not have alternating double and single bonds, 
but a double bond separated from the next by two single bonds, 
the band theory does not appear to work and one finds that the 
chain is an insulator instead of a metal. This empirical fact does 
not appear to have been explained, but seems to be a consequence 
of the same considerations of the metallic and insulating states 
discussed by N. F. Mott [ Phil. Mag. 6, 281 (1961)], and W. Kohn 
[Phys. Rev. 133, A171 (1964)] As this question is not quite 
settled we have limited ourselves to a conjugated chain which is 
known to behave as one would expect from the band theory. 
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Fic. 2. Band structure of a conjugated chain semiconductor 
caused by the localization of the double bond. The energy hw is 
the energy of the transitions of the side chains. 


superconducting state is possible if V is negative and 
vi ; 
lx ee (3.1) 

e 2[e-F (6/2) j^ 


where the sum is limited by | e| <ho. 
Transforming to an integral, we require 


1 x de 
e x i 
NO) LV) o [e+ (6/2)? }? 


(3.2) 


where x*= (fiw)?— (6/2)? and N (0) is defined above prior 
to the introduction of the gap. 


1 [o — (6/2)? Jha 
K nf | = (3:3) 
N(0)|V) 


6/2 


If (6/2)’< (hw)? then the criterion for obtaining the 
superconducting state is that 


—1 
-——] (3.4) 
N(0)| V! 


ô < Alte; exe] 
If this criterion can be satisfied, then the transition 
temperature can be obtained from the expression 


e 


1 7 tanhl8.e 
Se Í —e, (3.5) 
AN (0) V] 0 


which is the same expression for determining the tem- 
perature at which the energy gap of a superconductor 
becomes equal to 6/2. The form of this expression is 
given graphically in the BCS paper. We only need note 
that T. rapidly approaches the transition temperature 
for 5=0 as | V |N (0) is increased beyond that necessary 
to satisfy the criterion. So that if the criterion can be 
satisfied, then the transition temperature generally will 
be of the same order as the transition temperature of 
the metallic superconductor. We see then that a gap 
in the band structure does not necessarily exclude the 
superconducting state. Incidentally, this conclusion does 


* 
` 
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not violate Yang’s® statement that superconductivity 
or off-diagonal-long-range order (ODLRO) cannot occur 
in an insulator because his definition of an insulator is 
one in which there are no available empty states. Our 
point is that these states are; in fact, available in our 
model. 

It is perhaps appropriate to note that a gap in the 
density of states such as that considered above would 
make the average side-chain induced interaction more 
strongly attractive because for interband transitions 
(erpo— €x) would be at least 6 and as can be seen from 
(2.6), the attractive term would thus become larger 
provided that 5</w. If one takes the variation of 
(V(Q))av with Q into account, the details of the above 
inequality are, of course, changed, but the general 

: feature that the existence of superconductivity is 
determined by some such criterion remains. 


IV. PARTICULAR EXAMPLE 


As a particular example of a molecule of the type 
considered in Sec. II, we will consider in detail the 
molecule illustrated in Fig. 3. The spine is a conjugated 
chain of alternating double and single bonds. To this is 
attached a series of side-chain molecules as shown. 
Because of the great thickness of the benzene rings in 
the side chains, compared to the carbon-carbon spacing 
in the spine, it is not possible to attach a side chain to 
every carbon atom on the spine. This can be seen more 
clearly in Fig. 4 where the molecule is drawn to scale 
using the known values for the van der Waals radii of 
the constituent atoms. The changed periodicity of the 
new structure necessitates a slight modification of the 
wave function used in (2.1) and (2.2) above, but the 
modification is an obvious one which we will handle 
later. The side-chain molecule is part of a well-known 
dye molecule used for sensitizing photographic plates 
in the red, a diethyl-cyanine iodide, and it has been 
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ture proposed 
superconducting organic 
polymer. At each point R 
on the spine a similar side 
/ chain to the one shown is 
7° attached. These fide 

n chains are resonatin - 
ame brids of the two Bue 
structures shown in the 
inset. The positive charge 
resonates between the two 
nitrogen sites as illus- 
trated. 


C. N. Yang, Rev. Mod. Phys. 34, 694 (1962). 
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Fic. 4. Approximate scale draw- 
ing of the proposed superconduct- 
ing organic polymer. The plane of 
the benzene rings in the side chains 
are oriented at right angles to the 
spine. The two nitrogen sites on 
each side chain are indicated, but 
the iodine site has been omitted 
for the sake of clarity. 


chosen because its absorption spectrum is known, and 
its ground and excited states are well understood.’ 

Before estimating the matrix elements of the inter- 
action of the side chains with the electrons in the spine, 
it is useful to consider first what effect screening will 
have upon the Coulomb interaction. 


Screening 


The Coulomb potential at a distance 7 from a charge 
which is placed in a conducting medium is screened by 
the rearrangement of the charges of the medium. The 
potential is then given approximately by (e/r)e™. The 
screening length 1/\ in a free-electron gas can be 
estimated by using the Thomas-Fermi method as 
shown by Mott and Jones. From this one finds that 
X= (4me'/li?) (3N o/r)!!? where No is the number of 
electrons per unit volume. For the carbon atoms on the 
spine, one valence electron per atom is relatively free, 
and from the known size of the atom one can estimate 
Noand thus à. We find that 1/A is approximately 0.5 A. 
This value varies extremely slowly with the number of 
valence electrons so that our choice of one free valence 
electron is not critical. We note then that the Coulomb 
interaction is screened out in an extremely short 
distance. 

It should be noted too that screening can occur only 
where the charges of the medium are free to move, as for 
example, within an atom or along the conjugated series 
of atoms of the spine. Where the charges are not free 
to move indefinitely, then only a limited displacement 
of the charges occurs which merely modifies the 
Coulomb interaction by the dielectric constant of the 
medium, but does not screen it. These two cases must 
be born in mind in considering the Coulomb interaction 
and the side-chain interaction. Because the electrons 
are free to move in the spine one must use the screened 
Coulomb interaction for computing the Coulomb 
repulsion between the electrons in the spine. However; 


1K. Mees, The Theory of the Photographic Process (The 
Macmillan Company, New York, 1942), p. 987. 
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the interaction between the electrons in the spine and 
the charges on the side chain is only partially screened 
because the side chain is insulated from the spine and 
a free movement of charge between the two cannot 
occur, i.e., the conjugation of the side chain does not 
extend to the spine. Here one must use the Coulomb 
interaction modified by the dielectric constant of the 
medium together with some screening due to the in- 
duced movement of charge in the spine. 


Coulomb Repulsion 


For steric reasons, it is not possible to attach a side 
chain to each carbon atom of the spine. Consequently, 
the Hamiltonian of the spine is not invariant under a 
displacement from one atom to the next as assumed in 
our earlier discussion. In our example, the unit cell is 
repeated only after four carbon atoms and thus, we are 
dealing here with a lattice with a basis of four atoms. 
The wave function of the spine is then 


1 fe} 4 
Qm p» giis py aU s (r—r;.;) . (4. 1) 
(G) i i 


R, is now the position of the lattice point measured 
along the zig-zag line joining the carbon atoms and 7;,; 
the position of the jth atom in the unit cell measured 
with respect to Ri. o; is a phase factor which one would 
expect would be very nearly (1/4!/)eik6;,0, which is 
the value it would have if the Hamiltonian was per- 
fectly invariant under a C—C displacement. The 
number of unit cells, G is now a quarter of the number 
of carbon atoms in the spine. 

Let us now calculate the Coulomb repulsion between 
electrons in states given by (4.1): 


V (Q)Coutomb = [| $i-Q. mdrr (r2) y (12) 


X bu (ro)ór(ridirdiro. (4.2) 


Using (4.1) and the fact that the Coulomb interaction 
is screened from all except immediate neighboring ions, 
we obtain in the long-wave limit, i.e., Q~0 


1 TM e © 
ecu] f. | U(ri—7rj) [ay (riz) | U (r2—1;) |? rio 
4G 


+E five-nevedive—mlera|. 63 


kj 


The second term should be much smaller than the 
first because V (r) is heavily screened. To a first approxi- 
mation we shall ignore it compared to the first. A 
reasonable exact value of the dominant term could be 
obtained by using the known form of the sp carbon 
orbital and evaluating the integral, however, one can 
obtain a reasonable estimate of the approximate magni- 
tude of the term by considering the electron density in 
the orbital as constant and occupying a volume of about 
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half the volume of the atom. The integral is easily done 
and gives 6e?/\2R% where R is the van der Waals radius 
of the carbon atom. Using the value obtained above 
for X and a van der Waals radius of 1.5 À* we obtain a 
value of 6 eV for the integral. This seems to be a reason- 
able estimate for one might expect it to be comparable 
to the energy necessary to add one additional electron 
to a carbon atom. The energy necessary to form the C~ 
ion is known as the electron affinity A and is related 
to the ionization energy 7 and the electronegativity x 
of the atom by the relation? (/--4)/5.4— where J and 
A are expressed in electron volts. The ionization energy 
of carbon is 11.3 eV and x is 2.5 giving an electron 
affinity for carbon of 2.2 eV. This is of the same order 
of magnitude estimated above. Taking the larger value 
of 6 eV to be safe we find 


V (0) coutomb* 1.5 eV/G. (4.4) 


For larger values of Q the first term in (4.3) remains 
unchanged in the tight-binding approximation while 
the second term which we have neglected above is 
reduced by an additional factor of cosQa where a is the 
carbon-carbon spacing. Over the whole range of Q, then 
we can take the Coulomb repulsion to be of the order 
of that given by (4.4) 


Side-Chain Interaction 


The wave function of the side chain must be modified 
in the same way as the wave function for the electrons 
of the spine. It is now 


tg 4 
X qn=- eani y Ban Tii £ e (4.5) 


Gi? i-i j=l 


At the sites, 7, where side chains are attached 8 is 
approximately e'%#i/2!/?, and zero where there are no 
side chains. 

The side chain we have chosen is a resonating hybrid 
of the two extreme structures shown in the inset to 
Fig. 3. If the wave functions of these two extreme 
structures are y and y, respectively, then the ground 
state, Vo is (1/v2)(¥i:+y_) and the excited state of 
interest to us, Yı is the orthogonal hybrid structure 
(1/72) (¥4—y_). In the matrix element (2.4) we require 
vivo which is simply 3E p— Iy-[2]. 

Using the new wave functions for the spine (4.1) 
and (4.5), Eq. (2.4) reduces to 


4 42 
C | f Y È anojas| Unt- r)? 


j=l n-— 
; 4 
XV (r)ei9n593 E Barot Bat 
i=) 


X {lve Gras)? Wo Gruas))?), Ge 


* L. Pauling, The Nature of the Chemical Bond (Cornell Univer- - 


sity Press, Ithaca, New York, 1960), pp. 257, 95. 
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Fic. 5. Enlarged 
view of atoms of the 
spine of the proposed 
polymer. 


where we have confined ourselves to a single atomic 
level m. The different & levels then describe the bands 
of the type discussed in Sec. TT. We recall that j and / 
refer to each of the four atomic sites within the unit 
cell; and z has values 0 and +1 which refer to the same 
unit cell, i and the two adjacent cells, respectively. 

In the superconducting state two types of matrix 
elements are important; those which describe scattering 
between states close to the Fermi surface with momen- 
tum transfer Q~0 and those for scattering from one 
side of the Fermi surface to the other. The latter 
scattering involves a Q~7/a because there is one 
electron per atom in the band originating from the 
fourth sps orbital of the carbon atom. We shall estimate 
the side-chain interaction matrix element in the 
neighborhood of these values of the momentum transfer. 

Firstly, the interaction between the spine and the 
iodine ion of the side chain cancels in the matrix element 

(4.6) because the ion is located at the same point in the 
two extreme structures of the hybrid. Secondly, in the 
y- structure the positive charge is located on the 
nitrogen site which is remote from the spine, conse- 
quently, the interaction with this structure is weak. 
The only important contribution to the matrix element 
then, is that which is due to the positive charge on the 
nitrogen close to the spine in the y4 structure, and the 
electron on each of the sites r;. 

In Fig. 5 we show an enlarged drawing of the spine 
and part of the side chains. In view of our earlier 
discussion of screening, it would be consistent to treat 
the interaction between the positive charge on nitrogen 
site Ns (see Fig. 5) and carbon site 1 as virtually un- 
screened; the interaction with sites 2 and 3 as half 
screened and sites beyond this as completely screened. 
Our choice is based on the picture that the migration of 
negative charge from the vicinity of C2 and C; towards 
C; would partially screen the field produced by the 


TABLE I. Carbon-nitrogen distances and matrix elements. 


Matrix element V (Q) 
(eV) 


Carbon Distance to Ns 
site (A) Q=0 Q=n/a 
1 1.5 0.76 0.41 
2 7S) 0.47 0.11 
& 3 3.0 0.47 0.11 
4 (4.5) 0.76 0.41 
i Total 2.46 1.04 
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positive charge on Naso that at Cy the interaction would 
be effectively.reduced to zero. We arrive at the figure 
of 4 for the screening of sites 2 and 3 because half of the 
atomic orbital adjacent to the nitrogen site on these 
sites is practically unscreened, while the opposite side is 
quite strongly screened. There is very little materia] 
between the nitrogen site and each of these three carbon 
sites, so that it seems reasonable to leave the Coulomb 
interaction with the dielectric constant that of free 
space. In. Table I we tabulate the distances between 
the nitrogen site and each carbon site. We include also 
the matrix element for each site, j, in Eq. (4.5) com- 
puted for Q=0 and Q=7/a using the approximate form 
of the a’s and B's given after Eq. (4.1) and (4.5) and 
the unmodified Coulomb interaction limited in the 
manner described above. 

It is reasonable to neglect the interaction with the 
positive charge on the remote nitrogen site on the y.. 
structure because the distance to the spine is about 14 A 
and the interaction is further reduced by the dielectric 
constant of the material of the side chain. This dielectric 
constant must be similar to that of benzene which is 
about 2.2. This gives a total matrix element of 0.1 eV 
for Q=0 which we can neglect compared to the total 
computed in Table I considering the approximations 
we have made. 

In order to calculate the side-chain induced electron- 
electron interaction, we must know the energy ñw for 
the transitions of the side chains. For an isolated side- 
chain molecule of 1,1’-diethyl-4,4’-cyanine iodide the 
absorption maximum for the transition we are consider- 
ing occurs at 600 my giving a value of 2 eV for lic." 

In the polymer the side chains will interact with one 
another and change the frequency of oscillation to some 
extent, but as the molecules are quite well separated 
and the charges are quite well screened from one 
another, let us take the frequency to be about 2 eV 
nevertheless. Then the side-chain induced attraction 
V = (2V (Q))).,/Glio will be approximately —3.5 eV/G 
where we have taken the mean of the square of the 
interaction for Q—0 and Q=7/a as given in Table I. 
This is greater than the Coulomb repulsion of 1.5 eV/G 
estimated in (4.4) so that the net interaction is an 
attractive one, 


(4.7) 


The reason this is so strongly attractive is that we have 
seen to it that the nitrogen sites lie close to the spine S0 
that the matrix element (4.6) is large and at the same 
time have chosen a side chain with a fairly low-frequency 
transition so that fiw is small. 

Some idea of the superconducting transition tempera- 
ture now can be obtained by estimating the density of 
states for the electrons in the spine, N (0). This can be 
done in the following way. The spine itself is Very 
similar to a conjugated polyene chain {CH=CH—}n 
except that the side chains replace certain of the 
hydrogens so that one can crudely estimate the density 


V~—2 eV/G. 
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of states in the spine from a knowledge of it in a polyene 
chain. The benzene ring is essentially a triene 
{CH=CH— } tied back on itself in the form of a loop 
and can be described in terms of a simple band pic- 
ture.” The first absorption at 250 my is believed to cor- 
respond to a transition of a z-electron from a m=1 to 
m= 2 state which is approximately half the total width 
of the band. The total width of the band for benzene is 
thus = 10 eV and should be approximately the same for 
the polvene chain. If we assume the e versus £ curve is 
parabolic up to the halfway point in the band (up to 
5 eV), then the density of states of one spin at the Fermi 
surface is approximately 4G states/eV. This is probably 
a reasonable estimate of the density of states as it 
corresponds to an effective mass for the electrons in the 
spine of 0.7 the electron mass. i 

If there was no bond localization in the spine, then 
we could use Eq. (2.9) to estimate the superconducting 
transition temperature using (4.3) and the above 
density of states. One obtains a temperature ~2200°K 
in this case! This extremely high transition temperature 
can be understood when it is realized that in the chosen 
structure it is an electronic oscillation which provides 
the coupling between the electrons rather than the 
oscillation of the nuclei as in a conventional super- 
conductor. The simple argument of the isotope effect 
that the transition temperature for a phonon-coupled 
superconductor is proportional to 1/M'/?, where M is 
the isotopic mass of the nuclei indicates that for an 
electron-coupled superconductor the transition tem- 
perature should be a factor of (M/s)? (i.e., 2300) 
times larger. This is, perhaps, too glib an answer for it 
is necessary to choose the over-all structure so as to 
obtain a sufficiently strong coupling matrix element 
(2.6). Our particular model illustrates this in detail. 

If there is considerable bond localization in the spine, 
then our inequality (3.4) shows that the superconduct- 
ing state can still occur if the semiconductor gap ô is 
somewhat less than 0.67 eV. If this is satisfied, the 
transition. temperature in this case should still be 
several hundred °K. 

For transition temperatures as high as this the 
coherence energy of the superconducting state becomes 
comparable to the chemical binding energy. This energy 
is approximately 


Wo= —2N (0) (fiw)? expl—2/(V(0)| VD]. (4-8) 


In our example, the coherence energy is about 0.1 eV 
per unit cell of the chain. This is not very large, how- 
ever, if one synthesized a polymer in which the density 
of states is large but |V| is small so as to obtain the 
same transition temperature, then the coherence energy 
would become quite large. A coherence energy of as 
much as 1 eV per unit cell of the chain appears possible 


3 J.C. Slater, Quantum Theory of Molecules and Solids (McGraw- 
Hill Book Company, Inc., New York, 1963), p. 234. 4 

10C, R. Noller, Chemistry of Organic Compounds (W. B. 
Saunders Co., Philadelphia, 1960), 2nd ed., p. 665. 
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in such a polymer. As this energy is comparable to the 
resonance energy of the benzene ring, one should expect 
a considerable stabilization of the polymer on this 
account. It is interesting to note, too, that the destruc- 
tion of superconductivity at one point in the chain 
raises the energy by the coherence energy per unit 
length times the coherence length >. The coherence 
length ¢a for these molecules should be about 30 A as it 
is inversely proportional to T, and ¢y~104A for 
conventional superconductors. Consequently, it would 
require a large amount of energy to destroy the super- 
conductivity locally. 


V. DISCUSSION 


We believe that while the estimates for the various 
matrix elements in the above example are crude, they 
arc not unrealistic. This forces upon us the remarkable 
conclusion that superconductivity could and should 
occur in structures such as this even at room tempera- 
tures. There are many other possible structures similar 
to the one shown involving a semiconducting chain for 
the spine and a dye-like molecule for the side chain 
which would also be superconducting. Tt is unlikely that 
our particular choice described above would be the 
easiest to synthesize or have the optimum super- 
conducting properties, but it illustrates the possibility 
in a detailed manner. 

In these molecules we should expect the usual elec- 
trical properties of a metallic superconductor, however, 
in order to observe such effects contact would have to 
be made to the ends of the spine. This could be a difficult 
problem to solve, but may be possible by cross-linking 
the spines so as to form a three-dimensional net of the 
filamentary molecules. Because of the large transition 
temperature, one would expect the critical field for the 
destruction of superconductivity to be very high 
compared to that of conventional superconductors. The 
highly divided filamentary structure of a bulk sample 
of the polymer should mask any appreciable Meissner 
effect. Perhaps the most interesting feature of these 
molecules, however, lies in the phase correlation of the 
electron pairs throughout the molecule. This phase 
correlation should impose certain restraints upon the 
ability of the molecule to react chemically with other 
such molecules. The reason is that in order to form a 
covalent bond, the electrons must interfere construc- 
tively in the region of positive potential. Consequently, 
the relative phase of the electrons forming the bond are 
important. Such an effect has been discussed briefly by 
Ambegaokar and Baratofí" in regard to tunneling 
between conventional superconductors in the Josephson 
effect. The superconducting state is unique in that thís 
long-range phase correlation, “‘off-diagonal-long-range- 
order" (ODLROJ? distinguishes it from the normal or 


insulating states. This we have shown can occur even. 


1 V, Ambegaokar and A. Baratoff, Phys. Rev. Letters 10, 186 
(1963). 
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in our structure which is essentially a one-dimensional 
chain, which is in striking contrast to a classical one- 
dimensional interacting chain such as the Ising chain 
which cannot exhibit long-range order.?? Because of this 
ODLRO, which is a property of the molecule as a whole 
and the above chemical properties which are related to 
it, these molecules will have the property of reacting 
as a single entity which is precisely what London! was 
seeking to understand in regard to biologically im- 
portant macromolecules. 

In regard to the possible biological significance of our 
results, it is appropriate to mention a theorem which 
was established by Wigner on the probability of a 
quantum mechanical system reproducing itself. He 
succeeded. in showing that under two reasonable 
assumptions, this probability is essentially zero. The 
relevant assumption is that his “collision matrix" S, 
which generates the final state from the initial state, is 
assumed to be a random matrix. This assumption may 
be violated in the superconducting state because of the 
singular nature of the pair distribution associated with 
ODLRO. It would be useful to reexamine Wigner's 
theorem to see whether a superconducting quantum 
system would be capable of reproducing. The curious 
chemical selectivity mentioned earlier suggests that 
this may be the case. 
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Note added in proof. Yt is appropriate to clarify a point 
in regard to the band structure of our chosen model. 
The partial charges on the nitrogen sites of the side 
chains produce a periodically varying potential along 


12 J. Ashkin and W. E. Lamb, Jr., Phys. Rev. 64, 159 (1943). 
ac E. P. Wigner, in The Logic of Personal Knowledge (Polanyi 
Festschrift) (Routledge and Kegan Paul, London, 1961), p. 231. 
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the spine with a fundamental period of 4a. This pro- 
duces a gap at k-m/4a but no gap at 7/2a where the 
Fermi surface lies. The reason there is no gap at z/2a is 
that in the tight binding approximation, where one can 
consider an electron as sampling the potential at all 
points within a particular orbital before moving on, the 
effective potential of each orbital may be taken as that 
of the appropriate carbon nucleus. From a calculation 
similar to that of Table II, one can easily show that 
there is then no Fourier component of period 2a and 
therefore no gap at the Fermi surface. 'The only effect 
which tends to produce a semiconductor gap (i.e., one 
at the Fermi surface) is the tendency for the double 
bond to localize at alternate sites. This was considered 
in Sec. III, where we showed that for a fixed semicon- 
ductor gap, the superconducting state should occur if a 
particular inequality could be satisfied. The band gap 
produced by bond alternation, however, is not fixed, 
but depends upon the amplitude of the periodic distor- 
tion; consequently, the actual gap and distortion must 
be determined in a self-consistent manner. In this case 
our earlier arguments do not apply and whether the 
superconducting state or the semiconductor state occurs 
depends upon which has the lower energy. Longuet- 
Higgins and Salem [Proc. Roy. Soc. (London) A251, 172 
(1959) ] have calculated the stabilization energy for a 
polyene with alternation of bond lengths and obtain an 
energy of 0.019 kcal/mole per bond (~0.001 eV/bond) 
for the semiconductor state. We have calculated it for the 
superconducting state [ Eq. (4.8) ] and obtain (~0.025 
eV/bond) so that, in our model, the superconducting 
state should be favored. 

Finally, our calculation has shown that a phase transi- 
tion from the normal to the superconducting state 
should occur even in our one-dimensional system. This 
is unusual [see, for example, L. van Hove, Physica 16, 
137 (1950)] and one may question whether our result 
follows because the BCS theory, upon which it is based, 
is not sufficiently exact. We have some reason for believ- 
ing that our conclusions are valid nevertheless, but this 
point requires further investigation. 
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Cavitation in Liquid Helium* 
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Ultrasonic cavitation has been induced in liquid helium over the temperature range 1.2 to 2.3°K, using 
a pair of identical transducers. The transducers were calibrated using a reciprocity technique and the cavita- 
tion threshold was determined at 90 kc/sec. It was found that this threshold has a sharp peak at the À point 
but is at all temperatures quite low, with an approximate range of 0.001-0.01 atm. The significance of these 


results is discussed. 


INTRODUCTION 


STIMATES of the tensile strengths of liquids based 
on the assumption that it is possible to rupture 
molecular bonds, are considerably higher than measured 
values. l'or example, in the case of water at room tem- 
peratures such predictions of tensile strength vary from 
500 to 10 000 atm while empirical values range from a 
fraction of an atmosphere to 280 atm.!? To explain 
these discrepancies it has been proposed that true 
rupture does not occur in practice but that the applied 
tension is relieved by the sudden growth of relatively 
large cavities from microscopic nuclei. It is possible to 
nucleate cavitation in normal liquids by the following 
principal agencies: (1) minute gas or vapor filled 
bubbles, mechanically or electrolytically created, (2) 
inclusions of gas or vapor in cracks on solid surfaces, 
(3) solid particles unwetted by the liquid, (4) cosmic 
rays and radioactivity. 

The last mentioned agency was first suggested since 
even degassed distilled liquids have relatively low 
thresholds.’ There is now a growing body of phenomena 
in which cosmic rays and radioactivity nucleate 
cavitation.“ 

It has been pointed out by Beams! that the variety 
of possible nuclei is much restricted in liquid helium 
since it cannot contain bubbles filled with anything 
but its own vapor, it is a very efficient wetting agent 
and due to its low viscosity and density cannot readily 
carry solid particles in suspension. If cavitation in 
liquid helium is nucleated, then the most likely agency 
would be cosmic rays. 


* Work supported by the U. S. Office of Naval Research. 
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When the vapor above liquid helium I is pumped 
away, evaporative cooling occurs, and bubbles rise 
through the liquid. On reaching the point the bubbling 
stops and below this temperature the liquid, helium I, 
has excellent heat transfer properties, causing the 
disappearance of bubbles.” This and other manifesta- 
tions of superfluidity make it especially interesting for 
cavitation studies. 

The tensile strength of helium IT was first measured 
by Misener and Herbert" in the temperature range 
1.092-2.147°K using a bellows technique. They con- 
cluded that if helium II has a tensile strength it could 
not exceed 0.3 atm. Beams” measured the tensile 
strength of helium II by a spinning capillary method 
and by a piston and cylinder method at a temperature 
of about 1.9°K. He found the negative pressures neces- 
sary for rupture were 0.14+0.02 atm with the first 
method and about one-half of this value with the 
second method. Since these values are much less than 
theoretical estimates, Beams concluded that there 
must have been nuclei present. During preliminary 
experiments for the design of a liquid-helium bubble 
chamber Fairbank et al.“ applied negative pressures 
of about 100 mm to helium II and obtained photographs 
of bubbles. No tracks were found but there were some 
“double” bubbles, which Fairbank et al. conjectured 
might have some relationship to quantized vortices. 

The uncertainties in these experiments arose from 
difficulties in the measurement of small negative pres- 
sures and in the detection of the onset of cavitation. 
Relatively low negative pressures can be readily applied 
and measured using ultrasonic techniques. Further, 
ultrasonic cavitation gives rise to a characteristic noise 
which is generated by the collapse of cavities during the. 
positive pressure phase. This noise, when amplified, 
affords a sensitive means of detecting the onset of 
cavitation. 


APPARATUS 


The object of the present experiments was, then, to 
measure the threshold of ultrasonic cavitation in 


2K. R. Atkins, Liquid Helium (Cambridge University Press, 
Cambridge, England, 1959). 

13 A. D. Misener and G. R. Herbert, Nature 177, 946 (1956). 

4 W, M. Fairbank e£ al., Problems of Low Temperature Physics 
and Thermodynamics (Pergamon Press, Inc., New York, 1058), 
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Tektronix 
Oscilloscope 
585 
Fiiter 
SKL 302 
MICROPHONE V 
Voltmeter Amplifier 
HP 400D HP 450A 


Fic. 1. Block diagram of driver and microphone circuits. With 
identical PZT-4 transducers as driver and microphone it was 
possible to deduce acoustic pressures directly from measurements 
of driver and microphone voltages using a reciprocity technique. 
The cavitation threshold was determined by the onset of noise 
from the microphone as heard through earphones and as displayed 
on an oscilloscope. 
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liquid helium over a temperature range. To do this 
two transducers were employed, one to act as a driver 
and the other as a microphone. A block diagram of the 
driving and microphone circuits is given in Fig. 1. As 
will be explained later, it was possible to measure 
acoustic pressures simply by monitoring the driver and 
microphone voltages, while the threshold could be 
detected with cavitation noise, heard in earphones and 
seen on an oscilloscope trace. 

'The two transducers were identical disks of Clevite 
PZT-4 ceramic, 4 in. in radius and in thickness, with 
electrodes on the flat faces. The resonant frequencies 
of the transducers were both within 0.2% of 91.15 
kc/sec in liquid helium and their capacitances were also 
the same to within 0.595. Tests made with a third 
transducer showed that their sensitivities as micro- 
phones were also essentially the same. They were 
supported by threads (for vibration isolation) from a 
metal cradle with their opposed faces parallel, there 
being a separation of some 4.5 cm between them, as 
seen in Fig. 2. This figure also shows a carbon resistor 
which when immersed in helium formed one arm of a 
bridge so that by monitoring the off-balance current an 
accurate indication of temperature change was possible. 
The assembly shown, in Fig. 2, was suspended vertically 
in a standard low-temperature double-Dewar system. 
The assembly could be viewed through parallel vertical 
slits in the silvering of the Dewars. Temperatures were 
determined by measuring vapor pressure using Wallace 
and Tiernan gauges. 


PRELIMINARY OBSERVATIONS OF CAVITATION 


When the driving voltage was increased from zero 
there came a point at which intermittent bursts of 
noise could be hear in the earphones. This point cor- 
responded to the cavitation threshold and could also 
be observed as “noise” on an oscilloscope trace of the 
microphone output after it had been filtered to eliminate 
the 91-kc/sec driving frequency. With further increase 
of the driving current the bursts became a steady white 
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RUDNICK 


noise hiss, characteristic of ultrasonic cavitation, this 
noise being easily distinguished from that of boiling 
above the \ point, when the appearance of relatively 
large rising bubbles could be correlated with “burbling” 
sounds. With driving currents an order of magnitude or 
so higher than at threshold the cavitation noise was 
sufficiently loud to be heard outside the cryostat with 
an unaided ear. At these power levels the cavities were 
visible as transient bubbles of about 3-1 mm in diame- 
ter, uniformly distributed throughout the volume be- 
tween the transducers. This was the case whichever 
transducer was driven. It should be noted, however, 
that it was not easy to see these bubbles due to the low 
refractive index of liquid helium, and the use of photog- 
raphy was not practical since the cryostat could not 
readily accommodate optical systems of any size. With 
these reservations in mind it may be stated that no 
“double” bubbles were seen. An examination was made 
with an RCA 6810-A photomultiplier (cathode sensi- 
tivity: 60 pA/lumen, current gain: 12.5X 105) but no 
sonoluminescence could be detected at any temperature 
in the operating range. 


MEASUREMENT OF CAVITATION THRESHOLDS 


It has been shown by Maclean’ that using the laws 
of reciprocity for a four-terminal network it is possible 
to determine the sensitivity of a reversible transducer 
using only one other identical transducer. Following 


NYLON STRINGS 


CARBON RESISTOR 


Fic. 2. Driver and microphone suspension. The PZT-4 trans 
ducers were 1 in. in diam and 3 in. in thickness. A carbon resistor, 
used to determine temperature stability, is seen below the lower 
transducer. 


15 W. R. Maclean, J. Acoust. Soc. Am. 12, (1940). 
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CAVITATION 


the principles laid down by Maclean it may be shown 
that for two identical transducers: 


Mo=((V2/1;) (1/0) }'?, (1) 


where M o— sensitivity of a transducer used as a micro- 
phone in esu of potential/dyne/cm?. V, and I1 are, 
respectively, the microphone open circuit potential 
difference and driving current in esu as indicated in 
Fig. 3. Q is given by 


Q = P2/U,S , 


where P»- acoustic pressure (dynes/cm?) at the micro- 
phone face, U;=particle velocity at the driver face 
(cm/sec) and S — area of a transducer face (cm?). 

In the present case a standing wave is set up in the 
volume between the transducers. However, since the 
driver has a high impedance the particle velocity U, 
will not differ appreciably from the free field particle 
velocity U p. Suppose that due to the nondirectionality 
of the beam the pressure in a progressive wave is 
reduced by a factor r in traveling once between the 
transducers. The pressure at the microphone, P2, will 
then be given by 


P,—-2Ppr/(1—7), 


where Ppr is the free field pressure at the driver. Thus, 
in this case, 


Q= (pc/ S) 2r/ (1—r^) ], 


where p and c are the density and sound velocity of 
liquid helium, respectively. In Fig. 3 Cr represents the 
capacitance of the transducer and C, the capacitance 
of the cable. The capacitances of the cables had essen- 
tially the same value (within 0.5% of 524 uF) showing 
no appreciable change over the operating temperature 
ranges. Under these circumstances V1, =V, and 
I — Vy'Zp, where Zp is the combined impedance of a 
transducer and its cable. A further test was made to 
confirm that Zp did not vary with temperature. Hence, 
substituting in Eq. (1) for Q and taking values of V2’ 
and J,’ appropriate to the onset of cavitation then the 


MICROPHONE 


RIVER 


Fic. 3. Four terminal reciprocity network. Ci: capacitance of a 
transducer. C,: cable capacitance. C, and Co were identical to 
within 4% for both transducers. Ze: combined impedance ofa 
transducer and its cable. Vs: generated microphone voltage. 
72: measured microphone voltage. Jı: actual driving current. 
Ji’: measured driving current. V;' measured driving voltage. 
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Pi(rms dynes / cm?) 


T (*K)—— 


Fic. 4. Cavitation thresholds. The principal feature is a sharp 
peaking to a maximum value at the A point (2.172°K). This rise 
all occurs within 0.01°K below the A point. At other temperatures 
the threshold is comparable to the liquid pressure head, which 
varied smoothly from about 1600 to 1100 dynes/cm? at the driver 
face during the experiment. 


pressure at the microphone, Pm, will be: 
pc 2r YN) 
P.» (- See —) Vika)”, 
S (1—r) Zp 


where V; and V2 are expressed in volts and Zp in ohms. 
The pressure at the driver is then the cavitation thresh- 
old, P:, given by 


A+r) spe 2r 10732 
S (1—7?) Zp 


2r 


The ratio r was measured by applying pulses of about 
100-usec width to the driver, without removing the 
assembly from the helium, displaying the microphone 
voltage on the oscilloscope and measuring the ratio of 
successive reflected pulses. This gave 7, which was 
found to be 0.69 by averaging a number of measure- 
ments, so that 7— 0.83. Then, taking p=0.146 gm/cm’, 
c=2.38X 10! cm/sec, S=5.07 cm?, and Zp=2.75K 10? 
Q, from Eq. (2), P,-3.72X10* (VV). Thus the 
threshold at any temperature could be deduced directly 
from two voltage readings. 

The procedure adopted was to increase the driving 
voltage until cavitation noise could just be heard con- 
tinuously. Then by gradually reducing the driving 
voltage a point was reached with careful tuning where 
a burst of noise could not be heard more than once every 
five seconds. The voltage readings were then made. 
Certain precautions had to be observed to obtain 
consistent readings. To begin with, care was taken to 
ensure that particles of frozen air did not get into the 
helium. Secondly, it was found that the process of * 
temperature stabilization (achieved by balancing the 
heat input from the transducer against the pumping 
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rate) had to be truly effective and the threshold meas- 
urement not made too hurriedly after stabilization. 
The results shown in Fig. 4 were obtained only after 
the temperature had remained stable for some ten to 
fifteen minutes. The liquid level was 7.2 cm above the 
ace of the top transducer during this run and the 
bottom transducer was used as the driver. 

Some calorimetric measurements were carried out 
by timing the temperature rise from 1.53 to 1.73°K 
with the sound pressure maintained just above thresh- 
old level, and then with no sound at all. The time 
intervals in the two cases did not differ by more than 
about 5%, so that the threshold could not be deter- 
mined accurately. However, it was found that the 
total power flux due to self-heating and sound across 
the surface of the transducer was of the order of magni- 
tude of one m\W/cm?. 


DISCUSSION OF RESULTS 


In the first place, it is necessary to establish that the 
phenomenon observed was truly cavitation and not 
boiling at the surface of the transducer. In order to 
boil helium II it is necessary to generate a heat current 
of sufficient magnitude to convert the liquid surround- 
ing the source into helium I after which the boiling 
occurs. A number of workers have found that this heat 
current is of the order of 1 W/cm? !9 and that heat 
currents of the same order of magnitude are required 
for nucleate boiling of helium I.'5 In the present experi- 
ments the heat current at threshold was of the order 
of a mW/cm?. Only when driving the transducer at full 
power could the heat flux have attained 1 W/cm?. This 
is strong evidence that boiling was not present, a con- 
clusion corroborated by a number of facts. For instance 
the white noise obtained was quite distinct from the 
sounds of nucleate boiling and typical of ultrasonic 
cavitation. Again, at high ultrasonic power transient 
bubbles could be seen uniformly distributed in the 


18H. E. Rorschach and F. A. Romberg, Proceedings of the 
Fifth International Conference on Low Temperature Physics and 
Chemistry (Universiy of Wisconsin Press, Madison, Wisconsin, 
1958), p. 35. 
(1959) Haenssler and L. Rinderer, Helv. Phys. Acta 32, 322 
Ew Karagounis, Suppl. Bull. Inst. Int. du Froid, Annexe 2, 
195 (1956). 


BARMATZ, 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


AND RUDNI GK 

space between the transducers, even when driving the 
upper one. Whereas if the phenomenon were boiling 
the vapor phase might be expected to occur in the 
locality of the driving transducer or as stable bubbles 
rising from it. 

A significant aspect of the results is the fact that the 
threshold is very low at all temperatures. Apart from 
the À point the situation is similar to the case of gassy 
water where the threshold is determined by the static 
pressure head. However, the methods for estimating 
the tensile strength of a normal liquid may not be 
applicable in the case of helium, so that the values of 
4 atm and over, as cited by Beams" should be treated 
with some reserve. 

The other remarkable feature of the results is the 
sharp peak at the à point. Many of the properties of 
liquid helium show pronounced effects in this vicinity, 
especially the thermal properties. There is for instance 
the well-known behavior of the specific heat which 
rises sharply to very high values at the ^ point. One 
possible tentative explanation of this apparent cor- 
relation between the threshold and thermal properties 
can be given in terms of the Seitz theory of the bubble 
chamber.'® 

As has been mentioned in the Introduction, of the 
agents that are known to nucleate cavitation in normal 
liquids, cosmic rays and radioactivity are the most 
likely to be operative in liquid helium. That the passage 
of high-energy particles in liquid helium can give rise 
to bubbles is evidenced by the successful operation of 
helium bubble chambers. According to Seitz, the par- 
ticle produces highly localized energy deposits or 
“thermal spikes” in the liquid. The thermal spikes 
then cause the growth of microbubbles whose equilib- 
rium size is partly determined by the thermal prop- 
erties of the liquid. Possibly it is these microbubbles 
which nucleate the cavitation event. If the threshold 
is a function of nucleus size, as in normal liquids, then 
a relationship might be expected between the threshold 
and the thermal properties. 


ACKNOWLEDGMENT 


The authors wish to thank K. A. Shapiro for making 
available the cryostat used in the experiments. 


19 F, Seitz, Phys. Fluids 1, 2 (1958). 
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A theoretical model for the behavior of an optica 
is treated classically, and the active medium is made 
clectric dipole moments under the action of the field 
responding macroscopic electric polarization of the 


l maser is presented in which the electromagnetic field 
up of thermally moving atoms which acquire nonlinear 
according to the laws of quantum mechanics. The cor- 
medium acts as a source for an electromagnetic field. 


The self-consistency requirement that a quasistationary field should be sustained by the induced polariza- 


tion leads to equations which determine the amplitudes and frequen 


of the various parameters characterizing the maser. 


cies of multimode oscillation as functions 
Among the results obtained are: threshold conditions, 


single-mode output as a function of cavity tuning, frequency pulling and pushing, mode competition phe- 


nomena including frequency locking, production of 
approximate discussion of maser action using rate 
burning” plays a role. 


1. INTRODUCTION 


HIS paper gives a theoretical description of the 

operation of multimode maser oscillators. The 
type of approach is particularly suitable for gaseous 
optical masers of the type suggested by Schawlow and 
Townes,’ and first realized experimentally by Javan, 
Bennett, and Herriott,? but the equations should also 
find use in the description of some features of solid- 
state optical masers. 


2. BASIS FOR CALCULATION 


We consider a high-Q multimode cavity in which 
there is a given classical electromagnetic field acting on 
a material medium which consists of a collection of 
atoms described by the laws of quantum mechanics. 
No attempt is made to consider noise due to spon- 
taneous emission and thermal, density, or quantum 
fluctuations. The high degree of spectral purity ob- 
served by Javan and co-workers? suggests that these 
should be good approximations. 

The effect of the electromagnetic field on the atoms 
in the cavity is to produce a macroscopic electric 
polarization P(r,/) of the medium. This acts as a source 
lor the electromagnetic field in accordance with Max- 
well's equations. The conditions for self-consistency 
(that the field produced should be equal to the field 
assumed) determine the amplitudes and frequencies of 
the possible oscillations. The calculations will include 
nonlinear effects, so that phenomena of frequency pull- 
ing and pushing, mode competition, frequency locking, 
etc., can be described. 


* This work was supported in part by the U. S. Air Force Office of 
Scientific Research. The main results of the paper were reported 
at the Third International Conference on Quantum Electronics, 
Paris, February, 1963. Lectures on some of the material were 
given at the 1963 Varenna Summer School. 

1A. L. Schawlow and C. H. Townes, Phys. Rev. 112, 1940 
(1958). ; 

!A. Javan, W. R. Bennett and D. R. Herriott, Phys. Rev. 
Letters 6, 106 (1961). 

? T. S. Jaseja, A. Javan, and C. H. Townes, Phys. Rev. Letters 
10, 165 (1963). 


combination tones, and population pulsations. A more 
equations is also given in which the concept of “hole 


The thermal motion of an atom during its natural 
decay time may carry it several wavelengths through 
the standing wave pattern of the electromagnetic field. 
As a result, the atom “sees” Doppler-shifted optical 
frequencies which depend on its trajectory. This im- 
portant circumstance considerably influences the be- 
havior of the Javan-Bennett-Herriott rnaser. When, 
however, thermal motion is neglected the equations of 
the paper can be used in a model calculation for an 
ideal solid-state optical maser. 

We will assume that only two atomic states a and b 
contribute to the maser action. As a related simplifica- 
tion the vector character (polarization) of the electro- 
magnetic field will be ignored. In order to ensure that 
our analysis should apply, it would be desirable to 
have the optical configuration favor one plane of 
polarization, as with windows of the Brewster's angle 
type. The more complicated problem of a general state 
of polarization will be dealt with in another paper. 

A cavity of the Fabry-Perot tvpe used by Javan, 
Bennett, and Herriott has, of course, a continuum of 
modes because it is not enclosed by reflecting walls. 
However, it follows from work of Fox and Li* that 
there are discrete sets of quasimodes for which the dif- 
fractive leakage from the tube is small. The cavity 
modes of highest Q are the even symmetric ones whose 
circular frequencies are given by 


Q,=ane/L, (1) 


where c is the velocity of light, L is the distance be- 
tween the reflecting plates (L~ 100 cm) and wis a large 
integer, typically of order 2X10*. Fox and Li have 
shown that the modes of next highest Q are those 
possessing odd radial symmetry, which, for typical 
geometry differ by about 1 Mc/sec from the former 
modes. Our discussion will be specifically, but not 
inevitably, aimed at the modes of highest Q. 


4A. G. Fox and T. Li, Bell System Tech. J. 40, 61 (1961). 
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3. ELECTROMAGNETIC FIELD EQUATIONS 


We write Maxwell's equations in mks units as 


divD=0 curlE=—déB/ot (2) 

divB=0 cudlH- J4-0D/9!, 

where ) 
D=eaE+D, B-jH, J-cE. (2a) 


To an approximation whose validity will be dis- 
cussed in another paper, the array of excited atoms may 
be regarded as a medium with an electrical state de- 
scribed by a macroscopic polarization P(rj/) (electric 
dipole moment density). In order to avoid a complicated 
boundary value problem, it is convenient to assume the 
presence of a lossy medium with an Ohmic conductivity 
c adjusted to give the desired damping of a normal 
mode. The electric field then obeys a wave equation 


curl curlE+ p00 E/0t2-uoe:9?E/02 — — u0? P/3. (3) 


In the subsequent calculations the main effect of the 
space dependence of E(x,y,2,)) comes from the motion 
of the excited atoms through the field which leads to 
amplitude modulation of the fields seen by the atoms. 
'The analysis of Fox and Li for the even symmetric 
modes indicates that the electric field does not vary 
rapidly across the tube diameter. Accordingly we take 
only the axial variation of E into account. Then 
curl curlE is replaced by —0?E/8z?, where z is the axial 
coordinate, and E is the transverse electric field. For 


the zth normal mode (unnormalized), we have 
eigenfunctions 
U,(2)=sink,z, (4) 
with wave number 
K,=nn/L, (5) 


where 7: is a large integer. 

In the presence of a given polarization P(z,!), quasi- 
stationary forced oscillations of the electric field can be 
expanded in normal mode eigenfunctions 


BEND, A TOUR (2) , (6) 


where the amplitudes A n (/) obey a differential equation 
of a forced, damped simple harmonic oscillator 


A n g dA n 1 EPan (1) 
3r (5) T OA hm (2) ? 
df? €o/ dl e/ df 


in which P,(f) is the space Fourier component of 
P (zt) 


2 L 
Pa (= Í dzP (zl) sinK „Z. (8) 
LJ 


Since P,(/) will be very nearly monochromatic at an 
optical frequency? (o.f.) v, we replace its second time 


5 We adopt the convention that all symbols for frequencies 
should denote circular frequencies. A numerical value, e.g., 150 
^ Mc/sec, however, denotes an ordinary frequency. À decay con- 
stant like ya which denotes a reciprocal life time 1/74 often plays 
the role of a circular frequency. Numerical values of ya will be 


x given as ordinary frequencies. 
Ş r 


^ 
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derivative by —»?Pn on the right side of Eq. (7). Aq. 
justing the fictional conductivity o to give the desired 
Qn of the nth mode, we write 


o= ew/Qn. (9) 
Then 4, (f) obeys 


dA, /v\dA, y 
+( ) +0,7°A n (- Tn 
dP Q,/ di €0 


In the typical gaseous optical maser, the separation 
of the principal modes A~150 Mc/sec is much larger 
than the cavity mode band width »/Q~1 Mc/sec, 
Hence we may hope to neglect time Fourier components 
of A, (I) and P, (t) which are at frequencies far from the 
cavity resonance frequency Qa, and write? 


(10) 


Anli) 9 En (t) cosGat+¢n()), (11) 
and 
J£ (t) —C, (0) cos(v,.t3- Qn (0) 
--S,() siny t+ pal), (12) 


where the amplitudes E,(f) and phases o; (t), as well 
as the in-phase and quadrature coefficients C,(/) and 
S,,(t) are slowly varying functions of ¢ which, together 
with the frequencies v,, are still to be determined. The 
expressions (11) and (12) are put into Eq. (10) with 
only the first time derivatives of E,(f) and @n(t) re- 
tained. Equating the coefficients of cos(vnt+¢n) and 
sin(y,t-+ pn) separately to zero, and further neglecting 
small terms involving v,Z,/Q,, é,E, and vn GnEn/Qn, 
and recognizing that vn+ ¢, is very close to n, we find 
the self-consistency equations 


(nt @n—Qn) En= —4(p/€0)Cr (13) 
and 
E,A4(v/Qn)En= —4(v/€0)Sn; (14) 


which serve to determine the amplitudes, frequencies 
and phases of the o.f. radiation once the polarization 
state of the medium is known in terms of the Z,(t). 


4. POLARIZATION OF THE MEDIUM 


The maser action arises from the establishment of a 
negative temperature distribution for the two excite 
states a and 5 of the atoms constituting the medium as 
shown in Fig. 1. The ground state, far below a and b, 
is not shown. Consider what happens to an atom which 
at time fy is excited by some process (electron bom 
bardment, collision of the second kind, absorption 0 
resonance radiation, decay from some higher excite 


6The representation of an arbitrary function An(?) in the 
form (11) in terms of a variable amplitude E,(#) and phase gall) 
is not unique. Despite this, because of the use of the i UR 
wave approximation it seems possible through Eqs. (13) and (1 
to determine both amplitude and phase. [The positive frequency 
part of (11) is a complex function very closely equal to Ant n 
=E,(t) exp—i(vnt+ es(1)) which does have a unique amplitu 
papka] 
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state, etc.) into the upper maser state a. Let the atom 
be at position ro at to, and have velocity v. For the 
present, we neglect collisions, so that at time t> te the 
atom will be at r=ry+yv(t—J,). If there is an o.f. 
electric field E(r,t) in the cavity, the atom sees a time- 
dependent field E(rs4- vt— vis, f) for 12» fs. Associated 
with this field is a time-dependent perturbation energy 
whose matrix element is 3 


AV (t) — gE(rs- v— vis, t), (15) 


where @ (assumed real) is the matrix element for the 
electric dipole moment of the atom between states a 
and 5. The perturbation causes the atomic wave func- 
tion to become a time-dependent linear combination 
a (Dyad- b (Dy. The quantum-mechanical average value 
of the electric dipole operator for the atom is 
(a*b--ab*) 9. 

To follow the time-dependent wave function (in the 
subspace of V, and y), we start from the equations of 
time-dependent perturbation theory 


iá— W.ad-V (t)b—żiyaa, 


pte 5 (16) 
ib=W b+ V ()a—żiyib, 


in which the radiative decay of states a and 6 is de- 
scribed by phenomenological terms containing the de- 
cay constants y, and ys for the two states. Here AW a 
and ZW, are the unperturbed energies of states a and 
b, and the matrix element of the perturbation V (£) is 
given by Eq. (15). 

If the motion of the atom were neglected, and if the 
maser were working in a single cavity mode, V(/) 
would be monochromatic, and the rotating wave ap- 
proximation would allow the Eqs. (16) to be integrated 
exactly. Even so, there are great algebraic simplifica- 
tions to be gained by going over to a density matrix 
description? of an ensemble of atoms consisting of all 
those of a given category which are produced during 
all times fo<¢. A theory of maser action in this case 
has already been given? which is valid when the signals 
are strong enough to fully saturate the transition a €» 0. 
For multimode operation, such an exact solution can no 
longer be obtained. However, the simpler theory can 
help with the interpretation of our rather complicated 
equations, and it will be discussed in Secs. 16-20. 

When atomic motion through the electromagnetic 
field is taken into account an atom does not see a 
monochromatic perturbation even in single-mode opera- 
tion. The equations can only be solved in a perturbation 
expansion of the solution in powers of the E,(t). It is 
still advantageous to use the density matrix method, 
considering first only those atoms characterized by 


? W. E. Lamb, Jr. and T. M. Sanders, Jr, Phys. Rev. 119, 

1901 (1960), D pp. 1902-1903; L. R. Wilcox and W. E. 
Lamb, Jr., ibid. 119, 1915 (1960), especially p. 1928. 
. W. E. Lamb, Jr., Quantum M. echanical A amplifiers, in True 
in Theoretical Physics, edited by W. E. Brittin and B. W. Downs 
(Interscience Publishers, Inc., New York, 1960), Vol. II, espe- 
cially pp. 472-476. 
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Frc. 1. Two excited energy levels a and b between which the 
maser action takes place. The levels have a resonance transition 
frequency w>0, and are given phenomenological decay constants 
va and y». The excitation of the states is described by the func- 
tions ha(rs,to,2) which are introduced in Eq. (22). 


a, To, lo, V. The density matrix 


jal? abt 
Peut ( em 
ab lol? 


(17) 


obeys an equation of motion 
B —i(36,p]— 3 (Fp-FpT), (18) 


where T' is the diagonal matrix 


Ya. 0 
r=( ) (19) 
c qs 


and the Hamiltonian matrix 3C is 


We V(t) 
x«( ) (20) 
vi) W, 


with V(¢), as given by Eq. (15), having a complicated 
time dependence because of atomic motion. A solution 
of Eq. (18) which satisfies the initial conditions 


t 0 
pla, Tolo, V, la) = ( ) (21) 
0 0 


is required. The average electric dipole moment corre- 
sponding to this density matrix p is g (past pra). 

To obtain the macroscopic polarization P(r,t) we 
have to combine the contributions of all atoms which 
arrive at rat time /, no matter when or where they were 
excited to state a, and also a similar contribution from 
atoms excited initially to state b. Let Aa(ro,to,¥) be the 
number of atoms excited to state a=a, b per unit time 
per unit volume. We have 


t 
P(r,i)= foj Ss EP 
a=a,b} _.. 
X [pos (a, Tolo, Vt) + pna (a, roto, V,t) ] 
X&(r—r.—v(1—1)). (92) 


In practice, As (rofo,v) will be a slowly varying function 
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= of re so that it can be replaced by Na(t,lo,v). After 

integration over ro 

EU PT i à (23) 

‘ig a=a,b/ _,, 

Eu 

E : Xr Y) Loos (o, r— v (I— lo), fo, V, 1)4- conj. ]. 
NS 

E. Similarly, we will have use for a density matrix de- 


-  — scribing an ensemble of atoms which arrive at r with 
velocity v at time / regardless of their place ro, time fo 
or state a=a, b of excitation. This will be denoted by 


E- plr y, D= L | dta arose 


y a=a,b, 


X p(a,ro,l0,V,/)5(r—to— viH- vio). (24) 


The density matrix resulting from (24) by integration 
. over all velocities will be denoted by p(r,t). 

5. INTEGRATION OF THE EQUATIONS OF MOTION 
= The matrix equation of motion for the density matrix 
Tic p(esroll Yt) has components 


=A Bas —iwpar—YarparttV (t) (0a — prs) ; 


Daa = —Yapaa HiV (t) (pab— pba) D (25) 

Poa = — Yopa — İV (t) (oa5— poc) 5 

Pba= Pat", (26) 
Yar=3 (Yat o) (26a) 
9-W.,—Wy-0. (27) 


e consider first the case of excitation to the upper 
er state a. At /— o, paa — 1 and peb = pab = poa = 0. The 
tion to any desired order in the perturbation V (/) 
can be obtained by iteration. There are contributions to 
— pab = pba" in first and third order, to p» in second order, 
n Paa in zeroth and second order. Thus, in zeroth 


paa (a, Toto, Vl) —EXp— Ya (L— 19) (28) 


di V (P) expl (Yat iw) (t — i+ yalto—t)], (29) 


e third-order contribution is 


E 


Ü 
=f di'V(r) 
EGO 


LAMB, 


) triangular area in the fo, / plane. 
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where 


paa D? (a, ro lo, Vt) i 


t^ pan q 
=~ =| ar | di" V (t")V (i^) i 
* to to i. 


X (expira (l + atio) (7 — (7) | 


4-y«(fo— 0”) ]- conj.) > (31) ; 
and similarly » 
por (a, ro,lo, V.) 
v [i 
= + ar | di" V (")V (t^) 
to to 5 
X (texpEys (^ — !") + (Yatio) (t — 0) 
+Yallo— t”) ]H-conj.). (32) 


6. FIRST-ORDER THEORY 


In order to convert the expression (29) for pas” into 
a macroscopic polarization, we must first calculate 


pay (a, r,v,l) 
t 
-f dlopas (a, Yo=r— vi+ Vio, lo, MS t) (33) 


as in Eq. (24). 

The perturbation V(/) acting at time /’ on the 
atom specified by ro, fo, v is — (Q7) E(rod- y(t — to), t), 
but for Eq. (33) we require this for an atom char- 
acterized by ro=r— ví4-vío, V, fo, for which the ef- 
fective perturbation 


Vt‘) = — (g/A)E— v(L- n), t) 


does not depend on fo. We may then perform the above 
integration over lọ if we treat ħa as a slowly varying 
function of fo and evaluate it at /. For the first-order 
terms, we have to deal with an expression of the form 


t t 
] a. di F (L,)ev« (tot) , 
CAPE ME 


By an interchange? of the order of. integrations this 
becomes 


| 
| 


(34) 


t [14 t 
Jl di! | dil (Li! )ev« (7) = (1/^y,) Jl davrli). (35) 


; Let us assume now that the maser oscillator is running | 
simultaneously in M cavity modes, so that ; 


E()- Y. E,() i oae erc 


3 In each double integral an integration is carried over | 


9a 


ne 


E 


v 


a 


| 
| 
l 
| 
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where £,(/) and e,(f) are slowly varying functions of 
time. We make a rotating wave approximation by keep- 
ing only exponential factors like expi(w—v,)f/ and neg- 
lecting rapidly varying exponentials like expi(w-+v,)/’. 
Then pay (a,r, vd) A 


YN Ass) w rt 
(S i» di Elt) 
h Ya ul. 


— 


XU (Gr v(t’) exp—i(v,tt e,(!")) 
Xexp[— Ya i(v,—e)]u—1). (37) 
We also assume that the amplitudes E,(/) and phases 
g(t’) do not vary much in a time 1/*,;, so that they 


can be evaluated at time 4. With a change of variable 
of integration from /' to 7 —1—/', we find 


pan D (ar, vL) = 


353 


9NFAs(rv,l) x 
— (2, ] 25 E(t) exp—iG,l+ e.) 
P 


Ya 


3 dr'U ,(r— vr’) exp—(yaaritw—v,))r . (38) 
0 


The corresponding contribution to the polarization 
of the medium is 


PO (a,r,v,0) 


oe zilo SA Nalt Y) Ta] 
xE E, exp ites e dz'U ,(x— v7!) 
# 0 
xexp- Gaio) Heconi (39) 


Let us first assume that the excitation rate density 
has the form! 


dalr, v, D - W(v)As(r!) a=a,b, (40) 


where W(v) is the normalized velocity distribution 
function and A«(r,£) is the number of atoms excited to 
state œ per unit volume and time. Because we are 
assuming a spatial dependence of the electric field only 
on z, we may change over from a three- to a one- 
dimensional description. Then the velocity distribution 
W (v) refers to the z component v of v, and r is replaced 
by z. 

It will be noted that the quantity P(a,3,2,t) is pro- 
portional to A,(z/)/y,. When we now consider the 
contribution of atoms excited to the lower maser state 
b there is a complication which we did not meet in the 
case of a excitation. Spontaneous decay of atoms in the 
upper maser level a may be one of the excitation 

Tt would be easy to modify the 
excited to state 6 to have a different ve 
those excited to state a. 


theory to allow the atoms 
locity distribution from 
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mechanisms for state b. This could be plausibly repre- 
sented by replacing the excitation rate density A,(2,0/) 
by Ao(2,0,)+ frepaa(z,0,l), where f is the branching 
ratio (decay from a to 6)/(total decay from a) and X; 
now describes only “external” excitation processes. 

In order to reduce somewhat the complexity of the 
subsequent equations we will now proceed as if f were 
zero. The effects of cascade excitation a — b will be 
discussed by an approximate method in Sec. 20. With 
this simplification it turns out, as one would expect, 
that for b excitation P(b,z,5/) is exactly like (39) 
except for an over-all sign change and interchange of a 
and b. Hence, the total polarization P (2,0,4) = P (a,2,2,0) 
+ P(6,z,2,t) is proportional to a quantity 


N (2,1) m KES (z,l) /^fa) xm (Ay (2,0/*0)] , (41) 


which we will call the “excitation density." This is 
simply the excess density of active atoms in a steady 
state in the absence of optical oscillations. 

The first-order polarization 


PO (zl) e| deW (v)[pas "^ (a,2,7,1) 

ds + pas) (D, t)--conj.] (42) 
is also proportional to .V(z,t). For use in Eqs. (13), (14) 
a spatial Fourier projection on the nth cavity mode is 
next to be made 


p, o) Q/L) f 


J0 


L 


dzPU (s D) U (zy. (43) 


The product U,(z)U,(z—vr') which occurs in (43) 
may be written as 


sink ,2 sinK,(z—vr') 
=} cos{(K,—K,)s+K,or'} 
—hcos{(K,+K,)s—K,or'}. (44) 


The last term will not contribute appreciably to the 
z integration (43) because the excitation density JN (2,t) 
changes little in an o.f. wavelength. Since the velocity 
distribution is normally an even function of v only 
that part of the remainder of (44) which is even in v 
will contribute to the polarization, i.e., 


A[cos(K , — K,)z ] cosKzz', 


where the subscript » has been dropped in the last 
factor since all of the modes considered have very 
nearly the same wave number K=», d 

We find 


P.O (0)2 —Mi(g/h) E E, exp—i(v,I-- ex) Vus 


e 


xj deW Dilor tH Kr) Jr coni, (45) 


where 
Dle) = 1/ (Yat de) 


> 
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is a convenient abbreviation for a frequently occurring 
denominator and where 


iL pe TZ 
Nat) - | dzN (s,t) zi (n— | (47) 
Lja L 


is a spatial Fourier component of the excitation density 
N (,)). It should be noted that (45) has a very simple 
interpretation in terms of Doppler shifts of the atomic 
transition frequencies by Kv due to the atomic motion. 
This simplicity will be lost when nonlinear effects are 
considered. 

For the following detailed calculations a Max- 
wellian distribution 


W (v) = (um?) exp— (2/42) 


will be assumed. The speed parameter % is related to an 
effective temperature T by the equation 


(48) 


(49) 


where m is the atomic mass and ks is the Boltzmann 
constant. If it should develop that a Maxwellian dis- 
tribution is not realized in practice, some obvious 
changes in the later work can be made. 

With Eq. (48) the integration over ? may profitably 
be done on (39) before that over 7’, and we find 


PaP (N= —3(gt/hKu) 


M 


xt» E, (0) exp— i(v,t4- ex(0)) 
«Nu (OZ (v,.—w) + conj. ], 


tmu?r=keT , 


(50) 
where Z (v—w) is an abbreviation for 


Z(v—o, Yas, Ku) 
=iKu | dr exp[;(v—e)r—yavr—X3K^w)], (51) 
0 


which is a complex function well known in the theory 
of Doppler broadening." The function Z is, in fact, a 
function of a single complex variable ¢ 


it 
Z(¢)=2i i diexp—(P+2), 


(52) 
where 
s f=f+m, (53) 
with 
t= (y—o)/ Ku , (54) 
and 
n=Ya/ Ku. (55) 


It is fortunate that extensive tables! of the real 


: adig d o Springer-Verlag, Berlin, 1933), pp. 

; 1? B. D. Fried and S. D. Conte, The Plasma Dispersion Functi 

= (Hilbert Transform of the Gaussian) (Academic bee Inc., New 
= York, 1961). 


t 


a 
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part Z, and the imaginary part Zi of Z(E+in) are now 
available. 

P,O (I) is a linear function of the complex electric 
fields E,(/) exp—iv,t—ig,(t) of the cavity modes and 
apart from amplitude modulation arising from a possible 
slow time variation of the excitation density N (z,t) con. 
tains the same frequencies as the cavity field. 

'To determine amplitude and frequency (or phase) of 
the oscillations, we write out the contributions of P,» 
to C, and S, of Eqs. (13), (14). These are 


S, 2 — (gh/hKu)NZi(vn—o)E,, (56) 
C, O0 — —(9°/hKu)NZ,(vn—w) En, (57) 
where 
7 
Ñ=N(ù= anf dzN (zl) , (58) 
0 


which will be called the “excitation,” is the average of 
the excitation density over the cavity. We have now 
reverted to the notation of Eq. (51) for the Z function, 
but to shorten equations have dropped the parameters 
Yab and Ku which appear as arguments in (51). 

In this approximation, without nonlinear terms, we 
can only hope to obtain the condition for starting of 
oscillations and their frequency at threshold. Further- 
more, if the conditions are such that several modes can 
oscillate, they do so independently of each other and 
hence can be considered separately. The amplitude 
equation (14) gives 


E,— —3(/Q.) E, —À (y/ e), (59) 
or for a steady state, for which E,—0, 
(9?/ eotKu)NZ;(vn—w) = 1/Qn- (60) 


To first order in 7=~7Yas/Ku, we have 
t 
Ze - 2| -2 | eatin? ett (61) 
0 


Hence for pure Doppler line shape the condition (60) 
for the onset of oscillations in the th mode may be 
written as 


2m? e?/ (Amr eft) ] 
X (/eYAN exp— (yn—w)?/(Ku)2=1/Qn, (82 


where \=27/K is the wavelength. In these units, 
e?/ (Amevhc) e 1/137 


is the fine structure constant, so the left-hand side of 
(62) is the product of this and four other dimensionless 
factors: c/u, 21!?, exp— (v, —«)?/ (Ku)? and ($/ e) 

which is the net number of active atoms in a cylinder 
of cross sectional area (@/e)? and length A. As the fre- 
quency detuning v,—« increases, the excitation N re- 
quired to initiate oscillations increases in proportion to 
expt (r,—)?/ (Ku)*. The frequency of oscillation 1 


(63) 
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determined by Eq. (13) in which we may set $,—0 
without loss of generality. Using Eq. (57), we find 


ya — Q4 (v/2) (9? N / edu Ku)Z. (v, c). (64) 


It is convenient to use the threshold condition (60) to 
express N in terms of Q,. We find 


vn— n=} (/Qn) CZ, rs —9)/ Zio, —)] (65) 
or in the approximation y4/ Ku«&1 


1 p N pk 
ya—Q,— -( JEJ dxe”. (66) 
TIANO Jo 


c 


If the integral (66) is expanded to first order in £, 
= (v, —o)/ Ku we obtain 


(v, —95)/ (o—v«)z (1/7!) (j/QKu)—- S, — (67) 


which implies “linear pulling,” i.e., the detuning of v, 


from the cavity frequency 9, is proportional to the 
amount w—v, by which the oscillator frequency v; is 
removed from the atomic resonance frequency w. The 
right-hand side of (67) (S— "stabilization factor") is 
about 1/800 for typical values of the parameters used 
in Sec. 2. 

The more accurate expression (65) indicates a “non- 
linear pulling” such that the oscillator frequency is 
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nearer to the atomic frequency than it would be for 
linear pulling. To the next order in (Q,—w)/Ku, but 
still for ta Ku, we find 


(v«—€4)/ (w= v.) [1--3(0,—)/ (Ku)*]S. (68) 
7. THIRD-ORDER TERMS 


We now carry out a similar calculation for the third- 
order quantity P, (f) using Eqs. (8), (23), (30), (31) 
and (32). The integration of a(z, to) pas ® (a,2,t0,2,t) 
over times fy of excitation involves integrals like 


t t t pe 
| dilo j dt’ | dt" f di" F (tt t" 1") Exp Yalt” — lo), 
-n to ta J ly 


which by a repeated interchange of orders of integra- 
tion can be reduced to 


t ru fee 
(1/2) l dt’ | dt" ] di" Ff t^). 


Again, we keep only exponential factors in the time 
integration which are able to have resonance, and find 
after changes of variables 


j= t=’, af = fa", We es s ut (69) 


and some algebraic manipulations 


pas 9 (zo D =i PIN (2) 35 05 22 ELE, [exp— i(»,H- gy) tilt op) —i(vel-+ e») ] 


Mop c 
* E E: 
: : ; orl — gr) 
X | a| ar | dr” U,(2—tr')U,(2—tr' — vr") Us (a— tr tr tr’) 
40 0 0 


X {exp— [yas — ivy tivo ivot iw) T+ uer iy,— ive) AR (Ya tiw iv;)T )) 


-FEexp— iGout- Pp) d i(y fr Gp) +t (vol €)] 


x | ar | a” | dr” U,(z—vr) U,(z—vr' —v7") Us (z—tr' — or" — vr") 
0 0 0 


X [exp— Eao ivp t ivet iw)r + (Ya— iysd- ivo) + (Yay — iet iva) xk] Í 


+same with a and b interchanged. (70) 


In calculation of the Fourier projection Pa ® (f) integrals of the form 


2 £ n H" FEES Te eth 
O dN (1) UQ Un (s — tr) Us (aor — vr) U tr ort ori) 
0 


appear. The product of the four sine functions can be reduced to 


l[cos(K ,— K,-- K,— Kn) cosKv(z" — 7) H-cos(K ,— K.— 


137 alU " (zcavity band width/atom b is 
The term “stabilization factor" ( lue was large compared to unity. 


maser (Ref. 8, p. 460) where its numerical va 


K,4- K 4): cosKolr” +7") 


-pcos(K,-- K — K,— Kn) cosKs(s 2s" sy]. 


andwidth) was previously used in a theory of the ammonia beam. 


El 
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apart from rapidly oscillating terms and those odd in v. The integration over v may now be carried out, and we find 


P4 (0) m dip! > UU D [exp— i (rut- ey) Hit e) — i(vel-+ e) | 
hop c 


5 | 
E 30 E w% 1 | 
Y AV T " hU Hu I» 
3 xj ar | in| a UN peed pon) EXD 2 u)? (z^ —7')H- N (p—o- nen) exp—3 (Ku) (r +r) w 
3 0 EA) 0 | 
E JN Greca-n exp—4(Ku)*(7 4-27" - 7)? JLexp— (yay — ivt iv, — ive ie) 
E — (ya iv, — ive) T” — (ralio — ivo) r” ]H-Lexp— i(vul-- ey) — rst Gp) 31st e] hd 
E. E3 w% Lj i 
lc x i ao | an | THEN e sco exp —} (Ku) — EN caue exp 1G GA +r) 
X 0 Jo 0 
+N (ppe-uen) exp— 1 (Ku)*(r 4-27 +r”) ]Lexp— (va —iv.— ivptpive tin)’ — (ya — ive tive) T" | 
x ^w 
— (yay— io -iv;) 7] | - same with y, and Yo interchanged4-complex conjugate. (71) 
E One of the most important characteristics of the third-order polarization is that it has constituents which ; 
- oscillate at all possible frequencies wc v;d-v». The combination of signs appearing here is correlated with use of — 
a the rotating wave approximation. Terms with frequencies such as v,--rv,d-v. (near the third harmonic) are — 
—  — thereby neglected. 
- The formidable expression (71) can be simplified in either of two limiting cases: (a) no atomic motion (u- 0), ! 
i or (b) “Doppler limit,” i.e., Ku much larger than yap and various frequency differences such as v,-4-v,— 2»,, etc. | 
Jn the absence of atomic motion, one finds after some rewriting 


[0 PSO (1) — Ail? E Y; E EE E[esp ilot eu) Hitto) —i(vet+ ve) J 
: he qX 0 


X [BV Gehan) N nose Greca] ` $(»,— vn vedo) 
X[29.(v,—»)--:9(v,— v.) JED(e— v2) + (v, —e) ]H-conjugate, (72) 


where D(w) was defined by Eq. (46) and 


| 
] 
| 


Da(w)=1/(yetio), aa, b. (73) 


A The “Doppler limit” is appropriate for many possible gaseous optical masers, and for most of the remainder 
of this paper we will be dealing only with this case. Then 


exp— 4 (Ku) (r — 7’)? 


ug 


like a delta function of 7" — 7' and the integration over 7’” can be done in the form 


L3 L3 2 1/2 po 
Í dr' | BUG 2) exp - X (Ka)? (v rji — | di'G(r^jr). (14) 
0 0 Ku Jo 


er Gaussian factors do not have their full peaks in the range of integration, and give contributions which we 
glect because they lead to expressions with higher powers of Kw in the denominator. Then, after performing the 
e integrations over 7’ and 7”, we find 


grin? eu (Ku) ] DO DRE EN ODA D(w— a Vo) [Da (v, — ve) + $y(v,— va) ] 


X[exp— i(v,— Dol yo)t—i(eu— Pot p) |+ Sj(—Àv.—3v,-- Ve) Dal(ve— va + Di (v,— v,)] 
XL[expL— £(vd-v,— v))t—i(e,4- e,— ec) ]]) J- complex conjugate. (75) — | 
ging p and c in the second group of terms, this may be written more compactly as 


(Ku) | 2» 2» p» E,E,E,[exp[— i(v,—v,4- v;)t—i(e,— ej e.) || 


ae 


cA -by,— va) dV (epum) D(— 2a iret va) Dar, Vq)-+ Sy (v,— ve) ] : 


e SEEN ; complex conjugate. 
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8. SINGLE FREQUENCY OPERATION 


For this case, the triple summation over i, p, and « 
reduces to a single term 


P, (0) Teir N ot/ (foy y Ku) | 
Kal Dlw— rn) + D0) JE, exp—i(val-+ gn) 
complex conjugate. (77) 
For use in Eqs. (13), (14) we need the in-phase and 
quadrature components of this 
Ci (Y= e NOY Gv yy KC) ] 
XYaslw— vn) L (o — v, En? (78) 
and 
5, (0) - ge NCQ" (yy Ku)] 
XL ya^ (ova) JE? D (79) 
where the Lorentzian function is denoted by 
£(o—») — [Yat (o—»)?]7 . (80) 


The amplitude determining Eq. (13) is then of the 
form"! 


En=Onlin—BrEn’, (81) 
where 


an= — 3 (0/0) +N g? (hK) IZ: ono), (82) 


Ba = (v/A6)z*N[ 91/ (ech*yaveKu) | 
X[t--ya^£(v.—o)]. (83) 
It is useful to employ the starting condition (60) 


to express the coefficients o4, Bn (and others which 
appear later) in terms of a ratio 


3. N/No (84) 


called the “relative excitation" where Ñr is the excita- 
tion required for threshold oscillations when the cavity 
frequency Q, is tuned to the peak w of the atomic 
resonance curve. We find 


os 3/0.)(LZi9»—9)/2.(0)]t—1) — (85) 
and 
Ba= gr Q/ QEN e*/ G^2:(0))] 
X[1--va? (v, — 0) J. (86) 
A stable steady state occurs for an intensity of 
oscillations 
E,2=an/Bn J (87) 
which is easily related to the relative excitation with 


the help of Eqs. (85) and (86). siae 
The frequency determining equation (again ¢, may 


a, is simply related to the 
of the medium at frequency 
in coefficient a str 
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be taken zero) is Bate 
Yn=Qn—4(v/ (esE,) [Cn 9 4- C, ] 


Since the frequency of oscillation v, will diff 
from the cavity resonance frequency 2,, the rig 
of Eq. (88) may, to a sufficiently good approxin 
be evaluated for y, €,. We may then write 
V, Vn Font palin; 

where 

m, iy[ oN | Cet Ku) ]Z.(Q,, —o) 
and 
Dn— E tem oL PN / letar Ku) ] - 7 

X7as(Qn—w)L(Q,—w) (91 

or expressing these coefficients in terms of the rel: 
excitation (84) 3 


n=4(0/0n)MZ(Q,—w)/Z,(0) 


and 


p. Hir O/O ER Q3 UP ys (0))] 
X Yaon —9)£ (Q,—w) 


so that 


PrEn = pnotn/ Bn 


(and hence on the power level) as well as on th 

ing, i.e., there is frequency “pushing” as 

“pulling.” À 
Equations (89), (92), and (94) indicate thai 


(Q,, —)*-2y 4? < — Yat (05—9)Z:(Q.—)/ 
Zi 


an increase of excitation should move the 
the direction from w toward Qa. For small 
right side of (95) is approximately 4 


detuning is given by Eq. (87). 
Zi(£.m) for N=0, this equat 
proximately as 


(QE. (I yayo) 
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Fic. 2. Relative intensity of oscillation as a. function of de- 
tuning. The solid curve, drawn for parameters N =2Nr and Ku 
=4ya represents Eq. (96). The dotted curve indicates the Doppler 
gain profile of the numerator of (96). 


parameter"!5 
Tn=3( QE,)?/ (yas) 


much less than unity. 

The numerator in Eq. (96) has a peak for resonant 
tuning 9, —c, but the denominator, which comes from 
the nonlinear term involving the coefficient 6, also has 
a peak when Q,=w. Under certain conditions, the 
over-all curve of E,?=an/8, versus detuning Q,—«c 
should have a flattened peak at resonance, or even a 
dip between two maxima. The condition for the appear- 
ance of two maxima is 


(va/ Ku)? « 3(1—exp— [(9,* —)/ (Ku) ]) , 


where Q,*—c is the detuning required to stop oscilla- 
tions at the given level of excitation. The double peak" 
(see Fig. 2) should thus be seen somewhat above 
threshold, i.e., for relative excitation 


9U» 1/[1—2(va/ (Ku))]. 


(98) 


(99) 


(100) 


Under this degree of relative excitation, the electric 
field at central tuning is given by 


Tn=3(@En)*/ Pyare) e 4G (Ku))«1.— (101) 


so that the neglect of higher orders of perturbation 
theory should not be too serious, provided, as we 
assume, that yak& Ku. 


3 16 The significance of this quantity is shown more clearly in 
* Sec. 18. 
16 This dip has recently been observed. R. A. McFarlane, W. 
R. Bennett, and W. E. Lamb, Appl. Phys. Letters 2, 189 (1963); 
A. Szöke and A. Javan, Phys. Rev. Letters 10, 521 (1963). 


r 


LAMB, 


IR 


Taste I. This shows the twenty-seven possible values of the 
summation indices u, p, o which appear in Eq. (76) for "three". 
mode oscillation. The fourth column gives the corresponding fre- 
quencies v,— rpt ro: The last column contains numerical values 
for the amount by which these frequencies exceed vi. We have 
taken ve—v1=150 Mc/sec and y3—r3—151 Mc/sec in order to 
simulate (but greatly to exaggerate) nonlinear pulling effects, 


Typical à» 


m p c Vy —Vp-l- Vo (Mc/sec) 
1 1 1 yi 0 
1 1 2 Va 150 
1 1 3 v3 301 
1 2 1 2vi— rs —150 
1 2 2 vi 0 
1 2 3 vit va— vo 151 
1 3 1 2vi— v3 — 301 
1 3 2 vi-Hvz— r3 —151 
1 3 3 yi 0 
2 1 1 V2 150 
2 1 2 2vs—»i 300 
2 1 3 Va-l-va— vi 45] 
2 2 1 Lat 0 
2 2 2 ya 150 
2 2 3 Va 301 
D 3 1 yi va— vs —151 
2 3 2 2vs— V3 = 1 
2 3 3 v2 150 
3 1 1 V3 301 
& 1 2 va-d-va3— Vi 451 
3 1 S 2v3— vı 602 
3 2 1 vi Hrs — va 151 
3 2 2 V3 301 
3 2 3 2v3— vo 452 
3 3 1 vi 0 
3 3 2 va 150 
3 3 3 v3 301 


The physical interpretation of the dip is discussed 
in Secs. 17 and 18 from several points of view. 


9. MULTIPLE MODE OPERATION 


As the excitation is increased beyond that required 
for threshold of single frequency oscillation, other fre- 
quencies appear in the output of an optical maser oscil- 
lator. We wish to use the expression (76) for the non- 
linear polarization in the electromagnetic field equa- 
tions (13), (14) in order to account for the observed 
phenomena. 

The first theory of an oscillator capable of multi- 
frequency operation was given by van der Pol in 
1921-22. The necessary nonlinear features were pro- 
vided by cubic terms in the current-voltage character- 
istic of a triode vacuum tube. The tank had two R-L-C 
circuits with resonance frequencies 9; and Q. Van der 
Pol found that steady oscillations could occur only at 
frequencies near Q; or 9», but that simultaneous steady 
operation at the two frequencies was impossible. There 
were hysteresis phenomena, i.e., the choice of steady 
state of oscillation depended on the past history of the 
circuit parameters. 

Multicavity magnetrons provide very important 


17 B. van der Pol, Phil. Mag. 43, 700 (1922) and a review article 
Proc. Inst. Radio Engrs. 22, 1051 (1934). 
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) arious quantities needed for the evaluation of Eq. (76) for 
three-frequency oscillation as described in the text. 


p-ag-n s—ptu-m 


0 yi 0 0 
yA Vi 0 0 
A vi 0 09 
ga yi 2 0 
A vi 4 0 
A 2va— v 2 0 
0 va 0 0 
—44^ Va —2 0 
—4A vy 0 0 
24 v3 0 6 
2A vy 2 0 
“ay? vibes —2 0 
"0r vic v3 va 2 0 

0 v 0 0 

—A V3 —4 0 
Ix yA v3 —2 0 
EM v; 0 0 
= jA v3 0 0 
=A 2va— v; —2 0 


examples of oscillators capable of multifrequency opera- 
tion, and here again the normal pattern is that an 
oscillation existing at one frequency tends to suppress 
one at another frequency. 

"There is, of course, a very close connection between 
van der Pol's work and ours. Where he dealt with a 
nonlinear triode characteristic for a vacuum tube, we 
are concerned with the nonlinear response of an as- 
sembly of atomic systems which obey the laws of 
quantum mechanics. A one-to-one correspondence can 
be set up between the two problems. The Fabry-Perot 
cavity modes correspond to the resonant constituents 
of van der Pol's plate circuit. As we will see, however, 
the effective tube characteristics of the optical medium 
differ qualitatively from that assumed by van der Pol, 
and hence the optical maser behaves in a very different 
fashion with respect to multifrequency operation than 
the oscillator in van der Pol's original model. 

Rather than deal next with the case of two-frequency 
operation, it will perhaps save space to consider first the 
more general case of three-frequency operation. Prob- 
ably most of the interesting phenomena for optical 
gaseous masers can be understood without dealing ex- 
plicitly with more than three frequencies. After the 
more general equations have been obtained, we can 
easily drop terms and discuss two frequency oscillation 
as a special and simpler case. , 

We consider the expression (76) for the third-order 
polarization, in which the indices u, p, ¢, and » can each 
take on values 1, 2, and 3. It is useful to have the 
ingredients of the summands in tabular form. The 
entries in the fourth column of Table I give the fre- 
quencies of the various summands in P, identified 
by the u, p, and o values in the first three columns. It 
will be noted that besides the three frequencies vi, vs, 
and v; assumed in the cavity excitation, there are nine 
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additional frequencies present in the polarization of the 
medium. Hence there must be fields in the cavity at 
these new frequencies, and the desired self-consistency 
for three-frequency operation is in jeopardy. However, 
under certain conditions which will be determined later, 
the fields at the new frequencies do not produce appreci- 
able effects, even though the frequencies lie close to 
cavity resonance, so that the calculation can be made 
as planned. 

Asis already apparent from the single-frequency case, 
the oscillation frequencies v, are typically very close 
to the cavity frequencies 2,, which are equally spaced 
and separated by 

A= 150 Mc/sec. (102) 


Hence three of the new frequencies: 2va— va, vitna vo 
and 2vs—», are very close to the three main frequencies 
v1, v2, and vs, respectively, and the corresponding terms 
are carried along in the calculations. The remaining fre- 
quencies can be ignored as long as the oscillator is appre- 
ciably below threshold for four-frequency operation. 
Any pulling of v, from 9, is typically measured in 
kc/sec and hence the v, are not detuned from 2, by an 
appreciable fraction either of the cavity bandwidths 
v/Qn which are about 1 Mc/sec, or of the radiative 
decay constants Ya, Yt, OF Yap Which may be 10 Mc/sec 
or more. The entries in columns 5, 6, and 7 oí Table II 
occur in the frequency denominators in Eq. (76) and 
are there to be combined with imaginary numbers — ya, 
—iys or —iya. We have neglected the small terms 
arising from frequency pulling. Thus v2—» is freely 
replaced by A of Eq. (102), etc. A symbol likes 
notes a frequency halfway between vi 


v127— 3 (viT-v2), etc. 


AAs Ree SOR 
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integers characterizing the spatial Fourier components 
(47) of the excitation density which are needed for the 
evaluation of the contribution to (76) arising from the 
indices u, p, c, and 2. 

The combinations (mps) which contribute third 
order polarizations at frequency vı are (1,1,1), (1,2,2), 


LAMB, JR. 


tribution near vı: Each summand has a product of two- 
resonance denominators. The dominant terms are 
(1,1,1), (1,2,2), and (1,3,3) since they do not necessarily 
contain inverse powers of A. The other terms have at 
least one power of A in the denominator. All terms will 
ultimately be expanded to order 1/4. 

We find 


(1,3,3), (2,2,1), and (3,3,1) while (2,3,2) gives a con- 


PO (D — Aie t/t? Ku) EUN Qra/ (Qr) D (971) + D(0)) 
+ E,ESN Qra/ (ve7s))(D(@—= i2) + (A/2)) --EXEZN Qra/ (vare) (9 (9— r) + D(A) 
T BAR END (o— 1) 4N'D(A/2)]0D«(A)+ D(A) H-EPESN«9 @— 7) 4N:0(4)] 
X[Da(2A)-+Ds(2A)]} exp—i(it e) + 3EPEsLN 2D (o— v) +ÑD(A) LDa(A)+ D2(4) ] 
Xexp—i((2v2—vs)t— (2e2— ¢3)) H-complex conjugate. (104) 
The in-phase coefficient Ci is then 
Gata? g*/G&Kw) LEAN Qro/ Qyavo)) (e— v3) £(o— yi) +E EÑ (ya0/ (vere) )L(o— 12)  (9— v2) + (2/4) ] 
3 4 E,ESN (ya (vavo) (e — v2) 8 (9 — v2) + (1/4) H-EZEW. zyav(à— rit A) £ (e — 1)/ * 
| HEEN syav(w— rit 2) € (9— y)/ At3- BPE A UN ya (0 vis A) £(9— 012) cosy 
HN ela (0 112) 4) (e— »1)—N ]siny)], (105) 
where the “relative phase angle" y is defined as 
P= (2v2—1— v3) + Qez— ex 3) - 


The quadrature coefficient S1® is given by 
S= prL pt (KUKEN ya? &(o— v1) +1)/ (vers) HEEN [ya £ (@—vi2) +47 ab A J/ (vas) 
EE EPN [ya (9— v) yA] (avo) + BPE ALN (yai — (— »))A)(e—»)—2N] 
HEFE ALN (ya 2(w— vı)A)L(w—rı)— 2N H- EZ2ESN?LEN»Qra?— (»— 712) 4) £(9— vio) —N] cosy 
—Neya(w— ve +A)&(w— vie) siny ]}. (107) 
ax are similar expressions for the other coefficients which appear in the self-consistent field equations (13) 
'The generalization of Eq. (81) takes the form 
Èı=o E pEi — OE E? OnE Es — (n23 cosy + £os sin) EZEs , 
E= a3E3— B5 Ed — 0: ESE 31— 0:5 EE — (ms cost Eos siny) HEE, 
E a5E3— 3E 3 — 03 EsE y — 0: E3Ed— (121 cosy-A- £n siny) EZ, 


The coefficients a, and 8, were already calculated in the single-f re gi d 
(83). The other coefficients are given by M de Em E 


Bio gr pt (eS Ks) Neyer? (verve) Le (9 va2) 3-441] 2N A NA ya (9—»1)]8(e—»:)) , 09) 
615— igni» (o*/ (A Ku) Ny a? (y ey) [8 (9 — 2) + A2 ]—3N A? -XN A Dy a?— 2 (9— v1) ^ ]g (e— »1) ; (110) 


(106) 


(108) 


: —2)— Tery ( g*/ (ef Kus) ) QN [Ya (w— vi) A] (9— »12) — N) , (111) 
Eg — Fer" v Ie Kus) VN vy svo — vis A) 8 (— r), (112) 
(113) 

(114) 


—j3!, Qi (egi Kus) N gy al. (6— v23— A) (e — v23) + (9 — vrt A): (e —v12) ] 

Ter v (eh? Ku? CN [Yat (w— v23) A] (e — v23) + No ya? — (w— v12) A] (t  v12) + 2N), 

Leo l2y ge Ku) (N: "Yat. (yayo) [8 (w— v2) +A |H- XN AN [y a?2-2 (— va) A] (e — v3) — INA), 
my gt (egi Ku) {NYa (Yayo) LL (0 23) - 4^7 H- NA [oy a? (— 3) A ] (9 — v3) 2N A?) , 
AKu AN [yat (e — vss) A] (o — v23) —N ) , 


Ac 


‘ad 


— er 


) 


0 


x 
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The frequency and phase determining equations are of the form 
nt p= otp E+ rE P+ Tiki? + E? EES (qa sinj— £z; cosy) , (121) 
vat p= rai EpE?- rnE?H - iE EyEa(nis simj— £y cosy), (122) 
Vi 6377-03 4E e+ Tal + rook? + EPE Es (5 simp— £n cosy) , (123) 


where the coefficients c, and p, are already gi ) 
s n n y given by Eqs. (90) and (91), and the 7’s and £'s by Eqs. (111), (112 
(115), (116), (119), and (120). The remaining coefficients are a an 


T= — Fem! "v f (eh Ku) (N (y oy) [yas (0 — 910) £(o— v12) + 2y AHN Ayalon A) L(o—v)}, (124) 
713= — TET" v (ef? Ku)? (I (y yy)" y (— v2) (a — v2) -"r AT ]H-3N aA ra (o—71--2A)£(o—71)), (125) 
T2317 — eT"? p Cei Ku) US Greve)" Dan (0 — 12) £ (o— 011) - y A7! ]H- NaA"y as (o — v2—A)L(w—v2)},, (126) 
7237 — yer v Q (eh Kcu) UV (yayi) [as (— vas) £ (o— 71))- 2r A7 ]H- NA^ yao vi - A)£(—73)) , (127) 
ra — TaT v Qi (h Ku) AUN (yayi) "vaso — v2) £ (o— v2) Yash 7H EN Aro i—2A)£(—7,)), (128) 
75:7 —yer ^» Q (eh KUHN vy) van (0— vas)  (— vos) — yas" ]H- NA^, (07— — A) £ (v1). (129) 


10. TWO-FREQUENCY OPERATION 


We may here drop all terms referring to the third 
frequency vz. There are now no “combination tones" in 
near resonance with 9; and Qs. The amplitudes E; and 
Es are determined by the differential equations 


Éy—oE,—BIES— 0E E, 
Es a3Es— 0s, EE?—BE£ , 
where the coefficients æn, Bn, 612, and 021 are now given 


by Eqs. (82), (83), (109), and (113). 
Introducing the squared amplitudes 


(130) 


X=Ef and Y=Eë, (131) 
Eqs. (126) become 

X= —fiX —61Y), 

X —2X (a1—f1 1:Y) (132) 


Y —2Y (o?—0nX —B3;Y). 


'The condition for a steady state of oscillation is 
X —0, Y 20 and may represent graphically in an X-Y 
plane by the point of intersection of the two straight 
lines 

Li: B1iX-01Y —01, 


La: 031X +B2Y — a» D 


if there is one in the first quadrant, together with the 
Single-frequency solutions 


X-a/fi Y-0 and X=0, Y-a»6:. 


The differential equations (132) allow us to follow the 
temporal behavior of the state X, Y of oscillation in the 
phase plane X, Y. Through any pomt m this plane (ex- 
cept stationary points) there passes a curve which indi- 
cates the path followed by the representative point 
(X,Y) on its way to a stable state of oscillation: The 
parametric equations of the curves are X — X (),¥=1} i 
With the time / as parameter. From the differenti 


(133) 


(134) 
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equations it Is seen that each curve has a vertical tan- 
gent when it crosses the first of the straight lines (133), 
and a horizontal tangent when it crosses the second 
straight line. This principle facilitates a. very simple, if 
qualitative, graphical integration of the differential 
equations (132) for the paths of the phase points in the 
cases discussed below. 

The various possibilities are depicted! in Figs. 3-5 
where it is assumed that both a; and az are positive, 


cum. 
o SE : 
0.5 3 EO - 
Fic. 3. Phase curves showing the transient behavior of two- 
mode oscillation. The straight lines Lı and Lz of Eq. (133) are 
taken to have coefficients a;—1, a2=0.4, 81—8:—2, 013—605 — 1. 
The slope of a phase curve is zero when it crosses line Lz and in- 
finite when it crosses line Lı. Although both modes are mon 
threshold, the favored X oscillation is able to quench the Y 


oscillation. E 
18 The phase paths of Figs. 2-4 were kindly integrated on Y 
analog computer by Dr. B. Wise of the Engineering Science — 
Laboratory, Oxford, to whom the author is very indebted. 


a 
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Fic. 4. Diagram similar to Fig. 3, except that the gain pa- 
rameter for the second mode has been raised to az=1. Simul- 
taneous oscillations at both frequencies occur at the single stable 
steady state. Both Figs. 3 and 4 correspond to “weak” coupling. 


so that the two modes are individually above threshold. 
The coefficients f, are necessarily positive if the optical 
medium is an active one, and while the mode competi- 
tion coefficients @ could conceivably have the opposite 
sign, they have been assumed positive (and equal to 
each other) in drawing the figures. 

Figure 3 applies when mode 1 is well above threshold, 
but mode 2 is only a little above its threshold, either 
because the cavity resonance frequency Q» is detuned 
from the atomic transition frequency w or possibly be- 
cause Qs« Qi. It is clear that the point (a1/61,0) repre- 
sents a stable state of oscillation, while (025/85) 
corresponds to an unstable steady state. Hence there is 
a range of operation above threshold of modes 1 and 2 
where oscillations in the favored mode 1 are able to 
inhibit oscillations at the second frequency. One might 
say that the effective gain for the second mode 


ao! =a2—0X — o — 02/83 


is being made negative by the presence of oscillations 
at vı. From Eqs. (109) and (113) it is seen that the in- 
hibiting effect is enhanced when vı2=w, or when the 
two cavity modes are on opposite sides of the atomic 
transition frequency, and approximately equally far 
from it. The physical interpretation of this effect which 
clearly involves o.f. saturation will be brought out more 
clearly in Sec. 18. 

As the excitation increases, a2’ will eventually become 
positive, and the relevant diagrams are Figs. 4 and 5. 
The former applies when 818276? (weak coupling) and 

- the latter when 818»«6* (strong coupling). 

'The two cases of weak and strong coupling give very 

different behaviors. For weak coupling the point of 


" 


LAMB, JR. 


intersection of the two straight lines gives stable 
steady-state operation, while the single-frequency oper- 
ating points are unstable. The optical maser oscillates 
simultaneously at two frequencies under these condi- 
tions. For strong coupling, on the other hand, the point 
of intersection of the two straight lines represents an 
unstable steady state and would not be realized in 
practice. All other points in the state diagram evolve 
into one or the other of the single-frequency operation 
points. Which of the two is reached depends on the past 
history of the state of oscillation. In other words, there 
is hysteresis. 

In the Doppler broadened gaseous optical maser 
Eqs. (83), (109), and (113) indicate that the case of 
weak coupling is naturally favored, since 6,62 tends to 
be greater than 012921. Hence, with possible exceptions 
such as the one discussed in the next section, double 
frequency operation is preferred. However, when van 
der Pol’s theory of a double resonance feed back triode 
oscillator is transcribed into our notation, one finds 
that in his case 0— 2G. This results from his assumption 
of a term (E, cosri- E» coset)? in the triode output 
current. After discarding terms which have frequencies 
far from v; and vs, this becomes 


2(E9-4-2E,E2) cosvitd-$ (e+ 2 ESE?) cosvet , 


which leads to the stated relation 0— 28. The van der 
Pol oscillator prefers operation at a single frequency and 
exhibits hysteresis phenomena, as in the case of strong 
coupling. Evidently the atomic medium of an optical 


N 


Y Lo 


A 


: jme SSS 
o o5 - 1.0 
X 
Fic. 5. Phase curves showing the transient behavior of two 
mode oscillation when the straight lines Lı and Le of Eq- (133) 
are taken to have the coefficients a; =a2=1, 81—82— 1, 815—0n-— 
(strong coupling). There are two possible stable steady states, 
each corresponding to single-frequency operation. The particular 
state reached depends on the initial conditions. Hysteresis phe 
nomena would occur if the parameters characterizing the 0507 
lator were slowly changed. 
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maser differs from van der Pol’s triode oscillator be- 
cause the nonlinear response of the atorns to frequencies 
vı and v; has à resonant character not assumed of the 
triode. It will become clearer from the discussion of 
Sec. 18 that two groups of atoms with different velocity 
are driving the two oscillations with only a limited 
degree of interference. One could easily make a com- 
parable model of the van der Pol type with two triodes 
each with its own tank circuit. A small amount of 
coupling between the two oscillators would produce 
intermode effects described by the @ coefficients in the 
weak-coupling case. 

There is also a further case in which the two straight 
lines coincide, i.e., when 01/057 83/07 0/8». In that m 
there is à neutral steady state for the representative 
point lying anywhere on the line within the first auad- 
rant. In practice, the state of operation when this 
condition is nearly satisfied should be verv sensitive to 
microphonic disturbances. : 


11. INTENSITIES AND FREQUENCIES IN 
TWO-MODE OPERATION 


The two-mode steady-state solution of (130) is 
given by 


Ej— (B2a1— 01202)/ (B182— 01:021) , 
E — (8105—02101)/ (8182— 015021) , 


where, as explained before, the right-hand sides may be 
evaluated for v,—9, and v2=Q2 without appreciable 
error. The frequencies are obtained by dropping in- 
applicable terms from Eqs. (121)-(123). We may set 
¢1= ¢2 without loss of generality, and find 


v= orp Ert TE, 
va Qs--as--poE2-- tab, 


where the right-hand sides are to be evaluated for 
v, Q,, and the E,? are as given in (135). 

These equations are fairly complicated, and probably 
can be used in full generality only for a numerical 
analysis" of very detailed data on optical maser opera- 
tion. Such a study would be simplified if the values of 
the E's could be inferred experimentally, since that 
would effectively reduce the dependence of Eqs. (136) 
on cavity tuning. For the present we will merely work 
out the frequencies for the important case of “mid- 
tuning” where viz 3 (vid-v2) =, w—=4A and vs—o 
—42A. If we regard A as being much greater than the 
7’s, we then may approximate the coefficients appearing 
in Eqs. (135)- (136) as follows: 


Bie Bur Dr bsc yr GIN (col Kuyar) ^8, (137) 
(138) 
(139) 


(135) 


(136) 


po —p1— 2yaATB— p, 


Ti Ta Ps 
ees na I 
TE n 5 L. Fork and Dr. 
3 Such an analysis is being made by Dr. R: 3 j 
M. A. Pollack. The author is very grateful to then for helpful 
discussions on this and other parts of the manuscript. 
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and assuming Q;— Qs Q, the gain parameters become 
etat QST Z:4)/20)]- 1) a. (140) 


The intensities are Ej E= 1/8, and the frequencies 
are 


vi Q1--21— pa/8 , 


141 
¥a=Qo-+o2+pa/f, ues 
so that the beat frequency is 
Vo vitz A+ (e—01)-- ra] A. (142) 


It will be seen that in the approximations of Eq. 
(137) we have 8182= 6021, and hence in order to decide 
whether the coupling is “weak” or “strong” it is neces- 
sary to evaluate the coefficients 8 and 6 more exactly. 
In case of exact midtuning, one finds that £?—6* has 
the same sign as N-Na. Ordinarily this would be 
positive since N must be positive for oscillations to 
occur. However, it is possible in principle to arrange to 
have N20 and N;« —N by having N(z)>0 in the 
middle two quarters of the tube length, and N(z)<0 
in the end quarters. In practice the last requirement 
could be met by adjusting the gas discharge conditions 
near the ends so that the lower maser atomic level is 
more populated than the upper one. In He-Ne masers 
an increase of tube diameter near the ends might be 
helpful in this respect. Diffusion of Ne metastables to 
the walls is thereby reduced, and electron excitation of 
the lower maser level is increased. 


12. NORMAL THREE-FREQUENCY OPERATION 


Equations (108) and (121)-(123) are fairly compli- 
cated, but can be readily used to discuss a number of 
special cases, as indicated in the following three sections. 

In general, unless care is taken to adjust the cavity 
tuning very accurately, the three frequencies vi, vz, and 
v3 will be such that the relative phase angle V of Eq. 
(106) is a linear function of the time. Then the last 
terms in Eqs. (108), (121)-(123) are periodic functions 
of time, and in some approximation their effects average 
out. If we neglect these terms, we can get a steady-state 
solution for the intensities E?, EŻ, E from the system 
of inhomogeneous linear equations 


a=b Er -015E2--01,E E 


a= BoE e+ 0s Et - UE , (143) 
az= bE} - 0s Ef -032E2 , 
and for the frequencies vi, v2, v; from 
yy Qoi piE?-- TiEZ-- Ti Ez, 
737 Qs - a2 poet Ta Ej rls, (144) 


va 7 Qi as pEr Ta ÉL E Tair, 


again taking gi= €» ¢3=0. Since the coefficients in 
Eqs. (143) and (144) are slowly varying functions of 
frequency, it will suffice first to determine the E? from 


E 
E 
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(143) and then to calculate the v.s from (144). The 
equations are a fairly obvious generalization of those 
for the two frequency case. It will be noted that opera- 
tion with E20, E20 can inhibit normal oscillation 
at v; until, with increasing excitation, 


az — a3— 03 E — 03s E? (145) 
becomes positive. 

Further discussion of pulling pushing, and mode 
competition for this case will be left to readers requiring 
a numerical analysis of their data. 


13. COMBINATION TONES 


As mentioned in Secs. 7, 9, the third-order polariza- 
tion P,G(/) of the active medium has constituents 
which oscillate at all possible frequencies of the form 
vu— vp tve. Even for *two"-frequency oscillation, there 
are additional frequencies 2v;—»:—vs' and 2yy—v27 vo 
in the polarization which are very close to resonance 
with the principal cavity modes just above and below 
the two Q; and Q% of main interest. As a consequence of 
Maxwell’s equations, fields at frequencies vs’ and vo. 
necessarily exist in the cavity and can appear in the 
output. Well below the threshold for normal three- 
frequency oscillation, E; (and also Eo) will be much 
smaller than E, and Es. We also assume yax<AKKu 
and vx~w so that 


na~ Agr» prese KuAT)3[N--2N;], (146) 


fo~0. 
For a steady state, the third Eq. (108) then gives 
Fem EgEs[o3—03:E?—032E£ | cosy. (147) 


The relative phase angle y is determined by Eq. (123) 
to have sinv~0, whence |cosy| 21, and 


Es — dum?» Qi (ey Kuh?) 
X[os—03:E?—03E7 ] * |N4-2N.|EZ2E;. (148) 


This expression is intended to be used under excitation 
conditions for which the denominator is negative. When 
the excitation increases, and the factor involving o; 
turns positive, the neglected nonlinear terms in Es 
would have to be taken into account in order to de- 
scribe the previously discussed normal three-frequency 
operation. 

In order to obtain the combination tone (2v;—»1) 
experimentally, one should adjust the cavity tuning so 
Qə is slightly above the atomic transition frequency w, 
thereby making Q, a little nearer resonance than Q5. 
The excitation should be increased until "two"-fre- 
quency operation is obtained, but not yet genuine 
tihree-frequency operation. 

Under these conditions, the 6’s in Eq. (148) contain 
a factor A™ and to simplify the discussion they will 
now be neglected. Equation (148) then has a factor in 


— its denominator 
a37: —1(v/Q3)/G3, (149) 


LAMB, 


JR. 


where the gain factor G; is given approximately by 
Ga~ [1— (V/N ):(09/2«00)]7 , (150) 


if one is not too near to threshold for normal three- 
frequency operation. Equations (148)-(150), within 
their domain of validity, indicate that E; is smaller 
than E; by a factor 


Ey/ Ey! ph Eg (h Kuh?) 1QsGs| N24] . (151) 


Using Eq. (62) for n=3, this can be expressed in a 
convenient form 


Es/ Ey 303 (@E2)*/ vavo) E Qvav)/ &* 6s 
x [1+ QN;/N)|exp(A/Kwuy, (152) 


which shows how the amplitude of the combination 
tone depends on Gs, and on the saturation parameter 
as given by Eq. (96) for n— 2. It should be noted that 
Eq. (152) could vanish if the spatial distribution of the 
excitation density is such that 2V2+N=0. (If the ex- 
citation is confined to the central region of the Fabry- 
Perot tube, JV» and N, by Eqs. (47), (58) have opposite 
signs.) An experimental study of the above phenomena 
might facilitate determination of some of the quantities 
which enter into our equations but for which direct 
experimental values are not yet available. 


14. FREQUENCY LOCKING PHENOMENA 


It has been observed by Javan? and by Fork? that 
when the cavity tuning is gradually changed in normal 
three-frequency operation so that the separation of the 
beat notes v;—»; and v3—v2 approaches a small value 
(typically of order 1 kc/sec), a frequency jump occurs. 
This phenomenon can be easily understood by refer- 
ence to Eqs. (121)-(123). For simplicity, we neglect 
the small frequency pushing associated with the terms 
involving pam and Tam, since the nonlinear pulling 
terms c, already give sufficient generality to the fre- 
quency relationships. By subtracting the sum of Eqs. 
(121)- (123) from twice Eq. (122), we find a differential 
equation for the relative phase angle y of Eq. (106) in 
the form 

v=c+A sinj4- B cosy, (153) 
where 


(154) 


and A and B are slowly varying quantities which de- 

pend on the En, Enm, and nam. We evaluate the Enm and 

nm With the usual approximations ya«xAKKw. Then 
Teu er y Ue Ku) (N - 23) ; 
misery 04 ei KuAt) (N — 2N») , 


Enceinceuce0. 


0—202—01— 03, 


(155) 


Because of the nearly symmetrical tuning of 1 and vs 


20 . H 
A. Javan, private communications for which the author È — 


very grateful. 
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we may set EXE, and find 

Az ry p Cei Kua 
XQGN-F2N)E24-2(N—2N3)B2), (156) 

B=0. 

The differential equation 


Y=o-+A sinj4- B cosy (157) 


has an implicit solution 


Y 
(y= | dx/ (c4- A sinx4- B cosx), (158) 


/ Yo 


where Vo is the value of y at t=0. The character of 
the function y/(/) depends critically on whether 


(424+ BY"2<(¢| or (A2+B%)'2> |o]. 


In the first case, the integrand of (158) has no singu- 
larities in the range of integration, and one finds that 
as V approaches infinity so does /. Asymptotically, 
apart from pulsations, one has ¥~ol-+-const. On inser- 
tion of this value in Eqs. (121)-(123) one finds that the 
frequencies are given by 


Vn ~a Fn (159) 


apart from pushing effects and pulsations of phase gn, 
in agreement with the results of Sec. 6 with the approxi- 
mations made here. 

In the second case, (A?+B?)>o?, the integral di- 
verges, i.e. ¿—>œ when y reaches the value —sin™ 
X (c/ (A2-I- B2)? Tn other words, V(/) approaches this 
value asymptotically. The disappearance from y of 
any linear dependence on / forces vs—vs—vs—»; and 
frequency locking ensues, with a definite relative phase 
angle 2¢2— q1— es. 

Let us suppose that the maser is in normal three- 
frequency operation with two distinct beat notes va— va 
and v»—», near to A150 Mc/sec, i.e., o= (vs—v2) 
— (va—»1) is somewhat greater than (A?+- B?)?. As the 
middle cavity frequency Q» is tuned closer to the atomic 
resonance frequency w, the separation of beat note fre- 
quencies |c| decreases. There should be some pulsa- 
tions in phase which would increase in amplitude as 
symmetrical tuning is approached. When |o| reaches 
(42+ B2)/2, a quick transition to the locked state 
should be made, and only one beat note should be 
observed. Under the additional simplifying assumption 
E> E~ E; and with use of the starting condition 
(60) for single-frequency oscillation, the separation of 
the two beat notes which could be attained just before 
locking occurs should be given by 


le] —-A3EQ-2N3)/Nv C gEs/1A)v/Q , 


which is conveniently expressed as a small fraction of 


the cavity bandwidth. 
It might be pointed out that the above phenomenon 


(160) 
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is very closely related to one discussed by van der Pol” 
in 1924-27. He considered a self-sustained triode oscilla- 
tor, capable of oscillation at frequency vı. If an external 
signal at v is injected into the tank circuit, it may be 
possible to detect a beat note at !»—»,| using a square- 
law detector. If, however, v is tuned gradually towards 
vy, a very sudden jump occurs, after which oscillations 
occur only at v and the beat note disappears. The width 
of the “quiet” frequency range depends approximately 
linearly on the amplitude of the injected signal, when 
this is small. In the case of the optical maser, we can 
think that an oscillator at v; is being perturbed by an 
"external" signal at the combination tone frequency 
v= 2v5— v; which arises from the third-order polarization 
P(t) induced in the nonlinear active medium. 


15. POPULATION CHANGES AND PULSATIONS 


In the absence of o.f. oscillations, the density of 
atoms in one of the two maser states, say a, can be 
determined by suitable integrations from paa?) (d,20,t0,0,t) 
as given by Eq. (24). When oscillations set in, there are 
contributions of second order which can be calculated 
from paa®(a,2Z0,to,2,t) using Eq. (31) for a=a and a 
similar equation for a=b. One has 


Ea t 
pooled) | Zl dt | dude tht 


X X Aa (2o,lo,7) paa (a,20,19,2,L) « 


a=a,b 


(161) 


It will suffice merely to give the result. One finds, with 
obvious approximations 
paa(2,t)=[Aa(2,l)/Ya]+ EAD] nra] E02 0/0] 
xi(g/ Ku) LX (E,E, Dalva — v ))iZ (o — v) 
Hop 


Xexpil (v,—v,)t-- (Gu— gp) |+ c.c.) 


xcos[G1,—21,)2/L] (162) 
apart from terms with rapid spatial oscillations. 
For single-frequency operation, one finds 
Paa(2,t)= [As(2,0)/v] 
— (AsG,D/ va ]— LA (2,0/75]]) 
Xi(gE)GQ.Ku)Zi(e—»), (163) 


which contains the lowest order effects of o.f. saturation. 
In some cases, the density could be monitored by ob- 
servation of the decay radiation emitted from state a 
(or 6) in transition to some lower level. (Of course, if 
trapping of resonance radiation were involved, the 
interpretation would be somewhat complicated.) The 
change produced by o.f. radiation in paa(z,t) might 


serve to aid in the determination of parameters like 
€- 


?: B, van der Pol, Phil. Mag. 3, 65 (1927) and the review article 
cited in Ref. 17. 


^ 
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ya, Ex), ctc., which enter into all our equations, but 
which might not otherwise be known. 

For two-frequency operation of the optical maser, 
paa has a pulsating constituent at a frequency near A 
besides a dc part. If we assume Yay ~ Var AK 


paa (Zl) 
= [As(2D/v«]1— (Ao (50/v«]7 Lt (60/ ve] 
<a? g? (Kuya) LE? exp— (e—v1?/ (Ku)? 
+ Ex? exp— (wo—v2)2/ (Ku)? J+ (LAo/v«]— LA/mJ) 
dal? g? CuA)- E Ex exp— (o—v2/ (Ku) 
+exp— (w—v:)?/ (Ku)? ] cos(7z/L) sin(va—i)t. 
(164) 


From this expression one sees that the amplitude of the 
pulsations at frequency near A, relative to the dc 
change in population due to o.f. oscillation, should be 


QE, Eo(Ep+ Es?) (y«/ A) cos(nz/L) , 


if, for simplicity, we neglect the Gaussian exponentials. 
Here again, it might be useful to use this phenomenon 
as a diagnostic tool while undertaking a systematic 
study of two-frequency operation. 


16. CONNECTIONS WITH PREVIOUS CALCULATIONS 


The basic paper in this field is, of course, that of 
Schawlow and Townes! who give expressions for thresh- 
old equivalent to (62). Townes” has also given an equa- 
tion for linear pulling, as has Javan? for nonlinear 
pulling. 

Oscillations of an optical maser involve the propaga- 
tion of radiation in a nonlinear medium. Several papers 
have recently appeared which deal with this subject. 
For various reasons, these do not apply very closely to 
our particular problem. Thus, Bloembergen et al. and 
Franken and Ward” treated harmonic generation which 
plays a relatively minor role for us. Teng and Statz? 
discussed low-order nonlinearities in a gaseous medium, 
but, as will be discussed below, their treatment of 
Doppler broadening is not adequate for our purposes. 
Also our model for radiation damping is more realistic 
than theirs which involves just one relaxation time r, 
while our equations contain two decay constants Ya 
and yz. To be sure, the combination yas=3(ya+7») 
enters most equations, and this might be identified 
with 7. 

Among other publications which deal with maser 
theory are those of Wagner and Birnbaum?' and of 


? C. H. Townes, Advances in Quantum Electronics, edited by 
J. Singer (Columbia University Press, New York, 1961), pp. 3-11. 

2» A. Javan, E. A. Ballik, and W. L. Bond, J. Opt. Soc. Am. 52, 
96 (1962). 

^]. A. Armstrong, N. Bloembergen, J. Ducuing, and P. S. 
Pershan, Phys. Rev. 127, 1918 (1962). 

25 P. A. Franken and J. F. Ward, Rev. Mod. Phys. 35, 23 (1963). 

26 C. L. Tang and H. Statz, Phys. Rev. 128, 1013 (1962). 

21 W. G. Wagner and G. Birnbaum, J. Appl. Phys. 32, 1185 


.' — (1961). 
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McCumber. These papers consider to some extent 
the quantum nature of the electromagnetic field. They 
differ greatly in spirit and content from ours, and we 
will not attempt to make a comparison here. The work 
of Haken and Sauermann? and of Davis? seems much 
closer to ours, but there are significant differences in the 
models used, and in the appearance of our equations, 

As mentioned in Sec. 4 an earlier calculation? appli- 
cable to an optical gaseous maser neglected complica- 
tions arising from the atomic motions and multimode 
oscillation. It was then possible to work with a density 
matrix p(r,f) characterizing an ensemble? of atoms at 
position r at time / which were excited at any time fy £/, 
This obeyed 


ip=(5C,p |—(i/2)(Vpt+el) +74, (165) 


which differs from (18) by the term containing a source 
matrix A describing the (slowly varying) rate densities 
of excitations Ag and A, 


(166) 


(In most applications A will be a diagonal matrix.) 

It is possible to carry the calculation to higher order 
in the E, for multimode oscillation without atomic 
motion by an iterative procedure in which we begin by 
neglecting any time dependence in the population dif- 
ference paa— po. In the rotating wave approximation, 
one of Eqs. (25) then gives a quasisteady-state solution 
for Pav(Z,t) 


par(2,l)= —3i (9/4) EU y(2)Do—r) 


X (paa— pos) exp—i(val+ en). (167) 


Inserting this in another one of Eqs. (25) and again 
making a rotating wave approximation we find rate 
equations 

Daa = —Yapaa FR (Q5— Paa) HAa D 


(168) 
Bus — Yop H R (paa— poo) Ho, 


where 


R-i(g/h) 2 2; EJE,UA(Z)U,(z) 


X[D(o—»,)(expi(n—rJI+i(er— e9)4-c.c-1- 069) 


The “rate constant” R has pulsations for cases of multi- 
frequency operation, and through Eq. (168) these would 
lead to pulsations in the populations paa and pos at 2 

beat frequencies vy—v,. If it were deemed necessary to 
continue the iteration procedure the pulsating popula- 
tion difference paa—pss could be approximately evalu- 


23 D. E. McCumber, Phys. Rev. 130, 675 (1963). 63); 
2 H. Haken and H. Sauermann, Z. Physik 173, 261 (1902/3 
176, 47 (1963). 
2L. W. Davis, Proc. Inst. Elec. Engrs. 51, 76 (1963). 
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ated from Eq. (168) and put in Eq 
improved Eq. (167). 
If we neglect the pulsations in R,w 


8. (25) to obtain an 


€ obtain 


R=3(9/h Ya Y, EPU Pelr), (170) 
A 

which is a plausible generalization of the rate of transi- 
tions previously obtained. In a steady state, the popula- 
tion difference implied by (168), (170) is 


Paa— Dbb — [(As/y«)— (Av/y)] 
X[1+ QyaR/(vays))], (171) 


which shows clearly the effect of 0.f. saturation. 

Since the rate constant (170) depends on position z 
through normal modes U,,(z), the population difference 
(171) also depends on position, and may be said to 
have “holes” burned in it. Consequences of this for 
mode competition can be discussed along the lines fol- 
lowed in Sec. 10. Under most conditions the behavior 
will correspond to the weak-coupling case, although if 
the excitation density were such that — N, were larger 
than Ñ the strong coupling case might be realized. 

Combining Eqs. (165), (167), (171), and (41) we 
obtain a polarization 


P(2,!)= Q (paot pas") (172) 


and the coefficients S, and Cn which enter the amplitude 
and frequency determining Eqs. (13) and (14) are 


2 
Bem ( P/A) YL lw— Un) En 
JE, x 
(173) 
L 
x | dU YEN EDH QR OY) 
0 


and 
C, [(o—v5)/ras 19s. (174) 
These expressions depend nonlinearly on the mode 
amplitudes E, because of saturation. However, since 
we have already neglected the beat frequency pulsa- 
tions of paa—pe» which lead to terms with A in the 
denominator, it might not be consistent to keep any 
terms in R which are off-resonance by more than about 
|o—v, | ^ (yas A) ?. à : 
We will now consider only single-frequency operation, 
for which our Eqs. (173)-(174) are essentially exact. 
Then a single summand “=” contributes to R and we 


find 
2 f - 2! 2 
S, — gras | dsl U (2) EN @L 108+ (e 
L 0 


Hra (QEY LU nls) F/ Greve)". (475) 


The integral may be easily calculated if N (2) is a slowly 
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varying function of position, and we find 
Su — gr yaN £v.) f(w)E,, (176) 
where 
2— [CoE,/ yari) yat &(o—v4) (177) 
and 
fw) = (2/w)[1— (0-4 -w)-]. (178) 


If Eq. (175) for S, is expanded to third order in En, 
we obtain a result in agreement with that given by Eqs. 
(56), (72) for the single-frequency case with no atomic 
motion. It should be noted that whether (175) is ex- 
panded or not, the amplitude of oscillation has a maxi- 
mum for resonant tuning and falls off monotonically 
with detuning. There is no indication here that the 
double maximum of Sec. 8 might be a spurious one which 
arises from the neglect of fifth or higher order terms. 


17. DISCUSSION OF DOPPLER BROADENING 


The effect of atomic motion upon our equations is 
rather curious and warrants discussion which, for sim- 
plicity, will be given for single-frequency operation. As 
we have seen in Sec. 16, the optical properties of the 
medium may be described by a nonlinear susceptibility 


x(w—a, En)=P af (ecEn)- (179) 


It would perhaps be plausible to hope that the following 
simple recipe would take atomic motion into account. 
Because an atom moving with velocity v sees a Doppler 
shifted frequency, in the laboratory frame of reference 
it effectively has its resonance frequency shifted by 
w/c. The effective susceptibility ought then to be 


ae Í deW (s) Co eo nf ed rA 


which can easily be expressed in terms of the Z(¢) 
function which is discussed in Sec. 6. The effective 
damping constant becomes 


Yal IF (QE) A CP yay) ] 


instead of yas, and when this is much smaller than the 
Doppler parameter Aw, the line shape should be 
Gaussian with a normal Doppler width. 

The above prescription is incorrect except to the 
first order in E, if standing waves rather than running 
waves are involved in any way. This follows from a 
study of Eq. (70) contributing to the third-order polari- 
zation which involves a threefold integration over times 
7', 7", and 7". For single-frequency oscillation, the 
integrand contains 
U.(s—vr )U .(—07 — vr ) Us (a — 90! — or — tr) 

Xexp[ —i(e—»v4)7' ] exp[-E1(e —v4)2" ^ ]. 
Each of the U,’s is a standing wave which can be ex- 
pressed as the sum of two running waves like 
exp-ki(Ks—tr’), etc. The physical interpretation is 
that in order to contribute to P„® at time / an atom 


£ 
y 
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has to interact three times with the o.f. field: first at 
dor er eit then at i =i— 7'— 1", and finally 
at //z 1— r. At each of these interactions the atom has 
a “choice” of interacting with one or the other of the 
two running waves. The 7 dependence of a typical term 
of the integrand of (71) is then 


Boa (rip mar apio o) (rer ) ei vr 
+iKo(r 4-7 )2ei Ko (7^ + rit”). 


The physical consequence of the appearance of terms 
involving 3E Kv in the exponent is that each interaction 
involves a Doppler shift. Since we are interested in the 
case of a large Doppler width Ku, such a shift can take 
a given interaction very far from resonance even if 
o v4. This is expressed by the destructive interference 
of contributions to the integral at various value of 7’, 
7", and 7”. The interference will be least when the 


accumulated Doppler phase angle 
x Kor Ke(/ +r) EKo rtr" 7) 


is zero. In order to obtain a nonvanishing spatial Fourier 
projection (8) for P,,®, the choices + can not be all 
alike. The six remaining possibilities for the phase 
angle are 


+Kof7’+ (74-77 )— (r+ 1 +7r”)]= +Ki(r— r”) ; 
EK (Hr) + (Hr 1) ]e EK +2"), 


and 


FRA rA rN k 77) 
-EKo(1 4-27" +r”). 


As seen in Sec. 7, only the first two possibilities are 
able to lead to vanishing interference, and then only 
when z'— 7". 

Physically, one may say that a dominant type of 
process involves three interactions: first, one with a 
right (left) running wave at Į”, then one with a left 
(right) running wave at /’, and finally one with a left 
(right) running wave at /, with the time intervals 
obeying 4—// — 1/ —1'" so that the accumulated Doppler 
phase angle 


EK (1—10)-4-Kv(1—t")— Kolit] 


cancels out at time /. 

The above cancellation of Doppler interference would 
not occur if waves running in only one direction were 
present. The nonlinear terms are much less broadened 
and weakened by Doppler effect for a standing wave 
maser oscillator than would be inferred from a study of 
nonlinear propagation alone. The double peak in the 


9 ies " . A 5 
- power as a function of tuning met in Sec. 8 can occur 


y because n (saturation) is not as much Doppler 
'oadened as æn (linear gain profile). 
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18. HOLE BURNING 


In his discussion of maser action Bennett?! has made 
use of the notion of “hole burning." Since it aids the 
physical understanding of the rather complicated equa- 
tions, we will now show how this phenomenon is de- 
scribed in the present work. As Bennett's treatment 
does not bring in the population pulsations of Sec. 15 
we will base our discussion on the simplified theory of 
Sec. 16 in which pulsations of population were neglected. 

In Sec. 16 the atoms had zero velocity. It is possible 
to generalize the discussion for the case of atoms having 
a velocity distribution W (v) at the cost of further fairly 
plausible assumptions which are no worse than approxi- 
mations already made. We deal first with those atoms 
which have a definite velocity v and which were excited 
at zo at time fo. The perturbation experienced by such 
an atom is 


V()=—(e/it) 2 E U „(zot v(1— lo) ) cos(v,l-F pu) (181) 


so that instead of seeing fields at frequencies v, the 
moving atom sees fields at twice as many frequencies, 
i.e., put Kv. The rate concept approach of Sec. 16 can 
not be used since the atoms characterized by different 
values of zo and fo experience different perturbations, 
i.e., the phases are not the same for the various members 
of the ensemble of atoms arriving at z at time /. It is 
plausible, however, to estimate the effect of saturation 
on the population difference Pao (v,1) — os (v,/) by using 
an equation like (171) with a perturbation V(t) given 
by (181) but with the terms involving zo and ty omitted. 
The replacement for the velocity-dependent rate con- 
stant R(v) is then plausibly 


RO) =tral o/h)? E E/[g(v,—o Kv) 


+.£(v,-w—K»)]. (182) 


If v=0, this reduces to the space average of Eq. (170). 
The corresponding population difference for atoms hav- 
ing velocity v is then 


paa (91) — psv (vf) 
=W (v) (Aa/Ya)— (As/Ye) ] 
X[t-2vaRQ/Grv)]*.. 089) 


A plot of (183) against v would show the assumed ve- 
locity distribution with “holes” burned into it due to 
o.f. saturation effects. These holes would be appreciable 
whenever R(v) became comparable to vavel (Yor? 
which could occur near Kv»— --(v,—c), so that iwo 
holes could be burned for each cavity mode in oscilla- 
tion. At a resonance, where v,—c, the two holes for 
the nth mode would merge and reinforce one another. 
The holes in the velocity distribution would have been 


3 W. R. Bennett, Jr. Appl Opt. Suppl 1, 24-61 (1962; 


especially pp. 58-59. It should be noted that the holes are in first 
instance burned in the curve of population difference versus vec 


locity, and only indirectly in a curve of gain versus frequency- 
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scen in Sec. 15 if the integration over velocities had not 
been carried out so soon. 

With the above approximate expression for Paal?) 
—pw(v) we may use Eqs. (25) to calculate a pseudo- 
first-order value of paz!) and P(z,v,t) in the manner 
of Sec. 16. The result will be the same as before, but 
with a velocity dependent reduction factor 


[1E 2yaR (2)/ (yave) ] 


to express the effect of saturation, where R(v) is given 
by (182). There are similar reductions in the related 
functions S4(v) and C, (v). The coefficients S, and C, 
which enter the equations of self-consistency [ (13), (14) ] 
result from integration of these quantities over velocity. 
Thus 


S,— — ghi NE, | doW (v) &(w—v,-+ Kv) 


XL 2ya R(9)/ (avo) ] 


For single-mode operation, the rate R(v) in the de- 
nominator of (184) contains the two terms given by 
Eq. (182). If an expansion is made to first order in 
R(v) the integration over v can be done easily for a 
Maxwellian velocity distribution in the limit of large 
Doppler width Ku, and one gets results equivalent to 
those of Sec. 8. The possible dip in output power as a 
function of cavity tuning in single-mode operation can 
be interpreted as a consequence of the merging of the 
two holes at Kv— + (v, —e) — 0. 

For multimode operation a similar calculation can 
be made with the complete expression (183) for R(v). 
This will give the dominant terms of the equations of 
Sec. 9 for a Maxwellian velocity distribution and large 
Doppler width. However, the frequency locking terms 
involving V will be missing. 

For two-mode oscillation, (184) leads to especially 
strong mode competition attributable to hole burning 
when Kv=12.—w=w—, i.e., the traveling wave along 
+z for mode 2 and the traveling wave along —z for 
mode 1 are both in approximate resonance with the 
atomic transition frequency for an atom having ve- 
locity v. This effect can be correlated with the peak in 
6 for o—vis» mentioned in Sec. 10. 


(184) 


19. APPROXIMATE HIGHER ORDER THEORY 
FOR SINGLE MODE OPERATION 


It would be possible, but quite tedious, to extend the 
calculations of the text to fifth and higher order for the 
single-frequency case. The simpler approximate pro- 
cedure outlined below may serve in the absence of such 
calculations. It was mentioned above that an expansion 
of (184) to first order in R(v) reproduces the equations 
of single-mode operation correct to third order in En. 
If this expansion is not made one may hope to have 
equations which are valid for stronger signals. The v 
integration is complicated, and we will content our- 
selves here with two special cases (a) Ku» |vs—e|2»Y« 
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and (b) vn=w, Kuyar: In the former case, one finds 
approximately 


Sí—R412 9NE,/ 

(AKw) UE (QEN) (Pyare), (185) 
while at resonance, when ya=w, 
S, TONE, / 

(hKu)[1+3( pESY/ GPysy)) J^, (186) 


and the merging of the two holes shows up in a simple 
manner through the doubling of the term expressing 
the effects of saturation. It will be remembered that a 
similar doubling of the coefficient 8, took place in 
Sec. 8 and was responsible for the possible dip in maser 
output versus cavity tuning. Although the more general 
behavior of output versus tuning implied by Eqs. 
(185)- (186) should be qualitatively correct, it must be 
remembered that rather uncontrolled approximations 
have been made in their derivation. 


Cs eee 


i al tii a: 


20. EXCITATION OF LOWER MASER LEVEL BY 
SPONTANEOUS DECAY OF UPPER 
MASER STATE 


It was mentioned in Sec. 6 that the lower maser level — = 
could, at least in part, be excited by spontaneous emis- — Em 
sion from the upper. For the present this complication — 
will be treated only in an approximation in which the 
rate concept is valid. For simplicity we ignore atomic x 
motion, although the work of Secs. 16 and 19 suggests — E 
how this could be allowed for approximately. We write 
rate equations like (168) dE 


B. 
d 
mE 
= 
fi- 


Paa = — YaDaaT- R(p55— Paa) t Aa D 
Dbb— — yipu- R(paa — pin) HÀ ot fYapaa ? 


where the extra source term fypaa describes the effects aye 
of radiative cascade excitation of b from a assuming 

that a fraction f of the decays from a are to b. The A’s 
describe the uncorrelated excitation of the two levels. 3 
In a steady state one finds a population density E 
difference 3 


paa— pa [As Ya) fé 9) Qu/0)] E 
X[z-RGi«G-f)krM en) ]E*, (188) — 


which should be compared to Eq. (171). It will be seen ^ 

that the effect of a nonvanishing branching ratio f is ^ - 
merely to change the unsaturated population difference E 
(obtained for R=0), and also to modify the value of R — — 
for which a given degree of saturation would be ob- — 
tained. The saturation parameter of Sec. 8 will be — 
modified in an obvious fashion. Thus if f=0, a value 
of rate R=}yc70/7as would cause 50% saturation, — 
while if f=1 the value would be R—,. It should be — 


(187) 


terms S,, and C, are direct manifestations of satur 
tion phenomena. At the present state of maser art, 1 


US 


decay constants ya and y, are not well enough known 


et Oe! oe As bs a <_< 
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for the effect of a nonvanishing value of f to be easily 
seen. 

When population pulsations are taking place there 
will be a correlated time-dependent excitation of the 
lower level by cascade. It is possible that more interest- 
ing consequences than those obtained would result, and 
it is hoped to explore this possibility in a later paper. 


21. OTHER SOURCES OF BROADENING 


For some kinds of line broadening, especially in 
certain solid-state optical masers, one could adopt the 
recipe proposed in Sec. 17, and rejected for the case of 
Doppler broadening. If the effect of environment could 
be described by a distribution function for the atomic 
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134, 


AMB, JR. 


resonance frequencies w, an averaged nonlinear suscepti- 
bility could be used. This could also be done for the 
case of isotopic mixtures of the active atoms in gaseous 
masers. 

Although ya and y+ were introduced into our equa- 
tions to describe spontaneous radiative decay of the 
states a and b, it is plausible that such phenomeno- 
logical decay constants might also describe certain kinds 
of collision broadening. In that case, the y’s would be 
functions of the pressure? A more detailed discussion 
of collision broadening for a gaseous optical maser will 
be given in another paper. 


3? Evidence for such a dependence has recently been obtained 
by Javan and Szóke, Ref. 16. 
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Correlation Effects in Many Fermion Systems. II. Linked Clusters* 


Hucnu P. KELLY 
Department of Physics, University of California at San Diego, La Jolla, California 
(Received 31 January 1964) 


In a previous paper a set of coupled equations was derived for the ground-state wave function and energy 
of a finite system of interacting Fermions. ‘The equations are now modified so as to be more applicable to 
systems in which the number of particles becomes large. The resulting equations are shown to be equivalent 
to those obtained from many-body perturbation theory. 


I. LINKED CLUSTERS 


Ls a previous paper,! a set of coupled equations was 
derived for the ground-state wave function and 
energy of a finite system of interacting Fermions. The 
wave function was expanded in terms of multiple- 
particle excitations on an uncorrelated zero-order state. 
The total energy E of the system appeared in the result- 
ing equations and it was pointed out that this restricts 
the application of these equations to finite systems; in 
general, the restriction is to systems of small V. In the 
equations, the amplitudes for one-particle excitations 
are coupled to those for two-particle and three-particle 
excitations. The two-particle amplitudes are coupled to 
those for one-particle, three-particle, and four-particle 
excitations, and similarly for higher particle excitations. 
Tt was mentioned in I that it might be reasonable to 
approximate four-particle excitation terms, for example, 
as products of independent two-particle excitations. 

It is shown here that four-particle terms involving 
two independently propagating pairs enter the equa- 
tions in such a way as to eliminate the dependence of 
the two-particle excitation equations on the total 


ees Work supported in part by the U. S. Atomic Energy Com- 
" mission. 
1H. P. Kelly and A. M. Sessler, Phys. Rev. 132, 2091 (1963), 
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energy E, and similarly for the other excitations? 
Explicit inclusion of products of independent excitations 
yields the equations of the linked cluster expansion. 
'The resulting equations are shown to be the same as 
those obtained from many-body perturbation theory as 
formulated by Brueckner? and by Goldstone.* 

In I, the ground-state wave function is expanded as 


Y= |e)+2f (k; a)n "na| Po) 


ar pr f(RR’ ; oB)nc m gna | Eo) + BOO (1) 


a, Bk, k' 


The unperturbed solution |do) is a determinant com- 
posed of the N single-particle states which are lowest 
in energy. 

Equations are then derived for f(k; o) and f (Ek! ; o) 
by inserting |V) from Eq. (1) into 


H|J)- E), Q 
where H is written in the usual second-quantized form.’ 


2? T am indebted to Dr. A. M. Sessler for stressing the desirability 
of including products of independent pair excitations in the four 
particle excitations, so as to make the resulting equations more 
applicable to systems of large N. 7 

îK. A. Brueckner, in The Many Body Problem, edited by 
C. DeWitt (John Wiley & Sons, Inc., New York, 1958). 

4 J. Goldstone, Proc. Roy. Soc. (London) A239, 267 (1957). 
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Fro. 1. (a) In perturbation theory 
two independent pair excitations 
factor when both time orderings are 
considered. (b) Typical terms which 
always link any two excited pairs. 


An approximate solution for f(kk’; aß) is given in Eq. 

(15) of I, in this paper referred to as Eq. (1.15), 

SCAR; 0B) = (eat eg— e&— e —( (8) 2| v| 8) 
+E—Enp)( (kk) ex| v| a8). (3) 


If we neglect the term E— Zr, then this expression is 
that which is obtained from Rayleigh-Schrédinger per- 
turbation theory when we use a Hartree-Fock basis and 
include the first-order term and all higher order diagonal 
hole-hole interaction terms.: However, for large N the 
term E— Enr may become so large as to invalidate the 
equations of I. Such difficulties have been described by 
Brueckner in his comparison of Rayleigh-Schrédinger 
and Brillouin-Wigner perturbation theories.? 

We now show that the term E— Enr may be removed 
by considering the coupling of four-particle excita- 
tions in Eq. (I.11) for f(kk'; aß). In the equation for 
f (RE  ; ap), coupling with four-particle excitations adds 
the term 


Z ((y8) zl v| R Ey f(RR ROR : aßyô) (4) 


yokrikin 


to the left-hand side of Eq. (2) or Eq. (1.11). The 
coefficients f(kk'b"k'" ; aßyò) are composed of parts in 
which two pairs propagate independently and of 
remaining parts involving more complicated interac- 
tions among the four particles. 

The approximation of regarding f (RRR E" ; oBy6) 
solely as a product of two pair excitations has been 
discussed by Brenig and Sinanoğlu.” A justification for 
this decomposition is found in perturbation theory 
where the lowest order four-particle excitation is given 


5 H. P. Kelly, Phys. Rev. 131 684 (1963). 

6 Hartree-Fock single-particle states are assumed, although 
they are not essential for this discussion. 3 " 

7? W. Brenig, Nucl. Phys. 4, 363 (1957); O. Sinanoglu, J. Chem. 
Phys. 36, 706 (1962). 
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by the product of two pair excitations when we consider 
both possible time orderings as in Fig. 1(a). Those 
parts of f (EE E"k'" ; aßyô) in which two pairs of particles 
propagate independently should be written as products. 
of f(kk';aß). It may be noted that there are always. 
linked terms connecting any two excited pairs, however, 
as shown in Fig. 1(b). > 
When product pairs such as f(kk"; ay) f (EE; 88). 
are inserted into Eq. (4) and used in the equation for 
f (EK. ; aß), than linked terms result as shown in Fig. 2. 
When we consider the products 


Í kk; aE f(E" R^; ô), ; 
ya k'#k,k (3) 
640,68 k” Akk, 


these give us terms 


SRR; aB)E S| v| (EE), 2) f(E" E^; 78) 
yzaj8 Ek'kE 
64a,8 Lk" b 


on the left-hand side of Eq. (2). 
In Eq. (1.8) it is shown that 


E—Ene= $ (a8|o| (RE), f(RR ; aB). 


a, B. h.t 


NE 


Fic. 2. Diagram illustrating how H may couple two indepe 
pair excitations with a single pair excitation. There 
diagram in which the time ordering of the first two i 
is reversed. This term enters the equation for f(s’; ag) 
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Equation (6) is then rewritten 


J(M.ogLE-Ene- X, (e| K ea) (ER 5a) 


„kit kur 


Ee pa (yB\ | (RR) y f (ROR ; y8) 


yak krni 


z Dr (ye | v| (R^) 2) f (RR; 8) 


krri; y4a,B; 64a,B 


= D» (y8| 0] (k Rez) f (RR ; v8) ]. 


kik; ya, B; 575a. 


(8) 


When the term of Eq. (8) is added to the left-hand side 
of Eq. (1.11), f(kk' ; aß) (E— Eur) cancels on both sides 
of Eq. (1.11). 

At this point we now have the linked cluster expan- 
sion for two-particle excitations. Equation (I.11) is 
also now nonlinear but in many cases this should not 
cause any difficulty.» The remaining terms are not 
proportional to JV like E— Egr; they will be given a 
simple physical interpretation. The denominators in 
Eqs. (1.15) and (1.16) are now modified by replacing the 
terms E—Egr by the summations in Eq. (8) with 
positive signs. For example, Eq. (1.15), an approximate 


“equation for f(kk'; a), becomes 


Ir 


z AM krr; y4a,8; 54a, 


ae > 


J(kE' ; ap) 
= [eat ea— &— er — (a8) «sl 0] a8) 
E =, | »| (BUR) ex) f(k” k” ; aô) 
«8| v| (R' k^) Nu ; yB) 


ya, k! k 


ES rv otlel G^), (487578) 


m |0| (kk) ex) f(k” k ; y0) T 
X(RR^) "di v|o) : 


RR yo, B; ô*a, 


(9) 


HUGH P. 


KELLY 


Fic. 3. (a) Diagram 
factorization when the 
two disconnected parts 
have one or more par- 
ticle or hole lines in 
common. The two sub- 
tracted diagrams are 
really linked diagrams 
as shown in (b). 


Eq. (9) which refer to states |o) and |8} combine with 
the single-particle energies e4 and eg and with the 
diagonal hole-hole interaction term ((o8)e«|v|a)) to 
give an effective two-body energy for |a8). When the 
denominator also includes the particle-particle and 
exclusion principle violating (EPV) hole-particle terms 
of Eq. (L17), then the denominator becomes approxi- 
mately the difference between the two-body energies of 
the states |o) and |kk’). However, the two-body 
energy of | kk’) is not given so accurately as that of |a) 
because the interaction of the excited particles with 
unexcited particles is accounted for only in the Hartree- 
Fock approximation. However, pair-correlation energy 
terms of a ground-state particle with the other ground- 
state particles are now included. The last two terms in 
the denominator of Eq. (9) which refer to states |yô) 
account for the fact that correlation terms for ground- 
state particles which involve excited states |k) or |’) 
will be eliminated by the Pauli principle when there are 
other particles excited into |k} or |’). 

As a simple example of this result we consider the 
beryllium atom and let a, B, y, ô refer to the 25+, 25^, 
1st, and 1s- single-particle states, respectively. In 
Ref. 5 it was shown that the 1s—2s interactions are 
small compared to 15— 1s and 2s—2s interactions and 
so f(kk' ; 1525) is omitted in the denominators. Also the 
excited single-particle states used in Ref. 5 were all in 
the continuum and so the sums in Eq. (9) which do 
not run over both continuum states vanish because, 25 
shown in Ref. 5, Z= (Ro/) dk and continuum states 
have normalization (2/R9)!?. Equation (9) becomes 


F(R’; ab) 


= (eat ea—((a8)ezlola8)+ Z (alol (k'k Yea) 
Xf(E"k"' 5 8) — er — ex) X((RR')oz|v|a8). (10) 


In the denominator of Eq. (10) we have the effective 


0 
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— ((a8)««| v|af) corrects for the inclusion of the Coulomb 
interaction of |a) and |8) in both single-particle energies 
€a and eg. The term Verw(aB|o| (RR Jez) f (E E" ; aß) 
accounts for second and higher order interactions 
among the particles in states |a) and |B). In the calcula- 
tions of Ref. 5 it was found that omission of this term 
would have resulted in approximately a 1075 error in 
the correlation energy among the two 2s electrons 
When there are bound "excited" states in the complete 
set of single-particle states then the energy denomina- 
tors will also include terms from the last two summa- 
tions in the denominator of Eq. (9). These correlation 
terms which we have been considering may be related 
to a type of "rearrangement diagram" considered by 
Brueckner and Goldman.? 
The equation for one-particle excitations f(k; a) also 
has the term E— Egr removed when coupling with the 
three-particle excitation term is considered. We write 


fER';aBy)—f(h;m) 2, JRR BY) 
Kl kl kh Byz 


SE ait 

The result is that we replace (E—Eyr) in Eq. (1.10) by 
E (vl WH.) {HR a) 

T + OE QGubeQEOQfGE e). 02) 


B,yséa,ktt 


I. CONNECTION WITH PERTURBATION THEORY 


The results which we have just obtained may also 
be derived by a consideration of the perturbation 
expansion.4 We begin by examining the factorization 
of unlinked diagrams as shown in Fig. 3(a). The two 
apparently disconnected parts of each diagram are 
assumed to have one or more hole or particle lines in 
common and the two diagrams on the right of Fig. 3(a) 
must not be considered as “unlinked.” They are really 
“inked” as is shown in Fig. 3(b). The first term on the 
right-hand side of Fig. 3(a) is eliminated by the usual 
cancellation of unlinked clusters?^ Similar factoriza- 
tions were considered in Ref. 5 where the two diagrams 
on the right of Fig. 3(a) were called third-class EPV 
diagrams. These diagrams were summed in an approx- 
imate way, although with sufficient accuracy for the 
numerical calculations of that paper. However, these 
two diagrams on the right of Fig. 3(a) may be summed 
exactly by noticing that both possible time orderings 
appear? or by merely writing down the algebraic expres- 
sions and adding.? For example, in the first subtracted 

3 K. A. Brueckner and D. T. Goldman, Phys. Rev. 117, 207 


AEn. Bethe, B. H. Brandow, and A. G. Petschek, Phys. Rev. 


129, 225 (1963). 
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Frc. 4. Factorization of an unlinked diagram. It is assumed that 
all the disconnected parts have at least one hole or particle line 
in common so that there are correction terms (c), (d), (e), and 
(0 to the factorization (b). Diagrams (c) and (d) must be factored 
urther, 


diagram of Fig. 3(a), shown in Fig. 3(b), the product of 
the energy denominators is (B(A+#B)A)-1, where B is 
the denominator of the closed part considered separately 
and A is the denominator of the other part. The second 
subtracted diagram of Fig. 3(a) has the same matrix 
elements and a contribution (4(A4-I-B)A)- from the 
denominators. The sum of these expressions is (478), 
so the two subtracted expressions add to give a result 
which is the first correction term obtained in an 
expansion of the linked part with a shifted denominator. 
In other words, it is the second term obtained in an ex- 
pansion of the denominator of the following expression: 


(eat ae ev t(be | v| (CR Vax) (ey4- Eo €p— erry 


XR" E" e| bc)“ (RR )en| v1 ab). (13) 


The higher order terms are obtained from a considera- 
tion of the factorizations of diagrams of Fig. 4. Such 
factorizations were shown in more detail in Ref. 5. 
Correction terms resulting from the factorization of the 
diagrams in Figs. 4(c) and 4(d) and from the diagrams 
of 4(e) and 4(f) add to give the second correction term 
in an expansion of the denominator of Eq. (13) and 
the first correction term which would result from using 
the appropriately shifted denominator in the correction 
term of the denominator of Eq. (13). The result of such 
factorizations then gives the summation terms in the 
denominator of Eq. (9) or, in a more simple case, it 
produces the shift in energy denominators as shown in 
Eq. (10). 
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The time-dependent behavior of an ide 
equations and includes N-particle “superradiant” 


N two-level atoms interacting strongly with a single-ra 


tions which are integrated numerically for some special 
? conditions. Loss effects are grafted onto the dynamical equations 


ate conditions, the laser is expected to emit one or more 
an initial giant pulse, and that the subsequent emissions may 


power output as well as “Q-spoiling 
phenomenologically. Tt is seen that under appropri 
appreciable subsequent output bursts following 


be emitted by a population which is below inversion. 


I. INTRODUCTION 


HE continuing refinement of experimental tech- 
niques as well as of crystal growing techniques 
allows the mechanism underlying the production of 
coherent bursts of radiation, such as produced in lasers, 
to be probed in more detail. In the past, the analysis 
of the time dependence of a system of atoms coupled to 
a radiation field has most frequently been based on a 
rate equation, or energy balance approach. While 
this approach is satisfactory in many situations, the 
rate equations ignore phase effects between the dipole 
moment of the radiators and the radiation field. As 
will be pointed out below, a more complete analysis 
can lead to interesting effects in lasers under certain 
: conditions; in particular, to the extraction of coherent 
: radiation from a population which is below inversion. 
The model is essentially semiclassical, since the 
expectation values of products of field and atomic 
operators are factored into a product of field and atom 
expectation values. The results of this factorization do 
not differ significantly from the electrodynamic treat- 
ment for these purposes, and it is of some interest to 
note that “superradiant” spontaneous emission, 
including the correct Einstein B coefficient, follows 
naturally from the resulting equations. 
Phonomenological terms are added to the basic set 
of equations, in a way familiar from the field of magnetic 
resonance, to describe the loss of radiation from the 
resonant structure, the “dephasing” of the total dipole 
moment and the effect of pumping radiation. The 
equations are cast into a form amenable to numerical 
solution and a three-level, ruby prototype laser is 
discussed under various conditions such as Q spoiling’ 
and variation of the material parameters. 


* Present address: Physics Department, Universi = 
fornia, Riverside, California. p OU 

J H. Statz and G. A. deMars in Quantum Electronics, edited by 
C. H. Townes (Columbia University Press, New York, 1960). 

2 A. A. Vuylsteke, J. Appl. Phys. 34, 1615 (1963). 
< 3 R. H. Dicke, Phys. Rev. 93, 99 (1954). 

4 R. H. Hellwarth, Advances in Quantum Electronics, edited by 
J. R. Singer (Columbia University Press, New York, 1961). 
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alized laser is analyzed. The analysis avoids the use of the rate 
effects. Perturbation theory is not used; the dynamics of 
diation field mode yields four coupled nonlinear equa- 


cases appropriate to lasers operating under ordinary 


IL MATHEMATICAL FORMULATION 


Consider an N-atom system in which only two levels 
of each atom interact appreciably with the resonant 
modes of the electromagnetic (e.m.) field. In the dipole 
approximation, the Hamiltonian for the system, with 
zero-point energy of the e.m. field subtracted, is taken as 


RED AIR). VENAN 
i A 
+E AK) (@tRi+amR,). (IL-1) 
Mi 
The operators R;’, R47, following Dicke’s notation,’ in 
the space of the jth atom satisfy the commutation rules 


(II.2a) 
(II.2b) 


[ Ry, Ra? ]- ZR; , 
[ Ri, Re” ]- 2R90j; , 
respectively, and the field creation and destruction 


operators a^ and a, of the Ath mode satisfy the com- 
mutation rules 


(II.3a) 
(I1.3b) 


[anant ]  & , 
[a0 ]- 0. 


The electric and magnetic fields are assumed to have 
the normal mode expansions 


Es) - -E Onion) "PEs o (ata) (1.42) 
H(xi)a —i x (2x) PH. (x)hy (ax*—ay). (11.40) 
The A(x) and H,(x) are normalized so that 


(e 2’) [acon (x)d (vol) = NV 5 (II.52) 


hh’) Í H.(x)Hy(x)d(vo)-$». (IL5b) 
The electric dipole moment operator for the jth atom 


is taken as the odd operator 


uop/—u (R iR 3)8(x— x). (1.6) 
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We have defined the coupling constant 


K= Qon /fi) Ui. e, E (x) (11.7) 
and have also defined the zero of energy as midway 
between the two levels of each atom so that Ej— Ej 
-g and F2/-+ Ej-0,"where Ex and Fy are the 
energies of the upper and lower state of the jth atom 
respectively. d 

The operator identity valid for any time-independent 
operator O : 


ihÓ- [0,5c] (11.8) 
is easily worked out and gives the set 
iR,i=— QR 42D Kyayt R4, (IT.9a) 
: | 
ii i— QiR ij—2 Y; Kyo, Ry, (II.9b) 
À 
ik=- Kilat RiR,  (IL9o) 
À 
id, =e,+>, K Ri, (II.9d) 
7 
(IT.9e) 


iüy t = —wat—>y K Ry. 
i 


Normally, expectation values of these operator 
identities are taken over the complete density matrix 
of the system (atoms+field). Taking expectation 
values on the right-hand side of (II.92), (II.9b), and 
(II.9c) gives expectation values of products of atom and 
field operators, e.g., (a,Rs’), etc. The problem is then es- 
sentially intractable, except in perturbation theory, for 
systems more complicated than a single atom interact- 
ing with a single mode. However, if we now assume 
that the density matrix of the system can be factored 
into the direct product 


p— pjyG pa D p8 - pa” (11.10) 


of atom and field density matrices, terms such as 
(aR; factor to the product of expectation values 
(aX R). This is equivalent to phenomenological semi- 
classical theory, or the “S.C.F.A.” (self-consistent field 
approximation) of Willis.’ 1 

Tt is convenient to define the slowly varying complex 
quantities 45, 77 by 


(a) Axe, (II.11a) 
(m) Atem (II.11b) 
(Ry N= rie, (II.116) 
(Re teh (IL11d) 


new variables, we obtain from (II.9), 
n values with respect to the 
(11.10), the set of five coupled 


In terms of these i 
after taking expectatio 
factored density matrix 


5 C. R. Willis, Bull. Am. Phys. Soc. 9, 4, 399 aoe): 


A1455 
equations 
iA, Y, Kiirtee, (11.122) 
7 
ipm 2 s KA pei WR (11.125) 
ilti — =e K3i (Ay fri gi 91 
—Ayrig tot), (IT120) 


as well as their complex conjugates (where (R;/) is 
written simply as R;/). 

We digress somewhat now and consider a special 
case of (IT.12) to point out some of Dicke's results 
using these equations. Suppose for the moment that 
(7-9 and K,7=K), independent of j. That is, all 
atoms are identical, as well as being confimed in a 
volume whose dimensions are small compared to a 
wavelength Ac2zc/0. We can then define 


r=% r (IL13a) 
i 


and 


n= Re, 


i 


(II.13b) 


in which special case we find the first integrals of (11.12). 
> da? 2 const (II.14a) 
` 

and 


(IL.14b) 


(II.14a) is, of course, an expression of the conservation 
of energy. In analogy to Dicke's terminology, we would 
call (II.14b) the conservation of “cooperation.” In a 
discussion of permanent dipole moments (spins) 
(II.14b) would be an expression of the conservation of 
total angular momentum. Here, however, we explicitly 
have in mind induced electric dipole moments. It is 
convenient to represent a radiation process geomet- 
rically in a 3-dimensional space labeling the coordinates 
(1,2,3) following Feynman.^? According to (II.14) we 
can picture this simple radiation process as the motion 
of a vector R on the surface of a sphere in this space, 
(which, incidentally, is real space when considering 
permanent dipoles). The projection of R onto the (3) 
axis is the value of 2, the population difference divided 
by two, and the projection of R onto the (1) axis gives 
Re(r) which is proportional to the expectation value of 
the induced dipole moment. The value of the induced 
dipole moment is zero when all atoms are exactly in their 
upper or lower states 2— 4c (/V/2) and so we see that 
the maximum possible value of |R| —.V/2 (N is the 
total number of radiators), and it can have the (con- 
stant) values (V/2), (N/2)—1, ---0, depending on 
initial conditions of the problem. 


|r]? — const R. 


5 R. P. Feynman, F. L. Vernon, Jr., and R. W. Hellwartlz, 
J. Appl. Phys. 28, 49 (1957). 
? Y, H. Pao, J. Opt. Soc. Am. 52, 871 (1962). 
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To obtain spontaneous emission (using the same 
restrictive assumptions =, K,/=K,), we simply 
perform a power-series expansion in powers of the 
coupling constant Ky 


A) =AMO+ KAP (DH 77, 
rQ=M(NEK yO (D+. 
Inserting these into (11.12) and equating like powers of 


Ky gives I T 
iA D (= Kyr (Octo, (II.15a) 


79(1)=1r9(0)=r(0). (II.15b) 


Axy* (1)= A,(0)=0 for spontaneous emission. Integration 
of (II.9a) and insertion into (II.9c) leads to 


3 T .d e sin (o, —9)//2 
n= —|r(0)| "P X [o -9/2ZP . 


For spontaneous emission from the small volume into 
a large box of volume V, we find, in the usual way? 
(replacing the summation by an integral) 


(IL16) 


ac«— 2|r(0) |?o(Q) Kg? . (11.17) 
Using 
p(Q)=VOr/e and (Ko*)sphere= 272u?/3HV — (11.18) 
gives 


i= —|r(0)|2/r= —(RI—38(0)/7, (11.19) 


which is the desired result. Here 7 is the usual 
Einstein B coefficient 


r= $ (Qtu/he). 


Suppose the system is started off with all atoms 
exactly in the upper state, R?= (V/2)?. Then, according 
to (II.19), nothing happens. This is a point of unstable 
equilibrium in the semiclassical theory. However, if 
R?= (N/2)?, but N — 1 atoms are excited and one atom is 
in its ground state, then n= (N/2)—1, ñ= — (N —1)/r, 
and the system emits “normally,” that is, proportional 
to the number of excited atoms. But if we start off with 
R?— (N/2)? and with half the atoms excited and half in 
their lower states, then 7(0)=0 and ñ= — [ (N/2)}/7]. 
The population then starts to decrease at a rate propor- 
tional to the square of the total number of atoms, and 
it is this situation that Dicke calls “superradiant”’ or 
“coherent” spontaneous emission. The situation is 
analogous to a physical pendulum which “radiates” at 
— — a very slow rate initially when aligned closely along the 
© (3) axis but rather much more rapidly when in the (1-2) 
plane (when the electric field then sees a large dipole 

ment). 
- — We now return to consideration of the more general 
set of Eqs. (IL.12). Interest is in casting these into a 
form describing an idealized laser operating in one 
mode ; the resulting set of 4 nonlinear coupled equation 


8L. I. Schiff, Quantum Mechanics (McGraw-Hill Book Com- 
pany, Inc., New York, 1955). 


F. W. CUMMINGS 
will be cast in a form which will readily yield to numer- 
ical integration. 
Define the new variables 7, Ya so that 
myer = » Kirt eot) 5 
i 


(11.20) 


which is the (complex) dipole moment which couples 
into the Ath mode; also define the e.m. field amplitude 


and phase e, and ¢, of mode à by 
Ax ee». (II.21) 


In terms of the new variables we have from (11.12) 


= — my sin(dr-Yr) ; (11.222) 
eaga 77 — m cos (dy — vA) 5 (II.22b) 
aye ime, 
=F UK Kyien e t eio ERU 
i LEY) Kir (ox), (IL220) 


1 


We now make the reasonable assumption that if we 
take an average of these equations assuming a random 
distribution of atomic sites x’, the modes will decouple 
due to the orthogonality condition (II.5a). That is, 
we now assume that 


(IK Qe /A VK 2)! e: 


£ (Za /h V Jux , (II.22d) 


taking wu, for all j. uy is the component of the 
dipole moment which couples into mode A. 

The second term on the right of (II.22c) would 
vanish if Q2’=Q=a. It represents a dephasing of the 
total dipole moment due to different local environments 
giving rise to a spread in natural frequencies about 2. 
We wish to take this term into account in a way 
familiar from nuclear resonance theory by saying that 
when @=0, its effect is to cause the magnitude m to 
decay exponentially to zero at a rate y,. Then we write 


Ty-- yt, — — 20 en sin(a ya), (II.23) 
Jam = 2Teyn cos($y—yA) 5 (11.24) 
with 4 
1—»,Rj (11.25) 
7 
and 
T= 2rean?/hV . (11.26) 


From (II.12c), (11.20), and (11.21), we have that 
Ti— eyny sinleas) exmy Sin (Gy — Yr’) - (11.27) 
Msi 


The sum X runs over all those modes, except mode ^ 
which are at approximately frequency c» but which 
are appreciably more lossy than the mode designated 
by A. For instance, this term represents the losses © 
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radiation from the sides of a lasing crystal due to 
spontaneous emission. The usually small stimulated 
emission into these modes which are characterized by 
high loss (free-space-like modes) will be neglected. 

If we add a phenomenological mode loss rate term to 
(11.222), and collect equations, we have finally the set 


(d/di) (e^) + 28e! = — me sind, , 
(d/di)my--yynmy = — 2T'eyn sinb, 


(II.28a) 

(11.28b) 

(d/di)n — eam sinha tE exit, siny , (IT.28c) 
n 


y (d/ diy — 21 en cosh, , (II.28d) 
ex (d/dt)dy — — m cosh, (11.28e) 
0,7 d. — YA. (11.29) 


From (11.282), (I1.28d), and (II.28e) we note the first 
integral 


exam cosh — (eymy cos6,)ge- “Fat! (I1.30) 


which states that if 7, e, or cos, were ever zero, then 
cos#,=0 for all time for a nontrivial solution to exist. 
Otherwise, cosh will approach zero at a rate Batya- 
Cos6,—0 implies sinó,—-E1. It must be noticed, 
however, that sinô, as a function of time takes on the 
two values +1 or —1, since by inspection of (II.28a), 
(II.28c) the sign of sin@, determines whether the atomic 
system is emitting or absorbing radiation. 

If in (II.28b) we assume that 7%<Aym, then we 
may neglect f, solve for 7m, and insert its value into 
(11.282) and (I1.28c). This gives, by use of (11.30) with 
cosh — 0, 

(d/dt)e?+26,e°= (20 /y)ern, — (IL31a) 


(d/dt)n=— (20 /n)en— (n+N/2)/7. (11.31b) 


The term (»4-N/2)/7 has been used as an approxima- 
tion for the last term of (II.28c) ; »4- (N/2) is simply 
the population of the excited state and 7 is the 
spontaneous rate. The two coupled equations constitute 
the well-known rate equation approach to radiation 
dynamics.! The absorption rate (per atom) for mode A 
is given by a. — T/2yy— rox? / Voy. 

At the other extreme from the rate equation approxi- 
mation (II.31), the completely “coherent” description 
of radiation into a single mode is afforded by dropping 
all terms in (IL.19) which describe losses, that is by 
setting y= b=} em sind — 0. In this case, by 
defining the angle variable © in the m/(2T)'”, n plane 
as @=tan—[(2T"2)m/n] we find the equation of 
motion for © 

(d?0/d£)4- (2T)sinO-— 0, (11.32) 
which is simply the equation of motion of a spherical 
pendulum. The initial conditions are determined from 
d@/dt= — (2T)e,. The solutions are elliptic functions 
and become sinusoids when 4 (0) gets large (Z 10). 
The case e,(0)— 0 appropriate to spontaneous emission 


ELEME V eo oc 


TA T K : 


leads to the same general results as in Eq. (JL19) — 
concerning the rate of spontaneouse mission as à  _ 
function of N. Keeping only the mode loss rate in — 
(11.28) results in the equation =a 


((O/dP)--B,(dO/di)--2P smO=0, (11.33) 


easily recognized as the equation for a damped physical 
pendulum. 

The “perfectly coherent" process is one which 
describes a circle in the n, 9m/ (21)? plane. The solu- 
tions to the rate equation, on the other hand, are 
restricted to one quadrant in this plane. Since mT eyn 
^», n=0 implies that 75,—0, and thus the only crossing 
point of the axis n=0 is the improbable point 220, — 
147 0, in the rate equation approximation. 3 

In order to facilitate nurnerical integration 
set of equations, we define the new variables 


I-—eo2N , 

m-my/N(r)y", 

ç= — me, sind/(2N)'? , 

A=n/N, 

a= (NT). 

The equations (11.28) then take the form 

Í+ =at, 

dm?/dt+ 2e m*-— aA, 

d¢/dt-+ (83-y)£ — a(m?-- 21A) , 
A=—af—[(4+4)/7]—P(A—4), 


term oy emy sinf as (A4-3)/r, and added 
to the equation for the rate of change of the po 
difference A (normalized to 2N) to describe the 
of a “pump” which takes atoms from the gro 
and puts them into the upper state. We 
assuming an infinitely fast transition from the pum 
band to the upper laser level, as well as an optic 
thin material so that the pumping is uniform throug 
the volume V. We can allow P (the pumping ra 
well as 8, the cavity mode loss rate, to be functi 
time; the numerical solutions below, howeve 
include the case of a time varying B. 


IH. RESULTS OF NUMERICAL INTEGRA 


The set of Eqs. (11.35) were integrated 
speed digital computer for several values of a (cor 
pump rate P, and the coupling parameter a, and 
dipole moment dephasing rate y. The fun 
taken to be a “Fermi function” of time 


B()- [(6:—8)/ 0-2 -«t7 9 9 4-8, 


and will represent approximately i 
by a Kerr cell or similar app 


CC-0. In Public Domain. Gurukul Kangri Collection, Hari 


aS E TES 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


RRB UCEY AND 


1(PHOTORS. M) 
E 
s 


1x15 2x15 3x15 
(0 ——e 


(a) 


CUMMINGS 


p. W- 


axi 
Txi04 
lew 
a 
= 
E sxw* 
Z 
i | 
= 
z 
gax 
= 
: | 
4 


1x ic^ 


[] -E ee - 
a 1x14 3 5 7 9 u 3 15 7x 104 


om COStU (DIPOLE MOMENT) ———e- 


(b) 


Fic. 1. Laser behavior with constant pump and constant cavity loss rate under conditions which would be well approximated by 


the rate equation approach. Parameter values are 8— 10", y2103, P=103, a=10 and 73—3X10*, all in sec. t=0 is chosen from 


computed data near the start of significant behavior. (a) Intensity dependence on time. (b) Dipole moment versus population inversion. 


time somewhere just before the populations A reach a 
threshold determined by f;. 

Figure 1 shows the "relaxation oscillations" for a 
laser operating under conditions of ordinary power 
output.” Both the intensity 7 as a function of time as 
well as the corresponding phase diagram of (m cosy) 
versus A is shown for each case. The dots in the phase 
diagrams correspond in time to the peaks in the 


[] 1x15 2 3 4 sx ` 9x10 


16EC —9 


(a) 


intensity diagram. In Fig. 1(b), the entire phase 
diagram is seen to lie in the first quadrant and the 
solution is very well predicted by the rate equations, 
since the rate y is much greater than the fractional 
rate of change of m. The intensity 7 is shown in units 
of photons/ V, where N is the effective number of atoms 
participating. By changing the cavity loss rate to a 
larger value we see in Fig. 2, as expected, that the 


^ 

* 

L] 
P 


A (POPULATION INVERSION) ——e- 
X 
z 
ù 


Q rxz, 5x? 
TOM COS (DIPOLE MOMENT) ——— 


(b) 
Fic. 2. Same as Fig. 1, except for higher cavity loss rate, B= 109 sec"). 


9 For all the following figures, the quantity (em cos&i)s was taken as zero. [See Eq. (II.30).] Moreover, for convenience in 


graphing, 


the same discontinuous properties are attributed to y and ¢ as are exhibited by 6=¢—y. That is, sing and siny as 


functions of time are made to alternate between the values +1 and —1 in such a manner that sing (ż) takes on the correct values an 


the functions 7: cosy and e sing are continuous. 
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Fic. 3. Same as Fig. 1, except for a larger pump P= 10 sec, a smaller dephasing rate y=10" sec, and the addit 
with cavity loss rate initially 8; — 10? sec“! changing to a final cavity loss rate of By = 10" sec in a time Al=2.5. 


spiking is almost continuous; clearly the radiation taken as 3X10~% sec in all computations. 
almost completely escapes from the cavity before the shows a case of “Q switching” (Eq. (I. 
pump can again bring the population over threshold, parameters are chosen to exhibit a rather e 
A,=By/2c?, and the process simply repeats itself. (The ation (in present day practice) of very low 
parameter 7, the spontaneous emission time has been The second, third, etc., spikes in Fig. 


Tic. 4. Same as Fig. 3, 
except that final cavity 
loss rate 8;—10* sec! 
and Q spoiling occurs 
over time Af=5.10 sec. 
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to occur when the populations are below inversion, 
A<0. This is strictly a possibility inherent in this 
analysis and not in a rate equation description. Figure 4 
again exhibits the same phenomena; the second spike 
occurs totally when A «0. 

In both Figs. 3(b) and 4(b) the initial and final 
threshold values A,(?9 and A, are shown. Notice the 
difference in scale between Figs. 1, 2 and 3, 4. Figs. 1(b) 
and 2(b) would appear as diffuse dots near the A axis in 
Figs. 3(b) and 4(b). A case has also been run under 
“normal” circumstances (no Q spoiling) with a low 
threshold (A,7:0.001) in which the output J versus / 
appeared as a series of almost random spikes. Ap- 
parently, a great variety of solutions is possible here as 
compared to the rate equation approach. 

These results are intended to serve as illustrative of 
some possibilities in coherent device dynamics. The 
case of a “two-level” device akin to a ruby laser has 
guided the discussion. However, one expects that the 
Jaser line in ruby is homogeneously broadened, and at 
room temperatures the parameter y would be ~ 10!! cps, 
the full time width of the R, transition. If this is the 
case then the rate equations would be expected to as 
adequately describe the dynamics as the full set, 

(11.35) and secondary pulses as described above would 
not be possible unless one operated as a much lower 

temperature, <70°K. However, if the laser line is 
inhomogeneously broadened even at room temperature, 


F. W. CUMMINGS 

(which appears unlikely in ruby), then such “hopping” 
might occur, since y would be expected to be a good bit 
smaller. The above set of equations should approxi- 
mately describe the dynamics of such a system where y 
would be determined by the cavity parameter 8 and 
the effective number of atoms participating in the 
radiation process into a single mode would be roughly 
(Neff = (Aw-/Aw)N, where Awe is the cavity linewidth 
(presuming this is what governs the linewidth in this 
case) and Aw is the full laser atomic linewidth. 

It may be pertinent to point out that one cannot 
expect to see such “superradiant” hopping effects by 
«Q spoiling” a gas laser such as the helium-neon, 4-level 
laser of Javan.? The point is simply that due to the 
very rapid emptying of the terminal laser state into a 
metastable state compared to the time width of a giant 
pulse, one can never achieve a condition of A<0 with 
any appreciable population in the lower laser state; 
that is, these equations do not describe such a device 
even over the time interval of a giant pulse. 
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Collisional Electron Detachment from 20-MeV D? and D- Ions* 


KrAvs H. BERKNER, SELIG N. KAPLAN, AND ROBERT V. PYLE 
Lawrence Radiation Laboratory, University of California, Berkeley, California 
(Received 27 January 1964) 


Cross sections for collisional electron detachment from 20-MeV deuterium atoms and negative ions have 
been measured in Hz, He, Ns, and Ar. The cross sections as; per target molecule for the ionization of neutral 
atoms are 2.2, 1.6, 20, and 36, respectively, in units of 107!* cm?, Results for single-electron detachment from 
D^ ions, o-10, are 7.5, 4.7, 58, and 75, respectively. All values have an uncertainty of -- 1047, chiefly from 
absolute-pressure calibration uncertainties. The experimental values for atoms in Hz and He are in agree- 
ment with calculations that use the Born and free-collision approximations. The measured cross sections for 
single-electron detachment from negative ions in Hz and He are about one-half of the calculated values. 
The two-electron detachment cross sections, c-11, are approximately 4% of the single-detachment cross 


sections, c. 16, in all gases. 


I. INTRODUCTION 


HE cross sections for electron loss from fast hydro- 
gen atoms in hydrogen and helium targets have 
been calculated by using three models: the semiclassical 
model of Bohr,! the *free-collision" (impulse) approxi- 
mation, and the first Born approximation. All of 
these models predict an £^ energy dependence at 
high energies (i.e., where Ze/7:««1). For a hydrogen 
target, this condition implies energies above a few 
hundred kilovolts. In this asymptotic region, the Born 
and free-collision approximations give numerical values 
that agree with each other and with experiment? to 
within 1597. 

For single-electron detachment from fast H- ions, 
the evaluation of the cross section by the Born approxi- 
mation has been considered too cumbersome to carry 
out with the best available H- wave function, and the 
result is so sensitive to the choice of other approximate 
wave functions that the magnitude of the cross section 
is very uncertain. The free-collision approximation, on 
the other hand, permits a simpler calculation; since it 
has given good results in the case of electron detach- 
ment from H? it might be expected to be fairly reliable 
in the H- case. It has been used to calculate H™ cross 
sections in H and He? and in Nz and Os? 

We know of only three H- experiments at energies 
that are sufficiently high to test the free-collision ap- 
proximation. Rose e! al. have made gas-cell measure- 


* This work done under the auspices of the U. S. Atomic 
Energy Commission. 3 
! N. Bohr, The Penetration of Atomic Particles Through Matter 
(I Kommission Hos E. Munksgaard, København, 1948). : 
1I. S. Dmitriev and V. S. Nikolaev, Zh. Eksperim. i Teor. Fiz. 
44, 660 (1963) [English transl: Soviet Phys.—JETP 17, 447 
1963) ]. 
l > m Bates, and A. Williams, Proc. Phys. Soc. (London) 
A70, 306 (1957). 
1D. R. Bates and G. W. Griffing, Proc. Phys. Soc. (London) 
A68, 90 (1955). x ees 
5C. EE and H. K. Reynolds, Phys. Rev. 109, 355 (1958). 
*'M. R. McDowell and G. Peach, Proc. Phys. Soc. (London) 
74, 463 (1959). 2 
7B. T C Arch. Math. Naturvidenskab. 53, 9 (1957). 
An asymptotic expression for ¢_10 in Ar is also given; it should 
be applicable only at energies considerably higher than those 
di: d here. 
“TP. H. Roc R. J. Connor, and R. P. Bastide, Bull. Am. Phys. 
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ments at energies up to 1.8 MeV in H;, Os, Ar, and 
CO». Fremlin and Spiers? and Verba et al. have r 
ported measurements in air at higher energies (but 
under less well-known conditions) by observing ti 

radial attenuation of H- internal cyclotron beams. Ir 
general, the experimental cross sections are well below | 
the calculated values. : 

In the present work we have measured the 
sections for electron detachment from hydrogen ato 
co, and from negative hydrogen ions, e- and o- 
in a single high-energy experiment under wi 
trolled conditions. 

We note that the magnitudes of these cross sectio 
and the way they extrapolate to higher energi 
of interest in certain high-energy-accelerator di 
considerations.” 


II. EXPERIMENTAL ARRANGEMENT  — 


'The experimental arrangement is shown sc 
cally in Fig. 1. A beam of 20-MeV D- ions" fro 
Berkeley heavy-ion linear accelerator was bent 1 
to remove any possible contaminants. For the m 
ments of eo, the neutral beam was obtained b 
electron stripping in the first gas cell; all ions en 
from the gas cell were swept out by magnet S 


magnet S1 was set at 0 kG. E 
The collision chamber was a differentially pui 
gas cell, consisting of a high-pressure target 
sandwiched between two intermediate-pressure i 
bers backed by a 1500-liter/sec oil diffusion pun 
chambers were connected by 6-mm-diam 5- 


Soc. 3, 40 (1958). Numerical values taken from Allison 
Estimated errors are from a private communication 


Rose. E 
Do H. Fremlin and V. M. Spiers, Proc. Phys. Soc. (I 


linear accelerator does not operate stably at. 
necessary for hydrogen acceleration. 
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Fic. 1. The experimental arrangement. 


tubes (inset Fig. 1). With this cell a base pressure of 
5X (10-5 Torr could be maintained in the drift sections 
when the pressure in the target chamber was 5X107 
Torr. The effective length of the target chamber was 
L=24+1 cm. Integrated target densities ranged from 
NL=0.5 to 30X105 atoms/cm? for Ar and Nz, and 
NL=2 to 200X10! atoms/cm? for He and H». The 
pressure in the target chamber was monitored by a 
Westinghouse type-7676 high-pressure ionization gauge 
(Schulz-Phelps gauge) which was cross calibrated with 
three liquid-nitrogen-trapped McLeod gauges. Fluctua- 
tions of the calibration from day to day and uncer- 
tainty of the absolute accuracy of the McLeod gauges 
indicate an uncertainty of +10% in the pressure. 

The ions in the beam emerging from the gas target 
were deflected by magnet S2 (3 kG) and steered into 
the detector CC by magnet A(~8kG), the neutrals 
were detected by NC. The detectors NC and CC each 
consisted of a. 5-cm-diam plastic scintillator and pho- 
tomultipler. By means of collimating slits it was 
determined that the ions reaching the detector were 
well localized in a 5-mm-diam beam spot, which was 
small compared to the size of the detector. If two 
species of ions were to be detected, magnet A was 
used to steer first one and then the other into the 
detector CC, while the neutral counter was used as a 
monitor. 


II. RESULTS AND ANALYSIS 


Cross sections were calculated at a number of pres- 
sures for each target gas according to the following 
formulas. (We neglect cross sections for electron cap- 
ture by the fast ions; these are at least three orders of 
magnitude smaller than the electron loss values con- 
sidered here.) 

Tf m is the number of hydrogen atoms at a depth v 
in the target, co; is the cross section for electron de- 
tachment, II— NL is the target thickness (atoms/cm?), 
and %4 is the number of positive ions produced in the 
thickness r, then we have 


^ dno/ dz — —119091— —dn4/dz , (1) 
12S, K. Allison, Rev. Mod. Phys. 30, 1137 (1958). 


^ 
^ 
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with the solution 


1 
ca In| 1+ 
I 


Similarly, the disappearance rate for negative hydrogen 
ions due to loss of one electron (s—10) and two electrons 
(11) in a single collision is given by 


i4 (II) 
| (2) 


no(I) 


dn_/dx=—n_(o-10+o-11) ; (3) 
with the solution 
1 n, (11)-+ 70 (11) 
0—101-0—1 In| 1+ : ] . (4) 
IH 1. (II) 
Using the additional relation 
dno/dz = .0—39— 119001; (5) 
we obtain 
ng (II) 901— (o_19+o~-11) 
10> | ] O 
n_(TI) 1— expli (c 10d-0—11— 001) 


Equations (4) and (6) were solved for c—1o and ou, 
and Eq. (2) was solved for co. Uncertainties in the 
pressure, cell length, background correction, and count- 
ing statistics were propagated through the above equa- 
tions to obtain the uncertainties in the cross sections. 

At sufficiently low pressures, the above equations 
and the uncertainties in the results can be approxi- 
mated by relatively simple expressions that demon- 
strate the correct limiting behavior. 

The co, calculations quoted in the text have been 
carried out for fast hydrogen atoms in the ground state. 
If an appreciable fraction of the atoms of our beam were 
in excited states, and if the variation of co; with quan- 
tum level, z, were sufficiently large, then the experi- 
mental values could be considerably greater than the 
ground-state cross sections. This situation would show 
up as a decrease in the measured cross sections with 
increasing gas-cell pressure, closely approaching the 
ground-state value at the higher pressures. Unfortu- 
nately, we did not have time to take oo: over a large 
pressure range (the maximum was a factor of 10 in 
Hs), but no variation of the kind mentioned above 
occurred in any of the gases. 

It is known: that perhaps 15% of the neutral 
atoms produced by charge exchange or Hz+ breakup 
are excited to 223; the fraction in each state is 
f=(2—3)n. (The n=2 states completely decay in 
our geometry.) Preliminary measurements by us indi- 


8A. C. Riviere and D. R. Sweetman, in Comptes Rendus de la 
Vie Conférence Internationale sur les Phénomènes d'Ionisation dans 
les Gaz (Paris 1963), Vol. 1, 105. d 

4 K. H. Berkner, J. R. Hiskes, S. N. Kaplan, G. A. Paulikas, 
and R. V. Pyle, in Proceedings of the Third International Con- 
ference on the Physics of Electronic and Atomic Collisions, Un- 
versity College, London, 1963 (North-Holland Publishing Com- 
pany, Amsterdam, in press). 
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cate that the excited-state populations are an order of 
magnitude smaller when the atoms are produced by 
electron detachment from negative hydrogen ions, as 
in this experiment. There is also evidence that the 
detachment cross section varies more slowly than n. 
We therefore conclude that the measured cross section 
is appropriate to the ground state. This question does 
not arise for the negative-hydrogen-detachment cross 
sections. 

Results for nitrogen are shown in Fig. 2 in order to 
illustrate the degree of internal consistency. Results for 
other targets are of similar quality. 

The weighted averages of the various cross-section 
measurements are given in Table I. To aid in the dis- 
cussion, they are also plotted in Fig. 3 together with 
lower energy data from other experiments and theo- 
retical estimates. In this figure, and in the following 
discussion, the ions are treated as if they were 10-MeV 
hydrogen rather than 20-MeV deuterium particles. 


IV. DISCUSSION AND CONCLUSIONS 
A. Electron Detachment from Neutral Atoms, ooi 


Our measured cross sections for electron detachment 
from neutral hydrogen in Hs and He are in excellent 
agreement with the theoretical predictions of the free- 
collision? and Born approximations? and are consistent 
with an E-! extrapolation of the lower energy results 
of Barnett and Reynolds® (Fig. 3). For the heavier 
targets, Ne and Ar, such cross-section calculations have 
not been reported. A linear extrapolation to the results 
of Bamett and Reynolds indicates that certainly in 
Ar, and probably also in Ne, the cross section falls off 
less rapidly than the E~! predicted for and observed 
in the lighter targets. The semiclassical theory of 
Bohr, which also predicts the E^! energy dependence 
in light elements (although it considerably overesti- 
mates the magnitudes of the cross sections), predicts a 
weaker energy dependence, approximately proportional 
to E- for medium-Z elements. For reference, lines 
corresponding to E-!, £~/*, and E~! variations are 
drawn through our go: point in Ar, but these are not 


'TABLE I. Measured cross sections per target molecule 
for 20-MeV deuterium (in units of 107 cm?). 


oil gor 

ou oi Hoo o_10 5 9-1 7 A TER: 

+10% +10% +10% +25% g-1i6 C19 

He 2.2 7.8 7.5 0.29 0.038 0.31 
He 1.6 4.9 4.7 0.14 0.029 0.34 
Na 20 61 58 2.7 0.046 0.31 
Ar 36 18 15 25 0.033 0.48 


E uero eei VE ERI ES 


15 A. C. Riviere and D. R. Sweetman, in Proceedings of the Third 
International Conference on the Physics of Electronic and Atomic 
Collisions, University College, London, 1963 (North-Holland 
Publishing Company, Amsterdam, to be published). 
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Cross section per molecule (cm?) 


Pressure (105 torr) 


Fic. 2. Cross-section measurements in Ns versus pressure. 


intended to suggest that the asymptotic energy region 
has been reached. 


B. Single-Electron Detachment from Negative 
Hydrogen Ions, g- 


The results for single-electron loss from negative 
hydrogen ions, oi, seem to show the same energy 
dependence as the oo: cross sections at high energies — 
(Fig. 3). The Born-approximation calculation of Me- E 
Dowell and Peach? for a hydrogen target predicts a 
cross section that is 24 times our experimental value. - 
Moreover, in the 1-10-MeV energy range their calcu- 
lated cross section has not yet reached the asymptotic 
dependence and thus varies much more slowly than. 
Ei, (The line marked M-P in Fig. 3 is their asymptotic 
solution.) As mentioned in the introduction, this calcu- 
lation is extremely sensitive to the choice of the ap- 
proximation to the H- wave function. Because of dde 22 
they state that their results should not be in error by —— 
more than a factor of 5 at high energies. A 

The free-collision approximation, which requires only 
a knowledge of the ionization potential, also 
results that are higher than the experimental values 
for H»? He,? and N».* a 


C. The Two-Electron Detachment 
Cross Sections, o_11 


The cross section for the loss of both electrons fro 
an H- ion, o-u, is approximately 4% of the | 
electron detachment cross section, a-m at 10 
in all of the target gases. This proportion is simi 
that observed by Fogel ef al. at energies of 40 
less.8 We know of no theoretical predictions fo 
c, cross sections; the problem is currently v 
consideration by McDowell.” : 


16 Ia. M. Fogel, V. A. Ankudinov, and R. E. Slabo: 
Zh. Pepe i Teor. Fiz. 32, 453 (1957) [English 
Soviet Phys.—JETP 5, 382 (1957) 1. : : 

1: M. R. McDowell, Royal Holloway College, England ( 
communication). EM 
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Fic. 3. Energy dependence of the hydrogen cross sections. 6:010; M:co; A:o_1. The “10-MeV” points are the 20-MeV deuterium 


3 data of this paper, and the others are from Allison's review article (Ref. 12), which includes measurements by Barnett and Reynolds 
(B-R), Fogel e£ al. (F), Hasted and Stedeford (H-S), Rose eż al. (R), Stier and Barnett (S-B), and Whittier Ww). The solid lines are 
internal cyclotron-beam measurements by Fremlin and Spiers (F-S) (Ref. 9) and Verba et al. (V) (Ref. 10). The broken lines give Born- 

i approximation calculations by Bates and Williams (B-W) (Ref. 3), Bates and Griffing (B-G) (Ref. 4), and McDowell and Peach (M-P) 

E (Ref. 6), and free-collision-approximation calculations by Dmitriev and Nikolaev (D-N) (Ref. 2) and Wright (Wr) (Ref. 7). 

s A v 

E D. Summary ACKNOWLEDGMENTS 


= "The measured oo; cross sections are in good agree- 

‘ment with theory for H» and He target gases; calcula- 

tions are not available in higher Z materials. The o_1 

cross sections are not in good agreement with theory. 

_ As yet, no calculations of o_,; have been reported. 

- «Measurements at considerably higher energies are 
ecessary before the asymptotic nature of many of the 


oss sections can be established. 
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The Investigation of the hysteresis occurring in the threshold conditions for the helical instability oscilla- 


tions in electron-hole 


plasma has been extended. The hysteresis in electric field strength Æ can exceed 45 


V/cm and 50% of the applied E at threshold. By determining the stability-instability boundary in the p-type 


InSb 


as a function of the parallel (or antiparallel) electric and magnetic field strengths and also the plasma 


density, the magnetic field induced by the formation of the helical density perturbation is deduced. Resulting 
displaced loop B-H curves are presented, The magnitude of the induction enhancement Byya can be as large 
as 165 Oe and 55% of the applied B at threshold. The extent of applied magnetic field (or electric field 
strength) over which the loops can occur is limited at the high magnetic field end (typically <600 G) by 
vanishingly small plasma density and at the other end (2:280 G) by the occurrence of magnetic pinching, 
The loops are largest at the low magnetic field end. The range of loop energy, defined as the product of Enys 
and Bhys, increases with decreasing plasma cross-sectional area until saturation occurs at à plasma radius of 
™~3X 107? cm. The full range from largest to smallest loop in any one sample is achieved by a relatively small 
variation in the “input energies,” i.e., the product of the magnetic and electric field strengths at threshold. 


I. INTRODUCTION 


T is well known that the stability of a plasma may be 
enhanced by the application of a longitudinal mag- 
netic field.! Early stability and confinement experi- 
ments also showed the existence of a definite limit to 
the improvement so obtainable. Indeed, too large a 
magnetic field causes the onset of oscillations in the 
plasma and its rapid radial diffusion to the container 
boundaries. The helical instability theory first pro- 
posed by Kadomtsev and Nedospasov? accounts for 
these occurrences and describes the boundary between 
stability and instability in terms of the applied, parallel 
electric and magnetic field strengths. Johnson and 
Jerde* supplied a firm mathematical base for the 
theory and improved its predictions. 

Almost concurrently with these electron-ion plasma 
experiments,! Ivanov and Ryvkin* reported the occur- 
rence of current oscillations in germanium immersed in 
a longitudinal magnetic field. Larrabee and Steele® 
made a detailed study of this phenomenon and named 
it the oscillistor. They demonstrated that the oscilla- 
tions occur only if excess carriers, ie. an electron-hole 
plasma, is present in the semiconductor. Glicksman" 
recognized the applicability of Kadomtsev and Nedos- 
pasov’s theory to this phenomenon and adapted it to 
electron-hole plasmas in an insulator. Holter’ used 


ited Nations, Geneva, 1958), p. 146. — 

omga XU ER and A. V. Nedospasov, J. Nucl. Energy 1, 
230 (1960); see F. C. 
a review of experiments and 
S E E D. A. Jerde, Phys. Fluids 2 988 aysa. 

4T. L. Ivanov and S. M. Ryvkin, Zh. Techn. n E (1958) 
[English transl: Soviet Phys.— Tech. Phys. 3, 722 d 25 d on 

SR. D. Larrabee and M. C. Steele, J- Appl. DENS. l, x 
(1960); R. D. Larrabee, J. Appl. Phys. 2 sro ue ; see also, 
J. Bok and R. Veilex, Compt. Rend; P o 58). 

EE p 129, 2548 (1963); for more details see 
Boeing Scientific etch Labs Document. DE 
ber 1962 (unpublished), or Arbok Univ. 1 Bergen, Mat- E 
Ser. 8, 3 (1963). 
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Johnson and Jerde's? approach to derive again explicit 
expressions for the threshold frequency and electric 
field strength as functions of magnetic field strength, 
some parameters of the plasma and its container (the 
semiconductor), this time applicable also to both ex- 
trinsic and intrinsic semiconductors as well as to in- 
sulators. His comparison with Ancker-Johnson's* meas- 
urements of the threshold conditions showed good 
agreement. Other data’ on the threshold conditions 
which do not include the densities of the plasmas, a 
parameter of the theory, are compatible with theory. 

Recently, hysteresis in the conditions necessary for 
the production of the helical instability oscillations in 
electron-hole plasmas was reported along with the 
suggestion of its practical application to memory 
devices.!! This hysteresis differs from the familiar type 
possessed, for example, by ferrites, in that it exhibits 
two loops which are displaced from the origin of the 
B-H curve, as illustrated by Fig. 1(a). A plasma is 


AT 


Fic. 1. Hysteresis loop diagrams for helical instability oscil- 
lations: (a) B-H curves; (b) B-E curves. The dot-dash line in 
each first quadrant shows another possible loop shape. , 
8B. Ancker-Johnson, in Proceedings of the International Con- 
ference on Physics of Semiconductors, Exeter (The Institute of — 
‘Physics and the Physical Society, London, 1962), p. 131, 
3(a) T. Misawa and T. Yamada, Japan. J. Appl. Phys. 2,19 
(1963); (b) T. Misawa, Japan. J. Appl. Phys. 1, 67 and 131 (1962), 0 
X F, Okamoto, T. Koike, and S. Tosima, J. Phys. Soe, Japan 
17, 804 (1962). oe 
1! B. Ancker-Johnson, Appl. Phys. Letters 3, 104 (1963). 
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TABLE I. Sample properties and hysteresis data. Sce Fig. 3 for dimensions. 
iti Byys deduced from 
Initial Hole NA. 
Sample hole density mobility i : Eyal V/em) TERNOS ae E 
No. (cm) (cm?/V-sec) — Eu (V/cm) Bu(G) £ Enys ary undary 
eee HRS 9.5 282 0.53" 35 65 20 
E E ieee 865 302.5 0.248 26 65 18 
75 317 onsa 17 62 13 
69 337 0.085 13 48 9 
63.5 355 0.045 10 48 8 
56.5 385 0.02; 9 59 1 
51.5 417 0.012 i ee 4 
2D2-7 5.0x 104 S3 $9.5 299 0.122° 48. . 
UM b "à 325 0.07¢ 34 144 68 
66 365 0.055 29 114 47 
63 376 0.045 21 83 36 
60.5 390 0.045 14.5 54 20 
52 421 0.023 10 38 11 
212-8 4 MH 0 164 345 0.087 40 93 31 
= oS e 159 370 0.075 42 91 26 
143 394 0.05; 32 88 18 
127 430 0.032 24 80 26 
118 460 0.03; 19 64 21 
110 485 0.022 14.5 51 15 
103 510 0.015 9.5 34 6 
2D2-2 3.7X104 7000 85.5 390 0.113 22.5 69 37 
79 410 0.100 18 59 28 
73 430 0.08; 15 55.5 24 
66 450 0.072 13 59.5 ee 
60 471 0.06; 11.5 70.5 21 
57 490 0.05; 10.5 77 tee 
53 510 0.045 9 82 30 
50 530 0.04; 8 87 30 
48 550 0.035 7 80 28 
46 570 0.033 6 70 24 
44.5 590 0.035 5 53 20 
42.5 610 0.02; 3 36 12 


* Obtained by extrapolation of the J-V characteristic after the onset of current pinching. 


normally diamagnetic but at the onset of the helical 
instability it becomes paramagnetic as experimentally 
demonstrated in an electron-ion plasma by Johnson.” 
This paramagnetism occurs because the formation of 
the helical density perturbation produces an enhanced 
induction. The hysteresis follows from the fact that as 
the applied magnetic field is reduced, the threshold in- 
duction is not reached until the applied field has a 
smaller magnitude than it had at the onset of instability. 
Drummond" first deduced that Johnson's results im- 
plied the existence of such a new hysteresis effect. 
The hysteresis observed" in an electron-hole plasma 
is an hysteresis in the parameter which defines the 
boundary between a stable and unstable plasma, namely 
the magnitude of the produce of the electric and magne- 
tic fields, as illustrated in Fig. 1(b). The present paper 
extends these hysteresis measurements to the determina- 
tion of the induction enhancement caused by the helical 
instability and, hence, B-H curves of the type illustrated 
in Fig. 1(a) are presented. 

?R. R. Johnson, in Proceedings of the Sixth International 
Conference on Ionization Phenomena in Gases (Paris, 1963), Vol. 
Pas Drummond, Boeing Scientific Research Laboratories 
Progress Review, First Six Months 1963 (unpublished), p. 95. 
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II. EXPERIMENT 


The circuit is the same as that employed previously." 
It consists simply of a triangular waveform delivered to 
a single crystal parallelepiped of p-type InSb at 77°K in 
the presence of a magnetic field which is applied essen- 
tially parallel to the direction of current flow. The 
plasma is produced by injection from indium solder 
contacts of various cross-sectional area. The electric 
field strength is determined by measuring the potential 
difference between two very small contacts spaced a 
known distance apart. The series of measurements re- 
ported here involves four samples of similar bulk 
properties but different dimensions and contact ar- 
rangements as recorded in Table I. The waveform of the 
2.5 V/Ig. div. 


Ehys = 34.2 Wem 


Fic. 2. An oscillogram show- 
ing hysteresis in the threshol 
electric field strength while 
the plasma is immersed in 4 
constant magnetic field in- 
tensity. 


15 0,5 usec / lg. div. 


1 B. Ancker-Johnson, R. W. Cohen, and M. Glicksman, Phys- 
Rev. 124, 1745 (1961). 
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Fic, 3. Dimensions of samples. (a) 2D-2, (b) 2D2-7, (c) 2D2-8, (d) 2D2-2. 


helical instability oscillations is markedly affected by 
the degree of parallelism (as also noted previously) be- 
tween the applied magnetic field and the direction of 
current flow, particularly in the cases of nonsymmetric 
current contacts (samples 2D2-7 and 2D2-8). By careful 
adjustment it was always possible to obtain essentially 
sinusoidal oscillations as illustrated by Fig. 3(d) of 
Ref. 11.15 

The use of a triangular input waveform enables the 
simultaneous determination of the following pertinent 
parameters: the electric field strength threshold £i 
at a given magnetic field strength Bi, and the difference 
in electric field strength at the onset and cessation of 
oscillations Enys, as Fig. 2 illustrates. A separate meas- 
urement of the current-voltage characteristics (see 
Figs. 1 and 4 of Ref. 8 and Fig. 5 of Ref. 14, for ex- 
amples) relates En to the total current and, by subtrac- 
tion of the extrapolated Ohmic current, to the plasma 
current. Hence the boundary between stability and 
instability may be delineated by plotting Bu as a 
function of Em with the ratio of the injected plasma 
density to the initial density of carriers (or background 
plasma density) £ as a parameter. (See Part IV for 
method of calculating £. 

Two examples of boundary diagrams are shown in 
Figs. 4 and 5. The correpsonding £ parameters for each 
measured point on the boundary are listed in Table I. 
As the electric field strength is increased with the 
plasma immersed in a constant magnetic field, oscilla- 
tions commence at the conditions corresponding to the 
circles. A line connecting these points defines the 
boundary, a line which has no characteristic locus since 
the third controlling parameter £ is arbitrarily fixed 
by the injection properties of the particular contacts. 
In practice, however, each boundary is of such a shape 
that the product of the threshold electric and magnetic 
fields is very approximately a constant. The conditions 
marked by the triangles on Figs. 4 and 5 indicate the 
observed hysteresis in electric field strength, i.e., the 
lower than Ey values at which oscillations are observed 
to cease. . 

A square-wave input was also employed whose rise 
and fall times (<13 nsec) are less than the period of 
these helical instability oscillations. With a circuit 
identical to that in which the triangular-waveform 
input is used, except for the absence of the capacitance 


iati i ; he 

15 Small deviations (e.g., ~1°) from the alignment between t 
direction of current flow and the magnetic field yielding the ms 
nearly sinusoidal oscillations do not noticeably affect Em and Ebys- 
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to ground which makes the square-wave triangular, 
oscillations do not persist after cutoff of the square 
wave from an electric field strength greater than Em 
to one less than that at which oscillations cease when a 
slowly falling Æ is applied, ie, E<(En— Eny) 
Thus, Enya cannot be attributed to a slowly damping 
oscillation. 

En is constant to within +5% when the magnitude 
of the peak value of the applied field strength is varied 
from En to 2Em. The magnitude of the field strength 
at which the oscillations cease is constant to within 
+3% for the same excursion into the supercritical 
region and, in fact, deviations can only be observed 
when the maximum field strengths are close to Eg. 
Variations in the time-rate-of-change of the applied 
field strength by as much as a factor of 4 also do not 
affect Em and Enys beyond the above-stated limits. 

By completing the loops as drawn in Figs. 4 and 5 
the enhancement in the magnetic induction caused by 
the formation of the helical instability may be deduced. 
The influence of the parameter £ in deducing this 
enhancement is discussed in Sec. IV. Figure 4 records 
more typical results than Fig. 5 which shows the 
occurrence of some very large induction enhancements. 

The larger magnitudes of the hysteresis electric field 
strengths, Enys and larger induction increases, Buys, 
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Fic. 5. A B-E threshold diagram exhibiting very large 
hysteresis loops. 


and, therefore, the larger hysteresis loops, accompany 
the smaller applied magnetic fields as Figs. 4 and 5 
and Table I show. The Enys decreases steadily with 


i increasing applied magnetic field but the magnitude 
3 of Buys (Table I) depends on the shape of the instability 
boundary. If the boundary is steep at the high B end, 
: ie, |dB/dE| is large, the Bi, values do not steadily 


decrease with increasing applied magnetic field, as in 

the case of samples 2D-2 and 2D2-2 whose |dB/dE| 

at the high B end of the instability boundary are 6.5 

é and 7.0, respectively, compared with 3.8 and 4.2 for 

£ 2D2-8 and 2D2-7. Independent of |dB/dE|, however, 

is the fact that the hysteresis magnetic fields are smallest 
at the highest applied magnetic fields. 

The extent of the B-E boundary over which hystere- 
sis loops can occur is limited by two factors: At high 
electric field strengths and, therefore, high currents, 
the magnetic (Bennett) pinch effect®:4 prevails which 
inhibits the helical instability. Oscillations may still 
occur when the pinch is well formed but these pinch- 
unpinch® and magnetothermal!* oscillations are very 
different and easily distinguished from helical instability 
oscillations.” As indicated by the wavy line in Fig. 5, 
pinching begins in this sample at ~73 V/cm so the 
high electric field strength extreme of this boundary is 
somewhat affected by weak pinching. A detailed com- 
parison with theory (see next section) shows that the 
onset of pinching tends to make the measured threshold 
_ deviate from the actual boundary toward higher Eu. At 

— the other extreme of the hysteresis range, namely at the 


——3€B. Ancker-Johnson, Phys. Rev. Letters 9, 485 (1962); 
J. E. Drummond and B. Ancker-Johnson, Bull. Am. Phys. Soc. 
8, 471 (1963); 9, 318 (1964). 

EC 11 M, C. Steele and T. Hattori [J. Phys. Soc. Japan 17, 1661 
E (1962) ] offer an explanation of some very small amplitude 
— — oscillations observed during pinching by M. Glicksman and R. A. 
= Powlus [Phys. Rev. 121, 1659 (1961)] in impact-ionization 
mas in #-type InSb which is related to the generation of sound 
es. Impact ionization is not occurring in the present experi- 
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low electric field strengths, the plasma density becomes 
vanishingly small along with the amplitude of oscilla- 
tions and the hysteresis effect. T he ratio of plasma-to- 
background-density £ at the lowest electric feld 
strengths for which hysteresis can be readily observed 
range typically from 1 to 3% (cf. Table I). 

Measuring directly the magnitudes of the magnetic 
fields produced by the helical paths of plasma current is 
difficult, principally because of the relatively small 
sample sizes and large currents. Inadequate electro- 
static shielding plus the unavailability of a sufficiently 
high current source which could be appropriately 
modulated prevents the use of pickup coils, a highly 
satisfactory method for such magnetic field measure- 
ments in electron-ion plasmas.’ Hall probes, however, 


HALL PROBE 
(389) 


(b) 


JB 1G. 6. (a) Experimental arrangement used to measure Buys 
with a Hall probe. The plasma-injecting contact at the probe 
end is made with a gold strip. The spacing between this strip an 
the probe is 0.27 mm. (b) Oscilloscope traces: D shows the helica 
instability oscillations and their cutofi while the current in the 
Hall probe C remains on. The lower oscillogram has a faster 
sweep time and shows both the beginning and end of the current 
pulse in the plasma FE. Traces A and B correspond to the voltages 
on the Hall probe side contacts. 
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Frc. 7. Oscilloscope diagrams illustrating some of the startling properties of the oscillatory behavior. 


do yield a direct estimate when the arrangement shown 
in Fig. 6 is employed. A small parallelepiped of n-type 
InSb is located orthogonally to a plasma-containing 
sample, so that the former senses the applied longi- 
tudinal magnetic field as well as the helix field, but not 
azimuthal fields produced by current flow. 

The Hall probe circuit is supplied with resistances 
such that the transmission line is everywhere matched 
and the voltage contacts A and B are effectively isolated 
from the current path. The outputs of A and B are fed 
into a storage oscilloscope (Tektronics No. 564) 
equipped with a sampling dual-trace plug-in unit (No. 
3876). These voltages, at a time selected with the manual 
scan of a sampling sweep plug-in unit (No. 3177) and 
indicated by the arrows at the bottom of Fig. 6(b), are 
taken from the output jacks of the dual-trace unit 
through two bucking voltages supplies and a pair of 
dc amplifiers. The difference between these signals is 
then read on a voltmeter which is directly calibrated 
in gauss by varying the applied longitudinal magnetic 
field. The bucking voltages are adjusted to maximize 
the sensitivity: No signal is detected when the applied 
magnetic field is on but no oscillations are present 
[right-hand arrow in Fig. 6(b)], so that only the field 
added by the helix is measured when the manual scan 
is located at the left-hand arrow. Although the sensi- 
tivity of this system is ~1 G, drifting of the zero makes 
the measurements tedious. The results are summarized 
in Table II. 

Certain observed aspects of the hysteresis effect are 
quite startling. Some are illustrated by the oscillograms 
of Fig. 7. The first, 7(a), shows how rapidly the ampli- 
tude of oscillations can grow to surprisingly large 
amplitudes in an otherwise perfectly typical hysteresis 
oscillogram. Oscillations can apparently cease as the 
electric field strength decreases to some magnitude and 
then with further decrease commence again, 7 (b).'5 Un- 


18 A possibly related and also as yet not understood effect was 
noted in Ref. 8: As the magnetic field strength was increased 
above threshold while the total current was held constant, the 


fe) 
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usually large amplitude oscillations occasionally c i 
suddenly with decreasing electric field strength, pro- 
ducing a discontinuity in the average E, 7(c). A sudd 
switch from moderate to larger amplitude oscillatio 
can also occur during decreasing £, 7(d). No hystei 
or a “reverse hysteresis,” an anomaly in itself, can | 

followed by a sudden discontinuity in the measur 
at some E well below (Ex,—Enys), 7 (e). Rare but rep: 
ducible is the oscillatory pattern displayed in 7(f). 1 
input waveform was monitored before each of tl 

oscillograms was made and found to be smooth as: 
Detailed observations regarding threshold freque 
(i.e., frequency at the instability boundary) cannot 
made from this type of data since the frequency: 
function of electric field strength in the supercri 
region.2:3-—10 : 
II. THEORY 


The helical instability*2-"is a growing helical dei 
perturbation superimposed on an unperturbed, s 
state plasma density and electric potential in a plz 
column immersed in parallel (or antiparallel) el 
and magnetic fields. The perturbed density anc 
potential are assumed to have the following forn 
respectively, 


n= nis f(r,2) expLitz-- imB-- iet ], 
U= Us g(r) exp(ikz-- im4- iet ], 


Taste II. Hall probe measurements on sample 2D2- 


Bulk properties Bu(G) I4) 


po=3.5X 104 cm? 340 1.9 
Ls = 8000 cm?/V-sec 320 23 


€ 


45 = 


d 
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where f and g are small quantities compared with 
nio the steady-state density and Uo, the steady-state 
potential, respectively. The quantity & is the wave 
number along the z axis, » is the wave nunber in the 
azimutual direction (m=1 for helix) and w is the 
frequency of the perturbation. 

The plasma is assumed to be collision dominated. 
Tt is described by the continuity equations 


(9n4,/90) EV - (Qro ni) ) = vna; (3) 
and the equations of motion 


Div na & (us/c) (ngt na) Ys x B 
py (mont nag) E— (noH nig) V 
— (8/3) (nox -na) vel. (4) 


The last term in (4) is negligible because the collision 
frequencies v4 are so large. The subscripts + refer to 
holes and electrons, respectively; y is the recombination 
and generation coefficient; 70 is the density of initial 
carriers, D is the diffusion constant, » the mobility and 
v the velocity. An essentially neutral injected plasma 
|nan | Klai] with equal electron and hole tem- 
peratures is assumed. Even at relatively elevated electric 
field strengths (~200 V/cm) equal temperatures seems 
to be the most reasonable assumption.” 

A dispersion relation for the frequency is derived 
from Eqs. (1) and (2) and the boundary between 
stability and instability occurs when Im(w)=0. Using 
the additional condition?" that the derivatives of Im 
(w) with respect to k must equal 0 at the onset of in- 
stability, the electric field strength (also the wavelength 
and frequency of oscillation) is expressed in terms of 
the threshold magnetic field strength. 

Holter? has taken into account that the flow of 
current causes plasma heating above the semiconductor 
lattice temperature, here Vo= 6.6X 10? eV. Hence the 
mobilities are expressed as 


I. — uos (Vo/V)*, (S) 


where uo, are the hole and electron mobilities, re- 
spectively, at Vo which is less than the plasma tempera- 
ture V. For InSb in the temperature range of interest 
a~0.5. Then dimensionless threshold magnetic and 
electric field parameters may be expressed, respectively, 
as 


y ye(Vo/ V) (6) 
and 
6— &o(y/yo)- (7) 
With the use of these definitions 
: JYo= Ho-Ho;+ Bin’, (8) 
o= EnR/Vo. (9) 


R is the radius of the plasma. Equation (7) is obtained 


by eliminating the temperature V when Eqs. (6) and 


2 M, Glicksman and W. A. Hicinbothem, Jr., Phys. Rev. 


S x 129, 1572 (1963). 
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Fic. 8. B-E threshold diagrams for plasmas of constant den- 
sities in the same sample as used to obtain the data in Fig, 4. 
Simplified hysteresis loops corresponding to constant £ values 
are shown. The experimentally determined threshold boundary 
(nonconstant £) is also indicated. The £ values for two experi- 
mental points are identified for an illustration (see text). 


(9) are combined. In the limit of small threshold mag- 
netic fields, i.e., y«&1, which is the case in these exper- 
iments, 6 has an asymptotic solution which may be 
expressed as 

(10) 


Ko is dependent on the injected-plasma-to-background- 
density ratio £ but not on magnetic field, hence, it 
may be evaluated for a particular y««1, e.g., yı= 10 7. 
Then the corresponding values of 6; are known as a 
function of £ and the plasma temperature is found by 
equating Eqs. (7) and (6) and using (9) and (10): 


gE 0 GST 
y- : 
yı 6i 

A nonlinear theory treating the hysteresis effect 
itself in electron-ion plasmas has been worked out by 


Holter and Johnson. Such a theory applicable to 
electron-hole plasmas is as yet lacking. 


&— Koy-!*. 


IV. DISCUSSION 


The parameter £ is estimated from the readily 
obtainable Ohmic and total current magnitudes, 
respectively, 


Io-—epwoEA , (12) 
Tn=Lol 1+ ((uo—/ o£) - 1)n/ fo], (13) 

as 
md reri (14) 


o Io((uo-/uos)--1). 
when A is the cross-sectional area of a semiconductor 


21 O. Holter and R. R. Johnson, Boeing Document D1-82-0256, 
December 1963 (unpublished). 


(11) 
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containing an intial density of holes pọ and a plasma 
of density n. The experimental £ values are listed in 
Table I. (The radius of the plasma assumed for the 
calculations is that of the inscribed circle within each 
semiconductor cross section.) The calculated tempera- 
tures of the plasmas at the instability threshold ranged 
from 5.9X10~? to 1.1% 10-2 eV while the lattice re- 
mains at the bath temperature. 

The influences of £ on the enhancement in B caused 
by the presence of an helical current path are considered 
in the following: As E increases so does the plasma 
density which is injected by the current contacts, and 
as E exceeds Em more and more of the plasma current 
goes into a helical path producing an enhanced B. When 
E decreases, the total B= BappticatBhetix remains 
greater than By, even though the plasma current J, is 
decreasing. Thus, a cessation of oscillation occurs at 
an /,XI, at threshold and, therefore, at a £ value 
« £u. This fact and its influence on the magnitude of 
the deduced Bhys is illustrated by Figs. 8 and 9. The 
lines labeled with numbers represent the threshold con- 
ditions for constant £ values. The experimental thresh- 
old points are represented by circles and the triangles 
refer to conditions at cessation of the oscillations, 
consistent with the symbols in Figs. 4 and 5. At both 
threshold and cessation the amplitude of the oscillations 
is vanishingly small (cf. Fig. 2) and, hence, the perturba- 
tion theory? is applicable. 

Hysteresis loops are drawn in Figs. 8 and 9 corres- 
ponding to constant £-value boundaries. The resulting 
Buys are smaller than those deduced for nonconstant £ 
boundaries, Table I. In general, the magnitude of 7; 
at some operating point in the supercritical region 
determines the B produced by the helix. If £ were made 
independent of E and constant, e.g., by supplying the 
plasma optically when noninjecting contacts carry the 
current, the smaller set of Bays magnitudes would 
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Fic. 9. B-E threshold diagrams for plasmas of constant density 
in the same sample as used to obtain the data in Fig. 5. Simplified 
hysteresis loops corresponding to & values are shown. Theexperi- 
mentally determined threshold boundary (nonconstant £) is also 
indicated. 
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Fic. 10. Magnetic field is plotted as a function of plasma current 
using: (1-4) the experimentally deduced Brys values (nonconstant 
£); (5-8) By, values derived from constant £ conditions; and 
calculated B values produced by solenoids of equivalent turns 
ratio. The two points with error brackets represent the results of 
Hall probe measurements. 


result. Conversely, the steeper the J-V characteristic 
and, therefore, the more rapid the increase of plasma 
density with Æ, the steeper the B-E boundary and the 
larger resulting Bays- Sample 2D2-7 compared with 
2D2-8 is an example of the latter case (Figs. 5 and 4). 

Reference to a particular set of operating condi- 
tions (see Figs. 4 and 8) clarifies these statements; A 
threshold for the helical instability in sample 2D2-8 
is at B=370 G, E=159 V/cm and £— 0.08. The plasma 
becomes quiescent again at E= Eq,— E£y47 117 V/cm 
and: (a) for a constant plasma density and therefore 
£—0.08 at B= But Brya= 3704-26— 396 G; (b) for a 
plasma density dictated by the properties of the current 
injecting contacts £—0.03 at B=370+91=461 G. 

The magnitude of the fields produced by the helices 
are plotted in Fig. 10 as a function of the plasma current 
at the threshold. Curves 1 through 4 refer to the experi- 
mentally determined threshold boundaries and 5 to 8 
to boundaries of constant plasma density. The shape of 
each of these curves is controlled, as stated earlier in 
connection with Figs. 4 and 5, by the arbitrary proper- 
ties of the injection contacts. The two points with error 
brackets represent the results of the Hall probe meas- 
urements. These two points are plotted at the threshold 
plasma current, consistent with the rest of the data 
points, although the measurements were made at higher 
currents, Table II. Also graphed in this figure are the 
magnetic fields that solenoids would produce which 
possess the same radius and number of turns per cm 
as the helical instability in the four samples. The turns 
ratios of the equivalent solenoids are derived from 
Holter’s calculation of the wavelength of the helix at 
threshold (Fig. 1, Ref. 7). The dimensionless wave- 
length \/R is nearly constant for all the present meas- 


R (Table I) has a range of less than a factor of 2, hence 


` 


E 


urements; it varies only between 3.2 and 3.6. The radius — 
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Fic. 11. The first quadrant of deduced B-H curves. The larger 
loops correspond to Fig. 4 and the smaller (hatched) to Fig. 8. 


the fields produced by the equivalent solenoids are 
essentially determined by the magnitude of the current 
they carry. The equivalent solenoid results in Fig. 10 
are obtained with the assumption that all the plasma 
current goes into the helix at threshold which, of 
course, is not compatible with the perturbation theory. 

Figure 10 indicates that the equivalent solenoids pro- 

duce far less B than their counterparts in the plasma at 
low I» even though it was assumed that they carry all 
the plasma current. The fields deduced from the bound- 
ary measurements, however, result from excursions 
into the supercritical region, hence curves 1 to 8 should 
be shifted toward higher currents for comparison with 
the solenoid curves. The fields sensed by the Hall probes, 
when plotted against the J, they carried during the 
measurements (Table II), agree well with the fields 
produced by equivalent solenoids conducting the same 
current. The agreement may be so good fortuitously 
since the À of the helix in the supercritical region may 
not be constant in the range of these threshold magnetic 
fields. There is experimental evidence to suggest it is 
not constant??? but a nonlinear theory for electron- 
hole plasmas, as stated earlier, is not available. 

The Hall probe measurements were made on a sample 
with radius somwhat larger than 2D2-7. At 7,—0.5 A 
the Bhys deduced from the experimentally determined 
threshold boundary is ~35 G whereas that deduced 
from the constant-plasma-density boundary is ~10 G. 
Since the Hall-probe value of ~26 G is likely to be too 
small because the measurements were made a finite 
distance from the end of the plasma helix [Fig. 6(a) ], 


2 The only published wavelength measurements known to the 
?author are in Ref. 9. These results yield a A/R 4 to 5 times Holter's 


- calculated values. This makes the equivalent solenoid fields even 
smaller. 
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agreement with the experimentally determined By, 
value is indicated. 

The magnitude of Buys is very large compared with 
the part of it associated with the diamagnetism normally 
exhibited by a plasma. Recently Moore and Kessler? 
have measured the magnetic moment of an optically in- 
jected plasma in Ge. For a plasma density equal to 
1015 cm-? and an essentially infinite surface recombina- 
tion velocity, a condition corresponding to the highest 
magnetic moment, they find a diamagnetic moment 
density of ~2X10~ cgs units at 1 kG. The plasmas 
utilized in the present experiments can be expected to 
exhibit an even smaller moment since the plasma den- 
sities are 1 to 23 orders of magnitude smaller and the 
surface recombination velocity is finite. Hence essenti- 
ally all of the deduced Byy, is associated with the para- 
magnetism produced by the helical instability. The 
first quadrant of the resulting B-H curves correspond- 
ing to Figs. 4 and 8 (the more typical set of hysteresis 
loops) is shown in Fig. 11. The larger loops correspond 
to the experimentally determined threshold boundary 
and the smaller loops (hatched) to the constant £ 
boundary. The full B-H diagrams for sample 2D2-7, 
which exhibited the largest loops, are shown in Fig. 12. 
'The loops are again drawn with straight sides for 
simplicity but their shapes cannot be ascertained by the 
present experiments. 

The sign of the enhanced induction associated with 
the helical instability is positive as stated in the intro- 
duction, according to Johnson's measurements in 
electron-ion plasmas!? and as verified by the described 
Hall probe measurements. This switch from diamagne- 
tism to paramagnetism is completely compatible with 


IEEE 
100 200 


Fic. 12. Deduced B-H diagrams. The larger loops correspond 
to Fig. 5 and the smaller (hatched) to Fig. 9. 


23 A. R. Moore and J. O. Kessler, Phys. Rev. 132, 1494 (1963): 


The author is grateful to Dr. Moore for discussing their results $ 


prior to publication. 
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Fic. 13. The hys- 
teresis loop energy 
as a function of the 
input energy at 
threshold for the ex- 92 
perimentally deter- 
mined boundaries, 
i.e., £ nonconstant. 
The solid points rep- 
resent hysteresis en- 
ergy in a weakly 
pinched plasma. The 0.05 
insert shows the 
range of hysteresis 
loop energy as a 
function of plasma 
radius. 0.02 
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the helical instability theory® which states that the 
the helical instability theory which states that the 
sense of the helix is positive (negative) with respect to 
the applied magnetic field when the applied Æ is 
parallel (antiparallel), and is independent of the drift 
direction of the helical density perturbation. Enhance- 
ment of the magnetic field results and, therefore, 
paramagnetism. That paramagnetism results is also 
evident from simple physical reasoning: Since the 
application of a B, large compared to Bu, causes a 
shift in the equilibrium position of the plasma toward 
its container walls, the magnetic pressure inside the 
helix must be greater than that outside, thus the mag- 
netic field produced by the helix must add to the 
applied B.*4 

The effect of the plasma radius on the hysteresis is 
illustrated by Fig. 13 in which a measurement of the 
“hysteresis loop energy” is plotted against the “input 
energy" required to achieve oscillations for the four 
different plasma radii. The input energy necessary to 
produce hysteresis is quite constant as expected from 
both theory and the experimental results shown in 
Figs. 4 and 5. The theoretical curves of Ey as a function 
of Bua? have a slope of —0.5, hence for any one value 
of ¢ the product Ex X Bi is a constant as illustrated by 
the hyperbolic boundaries for constant £ values drawn 
in Figs. 8 and 9. Since £ varies in the experiments 
(Table I), this product is only approximately constant. 
Of considerable practical interest is the fact that very 


24 The author is grateful to H. P. Furth for suggesting this line 
of reasoning. 
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little change in input energy is necessary for very large 
changes in loop energy. The greatest range in loop 
energy, defined as the ratio of the loop energy at the 
lowest (EX B) at which observable hysteresis occurs to 
the loop energy at the highest (X B) at which noise- 
free oscillations occur, is produced by the smallest 
diameter plasma, if pinching is avoided. The insert in 
Fig. 13 indicates that the range saturates with decreas- 
ing plasma radius at ~3 10-? cm. If weak pinching is 
allowed, there appears to be an optimum intermediate 
plasma radius for obtaining the greatest hysteresis 
range. 

The startling oscillatory behavior illustrated by 
Figs. 7(b)-(f) may be related to a complicated relation- 
ship between the direction of current flow and the 
applied magnetic field direction, since such behavior has. 
not been observed in the two samples investigated 
which possess symmetrically located plasma injectors, 
namely samples 2D-2 and 2D2-2. 


V. CONCLUSION 


The hysteresis in electric field strength E accompany- — 
ing the onset and cessation of the helical instability can 
exceed 45 V/cm and 50% of the applied E at threshold. 


The attendant hysteresis in magnetic field intensity B, — : 
deduced from measurements of the boundary conditions 


between a quiescent and a rotating plasma which agree 


well with theory, can exceed 160 G and 55% of the — 


applied B at threshold. Hall probe measurements of 
the B produced by the helical path of plasma current 
tend to verify the deduced Bhys values both in magni- 


tude and sign (paramagnetic). The resulting, novel - 


B-H curves consist of loops which are displaced from 


the origin. An infinite number of hysteresis loops can 


(Eu X Bin). The extent in E (or B) over which loops 


r 


occur is limited at the high E (low B) end by the onset 


of current pinching and at the low E (high B) end by 
vanishing plasma current and, hence, vanishing hystere 
sis. The existence of the largest loops at low magnetic 
fields is in agreement with Holter and Johnson's non- 
linear theory?! of the helical instability in electron-ion — 
plasmas. E 
These loops are easily and reproducibly obtained in 
p-type InSb at 77°K. E 
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b ; ; ; : a 
Stability of Longitudinal Oscillations in a Uniform Magnetized 
Plasma with Anisotropic Velocity Distribution 
LAURENCE S. HALL AND WARREN HECKROTTE 
Lawrence Radiation Laboralory, University of California, Livermore, California 
(Received 13 January 1964) E 
We consider the longitudinal oscillations of a uniform magnetized plasma in which the particles are | 
allowed to have anisotropic distributions in velocity space, corresponding to the two temperatures Tı and | 
Ty perpendicular and parallel to the magnetic field. It is shown that no instabilities can exist unless Ti» Tu L. 
for at least one of the species of plasma particles. An additional theorem, of use in surveying for possible 
unstable roots to the dispersion equation when T1» Ti, is also proved. 
! | ‘HE stability of longitudinal oscillations of a the magnetic field, so that m4 n$-1, we have 
uniform plasma with anisotropic velocity dis- 
vm 


tribution in the presence of a uniform magnetic field 
has been studied by a number of authors? in various 
approximations. We consider here a multicomponent 
nonrelativistic plasma in which the zeroth-order dis- 
tribution function may be written as a bi-Maxwellian, 
viz., 


foj* Qr«T/m;)! QT yu/m;)-!? 
mo? 


2xT yj 


DEUM 
xex - = | > (y 

2xTu; 
and show that no instability exists unless T; Tu; for 
at least one species. (Here || and L, respectively, denote 
directions parallel and perpendicular to the magnetic 
field Bo; j denotes the species of particle; and m, T, 
and x are, respectively, mass, temperature, and Boltz- 
mann’s constant.) An additional theorem of use in 
locating possible instabilities when T1;>Tuj is also 
proved. 

Assuming that all perturbations vary in space and 
time according to the form exp[i(k-r—w1) ], where k 
is real and w complex, and Im(w)>0, one can follow step 
by step the prescription of Bernstein‘ in obtaining the 
dispersion relation for longitudinal oscillations. After a 
straightforward but laborious computation, one may 


write 
12-225 op OPF;=0, (2) 


where w j= (40Z7en,/m;)\?=plasma frequency, Q; 
=|Z,e|Bo/mjc=cyclotron frequency, and the sum is 
over particle species. If we write wj=w/Q;; y= k(2kTu;/ 
m,Q;?)!2=k- (radius of gyration) ; t} = Tn;/Tı;; and take 
nı and m, respectively, to be the direction cosines of 
the propagation vector parallel and perpendicular to 


*This work performed under the auspices of the U. S. Atomic 
Energy Commission. 

1E. G. Harris, J. Nucl. Energy C2, 138 (1961), and Oak Ridge 
National Laboratory Report ORNL-2728, 1959 (unpublished). 

?'Y. Ozawa, I. Kaji, and M. Kito, J. Nucl. Energy C4, 271 
(1962), and references cited therein. 
i 2 ae and W. Heckrotte, Phys. Rev. 131, 2129 (1963) ; 
"EL. Bernstein, Phys. Rev, 109, 10 (1958), Appendix II. 
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r= f dx exp(iex —1y; LP nit? — pen? (1— cosa)) 
0 S nk ee 

Xin x+n? sing]. (3) | 
We now drop the subscript j as understood, and con- 


sider the integral in (3) noting that it may be rewritten 
after an integration by parts of the first term to read 
F= +F at (0 — £9)n Fs, (4) 


where 


Pamio | dx exp(iex— ux? —A (1— cosa)) , | 
0 
: © 
ra= | dx exp(iex— ux?—A (1 — cosx)) sing, 
0 


| 
and we have set u=} Pm? and Ao $y'nj. All of the | 
parameters are real except w which is complex and which, 
for an instability, has positive imaginary part. We 
propose to show that, when Im(w)>0, 


(o. £ 


Re(w)-Im(F4)2>0, Re(w) Im(Fg)z0. 
Since both Im(F4) and Im(/z) are antisymmetric in 
Re(w), it is only necessary to prove (6) for w in the first ^— w 


quadrant of its complex plane. We then use the fact 
that if f(») is an analytic function of w in the region of 
consideration, the map of any closed contour C in the 
c plane onto the f plane encircles all values of f at — | 
tained by letting w assume values within C. We choose | 
C as in Fig. 1, and note that F4 and Fg both vanish on 
the infinite quarter circle. Further, for w on the imag! 
nary axis both Im(F4) and Im(Fg) vanish. Thus, We 
need only show that Im(F4) and Im(Fs) are sign- 
definite for w real and positive in order to show they 
are sign-definite for w anywhere in the first quadrant. 
First, consider Im(F4) and use the identity 


e^ cosz — P» Tie", 


12—20 


where the J;(X) are Bessel functions of the first kind 


— EM TL 
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STABILITY OF 


W- PLANE 


Im(w) 


RADIUS — 0 


Fic. 1. Contour for 
the examination of 
various integrals. 


and imaginary argument, obtaining for w >0 


Im(F4)-e(r/Ag)? Y» eI) 
l 


Xexp{—(w+)?/4u}>0. (8) 


Similarly, with the use of (7) together with the funda- 
mental relation Z;3(X) — Za3(4) 22II1(N)/^, one can 
also write that for œ> 0 


Im (Fs) 2X71(s/p)!/? Z eM) 


i-i 
Xexp( — (w?+P)/4p}1 sinh(le/2u) 0. (9) 


According to our previous discussion, (8) and (9) 
establish (6). Now consider the dispersion relation (2) 
which can be satisfied only if either (a) Im(£) vanishes 
for all species or (b) Re(w)-Im(F)<0 for at least one 
species.? Condition (a) can be satisfied in general only 
when Re(w)=0, which, if unstable, corresponds to 
purely growing waves. However, as pointed out by 
Kammash and Heckrotte? Re(F) is positive definite in 
this limit and no purely growing modes can exist. Hence, 
an instability can only occur if Re(c)-Im(F)«0 for 
at least one species. From (4) and (6), then, we have the 
result that no instability exists unless ?— Tu/T, «1 
for at least one species in the plasma. 

It is also of interest to be able to determine whether 
or not roots to the dispersion relation occur when «1, 
and if they exist, where they lie. Tn this connection, the 
labor involved in exploring the w plane can often be re- 
duced by utilization of the theorem, which we refer to as 
(10): 

5 We consider that the plasma has at least two species of 
particles. If one species only is principally responsible for the 
motion, we always consider here that the small but vanishing 
contribution from the neglected particles is in the stabilizing sense, 
as will almost always correspond to the physical situation. This 
means that any instability arising as a limiting case of (b) must 
be such that Im(F) — 0- for the active species. 


LONGITUDINAL 


OSCILLATIONS A1475 
If Re(w)-ImLF (w)] is non-negative for w real 

and lying in the interval lc l4- 1, where Lis an 

integer, it is also non-negative in the entire strip 

LS Rew) € I-- 1, Im(s)2 0. (10) 


Hence, if one can show that Re(w)-Im(P)>0 for some 
integral range of w on the real axis, one also can eliminate 
the possibility of unstable roots appearing anywhere in 
the strip above that range. : 

In order to prove (10), we rewrite (3) in the form? 


P=npFetnyks , (11) 


where 


Fo= | adx exp(iex—ux?—A(1—cosx)), (12) 
9 


and investigate F along the contours Re(w)=/. We have 
already proved Re(o)- Im F5)» 0, and so we need only 
prove /-Im(Fc)>0 when Re(w)=1 to establish (10). 
Setting w=/-+ic and again using the expansion (7), 
we can rewrite (12) after a little manipulation to read 


n 


Im (Ec) = x. €? La) = Iga(X)] 


k=l 


x J xdxe-*7-"" sinky. (13) 
0 


Consider the integral K(w’)= Jy xdx exp(io'z— ua), 
Im(o’)>0, whose imaginary part is antisymmetric in 
Rew’). Evaluating K along the contour C in Fig. 1, 
again we find Im(K)=0 along the imaginary axis 
and the infinite quarter circle. Along the real axis 
k-Im[K (k) |=47' ku exp( —£/44) 20. Hence k 
times the integral in (13), which is just Im[K(A+ia) ], 
is non-negative. Moreover, the Bessel function (Aj) 


is a monotonically decreasing function of the absolute - 


value of its order, so that [7:_,(A)—Je.1(A) ]2:0. Hence 
Re(w)-Im(F¢)>0 and (10) is proved. 

In conclusion, we have shown that in a uniform 
magnetized plasma whose particles are distributed 
anisotropically in velocity space in zero order according 
to the bi-Maxwellian form (1), no unstable longitudinal 
oscillations exist unless T.» Ti for at least one species. 


In addition, instabilities can exist when T, 7, only 


for complex frequencies lying in the strip Im(w)>9, 
1<Re(w)</+1, where / is an integer, and where 
c Im(F) <0 for some real w lying in the range | <w «/-- 1. 


6 We may note in passing that since F may be written in either 
of the forms (4) and (11), the inequalities (6) show that there can 
be no instabilities whenever either m,=0 or 5-0. This is an 
extension of the result noted by Harris (Ref. 1) for the special 


case Ty =0. 
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Lattice Vibrational Spectra of Beryllium, Magnesium, and Zinc 
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The lattice vibrational spectra of Be, Mg, 
'The force constants used in the calculations were 
relations by slow neutron-scattering techniques. C 
agreement with measured values. 


I. INTRODUCTION 


N recent years the scattering of slow neutrons in 
solids has proved valuable as a tool for investigating 
the dispersion relations of lattice waves in crystals. In 
particular, measurements have been made on beryllium, 
magnesium, and zinc, all having the hexagonal close- 
packed lattice structure. In the first case (Be), Sch- 
munk e! al.! have determined the force constants fora 
central force model with interactions out to fifth nearest 
neighbors. For Mg, Collins? determined the force 
constants for both a tensor force model and a central 
force model with interactions out to fourth nearest 
neighbors. A similar experiment by Borgonovi et al.* has 
given the force constants of zinc for the tensor force 
model used for Mg.? Using these experimentally deter- 


and Zn have been obtained using the root sampling method. 
those determined from measurements of the dispersion 
alculations of the specific heat of each metal show good 


vibrational spectra and specific heat for each of the 
above metals. 
II. RESULTS 


The vibrational spectrum of a crystal p(w) is defined 
such that p(w)dw is the fraction of frequencies w be- 
tween w and w+dw. The method used here to approxi- 
mate p(w) is to sample the frequencies w(K) for a large 
number of values of the wave vector k in the first 
Brillouin zone of the crystal. The frequencies obtained 
must be weighted according to their degeneracy in the 
zone, and a histogram constructed of the number of 
frequencies falling in a given frequency interval. 

In the Born-von Kármán‘ theory of lattice dynamics 
the frequencies w are determined by the following 
secular equation: 


mined force constants, we have computed the lattice |D(k)- Me?I| —0, (1) 
giu ae =o T pg n 
35 
30 
25 
A 
F 
A us . H . 
E Fic. 1. Lattice vibrational 
H spectrum of beryllium for 
= a sampling of 10 261 points 
$5 in the reduced Brillouin 
2 zone. 
a 
10 
5 
o 
i 2 3 4 5 6 7 8 9 10 T B z 


w (IN UNITS 10'S sEc-!) 


1R. E. Schmunk, R. M. Brugger, P. D. Randolj 
2M. F. Collins, Proc. Phys. ou (London) 8 So doen) ee aie eek 28) S82 (1002) 
RC Borgonovi G. Caglioti, and J. J. Antal, Phys. Rev. 132, 683 (1963). 
M. Born and K. Huang, Dynamical Theory of Crystal Lattices (Oxford University Press, London, 1954). 
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Fic. 2. Lattice vibrational 
spectrum of beryllium for 
a sampling of 2413 points 
in the reduced Brillouin 
zone. 


pU) CIN UNITS 10714 SEC ) 
5 a 


9 
œ 


0.4 


where M is the atomic mass and 7 the unit matrix. The 
matrix D(k) is 6X6 for hexagonal close-packed crystals 
and depends upon the force constants of the lattice. 
Equation (1) has been solved for equally spaced points 
in that region which is one twenty-fourth of the first 
Brillouin zone, and irreducible under the symmetry 
operations that leave the roots of the secular equation 
invariant. The points are then weighted as to whether or 
not they are on the faces, edges, etc., of this reduced 
zone. For beryllium, the calculation was performed for 
10 261 equally spaced points (k values) in the reduced 


Fic. 3. Lattice vibra- 
tional spectrum of mag- 
nesium using the central 
force model. 


pw) CIN UNITS 10714 sEC ) 


nd A, "* E A 


60 80 
w (IN UNITS 10 sec-!) 


zone, and the spectrum shown in Fig. 1. Figui 
for comparison, the same calculation for 241. 
spaced points. We see that greatly increasing 
ber of points does not significantly change 
tional spectrum. 

In the case of magnesium, the calculation ` 
formed for a sampling of 2413 equally spaced 
the reduced zone, for both the central and 
models. Figures 3 and 4 show the vibration 
for central and tensor forces, respectively. 
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Fic. 4. Lattice vibrational 
spectrum for magnesium using 
the tensor force model. 
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from a sampling of 2413 equally spaced points in the 
reduced zone with the tensor force model, and is shown 
in Fig. 5. 


TABLE I. Cy for beryllium (cal/g). 


T (°K) Computed value Measured value 
100 0.070 
150 0.195 
200 0.311 
cm 293 0.455 0.466 
400 0.535 


ee 
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Fic. 5. Lattice vibrational 
spectrum of zinc using @ 
tensor force model. 


In order to make some comparison with experiment, 


the specific heat at constant volume Cy, which is given 
in terms of the frequency spectrum by 


TABLE II. Cy for magnesium (cal/g). 
re ae ee gee Ne ee es 


T (°K) Computed value Measured value 
123 0.1839 0.1767 
153 0.2023 0.2025 
223 0.2236 0.2228 
300 0.2329 
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Gy ene WN? wit 
—= tal Jn TE 
3R I (7) (eiT 1. 2i (2) 


where R is the gas constant, winax the maximum fre- 
quency of the crystal, and T the temperature measured 


TABLE III. Cy for zinc (cal/g). 


T (°K) Computed value Measured value 
72 0.0538 TOO TRIN 
173 0.0826 0.0814 
300 0.0882 
PHYSICAL REVIEW VOLUME 
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in energy units, has been computed for all three metals. 
Tables I-IIT compare the calculated specific heat with 
measured values,’ for the vibrational spectra of Be, 
Mg, and Zn, respectively. The agreement is quite good 
in all cases, although specific heat comparisons do not 
provide a good test. 
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Electron Spin Resonance Studies with Superconducting Magnets: 
The Spectrum of Dy’* and Sm^* in CaF, 
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An electron spin resonance spectrometer has been constructed using strong magnetic fields up to 46 000 G 
generated by a superconducting magnet. It is being used for the measurement of the magnetic properties of 


transition elements in ionic crystals such as (a) small g factors, (b) energy level separations up to 20 em™, 


(c) isotropic and anisotropic exchange terms in spectra of pairs of paramagnetic ions, (d) spin-lattice relaxa- 
tion times at high magnetic fields. The method is illustrated in the measurements of (1) the energy level 


separation of ';—T* levels of Dy** in CaF» which is found to be 7.352-0.2 cm™; (2) the g factors of the 
tetragonal spectrum of Sm** in CaF? which are found as gu =0.90740.01, g4 =0.544-£0.015. 


INTRODUCTION 


po ma spin resonance methods have been 
widely used in determining the magnetic properties 
of transition elements in ionic and semiconducting 
crystals.! Most of these experiments have been per- 
formed with magnetic fields of less than 20 000 G.? The 
availability of superconducting magnets with fields 
of the order of 50 000 G permit a number of additional 
experiments of interest. These experiments are of the 
kind: (a) Measurements of energy level separations 
of the order of 3-20 cm—. (b) Measurement of small 
g factors, i.e., gX1. (c) Measurement of some isotropic 


* Permanent address: The Hebrew University, Jerusalem, 
Israel. 

+ Supported in part by National Aeronautics and Space Agency 
through Grant NSG 330. EE 

+ Supported by the Air Force Office of Scientific Research. — 

1 See, for example, W. Low, Paramagnetic Resonance in Solids 
(Academic Press Inc., New York, 1960). A 

2S. Foner and W. Low, Phys. Rev. 120, 1585 (1960), have per- 
formed ESR experiments using pulsed magnetic fields of the 
order of 2 000 000 G. However, this method has the disadvantage 
of poor accuracy in the field measurements. Relaxation effects 
due to fast variation of the magnetic field as a function of time 
effect the intensity of the spectrum. It can be used, however, to 
some advantage in the measurement of relaxation times at 
high magnetic fields [L. Rimai (private communications) ]. 
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or anisotropic exchange splittings of neighboring 

paramagnetic ions. (d) Spin-lattice relaxation phe- 

nomena in high magnetic fields. 2 
Many iron group and rare-earth elements have an 


energy level scheme in which a few levels are closely — = 


E 
E 


bunched together. Examples of such ions are V9*, — — 
Cr+ (d?) in trigonal fields which have a singlet asa — 
ground state and a doublet at 7-10 cm™ above the ; 
singlet. A similar case is Fe?* (d*) in noncubic fields. z 
In some crystal hosts, transitions between the two 
levels of the non-Kramers doublet can be induced. 
However, transitions between the singlet and the - 
doublet cannot be seen with conventional resonance - 
spectrometers and magnetic fields. With very strong 
magnetic fields one of the doublet levels can be brought 
close enough to the singlet so that transitions between — 
these levels can be effected. One usually can find a 
suitable magnetic-field orientation with respect to th 
crystal axis which will permit transitions of the type 
AS,=-t1 of sufficient intensity. The appropriate spi 
Hamiltonian is 


5=g, GHS.+:8(HS.+H,S,) 
+D(S2—}S(S+1)) FE (S2—S/y 


` 
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with S=1. When Æ=0 the initial separation of |1) 
levels from the |0) level can be determined. When £0 
then the separation of one of the nearly degenerate levels 
from the |0) level can be measured. 

Of greater interest are the rare-earth and uranium 
group elements. Here the various J levels are split into 
a number of Stark levels through the crystal field. In 
many cases, the total amount of the splitting of 5-10 
Stark levels may be only of the order of a 100 cm™. 
There are levels separated by less than 20 cm-!. The 
position of these levels can be estimated by diverse 
methods such as by measuring (1) the magnetic sus- 
ceptibility or specific heat as a function of temperature, 
(2) the spin-lattice relaxation time as a function of the 
temperature and evaluation of the Orbach process,?* 
(3) the fluorescence in the optical spectra, (4) the 
Faraday rotation, and (5) the intensity of the spin 
resonance of the excited state as a function of the tem- 
perature, provided this resonance can be observed in 
the temperature range where this level is still sufficiently 
populated. Most of these methods are not very accurate 
and permit the evaluation of the level only to an 
accuracy of about 1 cm™. Others do not work well in 
dilute salts which are used in spin resonance methods. 

Since many of the g factors of rare-earth ions are 
large, one can bring levels, which in the absence of a 
magnetic field are far apart, close together and even 
cause them to cross. Assuming a magnetic field of 
about 50000 G and g factors of about 6 for the two 
levels, and operating K-band frequencies, one can 
determine energy levels separated by about 15 cm™. 

The g factors in the rare-earth ions depend on the 
crystal-field parameters A,”{r"). In general there are 
many more crystal-field parameters than the measured 
three or less g factors. Paramagnetic resonance measure- 
ments of these g factors on one single Kramers doublet 
does not permit the evaluation of these crystal-field 
parameters. These can be determined from the optical 
spectra which give the average value of the A,”(r") 
over all the optical levels. There are reasons to believe 
that the crystal-field parameters may depend to some 
extent on the various JT' levels because of the shielding 
and antishielding factors," ie, that 4,"(7")uue 
(1—5,"(J,D)), where pn” is a shielding factor. It is, 
therefore, of interest to measure sufficient g factors and 


REN 


| 


3 R. Orbach, Proc. Roy. Soc. (London) A264, 458 (1961). 

4C. B. P. Finn, R. Orbach, and W. P. Wolf, Proc. Phys. Soc. 
(London) 77, 261 (1961). 

5 An interesting experiment is to measure the fluorescence or 
absorption of light when the energy levels cross. The polarization 
of the light or the angular distribution of the light would be 
expected to change. This experiment is very similar to the optical 
atomic level crossing experiments except that in the case of a 
crystal you have the advantage of the symmetry planes and axes. 
Such an experiment should permit the accurate determination of 
g factors and splittings of both of the ground state as well as 
optically excited states of transition group elements in crystals. 
ue a 6 25 E. Watson and A. J. Freeman, Phys. Rev. 133, A1571 
| *2W. Low, Enrico Fermi Summer School, Varena, 1963 (to be 
published). 
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a separation of Stark levels of a given J level in order 
to determine the A ,"(r") for this particular level and to 
compare it with the weighted average over all optica] 
levels. 

Separation of energy levels in the range of 1-20 cm~: 
are also found in exchange coupled pairs of paramag- 
netic ions. For example the spectrum of exchange 
couplet Ir** in KelrPtCl® can be explained by the 
interaction spin Hamiltonian? 


ge JS SDSS S Si Sy'S,’) 
+E (CSA SaS?) ? 
with Si=S’=4, and Ji=8cm!. (2) 


The isotropic exchange parameter can be determined 
by one or more of the methods described above. Strong 
magnetic fields would permit a more accurate deter- 
mination of these parameters. 

Some of the g factors in paramagnetic resonance are 
smaller than one. An example of this is 8m?*. A magnetic 
field of about 50 000 G and at K, band frequency would 
permit the determination of g factors as small as 0.2. 

Finally, it would be very interesting to determine the 
spindattice relaxation in high magnetic fields, in 
particular, the magnetic-field dependence of the direct 
process, the Orbach process, and the Raman process 
both for Kramers and non-Kramers doublets. 

We report here the techniques of using superconduct- 
ing magnets for spin resonance experiments. We 
illustrate this by two types of experiments (a) the 
determination of the I's—I7 separation of Dy** in 
CaF, and (b) the determination of the small g factors 
of Sm?* in CaF». Experiments as outlined in (c) and 
(d) are in progress. 


EXPERIMENTAL DETAILS 


We briefly describe here the experimental setup. 
A fuller description of the apparatus will be given 
elsewhere. 

The experiments were made with a stabilized Ku 
spectrometer operating between 15 000-18 000 Mc/sec 
and at liquid-helium temperature. The cavity and the 
stainless-steel waveguide section were enclosed in a 
Lin. i.d. thin stainless-steel tubing. This tubing was 
inserted into the bore of the superconducting magnet. 
The modulation coils were wound around the stainless- 
steel tubing concentric with the cylindrical cavity- 
Fifty to 100 turns of thin pure copper wire were sufficient 
for a modulation depth of about 80 G. The heat dissipa- 
tion at such modualtion depth was not excessive: 
About 8 liters of liquid helium were required to cool the 
superconducting magnet and waveguide section to fi 
the Dewar for a 3-4-h experiment. 

The superconducting magnet was about 4 in. long 


8 J. H. E. Griffiths, J. Owen, J. G. Park, and M. F. Partridg® 
Proc. Roy. Soc. (London) A250, 84 (1959). 
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ELECTRON SPIN RESONANCE WITH 
and i.d. slightly larger than 1 in.’ The power supply was 
a standard stabilized supply. The magnetic field could 
be swept by putting a small de error signal in the sensing 
elements of the power supply. The rate of sweep could 
be controlled by varying the magnitude of the error 
voltage. Sweeps of several minutes to several hours over 
the range of 46000 G could be achieved without 
difficulty. A superconducting switch permitted the 
magnet to be operated in the persistent mode. 

The measurement of the magnetic field presents a 
little difficulty. The inhomogeneity of this particular 
magnet over a sample volume of 0.02 cm’ and a sample 
height of about 2 mm was of the order of 10-15 G at 
the center field of about 20 000 G. This inhomogeneity 
was estimated by putting small crystals with para- 
magnetic impurities of known linewidth and g factors. 
Calibrating the sweep, one could estimate the broadened 
linewidth in the superconducting magnet. The linewidth 
would probably be even larger for a proton resonance 
sample which has a larger volume and would not permit 
an accurate or easy determination of the magnetic field. 
(Since our spectrometer was inserted in the liquid- 
helium bath, a solid-state proton sample would have to 
be used. However, one could use also a liquid sample 
by using an inner Dewar so that the spectrometer and 
proton resonance could be operated at any temperature.) 
We have used a more primitive method. By placing 
various paramagnetic samples of known g factors in 
the magnetic field, we have made an experimental plot 
of the magnetic field H against the current 7. It was 
found that the magnetic field was not quite linear with 
the current, particularly at high magnetic fields. There 
is also a noticeable hysteresis. At center band, near 
20 000 G, and sweeping rapidly over the whole range 
back forth the hysteresis amounts to 100-150 G. In all 
measurements reported here were made with slow 
sweep rates and always in the same direction. The 
maximum error in the determination of the magnetic 
field is estimated to be about 0.2%. 

Once a line was observed on the scope or by phase- 
sensitive detection the superconducting solenoid was 
switched to the persistent mode. In this case the line 
stayed at a constant magnetic field with only a very 
small drift. 

Such a system, with a more homogenous magnet, 
could be used for magnet of frequency stabilization. It 
also permits the use of the continuous averaging tech- 
nique” to greater advantage. In this technique the 
signal and the noise are fed into a multichannel analyzer. 
The signal is additive, i.e., proportional to the number 
n of sweeps across the line, whereas the noise is pro- 
portional to n!, so that the signal to noise is propor- 
tional to n2. This is also true for phase-sensitive 


9 Manufactured by Magnion, Inc., Cambridge, Massachusetts. 
The power supply is made by Kepko Model KS-36-30M. ; 

10M. P. Klein and A. W. Barton, Jr., in Proceedings of the First 
International Conference on Paramagnetic Resonance, edited by 
W. Low (Academic Press Inc., New York. 1963), Vol. 2, p. 699. 
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detection. However, in practice it is found that low- 
frequency noise components are difficult to filter out, 
even with large time constants in the R.C. filter of 
lock-in detector. A system, such as a superconducting 
magnet, in which the magnetic field is constant and 
ripple is nearly absent, lends itself admirably to such 
a technique. 


THE SPECTRUM OF Dy?* IN THE 
CUBIC FIELD OF CaF, 


The cubic field of CaF, splits the J=15/2 of the 
°H 5/2 level into three I's, one T7 and one I's. The ground 
state is a I's") and the g factors within the I’, quartet 
have recently been determined by Bierig and Weber" — 
as 2.6340.05, 5.48-£0.15, and 14-E1. The next excited — 
state I'; has been measured by Low” as g- 7.472:0.08 5 
and by the above authors at 7.522:0.05. These authors — - 
estimate from the temperature dependence of the ` 
intensity of the T; level to be at 8.5-E1.0 cm~! above the — — 
Ts level. All other levels are far removed. In addition — 
Bierig and Weber observed many lines associated with — 
tetragonal or trigonal symmetries and with g factors 
varying from 1.50 to 16. ee es 


This spectrum, therefore, has a wealth of lines which — 
can be used for field calibration. We have confirn 
many of the g factors determined by these authors 
high g factor is found to be 13.7+-0.3). In addition w 
observe lines at near 29 000 and 34 000 G when H|| to 
the 100 axis. These two lines are identified to belong — 
to the cubic transition before and after crossover from — 
the 


Ts: @:|+15/2)+a2|+7/2) a 
+403| F 1/2) F9/2)to th 
T; al 13/2)+c:| 2455/2) 

--a|73/2)4-d,| F 11/2) leve 


x3 


From these measurements and the known g factors of — 


A (c x F(6) 
Ad’) 1—lx| FA) 


where 4,(r*) are the crystal-field parameter, F 4 

and F(6) multiplication factors, and we find x=0 55 

+0.01. From this we find Exc 
Ad) - —2424-20 cmt, 
As(r)) 2 412-5 cm. 


u R, W. Bierig and M. d Wee Phys. Rev. 132, 1 

12 W. Low, Proc. Phys. Soc. (London) 76, 307 (1960). — 

BK, R. Lea, M. J. Leask, and W. P. Wolf, 
Solids 23, 1381 (1962). 
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In these calculations we have not taken into account 
the admixtures from the J=13/2 level at about 3500 
cm}. 
THE SPECTRUM OF Sm** IN THE 
TEGRAGONAL FIELD OF CaF: 

The Landé g factor of the 5775; level is small, only 
2/7, and one would expect that the spectroscopic 
splitting factor of the Sm** in crystals would be small 
as well. 

In the cubic field the J=5/2 levels is split into 
a Ps and T;. The T; level in the absence of admixture 
of higher excited states gives rise to transitions at 
g=22/21, 2/3, and 2/7 with relative intensity of 
25/36, 2/3, and 9/8, respectively, when H|| (100). With 
the magnetic field parallel to the (111) direction the 
calculated g factors are 16/21, 2/3, and 2/12 with 
relative intensities of 16/9, 4/9, 25/18. However, the 
J=7/2 is only at about 1000 cm™ followed by J=9/2 
near 2000 cm~ and one expects that the various levels 
would be admixed considerably, reducing in general the 
g factors from the values given above. In the presence 
of an axial field the I's level splits into two doublets. 
Rabbiner™ has observed the fluorescence spectrum of 

Sm*+ supposedly in an axial field and has analyzed the 
spectrum. The interpretation, however, should be 
treated with caution because of the complexity of this 
spectrum. 

We have looked for the resonance spectrum of two 
CaF, with nominal concentration of 0.05 and 0.1%. 
One set of crystals was nearly colourless supposedly 
contained mainly trivalent Sm?*, and the other set of 
deep green color and containing a mixture of Sm*t 
and Sm?+. The spectrum of the green crystal was very 
rich in lines whereas the other showed only a few lines. 
The lines common to both samples could be shown to 
have tetragonal point symmetry with 


Lis 0.907+0.01 5 
i= 0.54440.15. 


i N. Rabbiner, Phys. Rev. 130, 507 (1963). 
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[Note added in proof. M. J. Weber and R. W. Bierig 
have observed a similar spectrum with 


g=0.57£0.05 


(private communication). | 

In addition there were many other lines (apart of 
hyperfine structure from the Sm! and Sm™, both with 
spins I=4). These lines have not been yet interpreted, 
some of which no doubt are of cubic and trigonal 
symmetry. The presence of the rich hyperfine structure 
complicated the identification of the spectra. For ex- 
ample along the (111) direction, lines were found with 
£7 0.893, 0.87, 0.70, 0.67 (tetragonal), 0.62 and 0.415. 
(It is likely that line at 0.415 belongs to the cubic set. 
Possibly the line at g=0.89 or 0.87 may also belong 
to the cubic set. However, this needs further con- 
firmation.) It is of interest to note that there are so 
many more lines in the mixed Sm*t—Sm** crystal. 
Possibly some of the Sm?* which presumably is in the 
cubic site oxidizes later to Sm?*. 

'The above g factor can be interpreted to arise from a 
wave function of the a|2:5/2)2-b|-F3/2) with a 
70.86. This is indicative of considerable tetragonal 
admixture and is consistent with similar results found 
for!5.16 Nd?* and for Ce?*.7 
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Paramagnetic resonance of Yb?* has been observ 


ed in single crystals of CaWO, at 20°K and 3 cm. A 


tetragonal as well as two rhombic spectra were found. The tetragonal spectrum is fitted to an axial spin 


Hamiltonian with gu =1.0580.002, 2,—3.9202-0.005, |1% 
em, |13P| «2.1073 em“, [TA | = (2792-3)107* emo, [B| 


(772:1)107* em, [B| = (28524: 1)107* 
(1034-£3)10-! cm^!. The rhombic spectra 


have the y and z axes in the {110} planes. One has the z axis at 6°-41° to the crystallographic c axis and 
£2 =3.012+0.005, gy =4.788+0.005, g.=0.975--0.003. The other has the z axis at 771° to the c axis and 
82=4.1550.005, g, =3.704+0.005, 2, — 1.1552-0.003. The spectra arise from Yb?* ions at Ca?* sites and 
the rhombic symmetry is caused by charge compensation at next-nearest-neighbor Ca?* sites. The possi- 


bility of W** substitution is discussed. 


I. INTRODUCTION 


RYSTALS of the CaWO, structure (sheelite), 

doped with rare-earth ions have been used ex- 
tensively in the past few years as solid-state lasers. 
This fact stimulated the optical as well as paramagnetic 
resonance studies of these crystals.!? 

Paramagnetic resonance in CaWO, has been reported 
on the trivalent ions gadolinium,’ terbium, and 
neodymium.* In all these cases the symmetry of the 
crystalline field at the sites of the paramagnetic ions 
was found to be tetragonal. Recently, orthorhombic 
spectra of Fe** and Nd* have been reported. These 
spectra were attributed to substitutional sites of W** 
ions in the WO, complex. 

The purpose of this paper is to report on the investi- 
gation of Yb** in CaWOy. A tetragonal as well as two 
rhombic spectra have been found and the results cast 
some doubt on the attribution of rhombic spectra to 
the W$* sites. 


IL. THE STRUCTURE OF CaWO, 


CaWO, (sheelite) is a tetragonal body-centered crys- 
tal which belongs to the C4, point group. The point 
symmetry at the sites of the Ca” ions is Sy. The struc- 
ture is shown in Fig. 1. The distorted oxygen tetrahe- 
dra which surround the W** ions, are represented 


schematically. 
II. THEORY 


The ion Yb?* has 4f? electrons and the ground term 
of the free ion is 2F 7/2. In the rare earths, the interaction 


Supported in part by the U. S. Air Force through the European 
Qu EE Grant No. AF-EOAR 63-61. 
1L. F. Johnson, J. Appl. Phys. 34, 897 (1963). 
? L. F. Johnson and R. A. Thomas, Phys. Rev. 131, 2038 (1963). 
3C. F. Hempstead and K. D. Bowers, Phys. Rev. 118, 131 
1960). 
| ny A. Forrester and C. F. Hempstead, Phys. Rev. 126, 923 
1962). 
: 5 € Ranon, Phys. Letters 8, 154 (1964). 
6R. W. Kedzie and K. Kestigian, Appl. Phys. Letters 3, 86 


(1963). 
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with the crystal field is usually much weaker than the 
spin-orbit coupling and therefore J is a good quantum 
number. In crystal fields of lower than cubic symmetry. 
the J manifold of the ground state of the free Yb^* 
ion, splits into four Kramers doublets. Paramagnetic 
resonance is observable at liquid-hydrogen temperature 
where only the lowest Kramers doublet is populated. 


IV. RESULTS 


Two crystals of CaWO,;, doped with Yb?*, were in- 
vestigated by paramagnetic resonance at 3 cm and 
20°K. They were obtained from Isomet Corporation 
and Semi-Elements Inc. The concentration of Yb?* in 
the former was nominally 0.01775 and higher in the 
latter. An intense tetragonal spectrum was observed 
in the two crystals and the hyperfine structure of the 


Fic. 1. Unit cell of CaWO,. For clarity the oxygen 
are presented schematically. 
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isotopes Yb 171(7=}) and Yb 173(7—$) was clearly 
resolved. Two spectra of rhombic symmetry which 
were much weaker, were also shown by each of the two 
crystals. 


The Tetragonal Spectrum 


The tetragonal spectrum fits the spin Hamiltonian, 


W= gn BH SHH SHH Sy) 
+ASI,+B(SeletSyly) 
4+PU2-1/31(I+1)), (1) 


where the z axis is parallel to the crystallographic c axis 
and the parameters are 


gu= 1.058+0.002, 
[724 | 5 (771)10- em, 
[73B] = (285-1)10- cm, 
[P| «2.107 em, 
41—3.9202-0.005 , 
[14 | = (2793-3)107* cm, 
[2| = (10342:3)10* cm™. 


These results give |gu4/gıB|=1 within the experi- 
mental error, and therefore the admixture of J —$ into 
the ground state is negligible." Accordingly, the Kramers 
doublet can be described by an eigenfunction within 
the J=% manifold. In a tetragonal field this eigenfunc- 
tion may be of the form a|23-2)--b|-Fà) or c|+3) 
+d|=3). Only the latter expression is compatible 
with the measured g factors, giving gı= 1.052 and 
£1—3.957 when c=0.700 and d— 0.714. 

The gradient of the electric field at the nucleus is de- 
termined by four contributions: (i) that of the 4f elec- 
trons, (ii) the shielding effect of the inner core on these 
electrons, (iii) the gradient of the crystal field, and (iv) 
the shielding of the latter by the inner core. The first 
two are proportional to the matrix elements of the 
operator (/2—3J (J--1)) while the other two have 
usually been neglected. The contribution of the quad- 
rupole moment in paramagnetic resonance is then given 
by the last term in (1) where" P= —9e’Q/4/ (214-1) 
x«3UlelJXv|J2—3J(J4-1)|W). In the case of 
Yb** in CaF, it was found that? (v |J2—1J (J4-1)| v) 
=—3.9 and P= (862:4)10-* cm™. In the present case, 
the eigenfunction gives for the matrix element the 
value of — 1.16, with the result that P—25.107* cm. 
This value is higher than the upper limit for P obtained 
from our experimental results. We have no satisfactory 
explanation for this discrepancy. Possibly the contribu- 
tions (iii) and (iv) to the gradient of the electric field 
are not negligible. 


IINE 


; 
j 


: -7R, J. Elliott and K. W. H. Stevens, Proc. Roy. Soc. (Lond 
= A218, 35 (1953). : Qu Canton) 
= 8W. Low and U. Rosenberger, Compt. Rend. 254, 1771 (1962). 
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The Rhombic Spectra 


The rhombic spectra can be fitted to the spin 
Hamiltonian 


5¢=H.g.S+S.A.1, 


where g and A are tensors with principal values g,, g,, 
gz and Az, Ay, A respectively. 

Each of the rhombic spectra shows the existence of 
four inequivalent sites. For both spectra, one of the 
orthorhombic axes is perpendicular to the c direction, 
and we call it the y axis. Two sites of each spectrum 
have y along the [110] direction whereas the other 
two have it along the [1 10] direction. The other two 
axes lie then in the (110) and (110) planes, respectively. 
The sites of the more intense of the two rhombic spectra, 
which we shall call spectrum one, have the z axis at an 
angle of 6°1° to c, while the sites which belong to the 
weaker spectrum (number two), have the z axis at an 
angle of 7?-E 1? to c. 

Two orientations of z with respect to c are possible in 
each of the (110) and (110) planes, giving rise to four 
inequivalent sites which are interchangeable in pairs 
by a rotation of 90? about the c axis. 

Spectrum number one has the parameters g2= 3.012 
+0.005, g,—4.7882-0.005, and g.— 0.9752:0.003. 

Spectrum number two has the parameters g2=4.155 
0.005, gy=3.70440.005 and g.=1.155+0.003. The in- 
tensity of the even isotope lines in spectrum number 
one, was approximately 5% of that in the tetragonal 
spectrum, and the intensity in spectrum number two 
was about 2% of that of the tetragonal lines. On 
account of the four inequivalent sites for each rhombic 
spectrum it is found that about 16% of the Yb** ions 
occupy rhombic sites which correspond to spectrum one, 
and about 6% occupy sites corresponding to spectrum 
two. 

The relative abundance of the two odd isotopes, 171 
and 173, of ytterbium is 14% and 17%, respectively and 
therefore the hyperfine lines of the rhombic spectra 
were masked by the hyperfine lines of the tetragonal 
spectrum in such a way that only a few of the former 
were observable. Therefore the parameters of A could 
not be evaluated. 

From the results of the tetragonal spectrum We 
assume that in the rhombic sites also, the admixture of 
J=$ is negligible. Since now the symmetry is rhombic, 
the eigenfunctions are of the form a| o| +3) 
+c|=-3)+d|+3). The three g values gz, £w 8: and 
the normalization of the function give four quadratic 
equations in the four coefficients (a,b,c,d). In this case 
therefore the existence of a solution of the four equa- 
tions does not imply that the admixture of J z$ 5 
negligible, as opposed to the tetragonal case where there 
are three equations in two unknowns. 

From the g values it can be seen that the deviation of 
the rhombic field from the tetragonal is not large- There- 
fore, we have looked for a function which is close to 


ew 


N 
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that in the tetragonal case, obtaining for spectrum 1, 
0.160| + 2)—0.262| -F3)--0.678| 7E 5)--0.6801--3) and 
for spectrum number 2, 


— 0.093 | 3- 2)-0.099 | F3) 4-0.708 | 375)4-0.693] 4-3). 
V. DISCUSSION 


The tetragonal spectrum is usually interpreted as 
being due to ions at substitutional sites of Ca?*. 
Kedzie and Kestigian® explained their rhombic spectra 
of Fe** and Nd* as being due to ions which occupy 
tungsten sites in the WO,- complex. The distortion 
from tetragonal symmetry is explained by them in 
these cases by assuming a deformation of one of the 
oxygen bonds along the bond direction in the sub- 
stituted WO, complex, giving rise to four inequivalent 
sites. 

We disagree with this interpretation of the rhombic 
spectra for the following reasons. 


(1) It seems unlikely that trivalent ions, such as the 
rare-earth ions should substitute for the hexavalent 
tungsten, which is part of a covalent complex. Further- 
more, the ionic radii of the rare-earth ions are much 
larger than the radius of W$*. 

(2) The W-O bonds in the WO;- complex are not 
situated in any crystallographic plane? A deformation 
of such a bond, therefore, cannot cause a rhombic dis- 
tortion with axes lying in the (110) plane, whereas our 
investigations, as also those of Kedzie and Kestigian, 
show that such rhombic distortion does actually occur. 

(3) There seems to be no reason why the deformation 
of the complex—which is situated in a tetragonal 
environment—will be such as to prefer one single direc- 
tion (namely, a bond direction) which does not coincide 
with the tetragonal axis. 

(4) In our case, /wo rhombic spectra are observed. 
Even if we assume the W* substitution, only one of 
these spectra could be explained. Furthermore, the 
crystal field at the tungsten sites will be completely 
different from the tetragonal field, while our results 
show only a small deformation of the tetragonal field 
at both rhombic sites. This is evident from the fact that 
the g values of the rhombic spectra do not differ con- 
siderably from those of the tetragonal spectra and that 
the trace of the g tensor is almost the same in the three 


cases. 
Therefore we assign the rhombic spectra of Yb" 


R. W. G. Wyckoff, Crystal Structures (Interscience Publishers 
Inc. New York 1957), Vol. 2, Chap. 8. 
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to ions at Ca** sites where the deformation of the 
tetragonal field is caused by charge compensation. It 
is well known that in crystals like CaFs, where also 
trivalent ions substitute Ca?*, different charge com- 
pensations are possible and these give rise to different 
local symmetries of the crystal field. 

In CaWO, two forms of charge compensation suggest 
themselves. These are: (i) Ca? vacancies, and (ii) sub- 
stitutional Ca?* sites occupied by monovalent cations 
when the crystals are grown with such ions. 

The nearest-neighbor Ca?* sites lie in the (100) and 
(010) planes while the orthorhombic axes lie in the (110) 
planes. The latter is consistent with charge compensa- 
tion at next-nearest-neighbor Ca? sites. Cases where 
charge compensation is affected at more distant sites 
rather than at nearest-neighbor sites, were observed in 
CaF3? as well as in SrF2 and BaF2.™ 

Hempstead and Bowers? and Kedzie and Kestigian® — 
found that Mn’ (which does not require charge com- 
pensation) is situated exclusively in a tetragonal field. — 
The latter authors found that the isoelectronic Fe 
(which does require charge compensation) shows only 
an orthorhombic spectrum. a: 

The fact that the spectra of Mn?* and Fe** are com- : 
pletely different is not, in our opinion, a conclusive Hes ^ 

Sf 


Sese 


dication that Fe** does not occupy Ca?* sites, as sug- 
gested by Kedzie and Kestigian. As these auth 
pointed out, the spectrum of Fe?* can only be explaine 
by assuming a strong axial field where the Stark split 
ting is larger than the Zeeman splitting. The follow: 
consideration shows that this may occur at the C EP. 
sites; it is well known that the Stark splitting in Fe is 
at least one order of magnitude larger than in Mi 
The splitting? between the closest doublets of Mn 
the Ca?* sites is 0.0275 cm. Thus the splitting in Fe 
should be of the order of the Zeeman splitting, or larg 

In view of the present results and the uncertainty 
the assignment of the rhombic sites, it seems wort 
while that further investigations be undertaken of othe: 
rare-earth ions in CaWO, as well as in other crys 
having the sheelite structure. 
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A Born-von Karman model has been use 
lattice. The model used contains noncentra 
and next-nearest neighbors. Frequency distri 
experimental frequency distrib 
found between the shapes of the experimental 
frequency for vanadium was within 2% 


sion curves for iron were also calculated, and were foun: 


INTRODUCTION 


HE frequency distribution of a body-centered 

cubic lattice of identical particles has been 
investigated using ihe Bom-von Karman model of a 
crystal lattice. The interatomic forces used include 
noncentral forces of the type introduced by Gazis, 
Herman, and Wallis! in treating surface waves in a 
simple cubic crystal. These noncentral forces arise from 
the resistance to deformation of certain angles formed 
by three lattice points. Also included are central 
forces between nearest and next-nearest neighbors. 
Noncentral forces have been employed in recent 
calculations by Hendricks, Riser, and Clark,’ which 
appeared after our work was essentially complete; 
however, the noncentral forces considered by them are 
different from those used here. 

One of the primary aims of the investigation was to 
examine the effect of the inclusion of noncentral forces 
on the frequency distribution. Comparison between 
the central-force model and the model employed here 
was made for iron and for vanadium which is one 
of the more anisotropic body-centered cubic crystals. 
The central-force calculations for vanadium were 
available from the work of Clark? The calculations 
with vanadium were also compared with the frequency 
distributions obtained by the slow neutron experiments 
of Stewart and Brockhouse,* and Eisenhauer ef al.5 


PO ee eee À 


* A part of this work was submitted by one of the auth 
(B. C. C.) in partial fulfillment for the degree Master of Saee 
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d to determine the frequency spectrum for a body-centered cubic 
| angular stiffness forces as well as central forces between nearest 
butions have been calculated for vanadium and compared with 
utions available from slow neutron experiments. Qualitative agreement was 
and theoretical distributions. The calculated maximum 


stribution. Disper- 
d to be in good agreement with experiment, 


Dispersion curves along specified directions were 
calculated for iron and compared with the experi- 
mentally determined dispersion curves of Low.? 


LATTICE EQUATIONS OF MOTION 


Although the equations of motion for central forces 
appear elsewhere, for example, in DeLaunay,' they 
will be repeated here in order to provide a continuity of 
notation. Taking a displacement vector with compo- 
nents 1, 9, and w in the x, y, and z directions, respec- 
tively, the force in the « direction on atom (j, m, n) due 
to the central-force interactions of nearest neighbors is 


given by 
) 


Fa-o[U —8uj mnt V+W], 


where 
w= (Oa 1543, m1 A Mj m—1, 41 
Fujimi, nF 1653, m41, nt F Uji, mel n=l 
SPO IEEE 131,1, m-1) , 
V= (5-41, mp1, nyi F U41, mtl, n—1— Vj m-1,041 
—ÜUjpisgn—1,.n—17— Uj—-1,m4-1,24-1 — Uj—1,m4-1,n—1 | 


SPO ior Drape) D (2) | 


W = lw- aa, 
W (Wii munda 1 UI med nelle UTI m—D TET 


— 99; JUST 
U c1, m—1,n—l 0; 1, ml, n1 Wj-1,m41,n—1 


—105 1, m-Ln 411 05 1,m-1,1) , 


where ais the nearest-neighbor force constant. Similarly; 
the central-force interaction between next-nearest 
neighbors gives rise to a force on the atom (j,,1) in 
the x direction given by 


(3) 


Fa p tite, m,n 165—2, m, n— 25, m,.]s 
where B is the next-nearest-neighbor force constant. 
* G. G. E. Low, Proc. Phys. Soc. (London) 79, 479 (1962). 


1J. DeLaunay, in Solid State Physi 7 neei 
Va ysics, edited by F. Seitz 
D. Turnbull (Academic Press Inc., New York, 1936), Vol. 2. 
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FREQUENCY SPECTRA OF BODY-CENTERED 


For the noncentral forces examined here it is assumed 
that a change in angle is related to a change in potential 
energy by a Hooke's law force constant. If V is the 
potential energy and ô¢ a change in angle, one obtains 


V= 17 (ô$), (4) 


where y is the force constant. In this treatment two 
general types of angles were considered; 36 angles of 
the type à; in Fig. 1 associated with a force constant 
Yı and 72 angles of the type $» in Fig. 1 associated with 
a force constant y: The expansion of the potential 
energy is quite tedious but straightforward, and the 
total contribution from the angles considered results in 
& force on the atom of interest, in the x direction, 
given by 


F a= 20u; m, d-2U — V —W --A(uj,2 m nt 4j 2, m,n) 
— (ttj, m nd Uj, m,n F Uj, m nta F Uj, m n2) ] 
—vo[— 30u; m, i -3U--$ (V+W) 

FG (t5, mec2 n 13, m2, 71-1, makr Uj m, n2) | (5) 


where U, V, and W are given by Eq. (2). These non- 
central forces are invariant under rigid-body rotations. 

The equation for the force in the x direction on the 
atom (j,m,z) due to the central and noncentral forces 
considered here may now be written as 


F= mi nn 
= —15,na[9o- 284- 20y: +3072] 
ar (8-+-471) (tzr min 152: mn) oe (= Yit $y23) 
X (ttj m2, nok Uj, n—2,n- Uj, EAT Mj, m, n—2) 
+U (ed-2y14-3y2)-- (V--W)(e—vit2Y2) , (6) 


where U, V, and W are given as in Eq. (5). The forces 
in the y and z directions on the central atom are found 
by a circular permutation of , v, and w, as well as the 
increments of their indices 7, m, m. 


SOLUTION OF THE EQUATIONS OF MOTION 


The solutions of the equations of motion are assumed 
to be plane waves, where the usual cyclic boundary 
conditions of Born’ are applied. The solutions for the 
motion of, say the nth, atom are written as 


1147 X exp2si(vt-- k-r;) 
n= Y exp2mi (vt-1- k- r,) , 
Wn=Z exp2ri(vitk- tn). 


Here X, Y, and Z are the components of the plane-wave 
amplitude, and k is a propagation vector in the recipro- 
cal lattice whose magnitude is the reciprocal of the 
wavelength. The values of k are restricted to the first 
Brillouin zone, as discussed by Brillouin.? Substitution 
of these solutions into the equations of motion gives 
three homogeneous linear equations in X, Y, and Z. 


3 M. Born and K. Huang, Dynamical Theory of Crystal Lattices 


Clarendon Press, Oxford, 1954). FE 
: LO Brillouin, Wave Propagation in Periodic Structures (Dover 


Publications, New York, 1953). 
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Fic. 1. Body-cen- 
tered cubic lattice 
showing primitive 
(di) and secondary 
(ó5) angles. 


hiako 


A nontrivial solution requires that the determinant of 
the coefficients of X, Y, and Z vanish. This condition 
yields the secular equation 


| A 1— mo? B QM 
| B Ame? D |=0, (7) 
| C D A 3T ma | 


where 


A= (8at 16y:4+ 2472) 1—cosh; coss cosh) 
+48 sin*bh;— 231(5 cos26;— cos20— cos2d; 
—cos2b;— 2)+372(2+cos2h;—cos2%; 
—cos2b,—cos2b;), (8) —— 
B2 8(a—*y t- $92)cosd; sin, sinz, 
C=8(a—1+372) cosh, sint; sin’, 
D=8(a— 1+ $y2)cosd, sind» sind; 


In Eq. (8), &;— zak; (i=1,2,3) where a is the distance 
between next-nearest neighbors for the body-centered 
cubic lattice. 

Symmetry considerations allow a restriction of the 
values of 6; to points within 1/48 of the first Brillouin 
zone. The limits on the , are then given by O€di dz, — — 
0<@,<47, and 0<®;<7x. This reduces the values of — 
$5; to those within the region in & space bounded by the - 
planes $,— $», P2=0, $1— 5; and $-$;—3. = 

For a cubic equation of the type resulting from the — 
expansion of Eq. (7) Blackman” has shown that —— 
maxima in me? can occur only when Py Pa, &; have — 
some combination of the values 0, 4x, v. The point 
@,=6,=6,=0 is excluded as ma’=0 at this point. 
Substitution of those values of Pr, P2, bz which are wit 
in the first 1/48 Brillouin zone showed that in m 
cases considered the maximum occurs at (m, 0, 47) 
This maximum frequency may be found directly fr 
Eg. (7) and is Z 

MO max = 4(4a--84- 551) +4272. 
It was found in actual calculations that the posit 
the maximum could be shifted to (0,0,7) when ce 
values of the elastic constants, such as those for sodi 
given by Quimby and Siegell," were assumed. In 
case the maximum frequency is given by : 
Mum = 16(a+2y1+ 32) - 


1 M. Blackman, Proc. Roy. Soc. (London) A148, 
1 S, L. Quimby and S. Siegell, Phys. Rev. 54, 293 ( 
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Fic. 2. Three calculated 
frequency histograms for vana- 
dium with y2/y1—0.0 (upper 
curve), v2/vi— 0.5 (middle 
curve), and y2/71=1.0 (bottom 
curve). Wote: For display 
purposes, while the scale of 
the ordinate is the same for all 
three curves, the origin for the 
middle curve is at 0.02 and is 
at 0.04 for the upper curve. 
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RELATIONSHIPS BETWEEN ELASTIC CONSTANTS 
AND LATTICE FORCE CONSTANTS 


In the limit of long wavelengths, when the wave- 
length is much greater than the lattice spacing, the 
plane waves are effectively propagating as in a con- 
tinuous medium. This allows the identification of the 
lattice force constants with the elastic constants of the 
medium found from measurements of the velocity of 
sound. These relationships are obtained by expanding 
the displacement components such as tj+p,m+a,n+r in 
a Taylor series about %;,m,n, substituting into the 
equation of motion and matching the coefficients of the 
second order derivatives with those of the continuum 
equations of motion. The results are: 


G611— 2 (a-4-B--6y14-3y2) ] 
26127 2(a— 3y 1 $2) 
QC44— 2(a--vit- $2) . 

If y2=0, the above three equations determine the 


lattice force constants uniquely; otherwise, the force 
constants are determined assuming, for example, a 


(10) 


— given ratio of y;/y: The value y2/71 is used as a 


parameter in the sequel. The elastic constants for 
vanadium are those of Alers.” It may be observed that 
the maximum frequency when it occurs at ($7,0,57), 
as is usual, is a function of the elastic constants only 
and is 

(11) 


v _ However, when the maximum írequency occurs else- 
- where it is also a function of 72/71. 


A. Alers, Phys. Rev. 119, 1532 (1960). 


r 


Momax = 2a (czt 2645+ C11) - 
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CALCULATION OF THE FREQUENCY DISTRIBUTION 


In order to obtain the true distribution of frequencies 
in a macroscopic crystal Eq. (7) would have to be solved 
for a very large number of points within the first 
Brillouin zone, equal to the number of degrees of 
freedom of the particles in the crystal. À systematic 
method was used for choosing a relatively small number 
of solution points within the first zone. It is assumed 
that solution points are uniformly distributed such that 
an allowed point exists within a volume element in ® 
space, and further that the frequencies at all points 
within the element are equal to the value of the fre- 
quency at the center of the element. The values of $; 
were chosen in a regular manner by stepping along the 
axes in intervals of 0.017. Thus the volume elements 
were cubes of volume 10-9 7°. t 

Some of the solution points were on the bounding 
planes and were not, of course, centers of a cube lying 
entirely within the first Brillouin zone. There was some 
question of the desirability of including these points 
without properly weighing their contribution to the 
frequency distribution in the first zone. A numeric 
investigation of the most extreme case, that of the 
frequency distribution obtained with all points on the 
bounding planes included with the same weight as e 
interior points, showed that the resulting frequency 
distribution was essentially the same as the distribution 
with no boundary points included. Due to this resu: 
only points in the interior of the first 1/48th Brilloum 


zone were finally chosen. These considerations resulte 
in the inclusion of approximately 39 000 solution 


points in the 1/48th zone. Histograms representing 
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frequency distribution were prepared from the cal- 
culated frequencies by finding the number of frequencies 
contained in a given frequency interval. The frequency 
interval chosen was 1.0% of the maximum frequency. 

The cubic equation for the square of the frequency 
was solved using the Cardan method to find the first 
root, and the remaining quadratic to find the other two 
roots. A check on the accuracy of this method of 
calculating the frequency showed that six figures of 
accuracy were present for most cases. In the case of 
two equal roots the accuracy could be reduced to five 
figures ; however, an error of this magnitude is not 
significant in the construction of the frequency histo- 
gram. The calculations were performed on an IBM-7090 
computer. 


RESULTS 


The elastic constants for vanadium at 300°K are 
given by Alers! as 
C= 22.795 X 10" dyne-cm, 
c= 11.870X 10" dyne-cm-?, 
C447 4.255 X 10" dyne-cm. 
The maximum frequency calculated from Eq. (11), 


using these elastic constants, is 8.8710" cps which 
compares well with the experimental value of 9X 10” 


— Experimental 
~----- Two constant 
ik — ——- Four constant 
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g(v) 


0.02 


0.01 
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M 12 al 
YU se ! 
i f ith 
Fic. 3. The experimental results of Eisenhauer presented wit 
the frequency disp bation of Clark and the model used here with 
ya/y1 5: 1.5. The results are normalized such that the area under 
the curves are the same. 
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Fic. 4. Dispersion 
curves for iron along the 
[101] direction, P: —44 
=, %,=0 where 4 
=7ak,, along with the 
experimental results of 
Low, and Low's best fit. 


ENERGY (MILLI-ELECTRON VOLTS) 


4 
Va (10,1). 
Ld 


3r 
8 


cps extrapolated from the curve given by Eisenha: 
The two constant central-force model gives a n 
frequency of 7.0710" cps. Thus the three cor 
model gives a better result for the maximum fre 
The calculated and experimental frequency d 
tions are, however, quite different with regard 
location of the two major peaks, with the calcu 
frequency distribution having more widely 
maxima. The parameter y2/y1 may be varied witho 
destroying the good agreement with the experimen 
maximum frequency. Figure 2 shows the frequ 
historgrams for vanadium for ratios of *2/*1 of 0.5 
1.0, along with the case for y2=0. Figure 2 ind 
that increasing the ratio of 72/7 reduces the inte 
of the first peak and moves the two maxima cl 
together producing better agreement with experim: 
In view of those results, the frequency histogram 
ya/y17 1.5 was also calculated and a smootl 
obtained from these results is shown in Fig. 3 
with the experimental curve of Eisenhauer. 
comparison the smoothed curve for the ce 
model of Clark is also presented. ; 
To indicate the effect that an uncertainty 
the elastic constants might play in the de 
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of the frequency distribution, vanadium distributions 
were calculated using elastic constants varied by +10% 
independently and assuming y2=0. Briefly, it was noted 
that increasing c1» and cj; by 10% tends to spread the 
peaks while increasing c44 brings the maxima together 
slightly. It was also apparent that increasing ci; and 
c44 by 10% increased the sharpness of the low-frequency 
peak while increasing ci. had the opposite effect. None 
of these frequency distributions seem to fit the experi- 
mental one appreciably better than the distribution 
with the unchanged elastic constants, and the calculated 
maximum frequencies are all in the neighborhood of 
the experimental maximum. 

In addition to the frequency spectra discussed above, 
the model was used to calculate frequency dispersion 
curves. The calculations were compared with dispersion 
curves for iron as determined by Low.® Using the 
elastic constants given by Low for iron at 16°C as 


£11— 2.332X 10? dyne-cm, 
63127 1.355X 10? dyne-cm, 
c= 1.180 10? dyne-cm, 


dispersion curves were calculated along the [101], 


[111], and [001] directions. For comparison the same 


t 


e 
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calculations were performed using a two-constant 
central-force model. The results of these calculations 
are shown in Figs. 4-6 along with the experimental 
results of Low. These calculations showed that the 
maximum frequency of 9.735X(10? cps for the three- 
constant model was higher than the maximum fre- 
quency of 8.711X 10? cps found for the two-constant 
model. Comparison with Low's curves shows that the 
three-constant model matches the experiment better 
than the two-constant model; however, both are, 1m 
general, well within the experimental error. The results 
of the three-constant model within the first zone, ate 
essentially indistinguishable from the fit to the expert- 
mental data given by Low. The largest discrepancies 
occur for dispersion curves along the [111] direction, 
and even here the difference is apparent only for values 
of $ outside the first zone. The inclusion of the angular 
stiffness terms has, in general, improved the agreement 
with experiment in this case. 


CONCLUSION 


The dynamical properties of a body-centered R 
usi 


lattice of identical particles have been investigated 
the Born-von Karman model of a crystal lattice. 
model employed contained moncentral as well 


as 


s 


n 
D; 
le 
id 
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Fic. 6. Dispersion curves for iron along the [001] direction, 
©, =,=0, where $;=7ak:, along with the experimental results of 
Low, and Low’s best fit. 


central forces, namely, noncentral angular stiffness 
forces and central forces between nearest and next- 
nearest neighbors. 

Frequency distribution histograms have been con- 
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structed for vanadium and compared with the experi- 
mental frequency distributions available from slow 
neutron experiments. The calculated maximum fre- 
quency for vanadium was in good agreement with that 
derived from the experimental frequency distributions, 
an improvement over the results of the central-force 
model, Our model containing a single angular stiffness 
interaction (y2/7,;=0) gives a frequency distribution 
which is very similar to that obtained by Hendricks 
et al? with their noncentral model. Both distributions 
yield a separation between the two major peaks which 
is significantly larger than that found experimentally. 
Inclusion of a second angular stiffness interaction into 
our model (y2/7140) gives a small improvement but 
not enough to produce agreement with experiment. 
The disagreement in the experimental and theoretical 
values of the separation between the two major peaks 
in the frequency distribution might possibly be removed 
by including additional interactions in the model. It 
may well be; however, that short-range forces are 
inadequate, and that long-range forces are required. 
Since vanadium is a metal, the effects of the electron gas 
may be significant as discussed by DeLaunay.’ The 
recent calculation of Hendricks, Riser, and Clark? 
however, indicates that the use of DeLaunay’s model 
does not resolve this difficulty. Failure of the adiabatic 
approximation in a metal might also be a factor which 
should be taken into account. 
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The energy level structures arising from the crystal-fie 


rare-earth ions in CaFs are determined from a survey o r reso 
t with existing paramagnetic resonance data. Estimates 


spectra. The predicted ground states are in agreemen 


are made of the magnitude of the cubic crystal-field parameters t h 
l and trigonal symmetries, due to nearest-neighbor charge 


bations on the cubic-field energy levels. The paramagnetic 
d and new resonances for Ce?*, Sm?*, Er?*, and Yb* are 
ystal-field matrix elements in terms of 3-7 and 6-7 symbols and 


earth series. Axial crystal fields of tetragona 
compensation, are found to cause large pertur 
resonance spectra of all rare-earth ions are reviewe 
reported. Expressions for the calculation of cr 


ld splitting of the free-ion ground states of trivalent 
f the optical and electron paramagnetic resonance 


and their variation throughout the rare- 


their relationship to the operator equivalent approach are given in an Appendix. 


INTRODUCTION 


ARE-EARTH ions of the lanthanide series can be 
introduced into a calcium fluoride host lattice 
substitutionally at divalent calcium sites. Although 
several of the rare earths can be stabilized in the diva- 
lent state in alkaline earth halide lattices, we shall be 
concerned almost exclusively with the trivalent rare- 
earth ions. In the case of the trivalent rare earths, a 
: question arises regarding the compensation of the ad- 
i ditional positive charge associated with a trivalent ion 
$ substituted in a divalent site. Several different charge 
compensation mechanisms have been found in CaF: 
which depend upon the growth conditions, impurity 
concentrations, and thermal treatments of the sample. 
The crystalline electrostatic fields at the rare-earth 
sites have correspondingly different magnitudes and 
symmetries. In this paper and the one to follow! 
(henceforth denoted as II), the electron paramagnetic 
resonance and spin-lattice relaxation properties of rare- 
earth ions in sites of different crystal-field symmetries 
will be examined and compared. 

The interpretation of the spin-lattice relaxation re- 
A sults in II requires knowledge of the strength of the 
S crystal fields, the energy level structure of the resonant 
; ion, and the wave functions of the lower states. Un- 
fortunately, such information for the trivalent rare-earth 
series in CaF» is still incomplete. We have made esti- 
mates of the cubic crystal-field parameters, their varia- 
tion throughout the lanthanide series, and the relative 
size and effect of axial field perturbations from a survey 
of available optical spectra of rare-earth ions in CaF». 
This information, while not precise, is nevertheless 
valuable in analyzing paramagnetic resonance spectra, 
calculating the magnitude of spin-lattice coupling, and 
estimating the crystal-field splittings when direct meas- 
urements are lacking. The predicted crystal-field split- 
ting, ground state, and excited states of the lowest J 
multiplet of rare earths in various symmetry sites are 
correlated with the findings of paramagnetic resonance 
E R. W. Bierig, M. J. Weber, and S. I. Warshaw, Phys. Rev. 
A1504 (1964). 
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studies. For this purpose a review of the paramagnetic 
resonance spectra is made which will also provide a 
basis for the discussion of the relaxation properties of 
the ions in II. 


CHARGE COMPENSATION AND 
CRYSTAL-FIELD SYMMETRY 


Calcium fluoride has the fluoride structure with Oj? 
space group symmetry. This may be visualized as a 
cubic lattice of fluorine ions in which every other body- 
center position is occupied by a divalent calcium ion. 
Rare-earth ions can enter the lattice substitutionally at 
Ca?* sites. At such sites divalent rare-earth ions experi- 
ence a cubic crystalline electric field of 8-fold fluorine 
coordination. Trivalent rare earths, however, experience 
crystal fields of differing strengths and symmetries 
depending upon how and where the additional positive 
charge is compensated. The occurrence of the different 
charge compensation mechanisms is dependent upon 
many factors associated with the total crystal growth 
process. Several resultant distributions of ions which 
generate crystal-field environments of cubic, tetragonal, 
trigonal, and orthorhombic symmetry are described 


ore earth ion 
9 Rented at 
Co?" site 


! o F TU 
[ut] @ F interstitial 


Fic. 1. A portion of the CaF; lattice showing the site of sub- 


situtional trivalent rare-earth ions and locations of charge C07 . 


pensating ions. 


of 


qti egg meii tite i ie 
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below and can be visualized by inspection of the CaF; 
lattice shown in Fig. 1. 

- 'The most commonly found trivalent rare-earth site 
in Caz is one of tetragonal (C,,) crystal-field symmetry. 
Charge compensation is attained by the presence of an 
F- ion in one of the six nearest-neighbor interstitial 
sites at a distance a/2— 2.73 A, where a is the lattice 
constant. The three crystal-field symmetry axes are 
the [100] crystallographic directions. This charge 
compensating mechanism has been well substantiated 
from investigations of density and lattice constants by 
x-ray methods;?? ionic conductivity,’ as well as by the 
early paramagnetic resonance results of Bleaney et al.5 
and others.* 

Paramagnetic resonance spectra exhibiting cubic 
symmetry are observed when charge compensation is 
not achieved locally. Since the excess F- in the inter- 
stitial site is not tightly bound to the rare earth, it 
can diffuse through the crystal at high temperatures. 
Friedman and Low" have used this mobility to convert 
from tetragonal to cubic sites. The sample is heated to 
disorder the rare-earth — F- pairs and then rapidly 
quenched. Whether the interstitial fluorine can diffuse 
to an energetically favorable site near a trivalent rare- 
earth ion or gets trapped elsewhere in the lattice de- 
pends upon the rate of cooling. Trivalent rare-earth 
ions of the second half of the 4f” shell have been found 
in sites of cubic symmetry more frequently than ions 
of the first half. The stability of the former in cubic 
sites may be associated with their smaller ionic radii 
and different local distortions. 

A crystal field of trigonal symmetry (Css) arises from 
the replacement of one of the eight nearest-neighbor 
F- at a distance V3a/4— 2.36 A by O?-, a mechanism 
discussed by Feofilov.9? The symmetry axes are the four 
[111] cube-body diagonals. The formation of this type 
of site is a function of the oxidation-reduction conditions 
during growth. Sierro'^! has studied the effects of 
thermal treatment and hydrolysis at high temperatures 
on the trigonal resonance spectra of rare earths in alka- 
line earth halides. If samples are heated in an atmosphere 
containing a small amount of water vapor, H;0 mole- 
cules dissociate at the surface, OH replacing F~ which 
goes off as HF. Trigonal spectra corresponding to OH- 
substituted for F- have been observed. With further 


2E. Zi d A. Udgard, Z. Anorg. Chem. 240, 150 (1939). 

3 Hi SEDGE id F. $. Martin, J. Chem. Soc. 349, 1847 
(1957). 

4 R. W. Ure, J. Chem. Phys. 26, 1363 (1957). 

oR Blears} P M. MERC and D. A. Jones, Proc. Phys. 

: d 69, 858 (1956). vee 
“ae M um W. Hayes, and M. C. M. O'Brien, Proc. Roy. 
A254, 273 (1960). $ 

enan and W. uu J. Chem. Phys. 33, 1275 (1960). 

3 P. P. Feofilov, Dokl. Akad. Nauk SSSR 99, 731 (1954). 

*T. V. Stepanov and P. P. Feofilov, Dokl. Akad. Nauk SSSR 
108, 615 (1957) [English transl.: Soviet Phys.—Doklady 1, 350 
1957) ]. 
: D D m J. Chem. Phys. 34, 2183 (1961). 
11 J, Sierro, Phys. Letters 4, 178 (1963). 
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thermal treatment, OH- dissociates, the resulting O*- 
diffusing through the crystal to compensate a trivalent 
rare-earth ion as above. 

A paramagnetic resonance spectrum having trigonal 
symmetry has been observed’? from U** in CaF, 
which is attributed to the replacement of two F- ions 
at opposite corners of the fluorine cubic by two O- 
ions. Although a resonance from a lanthanide series ion 
in a similar site has not been reported, Ce, Pr, and Tb 
are known to exhibit tetravalent states. 15 

Recently an orthorhombic spectrum of U** in CaF 
has been reported by Mahlab et al.!° The X-V axes are 
in a (110) plane, the X axes being at an angle of 19° 
with respect to [110]. There are twelve inequivalent : 
magnetic sites. Several models for the charge compensa- Er: 
tion are suggested, the most likely one being an F- in 
a fourth nearest-neighbor interstitial site. 

In addition to lines which can be explained by the 
above charge compensation mechanisms, other lines 
have appeared in both the optical and paramagnetic - 
resonance spectra of rare-earth ions in CaF», particu- 
larly at higher rare-earth impurity concentrations, From 
an inspection of Fig. 1, it is not difficult to conjure up - 
other charge compensating ion distributions at sites — 
more distant than described above which may possibly 
produce measurable deviations from cubic-field sym 
metry. For example, F- ions in second nearest-neighbor 
interstitial sites could produce a [111] trigonal spec- 
trum. [Note added in proof. Such trigonal spectra have — 
been reported for Yb?* in SrF2 and BaF: by U. Ranon 
and A. Yaniv, Phys. Letters 9, 17 (1964). ] The replace- : 
ments of two Ca?* ions by a trivalent rare earth and 
monovalent impurity," or three Ca?* ions by two 
valent rare-earth ions have been suggested. The 
appearance of many new lines as the rare-earth con- — 
centration is increased may be due to the presence of 
energetically favored pairs or clusters of ions. Some 
the lines may also be associated with other imp 
ions or crystal defects. 

Rare-earth-doped, alkaline earth halides exhibit ad 
tional interesting properties. For example, several 
trivalent rare-earth impurities have been converted 
divalent ions by exposure to ionizing radiation, "1% 
the resulting divalent ion being in a field of cubic 


Fe 


2 R. S. Title, P. P. Sorokin, M. J. Stevenson, G. D. Pettit 
J. E. Scardefield, and J. R. Lanekard, Phys. Rev. 128, 62 (19 

13 A, Yariv, Phys. Rev. 128, 1588 (1962). 

4 F, H. Spedding, International Science and Technology, 
39 (1962). 

15'W. Low and U. Ranon, Paramagnetic Resonance (Acade 
Press Inc., New York, 1963), p. 167. 5 

15 E, Mahlab, V. Volterra, W. Low, and A. Yariv, Phys. 
131, 920 (1963). : 

17 B, Bleaney, J. Appl. Phys. 33, um I, 358 (1962). 

18 N. Rabbiner, Phys. Rev. 130, 502 (1963). 

1 W. Hayes and J. W. Twidell, J. Chem. Phys. 35, 1521 (196 

» W. Hayes, G. D. Jones, and J. W. Twidell, Proc. Phys. S« 
(London) 81,371 (1963). —— : 

2: W. Hayes and J. W. Twidell, Proc. Phys. Soc. (London, 
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metry. Observations? of optical absorption and conduc- 
tivity due to free electrons in single crystals of CdF» 
doped with rare earths indicate charge compensation 
by means of electrons in conduction bands or shallow 
traps. Recently, optical detection of phototransfer of 


electrons in (Sm, Eu) doped Caf of the form, 
9, 3 h 5. 
Eut4-Sm?* 25 Ewt--Snr*, 


has been reported? and interpreted as due to electrons 
localized at vacancies in the vicinity of the trivalent 
rare earth. Photoconductivity of divalent rare-earth- 
doped CaF: has also been studied. 


CRYSTAL-FIELD PARAMETERS 


Information about the crystalline electric field split- 
ting of the free-ion states can usually be derived from 
studies of optical absorption and fluorescence spectra. 
For the trivalent rare-earth series in CaF», such data is 
in general incomplete, widely scattered, and a detailed 
interpretation frequently lacking or not warranted. 
Among the factors contributing to this status” sare that 
the spectra are complicated due to the presence of vibra- 
tional satellite lines, complex structure appears at 
moderate-to-high impurity concentrations, different 
types of spectra exist which are sensitive to changes in 
neighboring ion distributions and growth conditions, 
the centroids of crystal-field levels from a given state 
for different point symmetries do not coincide, the spec- 
tral intensities may be weak, and there is no polarized 
absorption to aid in the identification of cubic-field 
energy levels.?72$ 

The determination of crystal-field parameters from 
optical data requires that the site symmetry and energy 
level assignments be established. Subsequent fitting 
procedures and numerical computations employed to 
obtain consistent and meaningful parameters are fre- 
quently nontrivial for the multilevel rare-earth energy 
structures. It may be necessary to include crystal-field 
admixing between states of different quantum number 
J and the breakdown of ideal Russell-Saunders coupling. 
Such detailed calculations are, with a few possible ex- 
ceptions, not warranted for rare earths in CaF», since 
much of the published data is not complete and the site 
symmetries are not well established. We shall therefore 


(1965) D. Kingsley and J. S. Prener, Phys. Rev. Letters 8, 315 
23 P. P. Feofilov, Opt. i Spektroskopiya 12, 531 (1962) [English 
transl.: Opt. Spectry. (USSR) 12, 296 (1962). ‘ yes 
ic S H. Anderson and Z. J. Kiss, Bull. Am. Phys. Soc. 9, 87 
25 Feofilov, in a recent survey paper (Ref. 26) on luminescence 
of rare-earth ions in alkaline earth halides, suggested that a previ- 
ous assertion may again be à propos, namely, that “the variability 
and diversity of the spectral fine structure exhibited by many 
hosphors with line spectra led us to the conclusion that a general 
individual spectra cannot serve as the basis for interpretation." 
„# P. P. Feofilov, Bull. Acad. Sci., USSR 26, 437 (1962). 
21 Z. J. Kiss, J. Chem. Phys. 38, 1476 (1963). 
28 W. Low, Advances in Quantum Electronics (Columbia Univer- 


sity Press, New York, 1961), p. 138. 
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content ourselves in the next section with less refined 
estimates of the crystal-field parameters. 

The optical spectra of several divalent rare-earth ions 
in cubic sites in CaF: have been studied. Although the 
crystal-field parameters for these ions will differ from 
those of their isoelectronic trivalent neighbor, due 
principally to the influence of the larger effective nuclear 
charge, they can be of value if a suitable correlation can 
be found. Since some empirically derived guidance is 
available for extrapolating the crystal-field parameters 
of a divalent ion to the corresponding isoclectronic 
trivalent ion, we shall include several divalent ions in 
the survey of optical spectra. 

Crystal-field parameters can also be obtained from 
analysis of such factors as the g values, g-value shifts, 
and hyperfine constants observed in paramagnetic 
resonance experiments.” The results, however, are in 
general quite sensitive to the accuracy of the measure- 
ments; for example, a small uncertainty in g value can 
give rise to a large variation in the derived crystal-field 
parameters. We shall be interested principally in a 
qualitative correlation between paramagnetic resonance 
data and the crystal-field parameters determined from 
optical spectra. 

The Hamiltonians describing the crystal-field po- 
tentials of cubic, tetragonal, and trigonal symmetry 
may be written as 


JCcubic B4(0494- 5045) 4J- Be(O6°— 2106) ; (1) 
where Bg= SBU p= —21B,°, 
FCretra= Bs°O2?+ B4042- BsOe+B0e+O0s'0e', (2) 
and 


SC ig B20+ B490493- BO £-1- B06" 
-FBéOé--BVOs. (3) 


The parameters B," are related to the strength of the 
crystal-field components and, in the notation of Elliott 
and Stevens? are equal to (,4n™(r"), where £a is the 
operator equivalent factor, o, 8, y for n=2, 4 6, 
respectively. The O,"'s are angular momentum oper- 
ators transforming as the corresponding spherical 
harmonics. 

In a very useful paper, Lea, Leask, and Wolf?! 
(henceforth referred to as LLW) have determined the 
eigenfunction and eigenvalue solutions of Eq. (1) for 
various angular momentum J. Their diagrams of the 
eigenvalues are plotted as a function of a quantity * 
defined by 

x F(4)B, F(4)8A«(r) (D 
1—|x| F(6B. F(6)yAdr*) 


? W. Low, in Solid State Physics, edited by F. Seitz and D. 
TA (aradomie pes Inc., New York, 1960), Supply 2. do) 
. J. Elliott and K. W. H. St Proc. . Soc. (Lon 
A218, 553 (1953). evens, Proc. Roy. 50 
3 K. R. Lea, M. J. M. Leask, and W. P. Wolf, Phys. Chem- 
Solids 23, 1381 (1962). 
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| pou ae and the F’s are multiplicative factors Moe n Ce splitting " the er ground state of 
given by LLW. The orderi : RUE ;, trivalent rare-earth ions in a cubic field in Cal’. The x values are 
S fold is domum upon A ue ee ee vee using prane crystal-field parameters in Eq. (4). 
: ; ° ratio O € Iourtn- anc denotes ground state based upon estimated x value. 
= sixth-order cubic-field parameters By/Bs, and hence = oot —€— L————Á€— — áPu 
1 A from Eq: (4) upon x. The free-ion ground states of the — gare. Lowest 
9 trivalent rare earths and their decomposition in a cubic earth Configuration Decomposition Estimated state in 
r field are given in Table I. Also listed are estimated z 10" ground state pus i. ey e 
1 values obtained using approximate cubic-field param- — ce Afi, Psi PIS ^10 Ts 
S $ eters for CaF» and their variation throughout the Pe ape Di ET; ET ETS 09. Ful 
S rare-earth series, which are discussed later. The pre- pM 4 ru Ur er "o z 
5 2 x Y j^, 44 rvrberberts <U 
C dicted or experimentally observed cubic field ground — Sm** Af* Hus — DeETS 1.0 n 
y states are given in the final column of Table I. Eu" 4f', Wo Ti Ti 
r ? o x s T Th Af*, TFs Ditt sA r+, 09 Ta Fy 
The determination of the parameters 4,7(r")requires Dys* 47% Hia Titr +3, 0.6 Ts 
n the evaluation of various crystal-field matrix elements. —Ho* ^ 4f? J V;i--2P;--2P4-2T; -05 — 13518 
"pL z + Af Af. * 2 
» "ge This may be performed easily for the lowest LSJ state n iu n ; : P m 
c using the “operator equivalent" approach of Stevens?  yi»* ^ 4f9*F;, 0.7 I; 
n and existing tables of matrix elements; however, when 9 —— ~— ~— M MÀ a e 
x treating the crystal-field splitting of excited states 
n L'S'J' and matrix elements between different J states, ELECTRON PARAMAGNETIC RESONANCE 
d it is convenient to use a more general approach which AND OPTICAL SPECTRA 
a utilizes the tabulated 3-7 and 6-j symbols. This ap- 5 
e proach and its relationship to that of operator equiva- A summary of measured paramagnetic resonance g 
" lence are developed in the Appendix. A formula is also — values for trivalent rare-earth ions in various symmetry 
given, in terms of 6-7 symbols, which relates the oper- sites is given in Table II. Additional features of the 
x ator equivalent factors for two different J states. Thus, resonances together with reported optical spectra, 
y given x for a state J, it can be used together with Eq. estimated crystal-field parameters and energy level 
(4) to derive x for other states J’, and thereby, from structures, and expected resonances spectra for all 
LLW, the expected ordering of levels. rare earths are discussed below. 
| 
TABLE II. Paramagnetic resonance g values of trivalent rare-earth ions in CaF». Numbers in parenthesis 
2) following the g values denote references given in the text. 
Rare-earth : Crystal-field symmetry ; 
| ion Cubic? tetragonal Trigonal 
| Cet Ts: 2.00, 3.10.1 (34) gu =3.038+0.003 (35) gu =2.38-£0.03 (c) 
3) | Tz: 1.29740.001 — (34)(b) 1 1.396-0.002 (35) gı<0.1 c 
f . o Nds (42) (b) gı =4.412+0.008 (5) 
je i gı= 1.301--0.002 (5). 
it x Smt See (46) si1=0.907 £0.01 (16) 
; g =0.5H0.15 
st :1.991 0.002 49) gu=g.=1.992+0.001 (52) £17 1.992+0.001 (53) 
e Ù eT. i $u—17.7682-0.000 — (55) g= [2850.01 E 
2 ú gı <0.25 (54) £1«0.25 34 
jg at Ts: 2.6340.05 — (53) 8-730. - (55) (I) gu=16+1 (55) 
al ee © $48-015 — (3 81 =2.82:0.05 pl ar b 0340.05 ay 
i 7 46) ) £174. . 
re Tartoo GO ES MEUM 
Í + =6.7854 2 5 I =7.76+0.02 35) (I) gu=3- A LO 
p Er g=6.785+0.002 (35) O gin 6 5334-0.006 (35) 228512002 — (39) 
he | d) 21 6.762:0.02 (35) (II) gu=2.206+0.007 (59) 
(d) g = 
a | 1—9.112-0.01 (35) 4,78.843-E0.010 (59) 
i (II) gu 1.7462-0.002 (39) (b) 
he E a ae ee 1.3234:0,001 (65) 
t = 2 8 £u —2. (c) £1. i 5 
x l Yb queque 99) i381 (c) 1,—4.3992-0.004 (65) ; 
| í 
| — 
4) i » g values for Ts resonances are for Hlj[100]. 
à i b pce discussion in text. 
| d Eronvencited state at ~24 cm™. 
D. = 


2 i Proc. Phys. Soc. (London) A65, 209 (1952). E = r edi 
3 SK W. H. Stevens, Proc E NP Meiropolis, and J. K. Wooten, Jr, The 3-j and 6-j Symbols (Technology Press, Massachusetts 


Institute of Technology, Cambridge, 1959). 
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Cerium 


The cubic crystal-field ground state of Cet in Cals 
should be a Ts quartet. Resonances from an isolated Ts 
derived from J=§ would have g values of A, 7A /3, and 
11A/3 (A=6/7 for Ce*+) for a magnetic field H || 100]. 
Dvir and Low™ have reported broad, anisotropic reso- 
nances at 4°K and 1.2 cm with g— 2.00 and 3.1+0.1, 
which were interpreted as due to Ce** in cubic sites. 
At high fields a line was observed with gi— 1.297+0.001 
and was ascribed to the excited I'; doublet which has a 
predicted g value of 10/7. Since the line was seen at 
4°K, they conjectured that the cubic-field splitting 
must be very small, not much larger than the Zeeman 
energy. As we shall see, however, the cubic field split- 
tings of isolated rare-earth ions in CaF» are, in general, 
very much larger. We have examined several Ce: CaF 2 
samples but have not been able to positively identify 
any cubic-field resonances. 

The resonance of Ce?* in sites of tetragonal symmetry 
has been reported by Baker et a]3* The observed g 
values are derivable from a ground-state doublet of 

the form cos6| 2-3) — sin6 | F3), where cos#~0.91. Using 
this result and the tetragonal field Hamiltonian Eq. (2) 
yields 
tan20— (2X 512B43)/ (B$-- 20B$) 2 1.15. (5) 


This is very close to the cubic field value, tan26= (5)1/2/2 
=1.12, obtained by setting B2=0 and Bé—5Bf, 
thereby implying the tetragonal perturbation may be 


i Ifl | | iE 
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500 1500. 2500 3500 4500 — 
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! 
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E i i ^i f Fic. 2. Paramagnetic 
b s resonance spectrum of 
0% | — Ce:CaFs at 9.6 kMc/ 
500 1500 2500 3500 4500 sec and 4.2°K for vari- 
ous Ce concentrations. 
Spectrum (d) is from 
| an unannealed sample 
ot (see text). The mag- 
Š netic field, in Oe, is ap- 

(c) 0259; proximately ||[:100. 
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E d Dvir and W. Low, Proc. Phys. Soc. (London) 75, 136 
. 86 T, M. Baker, W. Hayes, and D. A. Proc. Ph; 
Lodlion] 73. 942 (1959). Jones, Proc. Phys. Soc. 
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very small. This, however, is in contradiction to results 
from other rare earths in CaF» where axial fields 
arising from nearest-neighbor charge compensation 
greatly perturb the cubic-field energy level structure, 
Also no evidence of a small energy level splitting has 
been found from the spin-lattice relaxation of Cg 
in tetragonal sites.’ Thus the above agreement may be 
coincidental, particularly since any admixing from the 
2F',,2 excited state, which should be important for 
Ce?*: CaF», has been neglected in obtaining this result 
and since we do not have sufficient information to evalu- 
ate B2, B4, and B+ separately. The above ground-state 
wave function and the eigenvalues for J= 2 in a tetrago- 
nal field, Eq. (6), are consistent with a case where the 
second-order crystal-field term does not dominate and 
As (7?) is positive. 

We have observed resonances arising from Cet 
in sites of trigonal symmetry. The g values given in 
Table II can be compared with those expected for a 
doublet of the form |-+$) for which g= 18/7—2.57, 
gı=0; the differences are probably due mainly to ad- 
mixing with states from */*7/2. The trigonal field eigen- 
values for an isolated J—4 state are given in Eq. (7). 

At higher rare-earth impurity concentrations, many 
additional lines frequently appear in the paramagnetic 
resonance spectrum as in the case of theoptical spectra.”? 
The increasing spectral complexity with concentration 
is illustrated in Fig. 2 where three spectra from samples?" 
containing cerium dopings, each differing by approxi- 
mately one order of magnitude, are shown. Such addi- 
tional spectra may be attributed speculatively to the 
simultaneous existence of several different charge 
compensation mechanisms and ion arrangements, crys- 
tal imperfections, or clusters of ions, but additional work 
is needed to clarify the situation. The effects of anneal- 
ing conditions upon the number of lines, linewidth, and 
line shape is also evident. Traces (a), (b), (c) are from 
samples which were annealed at 1200°C for 12 h and 
cooled at a rate of 20°C/h. Trace (d) is from an “as- 
grown” sample cooled at a rate of ~200°C/h. Note the 
presence of the two extremely low-feld resonances in 
Figs. 2 (a), (b), which correspond to the zero-field 
resonances at 9.37 and 9.48 kMc/sec reported by 
Kaplan.?? 

The optical spectra of Cet: CaF has been reported 
by several authors*!8?; however, no complete crystal- 
field analysis has been attempted. Whereas the free- 
ion ?Fs;2 and ?Fz;, states are separated by 2250 cm”, 
the infrared absorption spectrum? of Ce?*: CaF» 15 
characterized by two broad bands centered around 


36 See Ref. 1 for more information on sample preparation. 

31 D. E. Kaplan, Bull. Am. Phys. Soc. 8, 468 (1963). ish 

38 P, P. Feofilov, Opt. i. Spektroskopiya 6, 234 (1959) [Englis 
transl.: Opt. Spectry. (USSR) 6, 150 (1959) ]. 

3 Z. J. Kiss, Phys. Rev. 127, 718 (1962). 

“A, A. Kaplyanskii, V. N. Medvedev, and P. P. Feofilov, 
Opt. i Spektroskopiya 14, 664 (1963) [English transl.: OP! 
Spectry. (USSR) 14, 351 (1963) ]. 
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2200 and 3200 cm. Estimates of the fourth- and sixth- 
order cubic-field parameters for Ce**, based upon extra- 
polations of results for other rare-earth ions in CaF,, 
suggest an zx value for *£;;; which in LLW is near the 
crossing of the Ts and Ts energy levels. Assuming the 
cubic field is dominant, the above lines could correspond 
to transitions from the °F's,2 Ts ground state to the I's-I's 
and I’; states of ?F 7/2, respectively. At low tempera- 
tures several observers have noted a complex line 
structure about the 2200-cm^ region with a spread 
7-350 cm~. In addition to vibrational satellites, this 
structure could arise from a small I';-T'; separation or a 
combination of this and additional splitting of the Ts 
quartet of ?F;,; due to axial crystal fields, since the site 
symmetries in the samples are not reported. 

If one assumes the ~ 1000 cm™ spectral separation is 
due mainly to the cubic-field splitting of *Fz;; and 
equates it to the '7-T's energy difference and furthermore 
assumes «0.8, which is near the Ig-I's crossover, 
crystal-field parameters A 4(r4)~—320 cm and As(r^) 
~45 cmr! are obtained. Using these parameters, an 
energy separation of ~700 cm-! is predicted between 
the Ts and I’; states of °F5;2. The above results can only 
be considered as coarse estimates, however, and may 
differ greatly from the actual values because, assuming 
the transitions are assigned correctly: (1) the over-all 
°F, /2 splitting may not be due solely to the cubic field; 
(2) the x value used was only an estimate; and (3) no 
account has been made of the off-diagonal elements of 
the crystal field connecting the °Fs;2 and ?F7/2 states. 
The last is certainly not negligible if the crystal field 
is as large as estimated above. When the site symmetry 
and various transitions are established, the crystal-field 
parameters may be determined by solving the combined 
spin-orbit and crystal-field problem using a representa- 
tion in which Z; and S, are diagonal rather than using 
the more usual perturbation approach with a II 
basis. 


Praseodymium 


The cubic-field ground state of Pr** in CaF should 
be either a I’; or a Ts state depending upon the ratio of 
the fourth- and sixth-order crystal-field parameters. 
The estimated x value in Table I predicts that the mag- 
netic I's triplet is lowest. The cubic field wave functions 
describing the Ts state for J—4 are 


(2| 3e3)— @)'?| FL), 
3)^(2)—|—2»- 


This state behaves like a spin-1 triplet in a magnetic 
field yielding an isotropic g value of 2.00. 

We have searched for resonances at 9.6 kMc/sec and 
2-4°K in a CaF» sample containing 0.57% Pr. A weak 
signal was observed with an isotropic g value ~2 which 
consisted of a single line of width ~20 G superimposed 


u M. T. M. Leask, R. Orbach, M. J. D. Powell, and W. P. Wolf, 
Proc. mm Soc. (London) A272, 371 (1963). 
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upon a broad line of width ~100 G. Hyperfine struc- 
ture due to the one stable isotope Pr”, nuclear spin 
I$, which should be a distinguishing feature was not 
evident. It was not known whether Pr^* was actually 
present in sites of cubic symmetry in the sample studied. 
The absence of a cubic resonance could be due to the 
difficulty encountered in getting trivalent rare earths 
belonging to the first half of the 4f shell into cubic 
sites. An additional broad intense resonance correspond- 
ing to a zero-field splitting of ~0.3 cm"! was also ob- 
served which is being investigated further. 


Neodymium 


The cubic-field ground state of Nd?* in CaF; is a 
T, quartet for all x (x<0). The g values for the Ts 
resonances depend upon the ratio B,/B, and possible 
admixing from excited states. A cubic field resonance 
spectrum for Nd?*: CaF, was reported by Vincow and 
Low”; however, there has been some controversy? 
regarding the interpretation of the spectrum which 
indicates xc — 1.0. In a tetragonal field the ‘Toja free-lon 
ground state is decomposed into five Kramers doublets. 
Resonances for Nd?* in sites of tetragonal field symmetry — 
were found by Bleaney et al.° and fitted to a wave — 
function of the form a/+3)+6!+4)+c|%) with — — 
ce 0. mc 

The optical absorption and fluorescence spectra of 
Nd** in cubic sites have been studied by Kiss." The 
crystal-field parameters determined from the measured — 
splittings of the */; and */1y states, neglecting certain 
off-diagonal terms, are ae 


Adfy=—415 em, As(r)) 33 cm. 


Using these results in Eq. (4) yields x values of ~ —0.70. 
and ~—0.91 for ‘Is and “Tiya, respectively. The ob- 
served energy level spacings are similar but do not cor- 
respond exactly to those in the diagrams of LEW for - 
these x values, thereby indicating that consideration o 
the admixing of states of different J may be required to - 
determine satisfactory crystal-field parameters. The 
cubic field energy levels of */5, are T's” — 0, D,9—180 
cnr-!, and Te— 700 cm. 
We have examined the fluorescence spectrum“ in a — 
CaF. sample which, from an examination of the pa 
magnetic resonance spectrum, had Nd** predominantly 
in tetragonal sites. Transitions between ‘Fayz and taa 
were observed at 8680, 8750, 8940, 9048, 9078, 9220, 
and 9465 A at liquid helium temperatures (these mi 
include vibrational satellites). Lines were also observe 
at 8860, 9150, and 9335 A at room temperature which 
became much smaller or had disappeared at 77°K. In | 
an axial field, the upper fluorescence state 4Faj2 de- 
composes into two Kramers doublets separated by 6 : 
where B4 —a(4f*, 1F4,5)A ar"). If the disappearing 


«G. Vincow and W. Low, Phys. Rev. 122, 1390 (1960). 
a W. P. Wolf, J. Phys. Soc. Japan 17, Suppl. B-I, 442 (19 
“We thank B. DiBartolo for making these measurements, 
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are caused by thermal depopulation of the upper doub- 
let, this implies a relatively large tet ragonal field split- 
ting and an estimated |4 r?)| of several hundred 
cm^!, Such a field would greatly perturb the two Ts 
quartets of '/o/». Measurements! of the spin-lattice 
relaxation of Nd?* in tetragonal sites indicate an excited 
state at ~65 cm". 


Promenthium 


Neither the paramagnetic resonance nor optical 
spectrum of Pm?*: CaF», which does not occur naturally, 
has been reported. In a cubic field the lowest state from 
5T, is a T; singlet for all values of x (w<0). The remain- 
ing excited states should be relatively far removed. 


Samarium 


We have searched for resonances at 16 kMc/sec and 
4.2*K in an ~0.1% Sm-doped CaF: crystal. The sample 
was prepared so as to enhance the occurrence of triva- 
lent rather than nonresonant divalent samarium ions.'? 
The only resonance observed was one exhibiting tetrago- 
nal symmetry about the [100] axes which probably 
arises from interstitial fluorine charge compensation. 
The measured g values are gi. =0.9340.04, g,—0.57 
-E0.05, which can be fitted by an approximate ground 
doublet wave function cos0|--$)—sin0|-F$), where 
c0s67«0.88. The hyperfine structure of Sm? and Sm’ 

(nuclear spins 7=$) was observed for H||[100] where 
4172170x 10-4 cm. Low‘® has recently confirmed 


Approximate energy levels Sm?* (cm) 


1500 
Tg 
x 1356 
Hz, 
e T6 — — — 1043 
Iz 
986 
~1200 cm! 
I 
191 
E 
— —— 52 
T= 
Free ion Cubic field Axial field 


Fic. 3. Crystal-field splitting of the two lowest states of Sm3* 
in CaF 2. The cubic-field energy level structure is CURE "the 
axial field values are from Rabbiner (Ref. 18). : 


S I R. O'Connor and H. A. Bostick, J. Appl. Phys. 33, 1868 
^ 48 W. Low, Phys. Rev. (to be published). 
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these measurements and has also reported seeing addi- 
tonal resonances exhibiting cubic and trigonal sym- 
metry; however, their interpretation is not yet com- 
plete. In the cubic field of CaFs the Sm?* ground state is 
a Ts quartet. 

A complex fluorescence spectrum from Sm?* in a 
field of axial symmetry in CaF» has been observed and 
analyzed by Rabbiner.’* The energy levels for 9E 5; and 
51;,» are shown in Fig. 3. Although no optical spectra of 
Sm?* in cubic sites have been reported, the ordering of 
levels within the two lowest °H states is expected to be 
approximately as shown (no significance should be at- 
tached to the relative energy level spacings, however, 
which require knowledge of the cubic field parameters 
and treatment of crystal-field admixing between J 
states). From Fig. 3 it can be seen qualitatively that the 
axial field removes the degeneracy of the Ts states and 
produces a large perturbation on the cubic-field energy 
level structure. This is also evident from the resonance 
results where for tetragonal sites tan20 in Eq. (5) equals 
1.46 compared to the cubic-field value of 1.12. 

The symmetry of the axial field present in the above 
fluorescence spectrum was not established. In a tetrago- 
nal field, the energies of the three Kramers doublets for 
an isolated J=& state are, from Eq. (2), 


E(2-1)- —8B44-120B 2. 
E(2-5, 73) -4B4— 20B2 (6) 
+ 6[ (Bo+20B.9)?+20(B#)?}”. 


In a trigonal field the three doublets from Eq. (3) are 


E(43)=—2B—180B, 
E(k}, F4)=Bs+90B.! (7) 
+3[ (38$ — 10.5 9?-F 10 (Bg) ]^. 


These energies are for isolated J states. The crystal- 
field splitting for Sm**: CaF, in Fig. 3, however, are 
not negligible compared to the separations of the various 
J states. Crystal-field admixing must be taken into 
account in this case to obtain meaningful parameters 
B,™ and predicted g values. 

The fluorescence spectrum of Sm?* in a tetragonal 
field of CdF, has also been reported. Since CdF» has 
(1) an isomorphic lattice structure, (2) a lattice con- 
stant very nearly equal to that of Ca Fs, 5.40 À versus 
5.451 A, and (3) the Pauling ionic radii of Cd’? and 
Ca?* are approximately equal, 0.97 A versus 0.99 A, the 
crystal-field splittings and ordering of states in the tWo 
crystals should be similar. This is indeed found for 
Sm*+: CdF», where, in place of the energy levels in Fig. 
3, the corresponding levels in CdF» from 9H s are at 
0, 54, and 147 cm~! and from 5H; at 959, 1169, 1381; 
and 1586 cm™. We shall include data from CdF when 
similar results for CaF% are lacking. 


41 J. S. Prener and J. D. Kingsley, J. Chem. Phys. 38, 667 (1963). 
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Europium 


'The ground state of Eu** is a singlet "Fo. From the 
fluorescence spectrum of Eu** in isomorphic CdF;, 
energy levels from the first excited J state "F, are re- 
ported'5 at 307-309 and 400 cm~, In a crystal field of 
axial symmetry a J=1 state is split into a singlet and a 
doublet with an energy separation 3wA(r*). Setting 
this equal to the observed 92 cm~ separation and using 
a(4 f*, *Fj)  —£, yields Ao%r?)~150 cm. 


Gadolinium 


Resonances from Gd*+ have been observed in sites 
of cubic, tetragonal, and trigonal! symmetry. 
Since it is an S-state ion, the over-all crystal-field 
splittings are small, being 0.149 cm™ in a cubic field. 
In an axial field the 857/2 ground state is split into 4 
doublets with a total splitting of about 2 cm~! for 
tetragonal and trigonal (O?-) sites. 


Terbium 


The cubic-field ground state of Tb*+(7/,) should be 
either a T% singlet or a nonmagnetic I’; doublet, thus no 
cubic-field resonances are expected nor have they been 
reported. For x $0.8, as estimated in Table I, the ground 
state is a T; with a relatively near Ts excited state. 
Terbium resonances and hyperfine structure for ions in 
sites of tetragonal and trigonal symmetry have been 
investigated in detail by Forrester and Hempstead.” 
The resonances for this non-Kramers ion arise from 
states of principally |+6) character which have a gii 
of 124— 18. 


Dysprosium 


We have reported® the resonance spectrum of Dy** 
in sites of cubic, tetragonal, and trigonal symmetry 
elsewhere. The cubic-field ground state is a T's quartet 
with g values corresponding closely to those calculated 
from the J= 15/2 wave functions of LLW for Ps and 
x=0.6. The first excited state is a T; doublet located 
~8 cm~ above the ground state. Using this energy 
separation and the above x value, the cubic-field param- 
eters are 44(r1)— — (200-300) cm™ and Ag(r*)= 30-40 
cm~. The large uncertainty in the values arises from 
the fact that since we are near the crossover of the I’; 


8 T. D. Kingsley and J. S. Prener, Phys. Rev. 126, 458 (1962). 
0 i Low, Phys. Rev. 10: CS OO Vm 
80 C. Ryter, Helv. Phys. Acta 90, 3» 57). 
dg S R. Lacroix, Compt. Rend. 250, 2686 (1960). 
£V, M. Vinokurov, M. M. Zaripov, Yu. T. Pol’skii, V. G. 
Stepanov, G. K. Chirkin, and L. Ya. Shekun, Fiz. Tverd. Tela 4, 
2238 (1962) [English transl.: Soviet Phys.—Solid State 4, 1637 
1 z D , on 
l ve Vinokurov, M. M. Zaripov, Yu. E. Pol'skii, V. G. 
Stepanov, G. K. Chirkin, and L. Ya. Shekun, Fiz. Tverd. Tela 5, 
599 (1963) [English transl.: Soviet Phys.—Solid State 5, 436 
1963) ]. 
l 5 RU Forrester and C. F. Hempstead, Phys. Rev. 126, 923 


1962). 
: 55 d W. Bierig and M. J. Weber, Phys. Rev. 132, 164 (1963). 
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and I’; levels and are using the l';-T4 energy separation 
as a scale factor, a small change in x can give rise to a 
large change in the over-all crystal-field splitting of the 
ground state. Recently, Low’? has repeated measure- 
ments of the cubic-field spectrum at higher frequency 
and fields with accordingly more accuracy. He finds 
Aal )= —242:-20 em, 4e(r)— 41:55 em, where 
any admixing from “His at ~ 3500 cm! was neglected. 
Rabbiner’ has reported measurements of the fluores- 
cence spectrum of Dy**: CaF. The assigned cubic-field 
splitting of “Misis 2:615 cm^! with a I’; ground state 
and a first excited T's state at 53 cmt, The discrepancy 
with the above may arise from the presence of additional 
spectral lines from ions in different symmetry sites. 


Holmium 


The possible cubic-field ground states of Ho?* in 
CaF, are a nonmagnetic F,” doublet and a P5?! triplet 
(g value 2:5). In the region 0.5 € x, where the T; state 
is lowest, the first excited state is a P4? triplet which 
in the energy level diagram of LLW is nearly coincident 
with the P5?! doublet. 

We have searched for a resonance at X band and 
2-4°K in a CaF, sample grown by Optovac, Inc., 
containing a nominal Ho concentration of ~0.1%. 
No resonances were observed. Hayes et al^ have 
searched for resonances at K band and 4^K in 0.01 
and 1% Ho-doped CaF, samples which exhibited optical 
absorption spectra charactersitic of Ho?*. Again no 
resonances were found suggesting the ground state is a 
T; state. 

Dy** is isoelectronic with Ho?*. At 77°K the fluores- 
cence spectrum" of Dy** from ?/; to the °/, ground state 
exhibits a group of lines between 0-100 cm! and 
another group around 400-500 cm-!. Laser action has 
been observed to a terminal state 90 cm above the 
ground state. From an examination of the energy 
level diagram for J=8 in LLW, the above spectrum 
indicates that —0.6« x« —0.4. Assuming an over-all 
crystal-field splitting of 450 cm™ for 5/4 and an x value 
of —0.5, yields cubic-field parameters of A s(74)~ — 400 
cm~ and A¢(r*)~40 cm for Dy?*. 


Erbium 


Resonances from Er?* in sites of cubic,” tetragonal 


(1)*5-59 and (II), and trigonal (I) and (II)? symmetry — 


have been reported. We have examined several CaF2 
crystals at X-band frequencies containing 0.1-1.0% Er 
and have identified cubic, tetragona! (I) and trigonal 
(II) resonances. The spectrum at these frequencies and 
moderately high Er concentrations is complex and con- 
sists of many broad, anisotropic lines with overlapping 


26 W. Low, Proc. Phys. Soc. (London) 76, 307 (1960). 

57 N. Rabbiner, Phys. Rev. 132, 224 (1963); 133, 1 (1964). « 
5$ Z, J. Kiss and R. C. Duncan, Proc. IRE 50, 1531 (1962). 

9 U, Ranon and W. Low, Phys. Rev. 132, 1609 (1963). 
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Fic. 4. Low-field portion of the paramagnetic resonance spec 
trum of Ert: CaF arising from ions in cubic, tetragonal, and 
trigonal crystal fields. Hyperfine structure from Er? (I =3) is 
also present. Measurements were made at 9.58 kMc/sec, 4.2°K, 
and with Z||[110]. 


hyperfine structure, as evident from the low-field por- 
tion of the spectrum shown in Fig. 4. Studies of the 
angular dependence of the spectrum in both the (100) 
and (111) planes reveal the presence of several unac- 
counted for resonances which do not correspond to those 
reported above. One resonance exhibits trigonal sym- 
metry with gu=6.31+0.05 and g,—2.142-0.04; how- 
ever, these g values do not satisfy the relationship 
(gut+-2g1) = 3goubio(I'7) as do the axial symmetry Ert 
resonances observed previously. The total spectrum 
was examined with increasing temperature up to~S0°K. 
The trigonal (II) resonance exhibited rapid line broad- 
ening in the vicinity of 30°K. The cubic-tetragonal (T) 
group of lines also exhibited broadening, the cubic line 
showing the more rapid temperature dependence. By 
40°K the cubic and gu, gi tetragonal resonances could 
no longer be distinguished as separate lines. No definite 
evidence of the tetragonal (I) excited-state resonance 
reported by Baker eż al. was noted, although it may 
have been partially masked by other lines in the spec- 
trum. No other excited-state resonances were detected. 
Measurements! of the temperature dependence of Ti 
for the trigonal (II) resonance indicate the presence of 
an excited state at ~50 cm™. 

Paramagnetic resonance studies? have shown that 
the cubic-field ground state of Ho? in CaF» is a T's 
doublet in contrast to the T7 ground state found for 
isoelectronic Er?**. Since the change in the ratio of the 
fourth- and sixth-order cubic-field parameters is ex- 
pected to be small between a divalent ion and its iso- 
electronic trivalent neighbor, the above result suggests 
that for both Ho** and Er** we are in the vicinity of 
the crossover of the I's and T7 levels which for J= 15/2 
occurs at += — 0.46. 

The lowest excited crystal-field states in Ho?+: CaF2 
are? a quartet at 30.1 cm™ and a doublet at 33.8 cm! 


~ @H. R. Lewis and E. S. Sabisky, Phys. Rev. 130, 1370 (1963). 
ou H. Weakliem and Z. J. Kiss (to be published). 
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which from LLW suggest an x value = —0.52. Using 
these values, the crystal-field parameters for Ho% are 
A,(rt)= —190 cm and A¢(r*)~16 cm™, where any 
admixture from 1713/2 i$ probably small and has been 
neglected. Since x is smaller for Ert, the sixth-order 
term must be correspondingly more important. 

Pollack‘? has reported a total separation of about 
400 cmc for the 8 crystal-field levels of “15/2 for Er 
in an axial field. As an order-of-magnitude estimate, 
assume a total cubic-field splitting of approximately 
the same size, 400 cm™, and an « value of — 0.4. These 
yield 4461) —310 cm™ and Ac(r9) 5:44 cm7, which, 
while not based upon direct spectroscopic data, are, 
nevertheless, consistent with expected values extra- 
polated from results for other ions. 


Thulium 


The cubic-field ground state of Tm*t may be either 
T» or I's. The estimated x value in Table I predicts a 
T» singlet lowest. Hayes and Twidell® did not observe 
any resonances in a 0.05% Tm sample at X band and 
20°K, but did observe a Tm? resonance after x 
irradiation. It has been reported that an axial spectrum 


of Tm?* has been seen.!5.9 


Ytterbium 


The ground state of Yb** in cubic sites in CaF. is a 
T; doublet. The wave function describing the Ty 
doublet for J= is (V3/2)|+)— (3) | F$) which yields 
an isotropic g-value of 24/7— 3.429. Low?! has meas- 
ured g—3.426--0.001; Hayes and 'Twidell? report 
3.4432-0.002. We find a g value of 3.4414-0.004. This 
resonance is observable up to temperatures ~100°K, 
which implied a comparatively long spin-lattice relaxa- 
tion time.! 

We have also observed resonances due to Yb** in 
tetragonal sites. The g values in Table II are consistent 
with a doublet derived from the cubic I7, since (gu-t-2g) 
=3geuvic(I'7) and indicate that the other states of 
?F.;, are sufficiently far removed to have little effect 
on the g values.9 They can be fitted by an approximate 
doublet wave function 0.806|4-5)—0.591|-F2). Two 
trigonal field resonances have been reported by Low 
and co-workers.?!.65 

Kiss? has investigated the absorption and fluores- 
cence spectrum of isoelectric Tm?* in CaF». The 7F 7/2 
ground state is split into a I; doublet lowest, 8 Te 
doublet at 547 cm™, and a T's quartet at 556 cm™. The 
cubic-field parameters derived from these values are 
4A4()- —180 cm™ and 44(79)—35 cm”, which cor- 
respond to an x value of 0.74. i 

The absorption spectrum of Yb?* was also examined 


€ S. A. Pollack, Bull. Am. Phys. Soc. 8, 614 (1963). 

*5 W. Low, J. Phys. Soc. Japan 17, Suppl. B-I, 440 (1962. 30 
(3. Low, Phys. Rev. 118, 1608 (1960); J. Chem. Phys. 34, 

55 W. Low and U. Rosenberger, Compt. Rend. 254, 1771 (1962 
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briefly by Kiss. Two dominant lines were observed at 
10240 cm and 10870 cm, which have also been 
observed by others.?*/^5^5 Tf these lines are ascribed? 
to transitions to the two cubic-field levels of the ?F5/2 
excited state, an 4,(7*) value of —280 cm™ is found. 
Comparing the crystal-field splitting of isoelectronic 
Tmt and Yb?*, the larger value for the latter reflects 
the effect of the additional nuclear charge of the 
trivalent ion. Approximate cubic-field energy levels for 
Yb?*, based upon Kiss’ assignment and assuming 
x~0.7, are shown in Fig. 5 together with expressions 
from the eigenvalues. The estimated over-all crystal- 
field splitting of 22/2 is ~800 cm~. | 

Low has investipated the electron paramagnetic 
resonance spectra?! and the fine structure in optical 
spectra?! of Yb:CaFs and suggested that the cubic 
field was relatively weak with an over-all ground-state 
splitting of ~12 cm™. Interpretations based upon such 
fine structure are, in general, inconsistent with the 
present results and estimates of the magnitude of the 
cubic field for other rare-earth ions. The existence of 
such small splittings for Yb: CaFs are also not evident 
from measurements of the g value? and the rate and 
temperature dependence of the spin-lattice relaxation.! 
Measurements” of the g shift, temperature dependence, 
intensity, and infrared aborption of Yb?* in cubic 
sites in CdF» indicate an excited state at 250-70 cm™ 
and an over-all ?Fz;, splitting of 2:116 cm™. We do not 
understand why these values are so much smaller than 
those estimated for Yb:CaF». Further work to confirm 
the measurements and verify the interpretations in the 
two cases are needed. 

The eigenvalues for a J=4 state in a tetragonal field 
are given by 
E(+3, EE —b3$—8b4--2b9 

+[(2089— 550—758) +3 (5b — 7b |^, (8) 
E (2-3, F4) =02+ 8b — 200 
-E[ (659 — b030) -35 (b F30) T, 


where 
b? = 1260B6, 


bt =60B6. 


b)—3B$£, 
b4—60B4, 
b4=12B4, 
The observed ground state is of the form cos6| +3) 


—sin@|=F$) for which, from Eq. (2) and the doublet 
wave function found experimentally, 


tan20— V3 (5545 7b6)/ (252° — 55.9 — 759) —3.18. (9) 


The difference between this value and the cubic-field 
value tan2@=—vV3, corresponding to 54—0, bi—54, 
and 5442 — bę, again reveals qualitatively that nearest- 


56 P, P. Feofilov, Opt. i Spektroskopiya 5, 216 (1958). 
61 p. P. Pashinin and A. M. Prokhorov, Paramagnetic Resonance 
(Academic Press Inc., New York, 1963), p. 197. 
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fic. 5. Estimated cubic-field splitting of Yb** in CaF, 
assuming A 4(r+) = —280 cm™ and +=0,7. 


Free ion 


neighbor charge compensation produces a large change 
in the crystal field at the trivalent rare-earth site. 


CONCLUSIONS 


The cubic field in CaF, at trivalent rare-earth sites 
appears to be characterized by parameters in the order 
of magnitude 


~Ag()~250-430 cm7!, 
A¢fr®)~30-45 cm. 


The extrapolation of data from divalent ions to the 
isoelectric trivalent ions is made subject to two assump- 
tions based upon limited experimental evidence, namely 
that the ratio of the fourth- and sixth-order parameters 
of the trivalent ion is in the order of, or less than, that 
of the divalent ion, and that the trivalent ion param- 
eters are larger than those of the divalent ion. The 
axial field parameters for nearest-neighbor charge 
compensation are more difficult to estimate from exist- 
ing data. The A2%(r*) term, however, probably bas a 
magnitude of several hundred cm™ for both tetragonal 
(n.n. F~ interstitial) and trigonal (O°) fields and hence 
produces a large perturbation on the cubic-field energy 
levels. The data are usually insufficient to evaluate the 
Ag, Ad, Ad, and As! terms for the noncubic crystal 
fields. 

The parameters A ,"(r") are expected to vary through- 
out the rare-earth series. If one assumes the 4,"'s 
are constants determined by appropriate lattice sums, 
then any variation of the crystal-field parameters would: 
be due to (r^). In the early work of Elliott and Stevens, 
this variation was approximated by a multiplicative 
factor (Z—53)-^^, where Z is the atomic numbef. 
Application of this formula has not been universally 
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successful, however. This point has been discussed by 
Freeman and Watson® who, from analyses of data for 
rare earths in trichlorides and ethylsulfates and using 
calculated values of the (r") integrals, conclude that 
the constancy of 4," with Z is not valid. 

The data on the cubic-field parameters in CaF for 
most trivalent ions are too uncertain to derive any 
explicit variation with Z. Qualitatively, A .(r*) appears 
to exhibit a decrease with increasing Z not unlike that 
predicted from the variation of Freeman and Watson’s 
(r'), Ag(r9), on the other hand, does not show the large, 
monotonic decrease expected from solely the (r9) varia- 
tion, but rather has a moderately constant or slightly 
increasing value throughout the series. The x values 
for the trivalent rare earths in cubic sites in CaF» 
given in Table I are estimated by extrapolating the 
behavior of Aa(r‘) and A¢(r®). Although these v values 
must be accepted with considerable reservations, the 
predicted cubic-field ground states are not inconsistent 
with available paramagnetic resonance data, and they 
explain the absence of observed cubic-field resonances 
for Tb?*, Ho**, and Tm?*. 


APPENDIX: CALCULATION OF CRYSTAL-FIELD 
MATRIX ELEMENTS USING 3-j 
AND 6-j SYMBOLS 


The crystal-field potential V at a rare-earth ion 
site can be expanded as 


V=} V=} Arr") Y: (0,0) , (A1) 


k,q k.q 


where Y;*(0,9) is the spherical harmonic of degree k 
and azimuthal quantum number q, (7*) is the mean value 
of the kth power of the 4f-electron radius, and the 
A,?s are parameters related to the strength of the 
crystal field. Using the concept of operator equivalents, 
a general term in the crystal-field Hamiltonian may be 


AND R. W. 
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rewritten as 


ts" )0s, (A2) 


where the ¢; is Stevens’ operator equivalent fac‘ or 
a, B, y for k=2, 4, 6, respectively, and the O,"s are 
angular momentum operators, a complete list of which 
has been given by Orbach.” Tables of matrix elements 
of these operators within a J manifold and operator 
equivalent factors for the ground states of the trivalent 
rare-earth ions are located in various places in the 
literature. While this approach is convenient for the 
ground-state computations, when calculating the crys- 
tal-field matrix elements of excited states, crystal-feld 
admixing between different J states, and in the treat- 
ment of spin-lattice relaxation where all values of k and 
q may be allowed, the more general approach given 
below which takes advantage of the tabulated 3-j 
and 6-j symbols of Rotenberg e al. has been useful. 

Elliott, Judd, and Runciman” have shown how the 
crystal-field matrix elements of a rare-earth ion in a 
configuration f^ can be calculated using the tensor 
operator techniques developed by Racah.” Their ex- 
pression for the matrix elements of U,*, defined as 
(4x/2k4-1)?Y,*, rewritten in terms of 3-7 symbols 
(:::) and 6-7 symbols {:::} is 


(f^WUSLJJ.|U E| f"W'U'S'L'J7 7) 
—&(SS^)(—1)*5*"EQJ4-1) QJ 1) J^ 
jp EESTI ION 
26 i 
E et) 
x (f"WUSL|U*|f*W'U'S'L'), (A3) 


where for a rare-earth ion within a given configuration 
only k=2, 4, 6 need be considered. The reduced or 
double bar matrix element at the end of Eq. (A3) are 


"TABLE IIT. Reduced matrix elements of the lowest S-L term of the trivalent rare-earth ions. 
The first + or — sign corresponds to the first ion of each pair. 


Ion Ground term (rSL||U?||r.S.L) (rSL||Us||z.S.L) (S L||U*|| rS) 
4 2 

CUR F +1 +1 +1 

POM 3H +3(1113/2X7)12 £3 (13/7)? 44(5X17/7)” 

SERE 4T + (13/2X 3X 11)! = (2132 17/32 112) + (5/11) (17x 19/3X 7)! 
H- 

iw Di = (13/2X 3X 11)1/2 + (2X 132 17/32 112)1/2 = (5/11) 17x 19/3X 7) 
3+ 2 

ae i H =4}(11X13/2X7)12 +3 (13/7)12 F4 (5X17/7)"2 

Eus 

M LE, Tl Ti 1 


68 M. J. D. Powell and R. Orbach, Proc. Phys. Soc. (London) 78, 753 (1961). 


69 A. J. Freeman and R. E. Watson, Phys. Rev. 127, 2058 (1962 
1.7? R. Orbach, Proc. Roy. Soc. (London) A264, 458 EOM i 


71 J. P. Elliott, B. R. Judd, and W. A. Runciman, Proc. Roy. Soc. (London) A240, 509 (1957). 


7? G. Racah, Phys. Rev. 62, 438 (1942); 63, 367 (1943). 
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TABLE IV. Normalization factors N for Eq. (A4). Numbers in column F are multiplicative factors common 
to all clements in the row. Other column headings denote q values, 
Ny F 0 1 2 3 4 $ 6 
us BC UAE 1 1/Qx3)*. —— AM20x3p CP CAS 3 ee al 
Na X7/11)12 1 1/4(5)12 1/202 5): 1/4(5X 7)! X5XT1) 
Net —1600/3X11X13)/2. 1 ; Visas ites Oen 


tabulated in Table III for the lowest L-S term of the 
trivalent rare-earth ions, where r denotes the remain- 
ing quantum numbers required to specify the state. 
Many of the reduced matrix elements for the various 
ions are related by simple multiplicative factors deriv- 
able from the relationships discussed by Judd.” The 
reduced matrix elements of U* for other L-S terms may 
be calculated using Racah's fractional parentage coef- 
ficients as shown by Elliott, Judd, and Runciman. 


1/20X3XT)^ 1/(BKSK7)"? 1/2(X5xT)? 1/30X7)8 — 1/6(0X11y? 1/QXTX11)* 


equivalent factors, then (A2) should be replaced by 


NA, a(r) U ae . (A4) 


The normalization factors .V;* for the various £'s and 
q's are listed in Table IV. 

In terms of the 6- j symbols, the operator equivalent 
factors, (4, for different J states are related by 


c : (SLJ']gd]SLJ^) 2741 
Nielson and Koster™ have recently tabulated these — = (—1)/74- - 
matrix elements for f” configurations. (SLJ | S EJ) 2J-F1 


In Stevens' operator equivalence approach, certain 
normalization factors have been absorbed into the 
definition of the 4,1. Thus if Eq. (A3) is used to evalu- 
ate crystal-field parameters and the 4,?'s are to be 
compared with those derived elsewhere using operator 

3 B. R. Judd, Proc. Roy. Soc. (London) A250, 562 (1959). 

** C. W. Nielson and George F. Koster, Spectroscopic Coefficients 


for the $^, d^, and f” Configurations, (The M.I.T. Press, Massa- 
chusetts Institute of Technology, Cambridge, 1963). 
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‘The temperature dependence of the spin-lattice relaxation time 71 


of trivalent rare-earth impurities in i 


CaF» has been measured at 9.6 kMc/sec using pulse saturation techniques. The rare earths investigated | 


include Kramers ions (Ce?*, Nd:*, Dy**, Er**, Yb?+), one non-Kramers ion (T b3*), and one S-state ion 
tetragonal, and trigonal crystal-field symmetries have 


(Gd**). The relaxation of ions in lattice sites of cubic, 


been studied and are found to be different. At liquid-helium temperatures the relaxation usually exhibits a 


T^! temperature dependence; however, the magnitude 
than estimated single-phonon relaxation times based upon the o 


s are approximately two orders-of-magnitude shorter 


rbit-lattice interaction. Measurements of 


the Ce?* relaxation in five samples containing from 0.08 to 1.6% Ce combined with results from other ions 


show that Tı decreases with increasing rare-earth concentration. 
saturation are observed at low temperatures and are discussed with respect to possible cross-relaxation 


Multiple exponential recoveries from 


within inhomogeneous spin systems and to other faster relaxing systems. At higher temperatures examples 
of both Raman and Orbach two-phonon relaxation processes are found. Calculations of two-phonon relaxa- 
tion times using estimated values of the dynamic crystal fields, energy level structures, and ion wave func- 


tions yield coarse order-of-magnitude agreement with experimental values. 


INTRODUCTION 


PIN-LATTICE relaxation of rare-earth ions has 

been the subject of considerable attention in the 

past few years, prompted by the discovery of the im- 

portance of two-step relaxation processes involving low- 

lying excited states! and the subsequent theoretical 

" treatment by Orbach.? The relaxation of several dif- 

ferent rare-earth ions has been measured in ethyl sul- 

fate, double nitrate? and garnet crystals! and com- 

pared with theory with varying degrees of success. We 

have investigated the temperature dependence of the 

spin-lattice relaxation time, T, of trivalent rare-earth 

ions in Cals, including examples of Kramers (Ce?*, 

aE Nd**, Dy*+, Ert, Yb*+), non-Kramers (Tb*+) and S- 

3 state (Gd**) ions. This crystal is of additional interest 

since trivalent rare earths can enter the lattice substitu- 

tionally in sites having cubic, tetragonal, and trigonal 

crystal-field symmetry; thus one can compare the 

characteristics of the relaxation of ions in different 
symmetry sites in the same host lattice. 

Calculations of the rate of spin-lattice relaxation and 
predictions of the relative importance of various relaxa- 
tion mechanisms require knowledge of the energy level 
structures, ion wave functions, and strength of dynamic 
crystalline fields. Unfortunately, such information is 
not well established for most rare-earth ions in CaF». 
In the preceding paper® (henceforth referred to as I), 
available optical and electron paramagnetic resonance 
data were reviewed to obtain estimates of the cubic- and 
axial-crystal-field parameters and electronic energy 
levels. "These results are used to estimate the magnitude 


- 1C. B. P. Finn, R. Orbach, and W. P. Wolf ^ 
(London) T, 2i (1561). , an Wolf, Proc. Phys. Soc. 
2. Orbach, Proc. Roy. Soc. (London) A264, 458 (1961). 
3 P. L. Scott and C. D. Jeffries, Phys. Rev. 127, 32 (1902) and 
Bd ans z 
a [, Svare and G. Seidel, Paramagnetic Ri A i 
. Press Inc., New York, 1963), p. 430... cm Oden 
De Wd M. J. Weber and R. W. Bierig, Phys. Rev. 134, A1492 (1964). 
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and importance of various one- and two-phonon relaxa- 
tion processes. Pertinent details of the resonance spec- 
tra, g values, etc., are also given in I. 

The temperature dependence of T, for the above 
rare-earth ions has been measured from 2°K to as high 
a temperature as the time resolution (73 usec) and 
signal amplitude would permit. These results are pre- 
sented following a brief review of the theory of spin- 
lattice relaxation in CaF? arising from modulation of the — 
crystal field by thermal lattice vibrations and a discus- | 
sion of the experimental approach. The Tys at liquid- 
helium temperatures usually exhibit a T^ temperature 
dependence characteristic of one-phonon relaxation | 
processes; however, the magnitudes are about two | 
orders-of-magnitude shorter than estimated relaxation — | 
times based upon the orbit-lattice interaction. In the | 
case of Ce?*, the low-temperature relaxation times are — | 
found to be a function of rare-earth concentration. Some ^ 

i 


possible mechanisms which may account for the en- 
hanced and concentration-dependent relaxation rates 
are discussed qualitatively. At higher temperatures Ti 
decreases rapidly with increasing temperature. In most 
cases the magnitude and temperature dependence of 
the relaxation times are in reasonable order-of-magD* 
tude agreement with those for two-phonon Orbach an 
Raman processes calculated by using estimates of the 
crystal-field parameters and energy level splittings 
obtained in I. 


THEORY 


We are interested in the spin-lattice relaxation rate 
of the lowest resonant doublet of a rare-earth ion 1 7 
crystal field. Since no evidence of a phonon bottleneck 
was observed in our experiments, we shall be concerne 
with the true spin-lattice rather than the Jattice-bat 
relaxation rate. If the excited states are relativey p. 
removed from the resonant ground doublet, then, © 


lowing a short intense rf pulse applied to excite the a 
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system from its equilibrium Boltzmann population, this 
two-level system will exhibit a simple exponential re- 
covery with a characteristic time T. 

There are several spin-lattice relaxation mechanisms 
which may be operative for rare-earth ions. The theory 
of rare-earth-ion relaxation has been developed by Or- 
bach? and reviewed by Scott and Jefíries? We shall only 
quote the final results here; the reader is referred to the 
above papers for more detail. 

The simplest relaxation mechanism is a one-phonon 
direct process in which the spin-relaxation transition is 
accompanied by the absorption or emission of a single, 
energy-conserving phonon. The expression for T, due to 
the interaction between a spin and the fluctuating elec- 
tric crystal field is given by 


1 BN? 
TT =; V»™(|b)|? coth(6/2kT 1) 
Tip s Kal > [P)|* coth(6/2kT), (1) 


where p is the mass density of the sample, v an average 
phonon velocity, 6 the energy of the resonant transition 
between spin states |a) and |b), and the V,"'s are 
components of the dynamic crystal-field potential. The 
orbit-lattice interaction does not connect time conjugate 
states; consequently, if |a) and |b) are zero-order state 
functions, Eq. (1) is applicable only to non-Kramers 
ions, since for a Kramers ion the ground state is at 
least doubly degenerate and the crystal-field interaction 
between these states obviously vanishes. A Zeeman field 
may, however, admix states of the ground doublet with 
an excited state |c} at an energy Ae. Spin-lattice relaxa- 
tion transitions are now allowed at a reduced rate 
governed by the magnitude of the Zeeman perturbation 
and given by 


1 3 /0N3/28AN* 
s o ) (el Hd eKel varl) 
Tip 2npv'h\h A* Ee 


Hlal E Valo e|H-J]5)|}? coth(6/2kT), (2) 


n,m 


where one must sum over all effective states |c). At 
temperatures above ~ 1°K, 5 is usually <2kT and hence 
the direct process relaxation time, Tıp, is characterized 
by a T3 temperature dependence. 

An important relaxation mechanism for many rare- 
earth ions is the so-called Orbach process. This is es- 
sentially a two-step process involving real transitions to 
and from an excited state |c) accompanied by the crea- 
tion and annihilation of two high-energy phonons. For 
this mode of resonance relaxation between the phonons 
and crystal-field splitting, one requires A.<k@ where 0 
is related to the cutoff frequency for phonons and is 
approximated by the Debye 6. The relaxation rate for 
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and is characterized by an exponential temperature 
dependence at low temperatures, T (A,/L). 

A two-phonon Raman relaxation process is one in 
which the energy of the spin transition is conserved dur- 
ing an inelastic phonon scattering. If the first-order 
term in the dynamic crystal-field interaction is used in 
second-order perturbation theory to calculate the transi- 
tion probability, one finds that for non-Kramers ions 
at low temperatures, where 7««6, 


—S 


(al Vnde] Vw 15)? 


Tir Ap!g A 2 nm ET 
nm b ó : : 
x(=) 0. 
h : 


For Kramers ions, again due to the consequences of 
time-reversal symmetry, there is a partial “Van Vleck 
cancellation"? such that in place of Eq. (4) one has — 


1 oth? 3 ARIN 
—~=——_—| *; (al Vrlo) V7 Dies E (5) 
Tig wm pA? m h s 


Another two-phonon Raman contribution arises from 
the use of the second-order term, in the variation of th 


crystal field caused by lattice vibrations, in first-ord 


perturbation theory and is given by 


1 9x6! NAJA = 
—=——_| 5 vin) — 


Tig Apr? nm L 


for non-Kramers ions at low temperatures. This term 
is the same order of magnitude and additive to the result 
in Eq. (4). As for the direct process, the above interac- 
tion is active for Kramers ions only via the eigenstate 
admixing caused by the Zeeman field. This introduces a 
multiplicative factor, (24@H/A.)*, into Eq. (6). In. 
general, the dominant Raman relaxation process for 
Kramers ions is given by Eq. (5) with a Tır propor- 
tional to 7-? compared to the T~” temperature depend- 
ence for non-Kramers ions. 

When a four-level Ts quartet is lowest or in the case 
of an S-state ion, the effective spin system may not. - 


eJ. H. Van Vleck, Phys. Rev. 57, 426 (1940). 
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consist simply of a single isolated doublet. The relaxa- 
tion modes in such multilevel systems are, in general, 
more complex. At low temperatures the dominant relaxa- 
tion process between pairs of levels may occur via 
multistep direct processes and exhibit a transient time 
dependence proportional to a linear combination of 
several exponential terms.’ Also, at the low-temperature 
end of the Raman relaxation region, a Tı mechanism 
proportional to T-* may be dominant.’ These cases, 
however, are somewhat specialized examples of rare- 
earth-ion relaxation. 
To evaluate T, from the expressions in Eqs. (1)-(6) 
the matrix elements of Vn” are required. For the rare 
earths, one need only consider z even and <6. Which 
components of V," enter into the calculation of the 
relaxation time in a given crystalline environment can 
be determined from a normal-mode analysis? of the 
vibrations of the paramagnetic ion and the nearest- 
neighbor surrounding charges, these being assumed to 
generate the principal orbit-lattice coupling. The vibra- 
tional modes transform as some combination of the ir- 
reducible representations of the crystal symmetry 
group. For CaF», which has cubic symmetry, the vibra- 
tional motion of the rare-earth ion and the eight nearest- 
neighbor F- ions transform according to the following 
irreducible representations of the O, group: 


T'vib (0x) =A 1t A 2u t IBAF Jar Da rate 2T» Tou D (7) 


where for transitions within a given electronic configura- 
tion only the even vibrations need be considered and the 
translational and rotational terms have been subtracted 
out. Similarly, for the tetragonal and trigonal site sym- 
metries described previously,’ 


Tyin (Cp) =5A 14+ 42+2B,1-4B2+ 6E , (8) 
and 
Tyin(Ca»)= 6A +424 7E. (9) 


Comparison of the irreducible representations in (7), 


(8), and (9) and those according to which the V,"'s: 


transform, shows that in all cases the vibrational motion 
is factored into a number of representations such that all 
V,”s are allowed. Thus there is no reduction and simpli- 
fication in the number of terms which must be considered 
in applying the above theoretical expressions to the 
evaluation of T, for ions in the different symmetry 
sites in CaF». 

Attempts to make explicit calculations of spin-lattice 
relaxation times are plagued by the necessity of making 
numerous assumptions and approximations. Given 
expressions for T, as in Eqs. (1)-(6), which are derived 
using a simplified treatment of the lattice vibrations, 
estimates must be made of the dynamic crystal-field 
components. Two approximations which have been 
used?? in making such estimates are that (1) the orbit- 
lattice interaction parameters may be approximated by 


e 


1R. W. Bierig and M. J. Weber, Phys. Rev. 132, 164 (1963). 
8 R. Orbach and M. Blume, Phys. Rev. Letters 8, 478 (1992) 
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static crystal-field parameters, ke, |(x8A s"/àx)] 
z |A 4" (xo)|, where « is a ligand coordinate displace- 
ment, and (2) |An™| ~|A,°|. Although the latter rule 
was based upon empirical data, it is not universally 
satisfied. Other approaches? include estimates of the 
strength of the orbit-lattice parameters obtained from 
uniaxial stress and ultrasonic spin-phonon absorption 
experiments. Convincing calculations of the parameters 
would require knowledge of the nature of the ligand 
fields, shielding effects, and vibrational motion. In the 
following we shall consider those combinations of matrix 
elements of the orbit-lattice interaction which appear 
in the expressions for the direct, Orbach, and Raman 
Tys as parameters to be determined experimentally. 
These are compared later with rough order-of-magnitude 
estimates using the above approximations and the crys- 
tal-field parameters and energy level splittings found 
in I. 

For a resonance frequency of 9.6 kMc/sec and taking 
p=3.18 g/cm? and a mean phonon velocity, v— 4X 10° 
cm/sec for CaFs, Tı for the direct process in Eq. (1) 
becomes 


Tip= (1.5/TM p°) (sec), (10) 
where for non-Kramers ions, 


M yp?z |(a| X; V«"|b)|*(cm?), (11) 


n,m 


and for Kramers ions, 
Mp2= QBA/A(|(a] H-J|oXc] E Va" ld) 


Hal X V,"|o(c| H-J|b)|y(em-). — 02) 


n,m 


For the Orbach process, from Eq. (3); 


exp(A./RT) 
vo (sec) , (13) 
4A3 (cm-3)M o? 
where 
| X (al V4" |oel] V," |b)? 
Moz xr (cm™). (14) 
| 3: (alvale X €] Venn |0) | 


A summation is to be made over all effective excited 
states |c) in (12) and (14). For Kramers salts, the T^ 
Raman relaxation mechanism in Eq. (5) is usually 
dominant and reduces to 


Tin = (107A4/T9M x!) (sec) (15) 
where 
Me Y (al Verlotel Va" ayl (nc. 09 


n,m 
nm 


? M. Blume and R. Orbach, Phys. Rev. 127, 1587 (1962). 
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Scott and Jeffries? found from the actual computation 
of sums of the type Ð n,m| (a! V,7|c)|? (assuming the 
individual terms to be incoherent), that for several 
rare earths these sums were of the same order of maeni- 
tude. Thus one might expect that Mp? (allowing for 
Zeeman admixing), Mo, and M? would not differ 
greatly. Exceptions are always possible, however, and 
would depend upon the actual state wave functions. 
Note, also, that if the magnitudes of [x (al V." 1e)? 
and |2 (c| Va”|b)|? in Eq. (14) are very different, 
Mo will be determined by the smaller of the two. 


EXPERIMENTAL 


A schematic representation of the pulse saturation 
X-band spectrometer used in measuring spin-lattice 
relaxation is shown in Fig. 1. The principles of operation 
are straightforward. A VA 153 reflex klystron is used as 
a main rf power source. The klystron environment is a 
temperature-stabilized oil bath, an important improve- 
ment for obtaining good low-noise klystron operation. 
The VA 153 is frequency stabilized to the sample cavity 
resonance. Relatively high-power rf pulses are generated 
by two states of X-band amplification obtained using 
VA 824B klystron amplifiers driven by the VA 153. 
The rf pulse width can be varied from 2-500 usec and 
peak output power is ~20 W. Klystron amplifiers have 
the advantage that their output pulse is monochromatic 
to the extent that the driving element is frequency stabi- 
lized, hence there are no high-peak power FM signals 
reflected from the sample cavity to interfere with detec- 
tion of the low-level transient relaxation signals. A 
portion of the VA 153 output is used as a low-level cw 
monitor of the sample cavity reflection coefficient. The 
cw power level incident on the sample cavity is norm- 
ally kept between 0.1 and 10 „W. 

The transient signals generated by relaxation of the 
spin system, following a high-power saturating pulse, are 
detected in a standard mixer; subsequent amplification 
is performed at 30 mc, the difference frequency at which 
the local oscillator is stabilized. The transient relaxation 
signals are displayed on a Tektronix oscilloscope and 
photographed. The VA 153 stabilization loop has a 
variable bandwidth (1-100 kc/sec) to permit rapid cor- 
rection of the output frequency so that the sample 
cavity may be transiently tracked. If this is not 
done and care is not exercised so that the sampling 
frequency is at the center of cavity resonance, a mixture 
of time-dependent absorption and dispersion may be 
detected. The decay times of the real and imaginary 
components of the reflection coefficient are not neces- 
sarily the same, hence considerable confusion in the 
data may occur if this admixture is not taken into ac- 
count. An alternative to rapid stabilization is to meas- 
ure the decay time of the observed transient signal far 
out in the tail where the dispersion is small. 

Diminution of pulse power incident on the mixer is 
accomplished by synchronously switching a double- 
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SYSTEM BLOCK DIAGRAM 


Fic. 1. Schematic diagram of pulse saturation 
X-band spectrometer. 


diode switch to reflect pulse power back into an absorb- 
ing isolator. The time resolution of the spectrometer, 
which is a few microseconds, is determined by the “turn- 
off” time of the klystron pulse amplifiers and by the 
recovery time of the preamplifier “front end" from 
saturation. 

The sample cavity resonates in the TEO11 circular 
electric mode and has a loaded Q= 8000-10 000. The 
sample under study is mounted axially in the resonant 
chamber on a sapphire tube. At low temperatures the 
sapphire provides excellent thermal coupling to the 
cavity which, in turn, is in thermal contact with the 
helium bath via helium exchange gas. The cavity mass 
acts as the thermal reservoir to which the spin system 
relaxes. The cavity temperature is carefully monitored 
by a set of 3 thermometers; two 47-2 carbon resistors 
and a platinum resistance thermometer. The carbon 
resistors were individually calibrated between 1.5 and 
20°K; the platinum resistance thermometer was cali- 
brated from 4.2-300?K. These three thermometers 
were used together with a gold-cobalt differential 
thermocouple connected between one of the carbon 
resistors and the sample mount to verify that the sample 
temperature did indeed track the cavity temperature in 
the range 2-50°K. The maximum temperature dif- 
ferential was found to be less than 0.1°K in this tem- 
perature range. This was important, since all relaxation 
measurements at temperatures above 4.2°K were made 
by simply allowing the cavity structure to warm toward 
77°K. The total time available for measurements be- 
tween 4.2 and 50°K is typically 3.5 h. 

Most of the CaF- crystals used in these experiments 
were grown in this laboratory by the Czochralski 
method! or by a modified Stockbarger' technique. 
The melts were contained in molybdenum crucibles and 
the growth ambient consisted of dry argon gas [dew 
point « —100^F]. Other samples were purchased from 
Optovac, Inc. The crystals were annealed by sealing 
them in evacuated quartz ampoules [10-5 mm Hg at 


9 (a) K. Nassau, J. Appl. Phys. 32, 1820 (1961). (b) D. C. 
Stockbarger, J. Opt. Soc. Am. 39, 731 (1949). 
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20°C], heating them to 1200°C for 12 h, followed by 
slow cooling at ~20°C/h. In general, the samples had 
rare-earth dopings of the order of 0.1 wt.%; undesirable 
impurities were present 20.001 wt.% as determined 
from spectrographic analysis. Each sample was x-ray 
oriented to within +2° of the desired orientation. The 
typical sample size was approximately 0.1X0.1X0.3 
cm, which gave a cavity filling factor of about 0,001. 
Except where noted, the observed transient relaxation 
signals exhibited a single exponential time dependence. 
Tı was obtained from a semilog plot of signal amplitude 
versus time and was defined as the time at which the 
signal had recovered to (1— —) of its equilibrium value. 
The experimental uncertainty in the individual Tı 
determinations is typically ~10%. The electron para- 
magnetic resonance (EPR) spectrum was examined as 
a function of cw power level to insure that the sampling 
signal intensity was always at least 20 dB below the 
power level at which steady-state saturation could be 
observed. For a few of the samples (Ce?* and Gd?) the 
incident power had to be reduced to a few tenths of a 
microwatt. The saturating pulse width 7, was always 
adjusted such that 5 Z T:. 


RESULTS 
Cerium 


Spin-lattice relaxation of Ce** in tetragonal sites in 
CaF. was investigated in the temperature range 2-15°K 
for five samples having different Ce concentrations 
ranging from 0.08 to 1.6 wt.% cerium. In all samples the 
tetragonal site resonances were the most intense. Addi- 
tional lines were usually present in the resonance spec- 
trum, particularly at the higher Ce concentrations (see 
Fig. 2 of I); however, they were generally too weak for 
relaxation measurements. T, measurements were made 
with the magnetic field H applied in a (100) crystallo- 
graphic plane. 

Figure 2 shows the over-all temperature dependence 
of T, for several different Ce?* concentrations. The 
transient recovery from saturation of the signal from 
the 1.6% sample displayed a single exponential time 
dependence with a characteristic time which was ap- 
proximately temperature-independent at low tempera- 
tures. The recovery of the other samples, however, could 
not be fitted by a single exponential term at low tem- 
peratures but rather by a sum of two or three exponential 
terms. An initial rapid recovery was usually observed 
which was more or less temperature-independent but 
concentration-dependent. This component had a charac- 
teristic time varying from 2100 psec for the 0.34% 
sample to 2:300 usec for the 0.08 and 0.18% samples. 
When saturation pulses of several hundred microseconds 
duration were applied, this component was diminished. 
Thus it appears to be associated with the establishment 
of internal equilibrium within the spin system" and not 


x uk D. Bowers and W. B. Mimms, Phys. Rev. 115, 285 (1959). 
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Fic. 2. Temperature dependence of relaxation time T, of Ce?* in 
tetragonal sites in CaF, for several different Ce concentrations. 


spin-lattice relaxation. Following it was a temperature 
dependent recovery composed of one or two exponential 
terms. The dominant temperature-dependent com- 
ponent of the recovery is concentration-dependent and 
is plotted as T; in Fig. 2. A final rather weak tempera- 
ture-dependent component of the recovery is also 
plotted in Fig. 2 for the 0.08% sample. Except for the 
most concentrated 1.6% sample, the relaxation in the 
low-temperature region is characterized by a T~ tem- 
perature dependence and a concentration-dependent 
magnitude. 

At higher temperatures, T>10°K, the data for the 
different samples tend to converge to a value independ- 


temperature dependence can be fitted to a T-? law for 
Raman relaxation of a Kramers ion using Tir~4% 1 
T- sec. The data can also be fitted to an exponent! 
law with A~150 cm~; however, the fit is poorer. 
For several of the rare-earth ions studied it is difficult 
to establish an unambiguous temperature dependence 
at high temperatures since the magnitude of T, after 
subtracting out the 7—! dependence at low temperatures 
is not known accurately over a sufficiently broad range 
of temperature. : 
If the asymptotic 7; temperature dependence 8 
fitted to be a T-? law for Raman relaxation, we find fror 
Eq. (15) that M g?2: 5A*. The first excited state for Ce 
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in tetragonal sites should be of form |-£1). Surns of the 
type È | (e| V™|c)|? connecting this state to the ground 
state have values ~ 1.5 10 cm? for A2™(72)~ 200 cm~! 
and A a” (r*)o- 400 cm™, (More correct wave functions 
including admixtures of states from ?Fzj9, would intro: 
duce terms involving Vs"; however, they are not ex- 
pected to change the order of magnitude of the above 
result.) An M 5? of this magnitude, however, implies a 
small A which is inconsistent with the dominance of 
Raman relaxation. If we consider Orbach relaxation 
via an excited state at ~150 cm-!, we find from the 
data and Eq. (13), Mo?~10! cm. Although the fit to 
an exponential temperature dependence is poorer, the 
magnitude agreement is better. A comparable Raman 
relaxation rate for A~150 cm™ requires orbit-lattice 
matrix elements M 5*—105 cm™. Although there is a 
large uncertainty in our estimates of the V,"'s, the 
above values of Mo? and Mg? are not unreasonable. 
Also an excited-state separation of a few hundred cm~! 
for Ce?* is possible if the tetragonal fields are as large 
as they appear to be for other rare earths in CaF;.* 
'Thus the above estimates are inconclusive in establish- 
ing whether the high-temperature relaxation is Raman 
or Orbach. The former, however, requires the larger 
orbit-lattice interaction. 

The angular variation of T, with magnetic field direc- 
tion in a (100) plane is shown in Fig. 3. The measure- 
ments were made using a 0.34% Ce sample at 2°K. 
The same ratio of Tys for the gi; and g, resonances, i.e., 
for the magnetic field H applied along [100] and [010], 
was observed from the 0.18% sample, while for the 1.6% 
sample the two 73's were equal. Since the gı resonance is 
weaker than the g, resonance, the relaxation could not 
be measured in the former orientation for the 0.08% 
sample. 

If the first excited tetragonal state is |), direct 
relaxation should be highly anisotropic since there is no 
Zeeman admixing for 6—0. The over-all shape of the 
curve in Fig. 3, however, does not correspond exactly 
to that expected for an excited state of either (+4) or 
(+8, -F$) and may be related to the concentration- 
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dependent effects. The dip in Tr near 0— 80? occurs fac 
the vicinity of the crossover of the resonances from Ce* — — 
in sites of tetragonal and trigonal symmetries. This sug- 
gests the possibility of cross-relaxation effects, although — 
such effects are not observed when cubic and tetrago 
Yb?* resonances are overlapped, as we report lat : 
Within experimental error, the relaxation at this orien- — — 


tation exhibited a simple exponential recovery. “ge 


Neodymium > 


The temperature dependence of T; for Nd** in 
tetragonal sites is shown in Fig. 4, where the g, resonance — - 
for H L[100] has been measured. Several weak broad 
(740-100 G) lines whose origin and nature were not 
established were present in the resonance spectrum. 
At 2°K the relaxation time of the gi; resonance is abou 
twice as long as that of the g, resonance. At low tempe 
tures T, exhibits a T-! dependence. At the higher te 
peratures, T>4°K, a relaxation mechanism decreasing — 
more rapidly than T-? with increasing temperature - 
begins to dominate. It can be fitted to an exponential 
temperature dependence given by T,-2.2X10- 
exp(93/T) sec. This suggests relaxation via an excite 
state or states effectively at ~65 cm above the grou 
state. The optical fluorescence data in I indicates a 
excited state at ~90 cm~. Within experimen’ 
the two results are probably compatible. A tet: 
crystal-field component 42(7*) of a few hundr 
would greatly perturb the two cubic-field Ps quarte 
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Fic. 5. Temperature dependence of relaxation time Tı of Gd** in 
tetragonal sites in Cal's. Gd concentration 1s 0.03%. 


0 and 180 cm~, hence an energy level at 65-90 cm™ is 
not unexpected. For Orbach relaxation via an excited 
state A=65 cm, we find from the Nd* results and 
Eq. (13), Mo! e 4X 10! cm-?. 


Gadolinium 


The relaxation of Gd?* ions in tetragonal sites was 
investigated. At liquid-helium temperatures the relaxa- 
tion times of all transitions observable at X-band fre- 
quencies were essentially equal. Extensive measurements 
from 2 to 45°K were made of the temperature depend- 
ence of T, for the 3.» —} transition with H||[100] 
and are shown in Fig. 5. The over-all temperature 
dependence of the T; data can be fitted by T1— 1.15€ 10? 
T-12--4*X 10? T-5 sec. The transient recoveries were 
essentially single exponentials. Gadolinium ions were 
also present in sites of trigonal symmetry in the sample 
used; however, the relaxation was not examined. 

Since an S state is unperturbed by a crystal field in 
first order, the crystal-field splittings and relaxation 
rates are usually very small. This is indeed observed for 
Gd: CaF», although the relaxation times below ~10°K 
are unexpectedly short, again, as in the case of other 
rare-earth ions in CaF», less than 1 msec. This rapid 
Tı and its T2 temperature dependence suggests that 
the low-temperature relaxation rate is not associated 
with the normal direct process and orbit-lattice interac- 
tions. The relaxation times at higher temperatures are 
longer than those for other non-S-state ions. The T~ 
dependence may arise from two-phonon relaxation 

processes involving excited states within the Sz; 
multiplet? or relaxation of time conjugate states as- 
sociated with a lattice defect site.? Since the measure- 


Pa 


12 J. G. Castle, D. W. Feldman, P. G. Klemens, and R. A. Weeks, 
Phys. Rev. 130, 577 (1963). : 
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ments were made in a range where T 20.10 and 65 for 
CaF, is ~480°K, a T1« T~ region is not expected at 
these temperatures for normal Raman relaxation 
processes. 

Orbach and Blume? have shown that for a multileve] 
spin system at low temperatures, Raman relaxation 
via excited states within the multiplet will be propor- 
tional to 7-® and can be more important than 7-7 and 
T— processes involving higher excited states. In genera], 
when the excited-state energy is very small compared to 
the energy of the most important phonons for relaxa- 
tion, the denominator of the second-order Raman pro- 
cess is ~ (wp)? rather than A? as in Eq. (4). Thus one 
has in place of Eq. (4), 


1 9 AlykI\® 
= EC M x‘. (17) 
Tig 4m) pho hN h 


M x* is again given by Eq. (16), but now |c) is a state 
within the ground multiplet. 

The energy levels for Gd** in a tetragonal field in 
CaF, and for a resonance frequency of 9.6 kMc/sec 
for the 1«» —1 transition can be derived from the 
parameters given by Vinokurov ef al. (see I). They form 
a multilevel system in which the Orbach-Blume mecha- 
nism may be operative. If the effective dynamic crystal- 
field interactions have the same relative importance as 
the static crystal-field parameters in the spin Hamil- 
tonian, then the 5?" terms should dominate and Raman 
relaxation would proceed principally via the |E) 
states. Comparison of Eq. (17) with the experimental 
Tvs for Gd: CaF; yields M g? =~ 1.4X 108 cm™. This value 
while smaller than for Kramers ions, is, however, several 
orders of magnitude larger than that predicted using 
the static crystal-field parameters. 


Terbium 


Trivalent terbium is the only non-Kramers ion 
studied. A plot of T; versus T for an HL[111] sample 
orientation is shown in Fig. 6. The relaxation times 0 
each of the four Tb"? hyperfine components were es- 
sentially equal. In the T— region the Tb** relaxation 
times are much faster than those found for the other ions 
Above 4°K, T, exhibits a gradual departure from 4 Ta 
law. By 10°K, 7 has decreased to a value approaching 
the limiting resolution of the apparatus; and, accor- 
ingly, the data in this region has a larger experimental 
error. The data, after substraction of the T^ depend- 
ence, can be fitted to an exponential law with Ac 
cm; however, one cannot attach too much signi 
cance to this result due to the large experimental un- 
certainties involved. The location of the excited levels 
of "Fg in a tetragonal field have not been reported to our 
knowledge. 2 

The angular variation of Tı at 2°K and for a magnete 
field applied in a (111) plane is shown in Fig. 1: proa 
anisotropic resonances previously reported by Fort ear 
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and Hempstead? in Tb: CaF, were also observed in the 
sample used. The presence of Th* in sites of trigonal 
symmetry, which are not observable at X-band fre- 
quencies, was not established. 

Terbium, being a non-Kramers ion, is expected to 
relax faster at low temperatures than. Kramers ions, 
which is indeed observed. The Mp? determined from 
Eq. (10) is 3X 10! cm-?. The resonant doublet, how- 
ever, is composed principally of J,=-£6 states which 
are not connected by the orbit-lattice interaction. 
Relaxation may be allowed, for example, by admixing 
of J-=+2 states. This small admixture would reduce 
the value of M p? relative to that for other ions. 

The above measurements were made in a sample 
containing 0.5% Tb. The relaxation from a 1.4% Tb 
sample was also measured between 2-4?K, where T 
~5 usec, independent of temperature. 


Dysprosium 


Two different samples of dysprosium-doped CaF» 
(~0.1-0.2% Dy) each containing Dy** in sites of cubic 
tetragonal, and trigonal symmetries were examined. 
No measurable relaxation recoveries were observed from 
any of the resonances for temperatures 2-4?K. Based 
upon the limiting resolution of the apparatus, this result 
suggests that the relaxation times of the Dy?* reso- 
nances in the samples studied are <3 usec. 
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Fic. 6. "Temperature dependence of relaxation time T; of Tb?* in 
tetragonal sites in CaFs. Tb concentration is 0.5%. 


3 P. A, Forrester and C. F. Hempstead, Phys. Rev. 126, 923 
(1962). 
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Fic. 7. Angular variation of T, for Tb** in tetragonal sites in 
CaF;. The magnetic field is applied in a (111) plane. 


The Dy* relaxation rates may be faster than those 
of other rare earths because of the presence of a rela- 
tively low-lying excited state. In a cubic field the relaxa- 
tion of pairs of levels within the ground-state I’, quartet 
may occur via an Orbach process involving the FP; 
excited state at A~8.5 cm. A T, of 1 psec, which would 
account for the unresolved relaxation transient, is 
predicted at 4°K for such a process if Mo? 10! cm. 
Corresponding times for Raman relaxation require 
M già 2: 4X 10° cm7?. If this process was dominant, how- 
ever, lifetime broadening of the T; resonances should be 
evident by T 2 10°K, which is not observed. At T=2°K, 
the major relaxation mechanism may be two-step relaxa- 
tion processes involving states within the I's quartet, as — 
discussed elsewhere.’ - 

The relaxation rate of the cubic field T7 excited state — 
would also be fast if the strength of the orbit-lattice 
interaction is as strong as required above, the temper: 
ture dependence of an Orbach process changing from a 
constant at low temperatures, TA, to a T law at: 
high temperatures. At temperatures <15°K, lifeti 
broadening of the F7 resonance linewidth is observed. — 
From the estimated magnitude and slow temperature ' 
variation of the l'; Orbach process, the broadening 
ably arises from the dominance of Raman relaxation - 
at these temperatures. From Eq. (15), a Tig of 1 : 
sec, and hence line broadening, is predicted at T= 
if M24 10* cm™. 


Erbium 


The temperature dependence of the relaxation of 
in sites of tetragonal (I) and trigonal (II) symn 
are recorded in Fig. 8. The position and relative in 
ties of the resonances in the low-field portion 
spectrum of the 1.0% Er?* sample used are show! 
Fig. 5 of I for H||[110]. Due to the strong overlay 
relative intensities of the tetragonal (I) and c 
resonances, only a common transient decay was. 
served when the magnetic field was set in the 
= 950-1070 Oe, the amplitude varying as the mas 
field was adjusted to line center. The transient 
which appears to be dominated by the tet 


9. 
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resonance, is characterized by a temperature-independ- 
ent component (~100 usec) and a strongly tempera- 
ture-dependent component which is plotted as T 
(tetragonal) in Fig. 8. The variation of 7; with angle 
of the magnetic field in a (100) plane is at most 20%; 
there is no measurable variation of the initial decay 
component. At low temperatures, ~2°K, a third decay 
component is evident having a characteristic time ~2-5 
msec. The amplitude of this signal, however, is so small 
that the experimental uncertainty in the decay time is 
very large. 
The high-temperature portion of the tetragonal (I) 
Tı versus T curve can be fitted by an exponential law 
given by Tı=~ 10" exp (27/T) sec. At low temperatures 
the data deviates from the exponential curve; however, 
the data is not sufficient to establish a reliable tempera- 
ture dependence for the competing relaxation process, 
which might be expected to have a 1/T law. An effective 
tetragonal excited state at ~27°K from relaxation data 
‘is not unreasonable in view of the crystal-field splittings 
in I and the excited state at ~35°K reported by Baker 
et al. from measurements of the temperature dependence 
- of the intensity of the tetragonal excited-state resonance. 
Assuming an Orbach process is operative, the Mo? 
derived from the above result and Eq. (13) is ~ 103 cm™. 
Line broadening of the cubic-tetragonal (I) group of 
esonances occurs for 7$30°K; by 40°K the individual 
es are no longer resolvable. 
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The trigonal (II) relaxation was measured with the 
magnetic field applied in a [110] direction to separate 
the gı trigonal resonance as far as possible from the 
other spectral lines. The results in Fig. 8 show a T4 
region and a rapidly decreasing T; at higher tempera- 
tures which can be fitted to an exponential law. The D. 
over-all temperature dependence can be fitted by 


T,29.6X 10-1T71-- 1.3 107? exp (78/T) sec. 


Comparing this result with an Orbach process in Eq. 
(13) yields Mo ~ 10° cm ?. The linewidth of the trigonal 
(II) resonance increased rapidly for temperatures 
$25°K and became unobservable above 30°K indica. 
tive of lifetime broadening and 71<10- sec. 

T; measurements were made earlier? in another 
Er: CaF, crystal which was less heavily doped than the 
above sample and in which Er*+ ions in cubic sites were 
approximately twice as numerous as those in tetragonal 
(I) sites. T; data for ions in trigonal (II) sites, which 
were also present, agreed with the data presented above. | 
Data taken in the vicinity of the isotropic cubic reso- l 
nance is shown in Fig. 8. For this sample the tetragonal | 
(I) resonances could not be sufficiently well isolated to | 
yield unambiguous results. Between 2 and 15?K, T; 
for the cubic resonance exhibited very little tempera- 
ture dependence and can be fitted by Tic T-^, where 
n=. This rather unexpected result was reproduced 
under varied experimental conditions and appears to be 
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Fic. 9. Temperature dependence of relaxation time Tı of x in 
cubic and tetragonal sites in CaF2. Yb concentration is 0.11%- 


( ES J. Weber and R. W. Bierig, Bull. Am. Phys. Soc. 8, 259 
1963). 
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intrinsic to the sample. Since there should be cubic- 
field excited states within 100 cm"! of the ground state 
such long 77's for T7 10?K are very unusual. i 

The relaxation of Er^* in tetragonal sites in CdF, 
has been measured by Zverev et al)? at X-band fre- 
quencies, and the results are reviewed here for com- 
parison. The Er concentration was ~0.1%. For 
T «4.2?K, T, varied inversely with temperature, where 
T,25.5X10-* T—! sec. Between 4.2 and 9°K, Ti 
exhibited a rapid decrease from a T-! law; lifetime 
broadening was observed above 16°K. Combining the 
last two data the authors conclude that 7, cannot be 
described by a power law of the type Tı œ T-". Instead, 
the relaxation appears to be influenced by the presence 
of a nearby excited state. 


Ytterbium 


In Fig. 9 are shown the temperature dependences of 
Tı for Yb?* in cubic and tetragonal crystal fields. The 
resonance spectrum for this 0.17% Yb sample was very 
clean, consisting only of approximately equally intense 
cubic and tetragonal resonances. Both the gı and gi 
tetragonal resonances exhibited nearly equal relaxation 
rates inversely proportional to T at the lowest tem- 
peratures. With increasing temperature the g, reso- 
nance T decreases rapidly reaching a T dependence 
by 10°K. Within experimental error, the high-tempera- 
ture data may also be fitted by an exponential law with 
an exponent ~125/T. T, for the gi, resonance shows an 
anomalous behavior for 3.5« T « 10?K, which we do 
not understand at present. 

The cubic resonance T, gradually departs from a T~! 
law with increasing temperature. The data for 5°< T 
<20°K, after subtraction of the 7 dependence, can, 
within experimental accuracy, be fitted to either a 
T-6.529-3) or exp(38/T) temperature dependence. The 
cubic Yb?* resonance is unique among the rare-earth 
ions studied in that it is observable up to ~115°K; 
the tetragonal Yb*t resonances are not observable at 
77°K. The temperature dependence of the cubic reso- 
nance linewidth was examined between 50-105°K. At 
low temperatures the peak-to-peak linewidth of the 
derivative of absorption is temperature-independent 
with a value of ~25 G. The additional temperature- 
dependent linewidth AH (T) aboye 50°K is plotted in 
Fig. 10. If this width is attributed to lifetime broaden- 
ing, its temperature dependence should reflect that of 
T; in this temperature range. The data in Fig. 10 shows 
a slow departure from a T? law. The temperature de- 
pendence for Raman relaxation of a Kramers ion, as- 
suming a simple Debye lattice and characteristic tem- 
perature 0p, is T? Seen? (a8e7dx/ feta Typ the T* limit 
applying only when T «0.105. Since 0p 2480? K!5 for 

15 / 1 Xorni: A. N. Prokhorov, and A. I. 
Se aed qua 17592. (1963) [English transl.: Soviet 


Phys.—Solid State 4, 284 (1962) ]. 
BD. R. Huffman and M. H. Norwood, Phys. Rev. 117, 709 


(1960). 
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Fic. 10. Additional temperature dependence component AIC 
of the linewidth of Yb** in cubic sites in CaF2, 1 


F 


CaF,, the deviation from T~’ is expected; however, | 
should be more pronounced. The above integral expres- — 
sion with 8p ~680°K provides a better fit to the data. - 
There are several possible sources of this discrepancy; - 
the validity of considering AH (T) as a simple ad 
contribution, the inadequacy of a simple Debye 
tional frequency spectrum for CaF, and the above 
tegral expression," and the possible enhanced Raman 
relaxation rate due to the mass excess of the rare: l 
impurity." 

From estimates in I (see Fig. 5), the first excited 
for Yb?* in a cubic field in CaF, may be ~ 600-700 
above the ground state. Thus Tre, which is propor 
to A‘, would be much longer than for other rare € 
This would account for the cubic-field resonance 
observable at such comparatively high temperature 
Since a A of this size is probably greater than the hi 
energy branch in the phonon spectrum of CaFs, 
simple Orbach process would not be allowed. Comparis 
the observed broadening with Raman relaxation, 4 
of M;2~105 cm? is obtained, which is necessaril 
certain due to the lack of good agreement in fitting ! 
temperature dependence. Extrapolating this relaxati 
mechanism for T=10—20°K yields T, values 
orders of magnitude longer than observed. Ii the 


17 M. J. Weber, Phys. Rev. 130, 1 (1963). 
15 J, G. Castle, Jr., D. W. Feldman, and P. G. K 
vances in Quantum Electronics (Columbia University ` 
York, 1961), p. 414. r >, 
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temperature 7';'s for the tetragonal resonances are fitted 
to a T-? law, Eq. (15) gives Mr? 304". A tetragonal 
field is not expected to split the Ts excited state of ?Fzj 
by more than a few hundred cm™. This would again 
suggest a relatively large first excited-state separation 
for the tetragonal ion relaxation and would require very 
large values of M r’, X 105 cm-7?, to account for the ob- 
served T's. Orbach relaxation, on the other hand, via 
an excited state at = 100 cm~, as suggested from the Ti 
temperature dependence, would require M o! — 105 cm™. 
Good agreement was generally obtained for Tı meas- 
urements from several different Yb:CaFz samples. No 
effects were observed which suggested the presence of 
any cross-relaxation between the Yb resonances. For 
H|[100], all transient signals displayed a single ex- 
ponential time dependence. By varying the angle of H 
with respect to [100], it was possible to gradually super- 
impose the cubic and tetragonal resonances, at which 
point a short decay attributable to the cubic line and a 
longer decay attributable to the tetragonal line were ob- 
served. Although the uncertainty in measuring the 
decay times in this case is greater than when either of 
the transients is observed separately, the measurements 
are of sufficient accuracy to permit identification. 
Measurements of the relaxation of Yb** in cubic 
sites n CdF? at X-band frequencies have been reported.” 
At liquid helium temperatures the relaxation of a 0.1% 
Yb sample is described by T1= (6.70.7) X 10 * T= sec. 
No resonances were observed at liquid hydrogen tem- 
peratures. More recent investigations” at 3.2 and 9.1 
kMc/sec in samples containing 0.01, 0.05, and 0.1% Yb 
demonstrate that resonances are observable at 57°K. 
The low-temperature relaxation times are concentra- 
tion-dependent as we have noted for Ce?* and show an 
unexpected and unexplained dependence uponresonance 
frequency, Tı «à. From g shift, intensity and optical 
measurements, an excited state at ~50-70 cm™ and 
an over-all 27/2 splitting of ~ 116 cm- is inferred. These 
values are much smaller than those estimated for 
Yb: CaF. 


DISCUSSION 


The magnitude and temperature dependence of T's 
for Kramers ions at higher temperatures, where T, 
deviates from a T= law, have been compared with 
theoretical expressions for two-phonon relaxation pro- 
cesses in Eqs. (13)- (16). A single excited state was con- 
sidered with an energy A obtained either from the ex- 
ponential temperature dependence for Orbach relaxa- 
tion or from estimates in I. The experimental results 
indicate that for Orbach and Raman relaxation, the 
effective sums of orbit-lattice matrix elements M o? and 
M g are typically in the order of 101-105 cm-?. These 


1 V. K. Konyukhov, P. P. Pashimin, and A. M. Prokhorov, 
Fiz. Tverd. Tela 4, 246 (1962) [English transl.: Soviet Phys.— 
Solid State 4, 175 (1962) ]. 

2» p. P. Pashinin and A. M. Prokhorov, Paramagnetic Resonance 
* (Academic Press Inc., New York, 1963), p. 197. 
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TABLE I. Comparison of low-temperature relaxation times and 
impurity concentration for several rare earths in tetragonal sites 
in Cal's. The rare-earth concentration is the total concentration 
not the number in the particular axial site. , 


Ion Concentration (%) TıT (sec °K) 
Ce 0.08 3.21073 
Yb3t 0.17 1.21073 
Cest 0.18 1.7107 
Nast 0.28 74X10 
Cet 0.34 3.1X10 
Tb3t 0.5 5.51075 
Cat 16 AX 10-5 (2-4°K) 


values are comparible with those obtained using the 
static crystal-field parameters in I and the approxima- 
tions discussed earlier relating them to the strength of 
the orbit-lattice coupling. The accuracy of the deter- 
mination of M? is dependent upon estimates made in 
arriving at Eqs. (10), (13), and (15). For example, since 
the average phonon velocity v enters to the fifth and 
tenth powers, respectively, in the Orbach and Raman 
Ty's, a small error in the choice of an effective v can 
greatly affect the magnitude of the predicted Ti. Be- 
cause of this and the uncertainty in A, only the magni- 
tude of M? is significant. 

The above value of M? should be useful in estimating 
the magnitude and dominant relaxation process for 
other rare earths in Cal’, when the excited-state ener- 
gies are known. It should be noted, however, that some 
apparent discrepancies exist. Mo? for the tetragonal (I) 
and trigonal (II) Er resonances are very different and 
are beyond the extremes of the above range of M* 
values. The large Mo? for the trigonal resonance may 
arise from the strong crystalline fields associated with 
the O?- substituted at a nearest-neighbor F- site; 
however, Er was the only ion in a trigonal site whose 
relaxation was studied, and hence this conjecture could 
not be verified. The relaxation of Yb in cubic sites at 
temperatures 10-20°K is inconsistent with the magni- 
tude and temperature dependence of normal two- 
phonon processes, if our picture of the energy levels is 
correct. Enhanced relaxation rates having different 
temperature dependences arising from the presence of 
lattice defects and paramagnetic empurities have been 
discussed!?; however, for a Kramers salt a T? de- 
pendence is expected which is beyond the estimated 
error of the temperature dependence for Yb?*. The 
relaxation may be associated with the same mechanism 
responsible for the low-temperature relaxation discussed 
below. 

When Eqs. (10)-(16) are compared with measured 
Ty's for Kramers ions, it is immediately apparent that 
the present theory is insufficient to account for the fast 
relaxation times observed in the low-temperature region; 
T 24°K, where T; is usually proportional to T+. As an 
estimate, consider a Zeeman admixing coefficient 
(28\/A)(a|H-J|c)~10-2 and sums lle] V=" 
7105 cm. Substitution into Eqs. (10) and (12) yields 
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ITI. SPIN-LATTICE RELAXATION 


Tip~0.1 sec for Kramers ions at 2°K compared to 
observed values of =1 msec. If one assumes that the 
effective summations of dynamic crystal field matrix 
elements in direct, Orbach, and Raman relaxation have 
approximately equal values, i.e., Mp? (non- Kramers) 
7M o^ M i, then the predicted relative magnitudes 
of the one- and two-phonon relaxation proce: es are 
incompatible with experiment. In the case of € ^*, the 
value of T; at low lemperatures is concentration- 
dependent. A tabulation of TT for other rare earths in 
CaF: is given in Table I and shows a correlation with 
rare-earth concentration. (Concentration-dependent Ty's 
have been observed in other crystals containing rare- 
earth and iron transition group elements?!) Finally, 
equally fast low-temperature T's have been reported 
for trivalent rare earths in CdF.2!*!9 and for divalent 
rare earths in CaF». Therefore, although the relaxa- 
tion times at low temperatures generally exhibit a T~ 
dependence characteristic of one-phonon processes, 
their magnitude and concentration dependence indicate 
that they are not derived from normal direct relaxation 
arising from orbit-lattice interactions. 

An adequate explanation of the fast low-temperature 
relaxation rates of trivalent rare-earth ions in CaF, 
must account for the T~ dependent of 7, the concentra- 
tion-dependent effects, the variation of 7; with angle of 
the applied magnetic field, and the absence of any ap- 
parent phonon bottleneck. Also, the fast relaxation has 
been observed for Kramers ions in lattice sites having 
cubic, tetragonal, and trigonal crystal-field symmetries. 

The possibility exists that some divalent rare-earth 
ions may be introduced into CaF» along with the triva- 
lent ions. If they are non-Kramers ions, they should 
relax quickly and could possibly influence the relaxation 
of trivalent Kramers ions via cross-relaxation. This 
explanation is insufficient for several ions studied, 
however, since the accompanying non-Kramers ions 
have nonmagnetic ground states, an example is Yb** 
which is isoelectric with diamagnetic Lu** (150). Divalent 
ion resonances were not readily apparent in the reso- 
nance spectrum. (Cross-relaxation could occur to excited 
state doublets, however, depending upon A, they would 
be depopulated at low temperatures which would reduce 
their effectiveness.) Finally, the orbit-lattice coupling 
for divalent ions would probably be less than for equiva- 
lent trivalent ions due to the smaller effective nuclear 

ge. 
PO relaxation rates may be caused by local 
vibration amplitude variations associated with the in- 


ers in Quantum. Electronics (Columbia 
gue nc rea i 1960); Advances im Quantum 
Electronics (Columbia University Press, New M ; A 
Peskovatskii and A. N. Cheng ae Dus 1 on ( : 2 
i l.: Soviet Phys.—so ate 4, 454 2) ]; A.A. 
Pene end A. M. Prokhorov, Zh. Eksperim. 1 Teor T e 
729 (1960) [English transl.: Soviet Phys.—JETP 11, 527 (1960)] 
AS i IEEE 51, 53 (1963); 
B . Sabisky and H. R. Lewis, Proc. a ; 
H. E and E. S, Sabisky, Phys. Rev. 130, 1370 (1963). 
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troduction of the paramagnetic impurity. In our case, 
this lattice defect has both a mass and charge difference 
and, for axial sites, has related substitutional or inter- 
stitial impurity defects. The relaxation would exhibit a 
T^ temperature dependence and be a function of im- 
purity concentration in certain ranges. Although some 
studies have been made of relaxation associated with 
defect sites, 225 detailed calculations applied to specific 
examples are needed to establish the importance of this 
mechanism at low temperatures where one-phonon 
processes are dominant. 

Multiexponential transient recoveries were frequently 
observed at low temperatures in the T+ region. We 
shall consider two cases where such behavior may be 
expected, cross-relaxation within an inhomogeneous 
spin system and cross-relaxation to another faster 
relaxing spin system. If a short, intense rf pulse applied 
to an inhomogeneously broadened spin system excites 
only a portion of it, then a decay time associated with 
the attainment of internal equilibrium within the total 
spin system may exist in addition to the time required 
to attain equilibrium with the lattice. The linewidths 
of rare earths in CaF, are usually larger than calculated 
dipolar widths and are believed to be inhomogeneously 
broadened. Several ions show transient recoveries from 
rf saturation having an initial fast temperature-in- 
dependent but concentration-dependent exponential 
component. As the rf excitation pulse is lengthened, the 
initial recovery component became less prominent rela- 
tive to the longer temperature-dependent component 
ascribed to spin-lattice relaxation. For Ce* at low tem- 
peratures, the initial component has characteristic 
times of 7300, 130, and 40 usec, respectively, for Ce 
concentrations of 0.18, 0.3, and 1.6%. This dependence 
is qualitatively as expected from the cross-relaxation 
theory of Bloembergen et al. 

If some paramagnetic species having a faster spin- 
lattice relaxation time is present, rare-earth ions in 
CaF, may lose their energy more rapidly via cross-relax- 
ation to the other system than by relaxing directly to 
the lattice. Depending upon the nature of the fast re- 
laxing system and the relative spin-lattice and cross- 
relaxation rates, an observed 7, having a variety of 
concentration and temperature dependences can be 
obtained.7-? Rannestad and Wagner have discussed 
some simple rate equations and time-dependent solu- 
tions pertinent to the expected behavior of such coupled 
spin systems. The reader is referred to their article, 


* B. I. Kochelaev, Dokl. Akad. Nauk SSSR 131, 1053 
(1960) [English transl.: Soviet Phys.—Doklady 5, 349 (1961) ]. 

25 P, G. Klemens, Phys. Rev. 125, 1795 (1962). 

26 N. Bloembergen, S. Shapiro, P. S. Pershan, and J. O. Artman, 
Phys. Rev. 114, 445 (1959). 

£ J. H. Van Vleck, Advances in Quantum Electronics (Columbia 
University Press, New York, 1961), p. 388. 

75 J. C. Gill and R. J. Elliott, Advances in Quantum Electronics 
(Columbia University Press, New York, 1961), p. 399. 

2N. Bloembergen and P. S. Pershan, Advances in Quantuas 
Electronics (Columbia University Press, New York, 1961), p. 373. 

9? A. Rannestad and P. E. Wagner, Phys. Rev. 131, 1953 (1963). 
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pp. 1958-1960, for an example of the type and range of 
arguments that can be presented to explain T results. 
'Their general extension to rare-earth-ion relaxation in 
CaF» is straightforward and will not be repeated here. 

Although qualitative agreement. might be obtained 
for our CaF» results using the above approach and ap- 
propriate models, it would be less than convincing 
without some knowledge regarding the origin and nature 
of the fast-relaxing species. The presence of other para- 
magnetic impurities, particularly fast-relaxing iron 
transition-group elements, could be important for 
relaxation. These impurities, however are definitely 
present in concentrations of less than one part in 10! 
by weight as determined by spectrochemical analysis. 
No distinct resonances characteristic of such ions were 
generally observed, although they may have been broad 
and hence escaped detection. Pairs, possibly exchange 
coupled, or clusters of rare-earth ions may be present 
and may account for some of the additional spectral 
lines observed in CaF, at higher rare-earth concentra- 
tions. Relaxation transitions between relatively near 
non-time-conjugate states associated with such clusters 
could be very rapid. Finally, fast relaxing centers may 
also be associated with the zero-field resonances noted! 
in Ce: CaF». 

If the temperature-dependent relaxation times at 
low temperatures are characteristic of some fast-relaxing 
species, then the cross-relaxation time must be very fast 
and not the over-all relaxation bottleneck. The results 
for Ce*+, however, show that for 1.6% Ce, an approxi- 
mately temperature-independent time of 245 usec is 
observed at 2-4°K suggestive of a cross-relaxation bot- 
tleneck. When the spin-lattice and cross-relaxation rates 
have comparable magnitudes, the relaxation of the 
coupled spin systems is expected to exhibit a multi- 
exponential decay in which the relative importance of 
each term and its characteristic decay rate will be a 
function of the lattice temperature. 

Until more is known about the numerous, concentra- 
tion dependent lines in the EPR and optical spectra of 
rare-earth ions in CaF, and the importance of other 


31 D. E. Kaplan, Bull. Am. Phys. Soc. 8, 468 (1963). 
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paramagnetic impurities, the above explanation of the 
fast low-temperature relaxation times via cross-relaxa. 
tion to other spin systems is only speculative, albeit a 
reasonable possibility at present. 


CONCLUSIONS 


We have surveyed the temperature dependence of 
spin-lattice relaxation of several trivalent rare earths in 
CaF». The relaxation times of Kramers ions at low tem- 
peratures, which usually vary as T-! and exhibit a 
concentration dependence, are about two orders of 
magnitude faster than predicted for orbit-lattice coup- 
ling. The nature of the operative relaxation mechanism 
which explains these results remains to be established, 
More intensive, detailed study of selected examples 
would be helpful. The magnitude and temperature de- 
pendence of two-phonon relaxation processes are gener- 
ally in reasonable order-of-magnitude agreement with 
predictions of relaxation times using estimated crystal- 
field interactions. As expected, ions in lattice sites hav- 
ing different crystal-field symmetry and strength have 
different relaxation properties. The comparison between 
theory and experiment must of necessity be only approxi- 
mate since current spin-lattice relaxation theory in- 
volves a simplified treatment of the lattice vibrational 
properties, but more important for rare earths in CaF, 
detailed knowledge of the energy levels, ion wave func- 
tions, and strength of the dynamic crystal field are 
lacking. The interpretation of the present results will 
benefit greatly as spectroscopic data on the crystal-field 
energy levels becomes available, at which time many 
uncertainties in the present comparison of theory and 
experiment will be removed. 
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Measurements 


are presented of the ultrasonic absorption in MgO and A1;05 for transverse and longi- 


PEN waves in the low-temperature region. In contrast to some theoretical expectations, the absorption 
o ongitudinal waves is not negligible but even stronger than for transverse waves. In the theoretical dis- 
cussion it is shown that this strong longitudinal absorption probably is caused by three-phonon processes. 


INTRODUCTION 


HE interaction of ultrasonic waves with thermal 

phonons in dielectric crystals shows a very differ- 
ent behavior at high and at low temperatures. At high 
temperatures when the ultrasonic period 2r/w is much 
larger than the lifetime 0 of most termal phonons the 
measured absorption can be well described by a mecha- 
nism first proposed by Akhiezer.!-? However, in the low- 
temperature region, defined by (w9>1) experimental 
difficulties have prevented until recently a careful 
study of the hypersonic absorption, although a theory 
for the absorption process was already published by 
Landau and Rumer‘ more than 25 years ago. They 
pointed out that through the anharmonic cubic terms 
in the elastic lattice energy transverse acoustic waves 
or phonons can interact with thermal phonons in proc- 
esses involving three phonons. For the absorption of 
longitudinal acoustic waves these three-phonon proc- 
esses were considered unimportant on the basis of 
energy and momentum conservation as will be discussed 
later. A finite absorption was thought to result only when 
according to Pomeranchuk? fourth-order terms in the 
elastic energy were taken into account (so-called four- 
phonon processes), or when according to Herring? 
the elastic anisotropy of the crystal was properly con- 
sidered (Herring mechanism). 

In the present paper we present measurements of the 
absorption of transverse and longitudinal ultrasonic 
waves in magnesium oxide and sapphire in the low- 
temperature region (> 1). The interesting result is 
that not only the transverse but also the longitudinal 
absorption follows the pattern expected for a three- 
phonon process. The magnitude of the rM 
absorption as well as its dependence on frequency an 


* This work was supported by the U. S. Atomic Energy 


Co ralo on leave of absence from Physics Department, 


iversity of Palermo, Palermo, Italy. 

MD J. Phys. (USSR) 1, 277 (1939). 7 1245 - 
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3T. O. Woodruff and H. Ehrenreich, Phys. Rev. 123, 155. 
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temperature are not compatible with a four-phonon 
process or the Herring mechanism mentioned above. 

By these results we are led to believe that longi- 
tudinal ultrasonic waves also are able to interact strongly 
with thermal phonons in three-phonon processes. We 
have already shown in a classical calculation,’ referred 
to here as paper I, that these “collinear” processes lead 
to a strong ultrasonic absorption, and thesame result will 
be derived here by a quantum-mechanical argument.** 

This strong absorption of longitudinal waves in three- 
phonon processes also solves the problem of a diverging 
thermal conductivity which was discussed in the litera- 
ture? under the assumption that longitudinal waves 
cannot be absorbed in three-phonon processes. Since 
they are, however, absorbed, their contribution to the 
heat conductivity remains finite, and no problem arises 
in this context. 


EXPERIMENTS AND RESULTS 


We have measured the attenuation of sound waves in 
magnesium oxide and ruby at the two frequencies of 
500 and 3000 Mc/sec between 4.2 and about 100°K. 
The samples were single crystals of rectangular shape, 
4X4X25 mm, with the 4x 4-mm faces polished opti- 
cally flat and parallel within a few seconds. The rod 
axes were oriented within one degree, parallel to the 
[100] axis of MgO and to the c axis of ruby. The MgO 
samples were colorless and quoted” as pure while the 
ruby rods contained up to 0.01% of Cr;O;. Care was 
taken to avoid resonant absorption of sound by the 
paramagnetic ions.!! 

For the generation and detection of the ultrasonic 
waves we used two methods: At 500 and at 3000 
Mc/sec two x-cut quartz disks with a fundamental 
resonance frequency of 10 Mc/sec were bonded to the 
sample and excited in a high harmonic. For some experi- 
ments at 3000 Mc/sec both sides of our samples were 
plated with films of permalloy, 90% Ni, 10% Fe, 


* R. Nava, R. Arzt, I. S. Ciccarello, and K. Dransfeld, Phys. 
Rev. 134, A581 (1964). 

*e Nole added in proof. Arguments for this mutual interac- 
tion of parallel phonons have also recently been given by S. 
Simons [Proc- Phys. Soc. (London) 82, 401 (1963) ]. 

3 P. G. Klemens, in Handbuch der Physik, edited by S. Fligge 
(Springer-Verlag, Berlin, 1956), Vol. 14. p. 1. 

10 Supplied by R. Meller Co., Providence, Rhode Island. 

u E. B. Tucker, Phys. Rev. Letters 6, 183 (1961). 
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about 5000 À thick for the generation and detection of 
sound by ferromagnetic resonance.” By using standard 
pulse methods sometimes more than 100 acoustic echo 
pulses were received. 

Figure 1 shows a typical pulse pattern, the time 
between two consecutive pulses corresponds to the time 
for one round trip in the rod. The decay of the pulse 
pattern is hardly ever exponential. The reasons for the 
irregular amplitudes of the various pulses are several : 
the end faces may not bé sufficiently parallel, the rod 
may not be elastically homogeneous, etc., all of these 
effects being temperature-independent. The absorption 
at a temperature T was determined from the decrease 
of an echo pulse relative to its magnitude at 4.2°K. 

With ferromagnetic excitation we occasionally ob- 


1.0 


0.1 


Absorption (dB/cm) 


t 
20 30 40 50 60 70 80 90100 
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Fic, 2. Attenuation of longitudinal waves in ruby as a 
function of temperature. 


? H. Bommel and K. Dransfeld, Phys. Rev. Letters 3, 83 (1959). 
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Fic. 1. Pulse pattern of 3-kMc/sec 
longitudinal waves in ruby at 4.2°K 
The waves are excited and detected 
ferromagnetic resonance in thin films 
of permalloy (90% Ni-10% Fe) on 
each side of the ruby rod. 


served a drastic change in the relative amplitude of the 
peaks on varying the intensity or direction of the mag- 
netic field. We believe that this is caused by inter- 
ference effects which arise from mechanical or magnetic 
nonuniformities of the films, or the excitation of Walker 
modes. By improving our deposition methods and by 
using films of small cross section these phenomena 
could be avoided. 

Through our long ruby rod only longitudinal waves 
could be transmitted well. The propagation of trans- 
verse waves proved to be considerably more difficult. 
In general, only one transverse echo could be seen. This 
difficulty probably arise from the acoustical birefring- 
ence, so that along the c axis the acoustic wave vector 
and Poynting vector are not parallel, and the acoustic 
energy does not travel parallel to the rod axis. The ab- 
sorption of longitudinal waves versus temperature is 
plotted in Fig. 2. Several runs at 3000 Mc/sec, with 
different ruby rods carrying different magnetic films 
gave the reproducible results of Fig. 2: The absorption 
increases with temperature approximately as T'. For 
the investigation of the frequency dependence, meas- 
urements at 500 Mc/sec, which agree with the results of 
Fitzgerald and Truellj? are also included in Fig. 2; 
the frequency dependence is close to linear. 

"The absorption of longitudinal and transverse waves 
in MgO is plotted versus temperature in Figs. 3(a) and 
3(b). The data for longitudinal waves, although not as 
accurate as the transverse data, clearly indicate that 
the frequency dependence is again close to linear, 2D 
the temperature dependence not far from T". 

With longitudinal waves the acoustic power level 
was kept low enough to avoid the nonlinear effects Ic 
ported by Shiren,“ and for transverse waves such à 
power-dependent absorption could not be observed even 

13T. M. Fitzgerald and R. Truell, research report, Brown 


University, 1963 (unpublished). 
MN. S. Shiren, Phys. Rev. Letters 11, 3 (1963). 
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al considerably higher power levels than those used by 
ind is n knowledge, for shear waves in the 

Mc/sec range, these are the first data on the tempera- 
ture dependence in a material other than quartz. 

Both for transverse and longitudinal waves in MgO 
a noticeable additional absorption occurred around 
15 K, when the absorption was measured in zero mag- 
netic field. Figure 3(a) also shows this extra absorption 
at 3000 Mc/sec, which could be removed by applying 
a magnetic field of the order of 1000 Oe. A similar 
phenomenon was observed by Bolef el al? at lower 
frequency and lower temperature in a more drastic way 
in heavily doped MgO. More experiments are planned 
for a detailed investigation of this anomaly which is 
probably caused by paramagnetic impurities. 


DISCUSSION 


Our experiment results may be summarized by saying 
that at low temperatures for both transverse and longi- 
tudinal waves the absorption increases linearly with 
frequency and with the fourth power of the temperature. 
In perfect crystals the absorption is due only to scat- 
tering by other thermal phonons, and is possible only 
if the potential energy developed in powers of the strain 
contains cubic or higher order terms. Landau and 
Rumer‘ considering the cubic terms only calculated the 
absorption of transverse waves for low temperatures, 
when «6721. They found F 

~4(20+6R)?(1—Vi/V1)2——#T', (l 
ar 1 (20+6R)?( i V z) ETAT ) 
where Q and R are constants of the order of 5X10” 
for ruby and MgO, £ is the Boltzmann constant, and 
p the density. The agreement with our results on MgO 
is satisfactory. The absorption is of the right order of 
magnitude and varies with temperature as T*, as ex- 
pected. Expression (1) was derived only for transverse 
waves, with the observation that the conservation of 
energy and momentum rules out three-phonon proc- 
esses for the absorption of longitudinal waves. 

Pomeranchuk? invoked the much smaller fourth- 
order terms of the elastic energy to account for the scat- 
tering of longitudinal phonons. The absorption accord- 
ing to his calculations should be proportional to cT", 
which is in strong disagreement with our experiments, 
both for ruby and MgO. In addition, Orbach’? has 
already estimated, for the case of quartz, that Pomeran- 
chuk’s calculations give an absorption which is several 
orders of magnitude smaller than the observations. We 
may therefore safely conclude that a four-phonon 
process is not responsible for the observed absorption 
of longitudinal waves. : esto 

On the other hand, the Herring mechanism also 
cannot account for our results. The absorption due to 
this process was calculated by Simons? for cubic crystals. 


15 D. I. Bolef, J. DeKlerk, and R. B. Gosser, Rev. Sci. Instr. 
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For a longitudinal wave propagating parallel to the (100) 
axis he found 


a= AWT. (2) 


The quadratic frequency dependence is not compatible 
with our observations. The factor A depends on the 
crystal and can be estimated for MgO, with the result 
that for our frequencies and temperatures the absorp- 
tion according to (2) is about three orders of magnitude 
smaller than the measured values. For ruby the ab- 
sorption by the Herring mechanism has not been calcu- 
lated numerically, but should be proportional to «7?, 
which agrees even less with our experiments. 

In view of the fact that our data with longitudinal 
waves behave as if they are caused by three-phonon 
processes, and in view of the difficulty in explaining 
them in any other way, we proceed to re-examine in 
more detail the absorption of longitudinal waves by 
three-phonon processes. We have already shown classi- 
cally in paper I that in isotropic solids the absorption of 
longitudinal waves by three-phonon processes is of 
similar magnitude as the absorption of transverse waves, 
But the fact that a longitudinal acoustic wave can 
interact in three-phonon processes with almost as 
many thermal phonons as a transverse wave may be 
shown in a simpler argument, which will be presented 
here: 

The conservation of energy and momentum requires 


611-9277 ws, (3) 
k, +k, me k; D (4) 


where the index “1” refers to the ultrasonic phonon. 
Since in our temperature range |Ail«&|Es;| the mo- 
mentum conservation (4) can be expressed in terms of 
the angle a between the acoustic wave vector ky and 
k, (see Fig. 4). 


cosa (|| — | £s] )/ LA] , 
or, because of 
[kı] =w/ ; 


we can write for (4) 


vı f/93—02 5 
cosa = ET ) ; (5) 


Tbh €1 


| ka,a! =w2,3/Tph » 


where a is the angle under which a thermal phonon 
with velocity vj, can interact with an acoustic wave of 
velocity v; with conservation of energy and momentum. 

But if energy is not exactly conserved because of the 
uncertainty in the energy arising from a finite relaxation 
time of 0 of the thermal phonons, we find 


63—u3—01 € 1/8 


and cosa, according to (5), can assume any value be- 
tween the two limits 


71 1 71 1 E 
COSA max = =(1 -—) 3 COSmin= = (t= . (6) * 
Tph Q1 Tph o. 
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Clearly, because of the uncertainty in energy, a thermal 
phonon can interact with an acoustic phonon between 
the angles amax and amin without violating the conser- 
vation of energy and momentum. For transverse waves, 
for example, we find from (6) 


2cosa 1 
Q max — min Aa= DEO (7) 
sino «10 
"This corresponds to a relative solid angle 
AN 2mrsina 2cosa 1 1 
SS CE (coso)— . (8) 
N 4r sina wl w0 


For longitudinal waves (v1— tph, &min=0, collinear case) 
we find from (6) 
1 (Aa)? =1/%10, 


and for the relative solid angle 


AN Tama (>) 
aC ORD 


(9) 


On comparing (8) and (9) it is evident that the two 
solid angles are almost identical for any relaxation time 
0, as long as «307 1, and the dispersion is neglected. 


en 
= [N 
KS |kl- Ik 


Fic. 4. Momentum conservation for the absorption of an 
acoustic phonon with wave vector k, by a thermal phonon of 
wave vector ke. ks is the wave vector of the final phonon in this 
three-phonon process. 
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Fic. 3. Attenuation of ultrasonic 
waves in MgO as a function of 
temperature. The dashed curve 
gives the attenuation in zero ex- 
ternal magnetic field; the continu- 
ous ones in a field of about 4500 
Oe. (a) Longitudinal waves, (b) 
transverse waves. 


3000 Mc/sec 


100 400 1000 


Temperature (°K) 


(b) 


One should therefore expect that a quantum-mechanical 
treatment of the ultrasonic absorption due to three- 
phonon processes should give very similar results for 
transverse and longitudinal waves. We made such a 
calculation, following the method of Landau and 
Rumer, and we found for the absorption of longitudinal 
waves due to the interaction with collinear thermal 


phonons: 
ki 


ar~3[6P P} oT". (10) 
p? V 7°38 


P is a constant of the same order of magnitude as Q 
and R in Eq. (1), and therefore the absorption of 
longitudinal waves is not too different from the ab- 
sorption of transverse waves. 

In his argument we have neglected the dispersion. 
As shown in paper I this neglect is justified as long as 


wð 2r (Tp/T , 


with Tp being the Debye temperature of the crystal. 
For our samples below 60°K 2x(Tp/T)?~1500 and 
for our frequencies w0 is indeed smaller than 1500, if 
the mean free path of the thermal phonons remains 
below 1 mm. Thus, the dispersion can be neglected. 

It seems to be a characteristic feature of cubic crys- 
tals that the absorption of longitudinal waves is notice- 
ably stronger than for transverse waves. This is evident 
from our measurements on MgO and was also found in. 
Ge by Dobbs eż a]. 


11 E. R. Dobbs, B. B. Chick, and R. Truell, Phys. Rev. Letters 
3, 332 (1959). 
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Measurements of the magnetic susceptibility 


b range where this allo of some dilute Zn-Mn alloys are reported in the temperature 


€ loy system is known to exhibit low-temperature resistance anomalies. All of the alloys 
Rene ur E with the most concentrated alloy (0.43-at.% Mn) exhibiting deviations in the 
RL . The Curie constant indicates that manganese when dissolved in zinc has a spin value 
prs NS rM qi a between 5 and 273°K. Calculations of the exchange integral J 
e ge r properties of dilute Zn-Mn alloys have been made from the Néel temperature, the magne- 
bcd : s a function of field strength and the suppression of the superconducting critical temperature 

sed on theoretical equations which assume a scalar s-d interaction between the paramagnetic ions and 
conduction electrons. All of the derived values of J lie in the range of (1.42-0.2)X 107?? erg which would 
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to 
» s n that the magnetic interaction giving rise to these various effects is of the same nature. 
ds ue for RE. the resistance minimum from the total resistivity, in the presence of a 
3 produced by magnetic ordering in the Zn-Mn system, is presented. 
INTRODUCTION ciently great to result in a resistive maximum. It is the 
HEORY predicts? and some experiments sub- Purpose of the present paper to present more complete 
stantiate?-5 the fact that the critical temperature — dat? on the spin value of manganese in zinc and to 
is suppressed in superconducting elements containing Calculate and compare values for the s-d exchange 
small concentrations of transition metal impurities. Integral J found from electron transport and magnetic 
Recently, measurements have been reported? on the Properties. A comparison of the value of J found for 
suppression of the superconducting transition tem- the Zn-Mn system will be made with the values of 
perature in a system that had been earlier reported to J found fora system possessing only a resistive anomaly 
| exhibit a low-temperature resistive anomaly.’ It has (Cu-Mn) and only a superconducting transition 
been shown theoretically^? and illustrated experi- (AI-Mn). 
mentally for the? Zn-Mn system that a sharply energy- 
dependent conduction-electron relaxation time in the MAGNETIC SUSCEPTIBILITY RESULTS 
neighborhood of the Fermi energy will produce a resist- AE 
| ance rising with decreasing temperature to some finite Table I lists the results of the magnetic susceptibility 
Í value at absolute zero. It is observed in the Zn-Mn sys- measurements. 
i tem that the resistive characteristic flattens off following Nominal manganese concentration was obtained 
| the so-called resistance minimum. A preliminary re- from a plot of “relative residual resistance" versus 
port? of the magnetic properties of this system show a — *weighed-out" concentration. Figure 1(a) shows the 
| weak antiferromagnetism and it is believed that al- collected data from several series of alloys. In Fig. 1(b) 
3 though the resistivity is decreased by the decrease of the room temperature susceptibilities of some rolled 
spin scattering on ordering, the decrease is not suffi- strips and the cast pieces used in the present temper- 
~ CERE EF ature runs are plotted against concentration obtained 
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1 W. Baltensperger, Helv. Phys. Acta., 32, 197 (1959). 

2 A. A. Abrikosov and L. P. Gorkov, Zh. Eksperim. i Teor. Fiz. 
12, 1243 (1961) [English transl: Soviet Phys.—JETP 39, 1781 


(1960). 


from relative residual resistances of rolled strips. The 
susceptibility measurements were regarded as being the 
most accurate and deviations of the points from the 
best straight line were attributed to concentration 
errors. Figure 1(b) was therefore used to obtain the 
sample concentration used in subsequent calculations. 

In the temperature runs, relative susceptibility was 


j aa Boato, G. Gallinaro, and I. Rizzuto, Phys. Letters 5,20 measured and converted to susceptibility from the 
{ ( 4 a8) "Merriam and D. P. Seraphim, Colgate Superconductivity separately-measured room temperature values. Figure 
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3 C. A. Domenicali, Phys. Rev. 117, 984 (1960). " 
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2 shows the alloy susceptibility plotted against re- 
ciprocal temperature in the region between room 
temperature and liquid nitrogen. In Fig. 3 the inverse 
of the manganese component of susceptibility (AX-*) 
is shown as a function of absolute temperature in the 
temperature range between liquid nitrogen and liquid 
helium. An insert shows the behavior of the most con- - 


centrated alloy in the vicinity of the Néel point. If one ~ 
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TABLE I. Magnetic susceptibility samples with the derived concentrations, Curie constants, and calculated magneton numbers, 


Effective magneton 


Nominal Concentration " 1 7 
concentration from AX data 10% X at room 105X Curie constant r number; Post 
from Fig. 1(a) [Fig. 1 (b)] temperature Room temp. Nitrogen Room temp. Nitrogen 
Sample weight (%) weight (%) emu/gm to nitrogen to helium to nitrogen to helium 
Pure Zn $i —0.140 
VII 0.042 0.046 — 0.077 1.5; 1.6; 3.8; oe 
. p 
IV 0.18 0.20 +0.121 7.55 1.65 4.03 4.1, 
II 0.43 0.40 +0.381 18.0 15.2 4.45 4.1 


» The effective magneton number for this sample is also determined from the zero susceptibility value as shown in Fig 


considers that (dX—1/dT)ry=0 defines the Néel tem- 
perature, a value for Tw of 6+1°K is obtained for 
sample II. From the linear portions of Figs. 2 and 3, 
Curie constants were calculated and using concentra- 
tions obtained from Fig. 1(b) these were converted into 
effective magneton numbers. The desired results are 


o Ol o2 03 “04 0.5 
Mn CONCENTRATION (wt %) 


Io6Ax (emu/g) 
ZB Be 
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Fic. 1. (a) Resistance ratio as a function of weight percent 
manganese determined from spectroscopic analysis or quantity of 
manganese added to the melt. The solid circles represent data 
from the thesis of W. B. Muir, University of Ottawa. (b) The 
increase in susceptibility of pure zinc as a function of manganese 
concentration as determined from the resistance ratio of the alloys 
measured. = 


r 


z CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


221 


shown in Table I from which it can be seen that the 
spin state of the manganese ion in zinc corresponds to a 
value of 2 which would give an effective magneton 
number of 3.9.!! 


ANALYSIS OF THE MAGNETORESISTIVITY DATA 
FOR Zn-Mn SYSTEM 


In an earlier publication? it was shown that the 
magnetoresistivity of dilute Cu-Fe alloys can be ex- 
pressed as the sum of a normal magnetoresistive term 
p, which is temperature-independent and obeys 
Kohlers’ rule and an anomalous term p, which is nega- 
tive and decreases with decreasing temperature. The 
anomalous term p, can be theoretically accounted for 
semiquantitatively for these alloys although the theory 
itself does not contain an explanation of the resistive 
minimum.“ The magnetoresistive behavior for an 
alloy containing 0.11-at.% Mn is shown in Fig. 4 where 
it can be seen that the magnetoresistance is negative in 
sign and saturates at a field value of approximately 
32 kOe. Shown in the insert of Fig. 4 is the magneto- 
resistance plotted as a function of H? where it can be 
seen that the magnetoresistance beyond the saturation 
value is a function of H?. This we attribute to the 


normal magnetoresistive component, pn. Using this. 


device we can then separate p, from prota and obtain 
ps aS shown in Fig. 4. 

Using Kasuya's expression for the term p, and neglect- 
ing small terms, we can write: 


Prt _ (b--C) n, / (b--C) Heo (1) 
PH (b+ GAA (blja) Bese : 
and 
3am*c (b-+(92)ay)? 
a pa (2) 
2htNe b+C 
where 
COE UD 
—Jz av — 2/)nV o TET Em 
I e ny 


1 t should be pointed out that the earlier reported (Ref. 10) 
spin value of 2 for a concentrated Zn-Mn alloy resulted from # 
determination of the Curie constant too near the Néel temperature: 
(1964) Muto, K. Noto, and F. T. Hedgecock, Can. J. Phys- 42,1 

13 K, Yosida, Phys. Rev. 107, 396 (1957). 

uT, Kasuya, Progr. Theoret. Phys. (Kyoto) 72, 228 (1959). 
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ROLE OF S-p EXCHANGE 


, Fic. 2. Total susceptibility of the 
zinc manganese alloys and pure zinc 
as a function of reciprocal temperature 
between room temperature and the 
temperature of liquid nitrogen. 


Fic. 3. The inverse of the suscepti- 
bility difference between the alloy and 
pure zinc as a function of temperature. 
The inset is an enlargement of the 
temperature interval below 40*K for 
sample II. 
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Fic. 4. Resistivity as a function of magnetic field strength for a 
Zn-Mn alloy containing 0.11 at.% Mn as determined from Fig. 
1(a). The inset is the resistivity as a function of H? showing the 
normal magnetoresistivity component p, as a dashed line. This 
normal component is also shown as a dashed line in the resistivity 
versus H plot and the dotted line is the difference (o—p;5) and 
hence represents ps. 


and b= (4/J)?; J is the s-d exchange integral; A is the 
Coulomb interaction constant; »*, e is the effective 
mass and charge of the electron, respectively; ( is the 
Fermi energy of the solvent; c is the concentration of 
paramagnetic ions per unit volume; NV is the total 
number of atoms per unit volume; « is gusH/kT; 
(jz)av is the average of the Z component of the para- 
magnetic ion angular moment vector j; (j2). is the 
average of the square of the Z component of the para- 
magnetic ion angular momentum. From Fig. 4 the 
saturation value of the magnetoresistance normalized 
to the zero-field resistivity value is 


PEya1/PHo= PH./ pu, 0.960. 


Then from Eq. (1) and using the value of /—5—2 from 
the susceptibility data a value of b can be found and, 
hence from Eq. (2), A and J calculated. The values of 
A and J are found to be 8.7X10-" and 1.4X10 erg, 
respectively. 


ESTIMATES OF J FROM THE NEEL TEMPERATURE 


Owen et al.1* have given a molecular field derivation 
of the Curie point in terms of the s-d exchange integral 
J.15 By writing the conduction electronand paramagnetic 

‘ion magnetization in terms of an s-d exchange inter- 
action constant they find a nonzero magnetization when 
the temperature is equal to, S(S+1)J°C/2ks. If we 
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identify this temperature with the experimentally 
observed Néel point!" we may then estimate a value of J, 
Using the most concentrated alloy which shows a Néel 
point of 6°K and using the Fermi energy of 1.64 10-1 
erg for zinc yields a value of J equal to (1.52-0.2) x 10-2 
erg which is in good agreement with the value derived 
from the magnetoresistivity data. 


SUPPRESSION OF THE CRITICAL TEMPERATURE 
IN DILUTE ALLOYS 


The experimental situation at the present time indi- 
cates that if a transition metal impurity possesses a 
localized moment which corresponds closely to the 
free-ion value, then a suppression of the superconduct- 
ing critical temperature will result. Baltenperger! has 
considered the influence of an s-d exchange interaction 
on the superconducting properties of a dilute para- 
magnetic alloy. The expression derived by him is 


dT, /dc— —2.49J?s (s--1)vo/kN, (3) 
where vo is the density of states at the Fermi level and 
the other symbols have the same meaning as before. 
The above expression is only valid if there is no spin 
ordering of the conduction electrons due to a coopera- 
tive magnetic interaction of the paramagnetic ions. 
Whether this condition is satisfied for the Zn-Mn 
system can be investigated by requiring that the field 
0" seen by the nonsuperconducting electrons is smaller 
than the thermal energy corresponding to the super- 
conducting transition temperature. Baltensperger has, 
in fact, shown that 


d0^/dc- voH?s (s+1)/6kN . 
The inequality to be satisfied is then that 
907/0c« 9T ,/dc. 


Using a value of J found from the previous analysis and 
assuming vo to be concentration-independent, yields à 
value of approximately 10°K/at.%. This value 1$ 
certainly less than the value of 170°K/at.% reported 
for the Zn-Mn system in studies of the suppression 0 
the critical temperature by Boato et a1? Having shown 
that this inequality is satisfied we can then use Eq: (3) 
derived by Baltensperger to calculate a value of the 
exchange integral J from the suppression of the critical 
temperature. This equation yields a value of 116100 
erg, which is in good agreement with the previously 
derived values. 


15 J. O. Owen, M. Browne, W. D. Knight and C. Kittel, Phys: 
Rey. 102, 150 (1956). " the 

16 This derivation neglects the nonuniform polarization of x 
conduction electrons and the resulting influence on the paramag” 
netic ion magnetization which undoubtedly forms a most SI 
nificant aspect of any such calculation. end 

1 Only for the simplest antiferromagnet will the Curie Jid 
Néel temperatures be equal. See C. Kittel Introduction to 190 X 
State Physics (John Wiley & Sons, Inc., New York, 1953), p. 7^ — 
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RESISTANCE ANOMALY AND THE 
s-d EXCHANGE INTEGRAL 


According to Yosida? and Kasuya* it should be 
possible to obtain a value for the s-d exchange integral 


in a dilute alloy exhibiting an antiferromagnetic order- 


ing from the change in resistance above and below the 
Néel temperature 


«msc 


2e8hcN. 


Ap=py—po= J^. (4) 


This graphical method for evaluating Ap is shown in 
Fig. 5(a) and in applying this method Yosida has 
assumed that the resistivity in the absence of a coopera- 
tive interaction would be temperature-independent. If 
the resistivity below the Néel temperature is the re- 
sultant of two opposing effects, viz., the resistance 
minimum mechanism and the cooperative interaction, 
the true Ap would be (pi—po) of Fig. 5(a). (For the 
Cu-Mn alloy referred to in Vosida's theoretical paper, 
application of this idea would lead to more than twice 
his value of Ap and consequently a 50% increase in the 
calculated value of J to 3.0X10-? erg resulting in 
better agreement with the theoretically free manganese 
ion value of 3.5X 10-? erg.) The true situation for this 
model, however, is not clear when the region over which 
the cooperative interaction sets in severely overlaps the 
region in which one would otherwise expect a resistance 
minimum resulting in the resistivity characteristic of 
Fig. 5(b). In estimating Ap for a Zn-Mn alloy it is 
better to select an alloy sufficiently dilute that the 
antiferromagnetic transition is more clearly separated 
from the minimum. As indicated earlier we are propos- 
ing that in the Zn-Mn system the influence of the 
cooperative magnetic interaction exactly compensates 
the increase in resistance with lowering temperature 
characteristic of the resistivity minimum alone, result- 
ing in a flattening-off of the resistivity curve below 
Tx [Fig. 5(b)]. In order to obtain Ap we produce the 
resistivity curve as a circular arc having the same radius 
as the solid curve above Ty resulting in a value of pi 
from which Ap=pi—po can be obtained. Applying this 


P 


Fic. 5. (a) The resistivity as a function of temperature for a 
typical high concentration Cu-Mn alloy. (b) The resistivity as a 
function of temperature for a typical high concentration Zn-Mn 
alloy. The dashed lines represent the extrapolated curves. 
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Fic. 6. The resistance ratio as a function of temperature of a 
0.15 and 0.17-at.% Zn-Mn alloy. The resistivities of these two 
alloys is 6.7 and 6.9 4Q-cm, respectively. Tw is the corresponding 
temperature indicated in Fig. 5. 


method to the Zn-Mn alloys of concentration 0.15-at.% 
Mn and 0.17-at.% Mn (Fig. 6) yields an average value 
for J of 1.510-” erg. 

There is good agreement between the value of J so 
calculated and that calculated from the magneto- 
resistance and the Néel temperature. This can be taken 
as evidence that the extrapolation procedure and its 
implications are valid, particularly since a direct 
application of the unmodified Yosida method would - 
yield a Ap of 0 for Zn-Mn alloys. 


CONCLUSION 


An estimate of J from the decrease in spin scattering 
on magnetic ordering in the Cu-Mn system has been - 
estimated by Yosida to be 2.1X10-” erg for a man- 
ganese ion of spin $. This value of J is higher but of the 
same order of magnitude as the value of 15%10-# —— 
found for the zinc-manganese alloys reported in the 
present paper. An estimate for J from the suppression 
of the superconducting critical temperature for the - 
Al-Mn system? yields a value of 0.23X 10-" erg for an 
estimated spin value for manganese of 4. The fact that — 
manganese possesses a very small moment, if any, wi n 
dissolved in aluminum! would indicate that tl 
mechanism responsible for the suppression of Tein the — 
Al-Mn system is due to a cause other than that re- — | 
sponsible for the observed effects in the Zn-Mn system. 

Calculations of the exchange integral from the mag- 
netic properties of dilute zinc alloys containing sma 
concentrations of manganese have been made from th e 
Néel temperature, the magnetoresistivity as a function 


of field strength and the suppression of the supercon- 
ducting critical temperature. The theoretical equatior 
used for these exchange integral determinations are 
based on a conduction electron-paramagnetic ion 
change interaction of the form 2/ S.s. The analysis € 
the experimental results based on this interaction 

yield the same value for the exchange integral wł 


‘SE. W. Collings and F. T. Hedgcock, Phys. Rev. 126, 16 
(1962). € : 
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Pd 
could lead one to conclude that the nature of the mag- total resistivity in the presence of a resistance maximum 
netic interaction that leads to these phenomena is of the produced by magnetic ordering. 
same physical origin. Graphical analysis of the low- 
temperature resistance characteristics of dilute Zn-Mn ACKNOWLEDGMENTS 
alloys, assuming the sum of two terms; one due to - 
magnetic ordering and one due to resonance scattering The authors would like to gratefully acknowledge T 
results in a value of J in agreement with that derived discussions with Dr. M. Merriam and Dr. W. B. Muir 
from other transport phenomena. This then suggests a and correspondence with Dr. I. Rizzuto, Dr. T. S. 
method of separating the resistance minimum from Dugdale, and Dr. M. Bailyn. a 
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Covalency in Crystal Field Theory : KNiF; 
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The theory of covalency in crystal field phenomena is examined, using, as example, the Ni-Fs complex in 
KNiF;. The Hund-Mulliken-Van Vleck molecular orbital-linear combination of atomic orbitals treatment 
is followed. The role of the antibonding azd bonding electrons in the complex is discussed from a multi- 
electron point of view. The exact selí-consistent one-electron Hamiltonian is discussed in some detail. 
Emphasis is placed on elucidating the source and nature of the covalent effects appropriate to the various 
physical phenomena. We find that it is the covalent mixing of those bonding electrons having no antibonding 
partners which contribute to all experimental observables (including the crystal field splitting 10 Dg, 
transferred hyperfine interactions, neutron magnetic form factors, and superexchange interactions). This 
view of covalency differs markedly from the one followed by Sugano and Shulman, in that the covalency of 
the antibonding electrons, which are assigned the sole role in their approach, is totally irrelevant. Quantita- 
tive numerical estimates (using approximations to the exact Hamiltonian) are given for the two models 
of the covalent effects in KNiF;, i.e., “unpaired” bonding and antibonding; they are shown to differ strongly. 
The relative roles of overlap and covalency are discussed; covalency is found to play an important but | 
by no means dominant role. Numerical agreement between the present inexact cluster theory and experi- 


^ 


ment is found to be poor. The various sources of this disagreement are reviewed. 


I. INTRODUCTION 


RYSTAL field theory!” has had a long and varied 
history: when treated as a semiempirical theory, 

with the crystal field splitting 10 Dg considered as an 
adjustable parameter, it has been highly successful in 
fitüng experimental data; when considered as a funda- 
mental theory for the behavior of transition metal ions 
in crystalline fields, it has been strikingly unsuccessful 
in predicting, from first principles, the fundamental 
parameter 10 Dg. Following the pioneering computa- 
tions of Van Vleck? and Polder,‘ a series of theoretical 
investigations’ succeeded in pinpointing the basic 


* Supported by the U. S. Air Force Office of Scientific Research. 
1H. Bethe, Ann. Physik 3, 133 (1929). 

2 J. H. Van Vleck, Phys. Rev. 41, 208 (1932). 

3 J. H. Van Vleck, J. Chem. Phys. 7, 72 (1939). 

4D. Polder, Physica 9, 709 (1942). 

5 W. H. Kleiner, J. Chem. Phys. 20, 1784 (1952). 
^9 H. S. Jarrett, J. Chem. Phys. 31, 1579 (1959). 

1 Y, Tanabe and S. Sugano, J. Phys. Soc. Japan 11, 864 (1956). 
8 J. C. Phillips, Phys. Chem. Solids 11, 26 (1959). 
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shortcomings of the theory, and indicated the need for a 
multielectron many-center molecular approach. 

The recent work of Sugano and Shulman” (henceforth 
denoted as S&S III), representing the most detailed 
computations undertaken to date, attempted to obtain 
a quantitative basis for the theory by including all the 
terms in the ionic model considered by their predeces- 
sors, as well as the effect of metal ion-ligand covalent 
mixing. A cluster model consisting of the metal ion and 
its nearest ligand neighbors in an external Madelung- 
like potential was invoked. Considering KNiFs, they 
obtained a theoretical cubic field splitting parameter 
(10 Dg) for Ni which gave the first quantitative indica- 
tion that covalency plays an important role in the 
crystal field interactions of salts as highly jonic as 


? A. J. Freeman and R. E. Watson, Phys. Rev. 120, 1254 (1960); 
This reference also contains a useful review of the theoretic? 
investigations prior to the work of Ref. 10. : 

10S, Sugano and R. G. Shulman, Phys. Rev. 130, 517 (1963); 
henceforth denoted as S&S TII. 
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COVALENCY IN CRYSTAL 
KNiF». In addition, they considered the role of the 
computed covalent mixing in the F? 
hyperfine interactions." 

While the role of covalency in superexchange,! and 
transferred hyperfine effects!t42—7 have been particular 
objects of study, overlap and covalent mixing signif- 
icantly affect almost every observable in an iron series 
salt. Covalent behavior must be understood in detail 
before matters such as the theory of superexchange can 
be put on any sort of quantitative basis. ; 

Sugano and Shulman’s modification of crystal field 
theory to include covalency in the cluster model may 
be viewed as an extension of the traditional approach 
to crystal field effects in ionic salts. In KNiFs, the 
familiar triply (tz) and doubly (ej) degenerate 3d 
orbitals, describing the Ni** ion in its ground-state 
configuration (¢;%,%e;"), in the ionic model, become 
antibonding orbitals in covalent theory. These orbitals 
are shown schematically in Fig. 1. Also included are the 
bonding orbitals, which are formed predominantly from 
ligand 2s and 25 functions. Since their energy is lower 
than that of the antibonding orbitals, they are com- 
pletely occupied. According to S&S, it is the covalency 
of the antibonding electrons which must be considered 
in order for the experimental observables to be repro- 
duced. Thus, in the theory of the transferred hyperfine 
effects only the covalency of (spin) unpaired antibond- 
ing 3d electrons of majority spin contribute, while the 
crystal field splitting is associated with the 3d antibond- 
ing electrons of minority spin. Sugano and Shulman 
utilized the conventional approach of treating electrons 
of either spin identically and therefore obtained a 
common estimate of this covalent mixing for the two 
phenomena. In this approach, the bonding electrons are 
considered to play no role (other than to provide 
orthogonal partners for the antibonding electrons). 

In this paper, the theory of covalency in crystal field 
phenomena is examined using the cluster model for 
KNiF, as example. We find the covalent mixing of 
those bonding electrons which have no antibonding 
partners, to be the appropriate manifestation of 
covalency in experiment. Unlike the antibonding version 
described above, it is the covalent mixing of the same 
(unpaired bonding) electrons (of minority spin in 
KNiF;) which contributes to both 10 Dg and the 


transferred 


1 K. Knox, R. G. Shulman, and S. Sugano, Phys. Rev. 130, 
512 (1963); denoted as K, S&S II. R. G. Shulman and S. Sugano, 
ibid. 130, 506 (1963); denoted as S&S I. R. G. Shulman and 
K. Knox, ibid. 119, 94 (1960) [in particular, see the discussion 

7 . (1) and (4)]. 
EEG ee OF elem Solid State Phys. 14, 99 (1963); 
of the references cited therein, see, in particular, J. Kondo, 
Progr. Theoret. Phys. (Kyoto) 18, 541 (1957). — * ; 
13 F. Keffer, T. Oguchi, W. O'Sullivan, and J. Yamashita, Phys. 
553 (1959). = = 
Soe aa Be Roy. Soc. (London) A236, 549 (1956). 

15 A. M. Clogston, T. P. Gordon, V. Jaccarino, M. Peter, and 
L. R. Walker, Phys. Rev. 117, 1222 (1960). M 

16 A. J. Freeman and R. E. Watson, Phys. Rev. Letters 6, 343 


1961). 
l 17 * Marshall and R. Stuart, Phys. Rev. 123, 2048 (1961). 
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Fic. 1. A schematic representation of the antibonding and 


bonding electrons grouped into bonding-antibonding pairs and 
unpaired bonding electrons. For each, the predominant atomic 
orbital, from which the molecular orbital is formed, is indicated in 
parentheses; the F ligand 2s, 2p,, and 25. orbitals are denoted 
simply by s, ps, and pr 


transferred hyperfine interactions and, in fact, to all 
experimental observables. The covalency (but not the 
overlap) of the antibonding electrons is entirely 
irrelevant because it is exactly compensated by the 
covalency of their bonding partners. This agrees with 4 
several features of the role of bonding electrons in 
transferred hyperfine theory which have been described. 
in the context of the Heitler-London method by Keffer 
et al? and for the molecular orbital (MO) method by 
Clogston ef al.5 These matters are important for if we 3 
solve the Hartree-Fock equations for the two types of 
covalent mixing we find them to differ markedly. This 3 
difference does not imply different orbitals for different 
spin in the sense of unrestricted Hartree-Fock (UHF) 
theory, because the antibonding covalency (which has 
no physical or variational meaning) may have any 
value—including that appropriate to the unpaired 
bonding electrons. (We briefly discuss, for the case of — 
Cr?*, the implications of “UHF covalent mixing.”) 
Two basic aspects of this crystal field theory are E 
considered in what follows: (1) the quantitative impli- — 
cations of the approximations made in the treatment of — 
a Hamiltonian, its matrix elements and related matters 
(2) the source and nature of the covalent and overlap — 
effects appropriate to the various physical phenomena. — 
We concentrate on the latter in some detail in the 
present investigation and leave some aspects of 
quantitative calculations to a future paper. Our stu 
of covalency are described in terms of the occup 
one-electron orbitals of the cluster. An alterna 
approach to crystal field problems consists of di 
with the unoccupied antibonding orbitals, i.e. 
antibonding holes (which are associated with 
unpaired bonding electrons). Such a treatment 
covalent mixing, and the resulting estimates of 10 Dg 
and other experimental parameters, is exactly e 
valent to that associated with the unpaired bond 
orbital treatment provided that certain restrictions 
be discussed later) are maintained. For this reason, 
will consider only one approach in detail but will m 
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brief contact with the unoccupied orbital prediction 
when describing results. 
In Sec. II we describe the covalent mixing of interest 
to us. A discussion of the one-electron Hamiltonian 
appropriate to the system follows (in Sec. III). Sections 
IV and V discuss the covalent mixing appropriate to 
transferred hyperfine and crystal field splitting effects, 
respectively. The role of the unpaired bonding electrons 
becomes apparent in these sections. Throughout, the 
traditional (and incorrect) antibonding model is ex- 
amined in parallel. In Sec. VI, we have cause to return 
to the matter of our Hamiltonian and the crucial role it 
plays in causing the two types of computed covalent 
mixing to differ. Since we use an approximate Hamil- 
tonian, as in Ref. 10, certain “self-energy” problems 
arise which lead to serious internal inconsistencies in 
the results. The nature of these inconsistencies are 
explored. Their resolution requires the use of a better 
approximation to the true one-electron Hamiltonian, a 
matter reserved for the future. The various matrix 
elements appearing in our calculations are discussed, 
and their numerical values are reported, in Sec. VIII. 
Results, a discussion of their implications for other 
physical parameters, and conclusions follow. 


Il. THE MOLECULAR ORBITAL APPROACH 


The molecular orbital approach is by now well known; 
hence, we will but briefly recapitulate some of the 
definitions and ideas required by us here. The antibond- 
ing molecular orbitals are defined by 


WA=N .(Ge—NsX2s—AoX2p0) , 
WA=Ni(G:—AxX2px) , 


(1) 
where the N; are normalization constants, the o's are 
Ni?*3d orbitals of / (xy, yz, and zx) and e (3?— y? and 
327—177?) symmetry, and the x's are appropriate linear 
combinations" of 2s, 20, or 2px atomic orbitals (y,’s) 
associated with the six nearest-neighbor F- ions. The 
N's are the covalent mixing parameters. The normaliza- 
tion constants are given by 


N [1-2 5,2, ENIHE2NNSSI ST oy 
NS [1—2X.5,H-,2]22, 2) 


where an S; is the overlap integral between X; and the 
gza Orbital with which it is being combined, and Sss is 
the overlap integral between X, and X,; it need not 
be zero valued, as is often assumed. 

One can alternatively speak of the covalent mixing 
associated with the bonding molecular orbitals, 


V.P = N (xu tY: PeF YsaX2pa) ? 
A V.P =N eo. (Xapo F Yope F oos X22) 3 
YP =N; (Xp Yi) D 


2 e 


(3) 
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where 


ING E [1 xs PENNE 2Necd ot 2ryeysoSe Ye Ya" ] 1 ? 
N= [1--2y,S -- 2NesO sol Z'Y Y e9 e 


AER RR) B (4) 


Ni = [14 dy Styx]? 9 


The bonding and antibonding orbitals are to be ortho- 
normal, so to lowest order 


N—ycb Ss, 
Ne=YetSe, 
Ar= Yr tSr, 
Yso = — Yos Sos- 


(5) 


The bonding and antibonding orbitals are to be 
eigenfunctions of the Hartree-Fock (H-F) equations 
for a (Ni— Fo) cluster in KNiF;, i.e., they are to satisfy 


(6) 


where h is the one-electron self-consistent H-F Hamil- 
tonian. In the present paper, we will follow Sugano and 
Shulman and approximate the effects of the crystal 
external to the cluster by an electrostatic Madelung-like 
potential V ext, i.e., we assume that there are no covalent 
or overlap effects, between the cluster and its environ- 
ment, which significantly affect the metal-ligand 
covalency of interest here. This approximation is 
necessary in order to make the problem tractable. (One 
can improve on the approximation by making Vex. the 
most accurate description of environmental effects 
obtainable with a local effective potential, but it is 
well to note that there are shortcomings in such an 
approach—shortcomings which are particularly signif- 
icant when one considers superexchange or any other 
interaction between ions in different clusters.) The use 
of such a Vex is compatible with a self-consistent H-F 
treatment for the cluster itself. 

One automatic result of a self-consistent H-F treat- 
ment is that a set of orthonormal one-electron eigen- 
functions is obtained. We are thus supplied with a test 
of the seriousness of any approximation we make in our 
Hamiltonian (or in evaluating matrix elements) which 
causes us to fail to have a proper self-consistent theory: 
The test consists of independently obtaining an anti- 
bonding orbital, Y4, and its bonding partner, V, and 
checking their orthogonality. We will see that the 
present approximate cluster theory does not always 
meet this orthogonality requirement. 

We will solve Eq. (6) for Y within a very limited 
function space, namely, one spanned by the molecular 
orbitals constructed from Ni? and ligand-free 10? 
orbitals. (One well might wish to allow greater vana 
tional freedom in a future calculation.) However, what 
is important to us here, is that the limited nature of this 
space in no way effects eigenfunction orthogonality or 


hY; — eV, A 


the self-energy problem to be discussed in Sec. VI- Letus - 
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now consider, in detail, the nature of the one-electron 
Hamiltonian. 


III. THE ONE-ELECTRON HAMILTONIAN 


We are interested in obtaining the exact one-electron 
H-F Hamiltonian, appropriate to Eq. (6), derived from 
a many-electron Hamiltonian consisting of kinetic and 
electrostatic terms, where the environment external to 
the (Ni— Fs) cluster is approximated by a Madelung- 
like one-electron electrostatic potential V ext (1;). The 
process is made trivial by the fact that the two H-F 
many-electron states of interest to us are single deter- 
minants constructed from orthonormal one-electron 
orbitals (¥4’s and W’s). For such states the one- 
electron Hamiltonian /;, for electron orbital i, is simply 


La 
hri) -iveex(—) 
a Tal 


1— Piz 
W;(r2)dt2 (7) 


+V ext (7s) + [re 


T12 


in atomic units. The œ summation (over nuclear 
potential contributions) is limited to the one Ni and 
six F nuclei for we have already introduced the potential 
accounting for all electrons and nuclei (presumed) 
external to the cluster. The Pi» operator permutes 
coordinates 7; and rs giving us the interelectronic 
exchange terms, which occur only between electrons of 
common spin. The sum over interelectronic terms is 
limited to the cluster and consists of contributions from 
all electrons in the cluster including the ith electron 
itself. This is allowed in a self-consistent H-F theory 
because the ith electron Coulomb contribution is 
exactly canceled by its exchange term. We will have 
occasion to return to the matter of including or exclud- 
ing an electron’s self-energy contribution to Eq. (7) 
later. 

We now with to re-express the interelectronic po- 
tential of 4 in terms of one-electron functions localized 
on the seven nuclei in the cluster, for it is in terms of 
these that the potential, of Eq. (7), can be evaluated. 
In this simple procedure the V's are expressed as linear 
combinations of the local atomic orbitals (LCAO’s), 
ie. V is broken up into Ni e and ligand X; orbitals 
[with the X,’s themselves expressed in terms of local 
orbitals (Vs) at the various fluorine sites], whereupon 
Eq. (7) may be written as 


h; (rı) = — iV Vex(71)+ Vix (r1) T Vz(ri) 
4-9 (r)4- V, ()-EVs (r) 
-VQ9(r)--Vsz(r). (8) 


As we shall see, the terms of the second line account 
for the important fact that the local one-electron 
functions of one center are nonorthogonal with those of 
another (i.e., S0) and that there Is covalent mixing. 
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Let us now consider the individual terms separately. 
Fa: This term is simply the H-F nuclear and inter- 
electronic Coulomb plus exchange potential for am 
isolated Ni** ion; 


Žyi 1— P; 


ENO e A pi (ra) -gj(ra)dr2. (9) 
yı j-all Nit* 
electrona 


Ti? 


Vi: These are similar contributions from the ligand 
ions; 


6 (=—Ly I1—P 
va E|—e x owe 
a=) Tia j =all electrons 712 
on ion a a 
3 
xus rid . (40) : 4 
E 


The above terms would make up the interelectronic — 
contributions coming from the cluster if there were zm 
neither covalent mixing nor any overlap between 
orbitals on different sites. The omission of overlap terms — E 
may be thought of as yielding a potential of the fom — 
one would have when the ions of the cluster are infinitely —— 
separated. It is important to note that such a potential — 
[actually the full first line of Eq. (8)] was used 
Sugano and Shulman in their work. The consequenc 
of this approximation will be discussed later. 
Next we consider the overlap contributions to the — 
potential. Vg? (rı) and Vs? (ri): The most obvious — 
overlap effects are associated with metal-ligand non- 
orthogonality. In the case of zero covalency, this as 
nonorthogonality is resolved by having \;=5S; w : 
y4;—0 for the bonding-antibonding pairs. Setting 
and y: to these values, we can inspect the overlap 
terms arising in Eq. (8). There are terms linear in 5, 


1 —P 
X;(r3)d va " 


Vs (ri) D2 25; f eno 


T12 


where the sum is over all bonding-antibonding pi 
Note that Eq. (11) is written in terms of the multi 
ligand molecular orbitals, X;. There are also higher or 
overlap terms in Sj, i.e., 


1— P 


gi(rz)dr2 


Vs (r)- 4X s| f «teo 


Tiz 


1—Pis : 
+ [xe x(n} +015. 
T12 Z 
These second-order terms are of the same order 
importance as Vs which involves an S; times. 
overlap charge density. Note that a e, appears t 
in the summations in these equations to a 
its two partners, X, and X,. Third and hi; 
terms are, of course, less important. 


z 
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E form 
1— Px à = 1—P1. 
Sut] fne (r2) X, (rz)dv24- | X” (ra) e| 
T12 *19 

E 20 31—Pu 
2 =25,, [ee X;(ro)dva (13) 
gj mae arsi [92 ext jese ux) re 
E 9 similar to what we have seen in Eqs. (11) and (12). 
me Finally, the simultaneous nonorthogonality of e, with 


k Fares eee se n 1 
as) ie Sg 
Ni B t 


Fic. 2. The charge density of 4m, of the u(-3z—r) 
orbital compared with the P electron overlap charge den- 
sity Az (Se Lec-xX2]—-25.eX«) and covalent charge density 
Ax(y!Lez—2x«]H-2LeX«— Sex]? plotted along the z axis from 
Ni?* to F-. Densities are all in the same units, normalized such 
that S exit (r)dr —1; the free ion S value was used (see Table VIT) 
and S-+y was chosen so as to reproduce the experimental (Ref. 11) 
anisotropic F- hyperfine term. 


In Fig. 2 we plot the overlap charge density, giving 
rise to the sum of the Vs“ and Vs“ terms, along a 
metal-ligand axis, and for comparison, the charge 
density associated with an antibonding electron prior 
to the introduction of overlap and covalency [1.e., just 
| es(r)|2]. We see that the antibonding charge density 
has been delocalized, by the addition of the overlap 
terms, in the sense that there has been a buildup of 
charge on the ligand (which is one of six so affected), 
but, there has also been an increase of charge on the 
metal as well. These increases have been compensated 
for by a decrease of charge due to the — 25;[ e;* (x) X;(r) ] 
term, in the region between the ions. In other words, 
one is not dealing with a shift in charge (or spin) simply 
from one ion to another but onto the ions from the 
region between. The often cited picture of overlap 
(and covalent) effects delocalizing charge and spin off 
the metal ion is, therefore, not valid, for we are dealing 
with a process which is more complicated. Whether the 
overlap effect acis as if there is a shift onto, or off, the 
metal ion, depends strongly on the nature of the 
operator whose expectation value is in question. 

Vszr: These are additional overlap potential contribu- 
tions due to the nonorthogonality between ligand 
orbitals. First, ligand-ligand (J—w) overlap enters into 
the evaluation of the second line of Eq. (12) (where we 
bave X2), and elsewhere. We will include the latter 
terms as part of Vsr but will not write them out 
explicitly. It should be noted that the normalization of 
the X;'s as defined in S&S I [ Eq. (2.4) ], is affected by 
this nonorthogonality; however, for the purposes of 
this paper, we assume their normalization covention. 
Secondly, X,— X, nonorthogonality leads to additional 
potential terms, the lowest order ones being of the 


m 


m 
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X, and X,, which leads to terms involving the X,x, 
charge density, have been arbitrarily included in Vg; 
The lowest order term is of the form 


2 


— Pis 


“Xa (ra)drs (14) 


25.8. [x (r2) 


T12 


and is of higher order than the contributions written out 
explicitly in Eqs. (11), (12), and (13), i.e., it is the 
product of two overlap integrals times an overlap 
charge density potential. Finally we come to covalent 
contributions. Vy® (rı), V;,O(rs): These are most 
simply obtained by evaluating the interelectronic term 
Eq. (8) for nonzero y; (with \j=y;+S;), and sub- 
tracting off the overlap terms already accounted for in 
Vs® and Vs@. One obtains linear terms of the form 


4 I—HEn 
nog- Xo df et02— ue) 
j=antibonding T12 

orbitals 
1 Pie : 
& X ow[ete2— eate. 08 
j bonding Tio 
orbitals 


It is immediately apparent that contributions from 
members of an occupied bonding-antibonding pair 
cancel one another, leaving us only with contributions 
coming from unpaired bonding orbitals, i.e., 


00 1=Pie 
VO r) =F p» ays | ej (r) 
j =unpaired bonding 119 


orbitals 


XX; (ro)dr2. (16) 


Such cancellations also occur for the covalent mang 
involving (a nonzero) Yes ina V,P — V,? pair. A similar 
cancellation occurs in second order giving us 


1— Pi: 
2i | ej* (r2) ej (rdv 


T12 


VM M=+ X 


j =unpaired 
bonding 


l= Pig 
ET Í Xj" (r2) X;(rə)dr 1 


712 


1— Pi: 


8) Í x (n) x (rods 
T12 


-Fhigher order terms. (17) 
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The cancellation illustrated in det 
simply from a well-known 
functions: 


ail above follows 
^ property of determinantal 
| Any linear combination of occupied orbitals 
ina Slater determinant leaves unchanged the expecta- 
tion value of an operator—and hence does not affect 
the prediction of any physical observable. 

The first line of Eq. (17) is associated with an obvious 
physical effect, namely, the transfer of charge from 
ligand to metal ions due to covalent mixing. The 
number of electrons so transferred is simply E 


N- 2 a. (18) 
i =unpaired bonding 
orbitals 


Antibonding-bonding pairs do not contribute to this 
transfer because the covalency of one is compensated 
for by that of the other, as is required, to second order, 
by the orthogonality conditions [Eqs. (5)]. The third 
line of Eq. (17) is coupled with V.,9 which is 2y; times 
an overlap potential. The two terms represent a first- 
order (in y;) shift of charge off the ligands into the space 
between ligands and the Ni** ion. It should be noted 
that as the unpaired bonding orbitals are of minority 
spin, the covalent terms involve a shift of minority spin 
onto the metal ion, effectively reducing the net spin 
localized at the Ni” site. These covalent charge density 
terms are plotted for a bonding 2e electron in Fig. 2 
(their negative should be taken when considering spin 
effects) where y was chosen so as to reproduce the ex- 
perimental" anisotropic transferred hyperfine interac- 


tion. This choice causes these covalent effects to be 


greater than the overlap terms for the same e, electrons. 
Such a dominance need not always occur. Note that 
there are also f2, and Ni closed shell overlap charge, but 
not spin, density effects. The overlap and covalent e; 
terms act cooperatively in their contributions to the 
Ni** charge and F- spin densities while making opposing 
contributions to the Ni** spin and F- charge behaviors. 

We have seen a covalent shift of charge associated 
with unpaired bonding electrons and no such shift for 
bonding-antibonding pairs. This fact is of great interest 
to us for reasons beyond the mere matter of estimating a 
potential for it indicates the profoundly different 
nature of the two types of covalent mixing. The mixing 
of a bonding-antibonding pair conserves the charge on 
the metal ion while that of an unpaired bonding electron 
involves the shift of charge from ligands to metal. This 
observation suggests that the computed covalent 
g, i.e., the y; differs for the two cases and is the 


mixin ‘ : : 
or our preoccupation with bonding electron 


reason f 
behavior in this paper. 

In the above discussion of the overlap and covalent 
contributions to the Hamiltonian, h, we deliberately 
neglected to specify exactly which orbitals were so 
involved. As covalent terms only occur in low order for 
unpaired bonding orbitals we need only consider those 
electrons which can enter into bonding with the open 
3d shell; i.e., the 2s, 25, and 1s shells of the ligands. 


A1531 


The 2s and 2p electrons are expected to make small 
contributions, and the 1s electrons insignificant contri- 
butions, to V, and Va. 

We also expect the overlap terms to make a 
more significant contribution to the potential terms 
than do the covalent terms. Now, any shell which 
significantly overlaps another ion shell should be 
included in Vs, Vs®, and/or Vss. Hence, when 
including overlap terms in the potential one must 
inspect terms other than the Ni 3d and ligand 2s 
and 2p electrons, to which consideration is normally 
limited. In particular, one should, at the minimum, 
explore the role of the Ni 3s and 3p shells since their 
overlap with the ligand electrons is only moderately 
smaller than that of the 3d orbitals. 

In the present paper we will follow the Sugano and 
Shulman treatment and mostly limit ourselves to a 
Hamiltonian (denoted by ko) consisting of the first line 
of Eq. (8) alone for two reasons. First, we believe it 
imperative to fully explore the implications of their 
approximations, approximations which at first glance 
seem to yield excellent agreement with experiment, 
before going on to a more complete (and more compli- 
cated) treatment. Secondly, a more complete treatment 
appears to require far greater care in the numerical 
evaluation of matrix elements than does a treatment 
based on ho alone. Greater care is not only necessary in 
the handling of overlap and covalent terms but also in 
the evaluation of the ks matrix elements. Approxima- 
tions which were apparently adequate for the simple 
case must be re-examined before one attempts the more 
detailed treatment; this is due, in part, to the severe 
numerical differencing which occurs in the course of 
evaluating matrix elements. Our reasons for being 
cautious on this matter will become obvious when we 
inspect the results of the present paper. 

As Sugano and Shulman have emphasized, the use of 
ho can be viewed as the first iteration of a self-consistent 
theory for the (Ni— Fs) covalent mixing problem. This 
is so, providing that overlap as well as covalent terms 
are included in a later iteration. As we shall see, the 
results of Sec. VIII suggest that a proper evaluation of 
covalent effects cannot be had without the inclusion of 
the overlap and covalent terms in the Hamiltonian 
[i.e., the second line of Eq. (8) ]. It is our intention to 
investigate the effect of such terms in a future paper. 

In going beyond the use of ho, one is involved in an 
ever increasing commitment to details of the cluster 
model, details which do not reflect the situation in 
the actual crystal. The use of kə and the MO’s as 
defined earlier for the cluster already imply a strong 
commitment. 


IV. UNPAIRED SPIN COVALENCY AND 
TRANSFERRED HYPERFINE EFFECTS 


We now consider the role of covalent mixing in 
transferred hyperfine effects. In doing this, we assure 
that the orthogonality relations of Eq. (5) hold and 
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that orbitals differing in spin may have different 
covalent parameters. 

The isotropic F hyperfine parameter is considered 
to be primarily associated!" with the unpaired 2s 
spin induced by nonorthogonality and covalency on the 
F- site. After a variety of other contributions'®”’" are 
accounted for, the parameter is written as the product 
of the hyperfine constant associated with a single F- 2s 
orbital times the fraction, fs, of induced unpaired 2s 
spin character at the F- site. Making the standard 
assumption of constant covalency for electrons of 
either spin, the value of f, in KNIFE; is 

TN AE AS) (19) 
and arises from the antibonding electrons alone. The 
X accounts for the fact that à is defined in terms of 
molecular orbitals and that we are interested in the 2s 
character induced on a single F- site (via both antibond- 
ing er orbitals). The anisotropic hyperfine interaction 
is complicated by the presence of (1) both p, and pr 
conbributions and (2) a substantial correction for 
classical spin dipolar effects (which must be made 
prior to estimating covalency from experiment). The 
p-electron interaction is entirely spin dipolar (i.e., not 
orbital) and given the experimental covalent hyperfine 
parameter, the fractional difference (f.— fx )between 
induced pe and p, behavior can be inferred. 

The coventional assumption of common covalency 
for either spin leads to f.—0 for KNiF; since all ! 
orbitals are occupied in the ground state. Experiment, 
therefore, would seem to yield information concerning 
the 2s and 25, covalent mixing in the antibonding et 
orbitals from the observed isotropic and antisotropic 
interactions. 

The above discussion assumes that the transferred 
hyperfine interaction is entirely due to the spin density 
associated with a ligand's y orbitals. However, the tails 
of the free ion Ni esa (and neighboring ligand) orbitals 
overlap into the region of a F nucleus and contribute'®”” 
to the spin density and hyperfme interaction via the 
c? and ex terms. These terms are not negligible. 
Unfortunately the use of free ion orbitals is inappro- 
priate for this evaluation since these orbitals do not 
take account of the presence of neighboring ion nuclei. 
We will encounter similar problems, introduced by 
inappropriate orbital tail behavior, when discussing 
orbital reduction effects in Sec. IX. These matters will 
be ignored in the remainder of this section. 

Let us now consider how the picture changes when 
one accounts for all covalent mixing while maintaining 
the necessary orthogonality of bonding and antibonding 
orbitals to second order in S; X; and y;. We will again 
consider the 2s effects as they are the least complicated. 
The implications for anisotropic effects will then be 
immediately apparent. 

"Consider the contributions to f, from the bonding et 
and ej, and antibonding e+ electrons. Writing out the 
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normalization constant in Eq. (17) and making use of 
Eq. (5), the antibonding contribution becomes 


fa^ {SHS yi y) HOCS). (20) 


(We will henceforth use superscripts A and B to denote 
antibonding and bonding, respectively, whenever 
appropriate for clarity.) In the absence of covalent 
bonding and Xs—Xe overlap, the f, contributions are 
4+1 for spin t and —1 for spin |, and hence cancel. 
Covalency affects these f, contributions via (1) the 
normalization of the Yes? orbitals [cf. Eq. (4)] and 
(2) the x, term of Yes”. With differing covalency, these 
contributions need no longer cancel. The individual 
spin contributions are 


fet g= IN HAN ot Yost 


=4{1— 2y Yt H Sse HOSH, (21) 
faj- —3(1—2y4S,—y.44-5,2) 0 (59) 1° (22) 
Combining all these contributions results in 

f, AG 2y4S 7017} +0(S%). (23) 
Similarly, one obtains 
fo=HS2+ 2S cotter} 059). (24) 


In other words, if bonding and antibonding orbitals of 
one spin are simultaneously occupied, there will be an 
S2 overlap contribution, but their covalent terms cancel 
exactly. Hence, it is the covalency of the bonding es}? 
electrons which contribute to the isotropic transferred 
hyperfine interaction in KNiFs, and if there is any 
question as to the constancy of 2s covalent mixing 
within the set of e orbitals, it is the mixing estimated 
for the e, electrons which must be compared with 
experiment. 

The coventional cancellation associated with spin 
pairing has been lost, but the more fundamental pairing 
cancellation between bonding and antibonding orbitals 
has been seen. The immediate effects of this for the 
anisotropic F- hyperfine interaction are (1) that /1 and 
ty electrons cannot contribute, no matter what their 
respective covalencies, since antibonding-bonding pairs 
of both spins are occupied; (2) that there will be S*, but 
no y contributions from the majority spin er? and er' 
electrons; and (3) that covalent contributions come 
solely from unpaired e,2 electrons of minority spin. 
It is clear from the identical form of Eqs. (20) and 
(23) that the usual phenomenological approach; 
which uses the antibonding expression [Eq. (20), 
actually determines! the unpaired bonding covalent 
mixing parameter. 

Complications occur for almost empty d shell ions 
such as Cr?+(t;4)3 which has but three antibonding 
d-like electrons. There are no e4 electrons in the groum 
state and differences in e+? and ej? covalent mixing 
can lead, for example, to an isotropic F- hyperfine 
term which would be zero in the traditional analysis- 
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NICE ae effects have been observed!’ in 
This either implies that the e? covaleneg eee 
very small here or that ee is 
oba Hue mia ue near cancellation 
diffraction ese eet u e has occurred. Neutron 
e electron character i UC adis. guit idus 
in the cluster's spin density. 

Ex o os computed differences in 

y, for this case, come from a different 
source than do the differences between the covalency of 
an occupied Ni?+ bonding-antibonding pair of ‘one 
spin and that of an unpaired bonding orbital of the 
other. The covalent “unpairing” of the Cr* orbitals 
has its source in the unbalanced exchange terms of the 
open 3d shell. By this we do not mean the HF exchange 
polarization of the F- ion by the 3d shell, a matter 
discussed elsewhere.? Instead, we mean the imbalance 
in covalent mixing caused by the imbalance in 3d 
exchange contributions to žo and, in turn, the various 
matrix elements appropriate to the determination of 
the covalent mixing. Although these exchange terms 
also contribute to the Ni’ case, a more fundamental 
role is played by the intrinsic differences in the character 
of the antibonding-bonding pair covalency, on the one 
hand, and that of the unpaired bonding electrons on 
the other. Note that the unbalanced exchange terms 
cause the single determinant description of the Cr?* 
cluster to be an improper eigenfunction, and of the 
Ni? cluster, to be a proper eigenfunction, of the spin 
operator (S?). The Ni?* description preserves its 
symmetry because of the presence of unpaired bonding 
orbitals of but one spin.” 


V. THE CRYSTAL FIELD SPLITTING 


We use the conventional definition of the crystal 
field splitting, namely, 


10 Dg= 6[(74)*(e74)*]— 6[04)*(e^)], 25) 


where the latter is the energy of the ground state and 
the former is the energy of the excited state (hence the 
superscript E to designate this state) obtained by 
replacing the /(xy)14 electron of the ground state by an 
e(32—9?),4 electron. If we make the common, but often 
unjustified," assumption that all electrons but the 
promoted one maintain constant orbital behavior in 
the cluster states, we have 


10 Dg= del- y) t] Eey), — Q9) 


where the es are the eigenvalues obtained with Eq. (6) 
for the one-electron Hamiltonian appropriate to the 


z ^ , 60). 
18 . Shulman and K. Knox, Phys. Rev. Letters 4, 603 (19 
19 A d UN and R. E. Watson, J. Appl. Phys. 34, 1032S 
j; a ublished). -ý ; 
DE eon is subject to Hec tbat Bo E pe 
d from a set of free ion e's and vs having z 
ELR per shell (although different ge and e, radial 


beka nior i R D R. E. Watson, Phys. Rev. 118, 1036 (1960); 
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particular cluster state involved. To second order, 


e^ 7 (pil h| en 28d h| X) — 2p gal h| Xo) 
ar (Seco h| Xo--LS?— v4 Y ei | h! ent y (27) 


which is obtained by multiplying Eq. (6) from the left 
by Y, reexpressing the Ws in terms of e's and x's, 
integrating and keeping all terms to second order. 
The first term of e is simply the diagonal energy and 
includes the classic point charge crystal potential, 
Kleiner's correction? to such a potential, and ligand 
exchange terms; the second term is the first-order 
contribution to the overlap energy of the sort inves- 
tigated by Tanabe and Sugano’; we then have the 
first-order term of the covalent contribution, which 
Sugano and Shulman concluded was important; and 
lastly, the second-order terms in overlap and covalency. 
The expression for e,“ is of the form of Eq. (27) except 
that there are first- and second-order overlap and 
covalent contributions from both the Xg and X, mixing, 
i.e., 


eA=(y.|h| p — L QGorve. 


[LN 


xQun| X)--ES2— v2 Yo, A] 92) . (28) 


hi XJ+LS ty? 


One may obtain alternative expressions” for the e’s, for 
example, by multiplying Eq. (16) by x instead of W, in 
which case e^ is given by 


at= (eh e)— (Sete) ee] A] Xo 
TSGeEY)GIdHR pi. (29) 


Note the omission of the two multiplying the linear 
term.” Unlike Eq. (27), this relation only holds if v is 
an eigenfunction of Eq. (6) (within our e, x subspace), 
in which case Eqs. (27) and (29) difter from one another 
in third and higher order in S and y. This point is of 
some importance for we shall be evaluating Eq. (25) 
[and thus Eq. (27) ] using noncovalent orbitals (y=0). 

We have already seen indications that the computed 
covalent mixing appropriate to a bonding-antibonding 
pair differs from that of an unpaired bonding electron. 
This suggests that the covalent mixing of the paired 
t(xy):? [and unpaired e(x?—5*);7] orbital [s], appear- 
ing in the ground state, might very well undergo 
substantial changes on becoming an unpaired (and 
paired) electron in the excited state. We must then 
inspect the role of bonding orbital crystal field energies. 
The one-electron energy of a bonding ¢ electron is, for 
example, 


e? =(X,| h| X)3-2y«el h| X) (Ql e) 
— QviSety)0un|x)y. (30) 
z2 Their counterpart of the second line [e.g, in S&S MI, 


Eq. (2.11)] shows a (e«|/n]e.) matrix element but we find 
(gelol e) more appropriate for this case. 
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The energy of a bonding-antibonding t orbital pair, 
Qa, in a crystal field is then 


Q= 4 Ftet— (y Aya | V ,Pyr,9) - AW? | W AW?) 
Die —S«qx pn oot (1-52) (God) e) Qu Xj) 
—(eieil XQ (oil eiXi)) 5 (31) 
where 


(V AV p| Vol) 
1 
= | [eme (ri) s) —3 c (r2)W p* (12) . (32) 
Yi? 


The two-electron terms of the first line of (31) account 
for the fact that the interelectronic interaction between 
the pair of electrons has been counted twice when 
summing e? plus «4. An equivalent expression is 
obtainable for the e bonding electron; its derivation is 
but slightly complicated by the simultaneous presence 
of s and c mixing. 


WATSON AND A. J. 
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In Eq. (31) as in Eqs. (16) and (17) we see the effect 
of the antibonding-bonding cancellation. The covalency 
of a bonding-antibonding pair plays no role in the 
cluster's energy!5 and hence in the crystal field splitting. 
Of course these pairs still make diagonal and overlap 
contributions to both the crystal field energy and to 
the Hamiltonian. 

It is clear that only the unpaired bonding electrons 
make covalent contributions to the energies of Eq. (25). 
These total energies (6) can be evaluated by a simple 
summation over one-electron e’s, providing that we 
account for the fact that this procedure introduces each 
interelectronic interaction twice by subtracting inter- 
electronic Coulomb and exchange terms [as we did in 
Eq. (31)]. Here, we wish to concentrate on those es 
which contribute to a nonzero 10 Dg, namely all 
unpaired bonding electrons (which contribute covalent 
terms) and the bonding-antibonding pairs involved in 
the transition. (which make diagonal and overlap 
contributions). The resulting ó's are 


&L(t4)8(e4)2 = exi t egi B+ esp t €s,ut eo Crete 6 EX GV, V) Gr; | Ya) J+ Ee (33) 


i>] 
and 
EL (424) 5(e#4) 3 ]= ype Ce ene an Gaya Gy PEt € ui EP > LU; |v;v)— GA; | Y A) J+ 6c " (34) 
ij 


where 6, is common to both energies and the sums go over all pairs of orbitals represented by the es. The subscripts 
v and z designate the (a2—?) and (37?—7?) orbitals of e symmetry, respectively. The v orbital is involved in the 
crystal field transition (ty — e2*)24) and the « orbitals are the “odd” unpaired bonding orbitals common to 
both crystal field states. Evaluating Eq. (25), rearranging terms and affecting appropriate cancellations, we have, 


10Dq= {(be| hE— V aE — V, 1 7— V, 7|) — (0i h=V st — Von — V sut — Vou" |$)} 


{SPEX | hE Vey? — Waites Vow 5 V Ra Xie) E (0; | hE— V.YE— VE 


Viena 3 Vine | $;)] 


—2S KX; v| hE — pu Veu Vou — 3 E = {S,2[(X, | h— V,.— Vent Veu — Vea? | X) 


(by — Vee? — Vee? — Vist Vea? |óz)1— 29406 | i— View? —Ven¥—Vieut—Voe,ut|\bry} 


ve ( (aut EB X | hë— Vs u — E X) (bay | hE Wot? 


Vou” | b2y) | 


= Dy ays? BL (X. | hE — Veu” — Wo” | Pey) — Xs | hE— Vu — Voa X 


H D (inaha u = Ven — BV 5,07| X9) — Go| =V a= Ve wt 3V 5,07 | $0) ] 


J 


$j-e 
o,j=s 


onde s Vaut— Voru’ —3V 5,07 lpo) —SA%,0| h= atl Va = cl Xi») ]) 


k E aee 9a Xe a) — (hu PB VG au) Deve uv BLOGs [ES Y 3, [Xs 


ias jar 
i-oj-s 


—SAX;,u]h—4V5,u7| bu) ]} E 
i=s, j= 
i=0,j =S 


2 COE E] Xiu) (Pu h*—3 jut? lu) ] 


—2yiu PP [OG |h” jw | Xiu) SXi u| hE—3 jT 2); (35) 
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where cach sum has but two terms (i=s and j=o and vice versa) and where 

ane See 1— Py» (= Ps 
/S= Si? Qu (r2): qs Ee erar [xe (r2) AS xen. |- 28; | gi (ra)— — Xi(roydra , (36) 
112 712 J Ti2 
i E 1— P " 1— Pis 
Vi =+2yi fero xr Ss Ptr nnns | 
` Y12 T12 
1— Pi 1— Piz E 
trl [oco eae [xe nn 7 2) 
T12 Yiz 


and A£ is the one-electron Hamiltonian defined for the excited state. The (| V |) integrals in Eq. (35) are the sole 
remaining interelectronic contributions from Eqs. (33) and (34), all others have cancelled. The first line of Eq. 
(35) consists of the diagonal crystal field terms; the next two lines are antibonding orbital overlap contributions; 
lines 4 and 5 are covalent contributions from the unpaired bonding / and e, orbitals involved in the transition”; 
and the last two summations are covalent terms associated with the odd unpaired e, bonding electrons. Within 
the cluster approximation Eq. (35) is an exact expression, to second order, for 10 Dg subject to the following 
three conditions: 


(1) The Hamiltonian must be self-consistently defined for the set of V's actually used, in which case Eq. (35) 
is correct independent of whether the W’s are, or are not, eigenfunctions of Eq. (6). 

(2) The radial behavior of the y and y atomic orbitals (not necessarily equal to that of free ion H-F functions), 
must be the same for the two crystal field states, otherwise there are additional contributions to 10 Dg. 

(3) It is assumed that the ligand 15 shells do not enter into the bonding. Otherwise, there are additional covalent 
contributions from unpaired bonding Y1,” orbitals. Such contributions are straightforwardly added to Eq. (35). 


As already noted, the last four lines of Eq. (35) arise from the odd unpaired W;,., bonding orbitals common to 
both crystal field states. These and the V;,.7 contributions to the other lines sum to zero if an excited state param- 
eter y;,.%% (and its associated matrix elements), equals its ground state, y:,.%, counterparts. Assuming this 
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equality to hold, Eq. (35) becomes more simply, 


10 Dg— {(eelh®| e)— God] e) HES EOG AE] X) H- Ces A7] Ge) ]— 25.05 A7] eo] 


HS X| h| x? (e 


h®| e)]—2S.0| hE, e) — {SAX | 8] X ))9- God 1] e)]— 25 X a] eo) 


T ( (va EB) (X,| | X.)—(ei | AP | e) ]— 2v, (X. | h*| c) 2yr BS AG RF] X)) 
REN Qa BPLOG | h| Xs)— (eel h| Ge) ]—2v 7 (X, | h| pe) tH 24 PS. X, | | x) 
Ha) Exe x) pelh] e] 209 xol h| Ge) 2v P Sce HL x2) 


The three (|V|) terms have not been written out 
explicitly, in anticipation of the fact that we will 
evaluate this equation in terms of the approximate 
Hamiltonian, /, which does not contain V, and V, 
terms. Hence, the (|V |} corrections of Eqs. (35) and 
(38) need not be made since these interelectronic effects 
were not counted twice in our sum over e's. We will thus 
omit the last line when evaluating Eq. (38) with /o. 


.Without these terms, Eq. (38) would be identical 


with the traditional one given in terms of antibonding 
covalency, provided that (1) the ground-state unpaired 


33 When treating the covalent mixing of three orbitals (e.g., 
the e14, e,,2, e,,8) only two of which (the e;47 and e;12) are 
occupied, there are covalent contributions to a quantity such as 
10Dg. These include, first, the y, and ye terms of Eq. (35) and 
second, nonzero yss terms [see Eqs. (3)-(5) and related text] 
caused by nonzero s—o mixing. The yss contributions enter in the 
third and higher orders in y and S and were therefore not listed in 
Eq. (35). In addition, there are contributions arising from any 
nonzero overlap of closed Ni?* shells with ligands, but these appear 
in third (and higher) order in 10 Dq. 


H Va DEV DEVi] (G8) 


ya and yi? values and their associated matrix 
elements are identical with those of the excited state 
e,,£4 and e,,74 electrons and (2) a similar equality 
holds for the unpaired 5.,* and p,14 contribitions. 
The present calculations indicate that these require- 
ments are not met. 

Before closing this section, we should note that, sub- 
ject to two requirements, theunpaired bonding e electron 
covalent mixing contributing to 10 Dg is exactly that 
which is appropriate to the transferred hyperfine 
interactions. This differs with the antibonding orbital 
picture where ground-state antibonding spin f covalency 
contributes to the hyperfine interaction and excited 
state antibonding spin | covalency contributes to 10 Dg. 
The two requirements are: (1) that the covalent mixing 
of the two ground-state unpaired e? orbitals be identical, 
a requirement fulfilled by symmetry considerations 
alone; and (2) that the “odd” e? orbitals maintain 
constant covalency in the two cluster states, for 
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otherwise we have Eq. (35). It is reasonable to assume 
that this last requirement holds in a lower order theory 
but that, at some point, we must revert to using Eq. (35) 
where these orbitals contribute to 10 Dg. 


VI. THE EVALUATION OF THE COVALENT MIXING 
AND THE ROLE OF SELF-ENERGY TERMS 


We are interested in obtaining the unpaired bonding 
orbital covalency which contributes to 10 Dq and to 
transferred hyperfine effects. Also, as noted in Sec. II, 
independent estimates of the mixing occurring for the 
members of an antibonding-bonding pair supply us 
with a test of the internal consistency of our theory. 
With this factor in mind, we will estimate the cova- 
lency of all the occupied bonding and antibonding or- 
bitals in the ground cluster state and of selected ones 
in the excited state. Let us now consider the process of 
estimating y. 

As already discussed, we wish to solve the H-F 
equations [Eq. (6)] defined?! for the cluster, in a two- 
function space consisting of e, and X; for the / mixing, 
and a three-function space, made up of pe, X, and X, 
for the e mixing, ie., we solve 2X2 and 3X3 secular 
equations, respectively, for our Hamiltonian /. For 

the 3X3 solution, we need the matrix elements (X, | | Xe} 
which were not considered by Sugano and Shulman. 
As these are intimatelv associated with the cluster 
approximation and also include important three- (and 
four-) center integrals and thereby divert us from the 
purpose of this paper, we shall not obtain these in the 
present treatment, deferring this matter, along with 
others which have been brought up, for a future effort 
involving the proper Hamiltonian, /, not fo. We will 
instead explore the consequences of following the 
traditional view of assuming X,— X, mixing and overlap 
to be zero valued (i.e., Yss=Yss=0). In this approxima- 
tion we obtain the following relation for any one of the 
y values appropriate to Eq. (3): 


Sn x)-— Gh Xi) v 8] Xylol] ei) J 
Hre: Al e)— Se^] e)]-0. (39) 


There are two roots to this equation: The |y| «1 root 
is appropriate to the bonding orbital; the other, with 
|y| 21, is appropriate to its orthogonal antibonding 
partner. Upon dropping the quadratic term, we obtain 
the Sugano-Shulman relation for y; namely?» [S&S IIT, 


Eq. (2.13) ], 
(el h|x3— Six h] xs) 


qim ; 
{X;| | Xi)—(eil h| eo 


24 Since the antibonding-bonding pair mixing does not contribute 
to the cluster’s total energy, we cannot properly derive the H-F 
equation appropriate to that mixing for such a derivation must 
follow from application of the variation principle to the total 
energy. One can write a one-electron H-F Hamiltonian [Eq. (7)] 
by inspection and solve for the pair mixing. This we will do but 
we must remember that the solution has a mathematical but not 
a,physical meaning. 
25 Note the typographical error in the sign of the S(x|/|x) term 
in Eq. (2.13) of S&S ITI. 


(40) 
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They also give a relation for Az [S&S III, Eq. (2.12)] 


(eid | X8) Ss ot] eoo 
Aem —— 
(x11 x) — (esl ex) 


(41) 


which we find is closely associated with the inverse of 
the second root of Eq. (39). We will obtain y values 
appropriate to both bonding or antibonding mixing. 
We expect the smaller root of Eq. (39) and the result 
obtained from Eq. (40) to differ negligibly since the y’s 
are small ; this will prove to be the case. 

As discussed in Sec. III, in computing the matrix 
elements appearing in Eqs. (38)-(40), our H-F one- 
electron Hamiltonian, /, will be replaced by the 
approximate Hamiltonian /j. Now, when evaluating 
the matrix element of a one-electron H-F Hamiltonian, 
one often may or may not, as one wishes, include the 
interelectronic Coulomb and exchange potential terms 
due to the electron for which the matrix element is 
being evaluated (in this case V;) because these terms, 
being equal, cancel one another. The inclusion of such 
self-energy terms leads to a common Hamiltonian for 
all electrons in the system, an important and often 
exploited feature of self-consistent H-F theory." If 
by any chance the cancellation does not occur, these 
terms musi be omitted from the Hamiltonian. Such a 
cancellation does not occur for /, since it is not a 
self-consistently defined Hamiltonian, and some am- 
biguity arises concerning the appropriate form of the 
self-energy term to be subtracted. Consider the evalua- 
tion of a matrix element appropriate to an antibonding 
orbital Y;^. The exact self-energy contribution to the 
potential is 


20 1-Pr 
| Va (r2) Yr;4 (ra) drs, (42) 


719 


but in view of the fact that /;o is defined in terms of the 
(zero overlap) ionic (Ni— Fe.) cluster [i.e., the first 
line of Eq. (8)], one would be subtracting out terms 
which did not appear in our Hamiltonian if we sub- 
tracted out Eq. (42). A possible choice, consistent with 
using the interelectronic potential of fo, is to omit the 
Coulomb and exchange potential terms associated 
with the parent 3dq; (i.e., the overlap ionic counter- 
part of Y;4), as was done by Sugano and Shulman in 
the course of estimating antibonding covalency. 
Likewise, the equivalent X; contributions would then 
be omitted when estimating bonding covalency. The 
use of an approximate Hamiltonian has led to this 
choice of an approximate self-energy term. We will 
consider the effect of going to the more exact term after 


26 For example, see A. J. Freeman and R. E. Watson, Treatise on 
Magnetism, edited by G. Rado and H. Suhl (Academic Press Inc., 
New York, to be published). 

?' Providing that we do not constrain spin pairs of bonding or 
antibonding orbitals to have the same covalent mixing. Such a 
requirement would involve us with the nonorthogonality assoc 
iated with the conventional Hartree-Fock theory (Ref. 26). 
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first considering the implications of this (zero-overlap) 
ionic approximation. 

The question arises, as to whether the py bonding and 
antibonding matrix elements, for example, evaluated 
with different self-energy terms will, on insertion into 
Eqs. (40) and (41), yield \,4 and y,” values which 
satisfy the orthogonality relation [cf. Eq. (5)]. A 
failure to satisfy Eq. (5) may arise from approximations 
in our Hamiltonian, in the treatment of the self-energy 
terms or in evaluating matrix elements; but, whatever 
its source, this failure implies a breakdown of theory. 
The matrix elements, for bonding and antibonding 
orbitals, are best compared by writing them for the 
proper ionic Hamiltonian, /t [the first line of Eq. (8) 
minus the appropriate o or x self-energy terms] and 
for a Hamiltonian, /o’, without the self-energy subtrac- 
tion [literally the first line of Eq. (8)]. (Up to this 
point, we have not considered the role of self-energy 
terms in discussing the //; Hamiltonian. We here redefine 
ho to include the above self-energy term so as to conform 
to existing usage.) Upon doing this, we have the 
relations 
(ei| d | X)^—(ei| ho| X^ — (eies] e3X)— (eie eX), 
(e;| ho | X)? — (e; ho] X5 — (XX: Xio)— (XX, | Xie), 
(gil hv! | e)4—(ei|ho| ei) — (eie gigi) — (pigil Vipi), 
(X;| ho’ | xX;)— (X;| hol X) = 0GX;|x;x)— (xx; X;X;. 

(43) 
The integrals on the right-hand side of Eq. (43), 
coming from the Coulomb and exchange self-energy 
terms, obviously cancel, indicating that these matrix 


elements are independent of whether such terms are 
included in the Hamiltonian. However, we also have 


(eil ho’ | e)2— eil hol e27 — (eieil XiX)— Gol eiX) ; 
(x hy | X4— (Xil ho] X24 = (eieil XiX— (eiXi] eX); 

(44) 

and here the cancellation of terms does not occur. 

Noting that the bonding and antibonding matrix 

elements of io’ are, by definition, identical, gives us the 

following relationships between bonding and antibond- 

ing ko behavior for a bonding-antibonding pair: 

Cei o|x:)4 — (e| ho] Xs)? , 

(eil ho| e34— (ei| hol e)? + Ceses| XX) — (eX CX), 

| 8o | x) 9 G6 || X2 — (eieil XXH Xl e2)- 
(45) 


The two-electron integrals are most definitely nonzero.% 


*8 The fact that self-energy terms cause (|/:|) matrix elements 
to differ from their (|Z|)4 counterparts is, in itself, a necessary 
but not sufficient condition for a computed ^ to differ from y”. 

.computation with the exact one-electron Hamiltonian and 
With the exact self-energy terms [Eq. (42)] will have (|A])2 
A(|h|)A for the (x|h\x), xilo) and (e|A| e) matrix elements, 
one will obtain identical y values. In the present case, the dif- 
ferences, in (e|Zo| x) matrix elements will be seen to cause severe 
differences between a y4 and its partner 7. 
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But, what is more important, we will see that the 
([/5] )'s, on insertion into, say, Eq. (39) yield a different 
y than do the (|/o|)#’s. This means that the theory, 
with its present approximations, fails to yield the 
required bonding-antibonding pair orthogonality. We 
will see this failure to be severe. 

Not only does the theory suffer this internal incon- 
sistency but it is difficult to ascertain which of the 
approximations is most at fault. Use of the full Hamil- 
tonian of Eq. (8), the exact self-energy term [ Eq. (42) ], 
and extreme care in the evaluation of matrix elements 
would resolve this problem but would also be beyond 
the scope of this paper. We do wish to inspect one 
feature of the theory here, namely, given a Hamiltonian 
(in this case ho’), what part do the self-energy terms 
play in the behavior of individual matrix elements and, 
in turn, on the resulting estimates of covalency. Such an 
investigation will further indicate the nature of the 
computed covalency and its sensitivity to computa- 
tional details (including assumed covalency) while 
enabling us to avoid the serious numerical problems 
associated with a proper treatment with the proper 
covalent Hamiltonian. We believe this to justify a 
partial handling of the problem. 

Let us define a Hamiltonian, H, which is equal to Aj 
minus self-energy terms defined for the covalent Vs 
in Eq. (42). One may then obtain equations similar to 
(43) or (44) such as 


Cei |Xi4)—(¢:| [o^ 

-—Ni((eieil e)—(eieil eX2—2Nexl ex) 

TAXCexl exit NCGebood MN Gxl ex) 
—A&xelxo) 

——Xl(ex! ex)i-(eelxx))o-t- 007) (46) 
Or 
(oil H|X4 — (eil ho] X4 —Ns((oieil XX) 

—(exlex). (47) 


For the other matrix elements we obtain 


(gil HX) — (eil tol XB+: leigi XX 
—(eXi| eX)}, 
(ei Hl ed (eil hol e)4—X(eieilXioxi) 
—(eXi| eX} , 
eH gx) 0G] X)4+(1-—N 7] 
X(CeieiXX)—(o25|eX)) (48) 
(es Hlg)” = lolh! e39--[1— (V 7)?] 
X(CoieiXX)—(G2|e2X)) , 
(Xil H(X) — GG |o] X) — Y* Gio XX) 
—(eXil eX}, 
where terms or order A5, y?, or higher have been dropped. 
We will report solutions of Eq. (39) utilizing the linear 
covalent contributions to the off-diagonal (e|H|x)4 - 
and (e| H |x)” matrix elements (omitting second and - 
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higher order corrections). We will see the computed 
covalency to be sensitive to the estimates of covalency 
used in evaluating these matrix elements. 

We have so far considered the differences which do, 
but should not, occur in the estimates appropriate to 
members of a bonding-antibonding pair. Let us now 
consider how these do (and should) differ with that 
appropriate to an unpaired bonding orbital. When 
obtaining the (¢;|/| e;) and (¢;|/|Xi) matrix elements 
for either member of a bonding-antibonding pair, the 
Coulomb and exchange contributions of o; to Vm 
[see Eq. (9)] cancel after the manner of the right-hand 
sides of Eqs. (43). The Coulomb repulsions of the 
seven other Ni 3d electrons remain, and the potential 
contributed by the Ni ion to these matrix elements is 
that characteristic of Ni?+. However, no such cancella- 
tion occurs when evaluating the same matrix elements 
for an unpaired bonding orbital because the c; involved 
in the bonding is not one of the eight contributing to 
the Ni potential. Eight 3d electrons make Coulomb 
contributions in this case, causing the potential to be 
characteristic of Nit, not of Ni**. [Such a 3d electron 
Coulomb contribution to (e| e) which is ~1 a.u., 
is to be compared with (1) the denominators of Eq. (40) 
which are as small as 0.25 a.u., (2) the (e| xx) Coulomb 
self-energy terms of Eqs. (44) and (45), which are 
70.3 a.u. and (3) 3d—3d exchange interactions, which 
are the order of 0.1 a.u.] It must be emphasized that 
this difference occurs whether we evaluate matrix 
elements with fo or with the exact self-consistent 
Hamiltonian 7. In other words, such an effect on the 
current results involving /o, is indicative of what 
actually occurs for the exact self-consistent LCAO 
treatment of the cluster. 

From our experience with the potential terms of 
Sec. III, we should have anticipated the differences in 
the Ni potentials appropriate to the various (o|A| ¢) 
and (e|Z|x) matrix elements. The fact that these 
differences occur is a key to the intrinsically different 
nature of the covalency of a bonding-antibonding pair 
on one hand and an unpaired bonding orbital on the 
other. The magnitude (1 a.u.) of the difference in the 
{y|h|¢) matrix elements has observable quantitive 
repercussions on one's theoretical estimates and makes 
it imperative that the pertinent covalent mixing be 
investigated. 


VII. THE MATRIX ELEMENTS 


In this section we evaluate the žo matrix elements 
appropriate to the various bonding and antibonding 
cases of interest, and examine the effects due to the 
net 3d spin on the Ni?* ion. Wherever possible, we will 
rely on the one- and two-electron integrals already 
obtained in S&S III and the same approximations have 
been made when evaluating the V; matrix elements. 
The additional required integrals have been evaluated 
with conventional (or restricted) analytic H-F (hence- 
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TABLE I. The one-electron energies (Ref. 32) of the RHE 3d 
orbitals, for the * (Mr, L, M ,75) state of Ni?*, as defined by 
Eq. (52). In these tables, the appropriate m. value is denoted by 
the arrow (1 and | denote m=} and —1, respectively). Also 
included is the errr value. All quantities are 1n atomic units. 


63d m,-— 1 l is 
my=2 — 1.46199 — 1.39472 
1 — 1.44377 — 1.34007 
0 — 1.41399 — 1.34007 
—1 — 1.44377 — 0.38057 | unoccupied 
= — 1.46199 — 0.38057 J orbitals 


enur = — 1.41254 


forth denoted as RHF) functions for? Ni** and” F-, 
However, no additional three- and four-center integrals 
were obtained for this work although they are required 
in a proper treatment of the theory. The Switendick- 
Corbato IBM 7090 program?! was used for such two- 
center integrals as arose. 


A. (ei|ho| e2 


It is in this matrix element that 3d shell exchange and 
Nit versus Ni?** potential repercussions are most 
important. We are constructing our molecular orbitals 
from free ion H-F orbitals and this suggests that we 
make use of the familiar” one-electron Hartree-Fock 
equation, 

SCxiei(r,o) = €itfi (1,0) 5 (49) 


when evaluating (e;|/to| p). Here p: is a one-electron 
spin orbital (i.e., a function of space and spin), e; is its 
H-F energy eigenvalue, and 3€w; is the one-electron 
H-F Hamiltonian for Ni?*, consisting of kinetic, nuclear 
potential, and interelectronic Coulomb and exchange 
terms. With the self-energy terms included, dCi is 
independent? of which Ni?* orbital it operates on and 
is related to ho’ by 


ho = Fenit V(r) ò (50) 


Since our 3des are RHF functions this suggests that 
we simply have 


loil ho | p= enu (eil V Lle, (51) 


where ernr is the 3d eigenvalue. Unfortunately, this 
equation does not hold, for the RHF Ni?*e's are not 
exact eigenfunctions of Eq. (49). RHF theory requires” 
a single radial function per shell. For an open-shell ion 
this is only obtained by constraints, for there exists no 
single radial function which, when inserted into the 
occupied os, will satisfy Eq. (49) for all electrons in a 
shell. The immediate implications of this can be seen by 
first inspecting Table I where the values? of 


e= (g: xil e:) (52) 


2R, E. Watson, Technical Report No. 12, Solid-State and 
Molecular Theory Group, MIT, 1959 (unpublished). 

2 C. Sonnenschein (unpublished). 

3LA. C. Switendick and F. J. Corbato, Quarterly Progress 
Report No. 34, Solid-State and Molecular Theory Group, MIT, 
October 1959 (unpublished). 

32 See p. 229 of Ref. 29. 
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Tare II. One-clectron energies for the cubic 3d orbitals of 
Ni** as defined by Eq. (52) for the configurations indicated in 
the text. All quantities are in atomic units. 


et = —1.42742 es} = —0.38057 

er, = — 1.35829 €, AP = — 1.35829 

est = —1.47226 e, E = —0.38057 
enm — 1.41254 


for the various occupied and unoccupied 3d orbitals of 
the RHF Ni?* ion are listed. Also included is ernr, 
which was used in S&S III when evaluating Eq. (51) 
for (e;|/to| e). We see that the occupied orbital e,’s 
vary by as much as 0.12 a.u. Energy denominators as 
small as 0.22 a.u. occur in Eqs. (40) and (41), suggesting 
that these variations can observably affect the evalua- 
tion of these equations. One notes that the smaller e;'s 
occur for occupied 3d orbitals of minority spin as one 
would expect since exchange lowers an electron's energy. 
The unoccupied orbital ¢’s lie approximately 1 a.u. 
higher than the others and show (cf. the discussion in 
the preceding section) the Coulomb energy penalty of 
attempting to place a ninth 3d electron on the ion. 
This penalty, which the unpaired bonding orbital 
covalency must pay, greatly influences results. 

One may rigorously use Eq. (51) providing one inserts 
the e, defined by Eq. (52) for the orbital and multi- 
electron Ni state of interest. The es of interest are 
readily obtained with the available?? integrals and are 
listed in Table II. The resulting (¢:|/o| ¢:)’s evaluated 
using Eq. (51), with self-energy terms properly ac- 
counted for and utilizing the S&S III approximate 
values for the (¢;|Vz|¢.)’s are given in Table III. 
Also listed for comparison are the matrix elements as 
obtained by Sugano and Shulman; these are (c;|/to| ei) 
values. 


B. (X;| Rol ei) 


If one assumes Eq. (49) to hold, then these matrix 
elements are obtained by first evaluating 


OG | ho’ | e) Xi] nil e324- 0] Vc] e3 
— euieS ic - Gl Vgl e- (53) 
Tase III. The matrix elements (e|/t| e) for bonding (B), 


antibonding (4), and excited configuration (E) covalent mixing. 
All quantities are in atomic units. 


S&S III 

values 
tt ; 0.1216 (0.1365) £8 —0.1358 
ty ; 0.1907 (0.1365) (i — 0.0667 
ett (sor pe) 0.0604 (0.1202) et? (s) — 0.2058 
E S et? (pz) —0.2178 
ey? 0.1744 (0.1202) eF? (s) —0.0918 
eE (po) — 0.1038 
ey (s) 0.8859 
e,” (pe) 0.8739 
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But since the ¢,’s are not exact eigenfunctions of that 
equation, the (X;/3€x;| ¢;) contribution should not be 
estimated in this way. This contribution has, with one 
approximation, been evaluated explicitly and the results 
appear in Table IV. The one approximation, neces- 
sitated by numerical inaccuracies in the 1s integral, 
consists of omitting the Coulomb potential term due to 


Tanrg IV. Values of /X;13€xil ei) and enues: (with 5; taken 
from S&S III) for ground and excited (Æ) configuration covalent 
mixing. All quantities are in atomic units. 


(X;|S€xi] ei? ente; 

ly —0.10528 —0.1067 
ty —0.10406 —0.1067 
etls) —0.11751 —0.1150 
es (De) —0.15150 —0.1564 
eyF(s) —0.11751 —0.1150 
ey (pe) —0.14605 — 0.1564 
e, (s) — 0.08478 e 

es (po) —0.09572 

4E — 0.0739 


the 1s? shell,and compensating for this with a nuclear po- 
tential term for a nuclear charge two less than that of Ni. 
Otherwise, the Switendick-Corbato program was used 
to obtain all integrals occurring in the (X;|9€xi| e;)'s. 
The ernrS: products are included in Table IV and we 
see that these differ but slightly from the exact values, 
indicating that assuming Eq. (42) has less significant 
repercussions here than it does for (¢|/|¢) matrix 
elements. 

Using the S&S III estimates of the (X| Vr] e 
contributions, the (X;|/ts| ¢;) matrix elements are given 
in Table V. As we see from Eqs. (45), these matrix 


'TanLE V. The matrix elements (%;|/o|¢) for ground and 
excited (£) configuration covalent mixing.* All quantities are 
in atomic units. 


Present 
calculation S&S III 
it — 0.0428 (—0.0443) 
ty —0.0416 
et(s) —0.1185 (—0.1160) 
et (pe) —0.0937 (— 0.0986) 
e,” (s) —0.1185 
e1? (pa) —0.0891 
e, (s) — 0.0858 
es (pa) — 0.0379 
ly? —0.0115 


? These matrix elements are independent of whether we are dealing with 
bonding or antibonding effects (see text). 


elements, unlike (x | /o| x) and (¢/ ko! e), are independent 
of whether we are dealing with bonding or antibonding 
orbitals. The matrix elements are sensitive to whether 
they are for an antibonding-bonding pair or for an 
unpaired bonding orbital [e(s)i, e(po)1, and 45]. We 
see values for the latter to be appreciably smaller in 


magnitude. E 
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(Ch (x; | ho| X) 


The various values of this matrix element are readily 
obtained using the integrals appearing? in Table II of 
S&S III. The results appear in Table VI. Two features 
of the matrix element behavior should be noted. First, 
as expected, unlike the cases of (x| hol e) and (¢| tol e) 
matrix elements, the (x|/to| x) matrix elements appro- 
priate to the unpaired bonding orbitals differ but 
trivially from the paired bonding orbital values. This 
small difference indicates that these matrix elements 
cannot compensate for the shifts, appearing in the 
unpaired bonding («|/to| x) and lol ho| e) values, due to 


'TABLE VI. The matrix elements (X;|1o|X;) for bonding. (B), 
antibonding (4) and excited configuration (Æ) covalent mixing. 
All quantities are in atomic units. 


ht —0.083 nu? +0.1737 
(ties —0.0835 (—0.0835) ty? +0.1739 
etA(s) — 0.9965 e}? (s) —0.7303 
et (Po) —0.1659 et? (po) +0.1123 
ep EA (s) — 0.9959 (—0.9959) e, EP (s) —0.7297 
(EA) — —01608 — (—01628) — ejE»(p)  +0.1154 
eyP (s) — 0.7300 
e}? (po) +0.1162 
fy FB +0.1753 


the Nit potential. Secondly, we see a sign reversal, 
similar to the one in Table II, on going from the anti- 
bonding to the bonding member of a p, or pz pair, which 
is again due to the self-energy terms of Eqs. (45). 
Their effect is to reverse the sign of the denominator of 
Eq. (40) and of the linear term of Eq. (39), a feature 
which will be of great importance to us when discussing 
results. 


D. The Overlap Integrals, S; 


As already noted, the present investigation has 
utilized analytic HF functions in its evaluation of 
integrals. Sugano and Shulman used the same Ni 3d, 
but different F-, orbitals. They used simple two- 
exponential analytic approximations to the F- functions 
of Froese, a choice which was necessitated by the 
extensive nature of the computations. S; values which 


TABLE VII. Comparison of overlap integrals obtained by 
S&S III and in this calculation. 


This 
calculation S&S III 
St 0.07077 0.07557 
Sels) 0.07977 0.08143 
Se(po) 0.10499 0.11071 
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were obtained for the current set of functions, along 
with their S&S III counterparts, are listed in Table VII; 
the differences serve as a measure of the error, due to 
wave function uncertainty, that should be attached to 
such integrals as these. The S; obtained with Son- 


nenschein's F- orbitals will be used in the following 
section. 


E. (|H]) 


From the definitions of Eqs. (47) and (48), we see 
that these matrix elements are functions of covalency 
and as we will want them for varying y and ^ values, we 
list in Table VIII the terms 


Koigi XX) — Cel eX: ], 


which are necessary for evaluating these equations and 
the corresponding terms of Eq. (45) (used in the 
evaluation of Tables III and VI). These were obtained 
from integrals appearing in S&S IL, which only include 
two-center integral contributions. It should be noted 
that by basing these terms on two-center integrals 
alone, we are making ligand x self-energy corrections 
[the first equation of (44) ] which are strictly compatible 
with the V; in our 4o Hamiltonian. Here too, the inclu- 
sion of three-center contributions to these integrals 
may have appreciable quantitative effects on these 
terms. In the calculations which follow, we will keep 
only the linear covalent contributions occurring in the 
(e| H |X;) matrix elements; the other elements will then 
keep their /r values. In view of our decision not to use a 
proper covalent Hamiltonian in this paper, a more 
detailed treatment seems unjustified. 


VIII. RESULTS 


Equations (39) and (40) have been evaluated with the 
ho matrix elements of the preceding section for the 
various bonding and antibonding y’s. The results 
appear in Table IX. Only the y’s, which are less than 
1 in magnitude, have been reported for Eq. (39); as 
stated earlier, the larger roots are redundant. 

An important feature of the results in Table IX 
concerns the internal consistency of a theory relying on 
li. We see that the y:4 and y;?, of a pe or pr anti- 
bonding-bonding pair, differ in sign while those for s 
bonding differ in magnitude. As we have already 
stressed, a y;1 must equal its partner yi? if the ortho- 
gonality requirements of Eqs. (5) are to be realized. 
Any failure to meet these requirements implies à 


Taste VIII. The [(eie;| XiX;) — (e| e%)] values necessary 
for obtaining the covalent contributions to the (|J |) matrix 
elements. All quantities are in atomic units. 


33 Two additional integrals are needed. Their values, in the 
notation of the S&S III Appendix, are: (de, pr*||d6*, px) 
= —0,000255 a.u. (de, pr*\|px-, d5*) =0.000039 a.u. The (x|/tolx) 
integrals are the most seriously affected by the (almost) complete 
omission (Ref. 10) of three and four-center integrals. 

a C. Froese, Proc. Cambridge Phil, Soc. 53, 206 (1957). 
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'TABLE TX. The y: covalent mixing parameters, as estimated with Eqs. (39) and (40), for bonding (B), antibonding (4) and excited 


configuration (Æ) cases, and compared with the S&S IIT values. 


Eq. (40) Eq. (39) S&S III values Eq. (40) 

11^ 0.180 0.171 ty? —0.178 —0. 174 

144 c 0.130 0.127 (0.1727) 1,2 —0.224 —0217 

er^) d 0.037 0.037 ex" (s) 0.114 0.112 

et^ (po) d 0.337 0.298 e1” (pa) — 0.319 —0.287 

eyPA(s) c 0.033 0.033 (0.0313) e” ” (s) 0.094 0.093 

e, FA (pa) c 0.214 0.201 (0.2848) eP” ( pa) —0.462 —0.404 
e” (s) a, b 0.0171 0.0160 
ei (pz) ab 0.0661 0.0654 
no, E 0.0325 0.0322 


* Contribute to 10 Dq. 
b Contribute to transferred hyperfine effects. 
e Contribute to 10 Dq in the traditional antibonding model. 


4 Contribute to the transferred hyperfine effects in the traditional antibonding model. 


breakdown of the theory. The deviations seen in Table 
[X lead to antibonding-bonding orthogonality integrals 
as large as 0.6 instead of the required zero. Thus, the 
severity of this orthogonality breakdown renders any 
quantity estimated with these y’s at best suspect. 

The y; sign reversals of Table IX have been caused 
by the reversal in sign of the [(xl/to|x) — e Us| ¢)] 
term of Eqs. (39)- (41), due to self-energy effects. We 
expect a similar sign reversal in this term if we go toa 
theory utilizing the exact self-consistent Hamiltonian 
(and the exact self-energy expressions). The only way 
in which this exact theory can then yield a common 
y,^ and yp? will be if (e|A|x) reverses sign as well 
(note that here we refer to a matrix element of //, not hy). 
We will shortly see indications that this does indeed 
occur. 

Three other features of the results of Table IX should 
be viewed before we leave them. First, Eq. (40) and 
the more exact (39) give results of similar character 
but occasionally differ by as much as 10%. Secondly, 
the difference in y’s, for pairs of orbitals differing only 
in spin, are not negligible. These do not affect our 
predictions for Ni?* but such effects are of sufficient 
magnitude to observably affect the transferred hyper- 
fine spectrum associated with an ion such as Cr (cf. 
Sec. IV). Finally, we see the unpaired bonding orbital y 
values to be appreciably smaller than their paired orbital 
antibonding counterparts, suggesting that one must not 
assume them equivalent and that one must utilize the 
unpaired bonding results when estimating 10 Dq or 
some other experimental parameter. The large uncer- 
tainty which one must attach to the antibonding 7’s 
(due to the antibonding-bonding pair breakdown) 
makes this comparison tenuous. The results which we 
will now inspect, also indicate substantial differences 
between the y4 and unpaired y? behavior. 

We are not prepared in the present paper to attempt a 
calculation involving the full Hamiltonian, /t, of Eq. (8), 
but we do wish to gain some idea of the sensitivity of 
the predictions to covalency. To do this we will go 
Over to the covalent treatment of the self-energy 
Correction, utilizing the Æ matrix elements of Eqs. (47) 
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and (48) (actually we will only consider the linear 
contributions to the (e| I|x)'s). Computing yrs as a 
function of assumed +’ values (appearing in (e| H xj); 
we obtain the results of Table X. We see that the 
bonding-antibonding pair results are quite sensitive to 
this covalent contribution and that the negative y’s 
turn positive with increasing covalency because of a 
sign reversal in the (e| H |x) elements. Since we have 
not used the proper Hamiltonian, we cannot expect 
complete convergence of the p electron bonding- 
antibonding y: pairs. The table also suggests that, given 
the Hamiltonian, we have underestimated the bonding- 
antibonding covalency if we use the antibonding results 
of Table IX. 

In contrast with the paired bonding behavior, the 
unpaired bonding 7’s of Table X show a tendency of 
(e|H|x) to reverse sign. The ((x| tola) — <el kol e)) 
term does not have a negative sign (as do the paired 
bonding elements), and this causes the trend toward 
negative "s. The tendency towards increased paired 
orbital y's and decreased unpaired bonding y's serves to 
accentuate the different nature of the two types of 
covalent mixing. 


Tate X. The covalent mixing y; as estimated using Eq. (39) 
with (@;|H|X) matrix elements for bonding (B), antibonding 
(A), and excited configuration (E) cases as a function of assumed 
covalency. 


M 


Assumed y? 


0 0.05 0.1 0.2 0.3 OA 0.5 
et A(s) 0.057 0.069 0,081 el E n tee 
et B(s) 0.112 0,033 0.064 ee M 
et Alo) 0.38 e 0.44 0.50 0.55 0.59 0.62 
e(a) | —0.30 —0.23 —0.15  —0.07 0.01; 0.10 
tp 0.24 0.34 0.42 048 0.53 0.58 
t72 —0.17 —0.000 — —0.112  —0.07 0.15 0.22 
m 0.19 0.27 0.33 0.39 O44 0.49 
145 —0.22 —0.12 —0.10 | —0.00, 0.19 6.28 
eLEA(o) 0.27 oo 0.33 0.38 0.43 047 0.50 
ey£B(c) —0.40 z —0.31 —0.20 . —0.08 0.05; 0.16 
ey£4(5) 0.051 0.062 0.073 + en boe oe 
ey FB(s) 0.003 0.073 0.053 see sua 
ea) 0.017 0.009 0.0008 — —0.016 - 
eL 8o) 0.065 0.047 0.0295 —0,007 s 
1,28 0.032 0.015 0.0030 — —0.033 Do eee. 
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Tasir XI. The covalent mixing parameters, Yi, aS estimated 
using Eq. (39) with leil I| X) matrix elements for the unpaired 
QEP, ey P (s), and ej" (a) orbitals for assumed y,° values. 


Input y? Computed yi 


ey? (s) 0.01 0.015, 
ey? (o) 0.05 0.047 
[E^ 0.025 0.0236 


In Table XI we have reevaluated the unpaired 's 
with input (e|Z7|x) covalent contributions suggested 
by Table X. The results show internal consistency and 
are in crude agreement with the noncovalent estimates 
of Table IX. This is largely due to the fact that the 
covalency is small and hence the covalent self-energy 
repercussions are small (but not insignificant). 

So far we have only considered the occupied orbitals 
of the cluster and have not used the alternate approach 
of dealing with the antibonding holes. As stated earlier, 
such an approach is exactly equivalent to the unpaired 
bonding method provided that one maintains various 
restrictions of the present paper (e.g., common radial 
behavior for the ¢’s and x's appropriate to an anti- 
bonding-bonding pair on the one hand and to the 
unpaired bonding orbitals and antibonding holes on 

the other) In a less restricted treatment, the two 
approaches need not be equivalent but one is then 
involved with an unrestricted H-F theory and its 
associated difficulties? Estimates of antibonding hole 
y values involve zo self-energy corrections [e.g., in 
Eqs. (44), (47), and (48) ], and the results must equal 
(to second order) their unpaired bonding counterparts 
in order to maintain orthogonality requirements. Using 
Eq. (40) and the matrix elements and integrals of 
Tables III-VIII, we obtain values of y equal to 0.014;, 
0.048, and 0.024 for the e(s), e(o), and !" orbitals, 
respectively. These are in remarkable agreement with 
the values listed in Table XI. 

While we seem to have converged on “self-consistent” 
unpaired bonding values in Table XT, this by no means 
implies that they are in detailed agreement with what 
will be yielded by the more exact theory. On the other 
hand, they differ strikingly with those bonding-anti- 
bonding results obtained either by Sugano and Shulman 
or in Tables IX and X. This is due to the Nit—Ni’* 
Coulomb cancellation differences and will remain in 
the exact self-consistent field cluster theory. 

In order to facilitate the discussion of 10 Dg and 
transferred hyperfine effects, the y values, appropriate 
to the unpaired bonding and antibonding models, have 
been extracted from Tables IX to XI and listed 
together in Table XII. The S&S III antibonding results 
are included for comparison, as are the y4 values 
suggested by Table X, since these would be appropriate 
to 10 Dg and the transferred hyperfine effects if we had 
adhered to the traditional (but incorrect) antibonding 
model. Different y4 values are reported for 10 Dg and 
for the transferred hyperfine effects as spin | parameters 
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are appropriate to the former and spin f to the latter 
in the antibonding model. Computed 10 Dg or f; values 
based on these y^ will not be reported since this would 
be incorrect. We merely note that the use of these y4’s 
would predict even stronger covalent effects than those 
apprearing in S&S III. 


A. Transferred Hyperfine Effects 


Values of f; were determined by using Eqs. (23) and 
(24) for the various sets of unpaired bonding y?s 
reported (and defined) in Table XII. The separate 
spin 1 and spin | contributions to the f; values are 
listed in Table XIII along with the experimental’! 
values of f, The experimental f, value includes the 
so-called 1s—2s cross-term correction." Under the 
antibonding column, we list the S&S III predictions 
and give their pure overlap contribution separately 
from the covalent terms [cf. Eq. (20) ]. 

The predicted f, values are not in good agreement 
with experiment; the S&S III antibonding prediction is 
in somewhat better, but by no means good, agreement. 
What agreement there exists for the unpaired bonding 
results, arises largely from the spin f electron overlap 
term rather than from the spin | covalent terms. 

The experimental f, value lies between the S&S III 
antibonding prediction and those of the present calcula- 
tions. The theory is again seen to be in poor agreement 
with experiment. Quite aside from the fact that the 
S&S III predictions involve the inappropriate anti- 
bonding ’s, we do not find them in marked numerical 
agreement with experiment. The present results con- 
sistently and appreciably underestimate both f; and 
fo: We will see a similar, and perhaps related, low value 
for 10 Dg. Let's consider that case now. 


B. Crystal Field Splitting Parameter 


Our expression for 10 Dg [ Eq. (38) ] was derived for 
the exact self-consistent Hamiltonian /r, but, as discus- 
sed earlier, we shall use //j matrix elements to estimate 
the diagonal, overlap, and covalent contributions. 
Before proceeding to this task, it is instructive to 


Taste XII. The y values appropriate to the unpaired bonding 
and the antibonding models for 1029 and the transferred hyperfine 
effects as obtained from Tables IX-XI. Included are the S&S IIT 
antibonding estimates. 


Unpaired bonding Antibonding 
A B C D E 
Table Table 
IX IX Tables S&S Table 
Eq. (40) Eq. (39) X, XI TII x 
For transferred hyperfine effects 
y42 0.017, 0.017. 0.015 yət 0.031 0.05-0.08 
Yet” 0.066 0.065 0.047 ort 0.285 0.4 -0.6 
For 10Dg 
yay? 0.017, 0.017. 0.015;  y,4E4^ 0.031 0.05-0.07 
yat” 0.066 0.065 0.047 474 0.285 0.2 -0.5 
YEB 0.0325 0.0322 0.024  y44 0.173 0.2-0.5 
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'"TAnLE XIII. Comparison of unpaired bonding [cf. Eqs. (23) 
and (24)] and antibonding (S&S IIT) transferred hyperfine 
parameters, f: with experiment. The various calculations are 
defined as in Table XII. 


Antibonding 


Unpaired bonding 
A B S&S III 


fs 


a 


Spin ¢ overlap =0.0022 
Spin f covalent =0.0020 


Spin t overlap =0.0021 0.0021 0.0021 
Spin } covalent —0.0010 0.0010 0.0009 


Total fs=0.0031 0.0031 0.0030 Total fs =0,0042 
Experiment =0.0054 


to 
Spin f overlap =0.0037 0.0037 0.0037 


Spin f overlap =0,0041 
Spin 4 covalent =0.0060 0.0059 0.0040 


Spin ¢ covalent =0.0480 


Total f, =0.0097 0.0096 — 0.0081 Total f,=0.0521 
Experiment —0.0378 


compare this estimate with the results appearing in 
S&S IIT. Such a comparison, however, cannot be done 
directly since their equation for 10 Dg [S&S III, Eq. 
(2.11) ] differs with Eq. (38). For example, they have 
an e, overlap term, 


—Sedlh POSSA Pe) (54) 
and a covalent contribution 
—HPMedhix)- Sas ekle). (65) 


Here, the second-order contributions appear in the 
“renormalization” term of their equation. Equations 
(54) and (55) obviously differ with lines 1 and 4 of 
Eq. (38) which are their counterparts here. Prior to 
comparing our results with S&S III, we must then ask 
whether the difiering equations for 10 Dg in any way 
affect our observations. With this in mind, we will first 
evaluate Eq. (38) with the antibonding y’s and matrix 
elements of S&S III. [ Equation (38) is readily converted 
for use in their antibonding model by replacing the 
values of y and the matrix elements of the fifth line 
(for the ¿£2 electron) by the corresponding /4 values 
and similarly replacing the e,” and e,? of the sixth and 
seventh lines by their counterparts e,7F4 and e,FP4, 
respectively.] The results of this calculation are given 
in Table XIV along with the values obtained by using 
Eq. (2.11) of S&S III, with and without second-order 
terms. Sugano and Shulman identified the linear terms, 
alone, as specific overlap or covalent contributions. 
With the inclusion of second-order terms, we see from 
Table XIV that the two equations yield the same fofal 
contribution from a particular electron but differ 
appreciably as to the relative roles of *covalent" and 
"overlap" effects. This implies that one equation or the 
other has incorrectly identified these contributions. 

As already discussed, Eq. (38) was obtained from 
one-electron energy expressions [e.g., Eqs. (27), (28), 
and (30)] by the explicit evaluation of (V|Z|W), a 
process which correctly yields the energy whether or 
not Y is an eigenfunction of 4. The matrix elements of 
the resulting equations were then in no way manip- 
ulated. Equation (38) correctly yields a value of 10 Dg 
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in the limit of zero covalency by the simple expedient of 
setting the y values equal to zero: the terms which 
remain are considered to be diagonal or overlap contri- 
butions; those which are introduced in the process of 
allowing nonzero y’s are considered covalent effects. 
Thus, we conclude, Eq. (38) gives the proper break- 
down of covalent and overlap effects. 

Equation (2.11) of S&S III is based on one-electron 
energy expressions such as Eq. (29) which in turn was 
obtained from 

(olal wy Celi) e, (56) 


a relation which holds only if Y is an eigenfunction of 
the H-F equation [ Eq. (6) ], i.e., only for the one value 
of appropriate to that equation. One can alternatively 
obtain Eq. (29) by using Eq. (10) to remove a linear 
— (q-FS)e|hix) term from Eq. (27). However, in 
doing this, one assumes a specific relation between y, 
S, (e|h| e), Gel lx), and (x| 4| x), namely, that y be 
appropriate to the H-F solution. The resulting expres- 
sion is simpler than Eq. (38), but it yields correct results 
only if the H-F y value is inserted, and we can no longer 
correctly separate the covalent and overlap contribu- 
tions by simple inspection, as we did for Eq. (38). 
From Table XIV we see that in S&S III the role of 
overlap effects was underestimated and that of cova- 
lency overestimated. 

Computed unpaired bonding values for 10 Dg are 
listed in Table XV: Cases A and B (described earlier) 
have been merged since they vield identical results; 
the S&S III antibonding values of Table XIV have 
again been included for comparison. All of these results 
have been obtained by using Eq. (38) and all indicate 


TABLE XIV. The overlap and covalent contributions to 10 Dg 
obtained using Eq. (38) [modified as indicated in the text ] and 
by using Eq. [S&S III, (2.11) ] when evaluated with the matrix 
elements and y4’s of S&S III. All quantities are in cm™. 


S&S III Eq. (2.11) 


Linear plus 
Linear terms second-order 
Eq. (38) only terms 
s contributions 
overlap 2870 2070 2250 
covalent 240 790 855 
total 3110 3105 
bo contributions 
overlap 4680 2400 2720 
covalent 5035 6170 7000 
total 9715 9720 
pr contributions 
overlap E1595 — 130 —905 
covalent —1440 —1680 —2075 
total —2975 — 2980 
Total overlap 6015 3740 4065 
Total covalent 3835 5280 5180 
Total 9850 9020 ` 9848 


35 The total antibonding value for 10Dg differs bv 70 cm! with 
m appesting in S&S III. This occurs because e differ ae 
the matrix element appropriate to the second-order bital 
contribution (see footnote 22). Rae s 


s 
. 


* 
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TABLE XV. The diagonal (as estimated in S&S TI), overlap 
and covalent contributions to 10 Dg evaluated with Eq. (38) 
and the Jo Hamiltonian for the unpaired bonding »y's of sets A-B 
and C, and for the antibonding y's of S&S TI. All quantities 


are in cm™. 


AND A. J. 


S&S III 


Case A-B Case C 
Diagonal —3570 —3570 —3570 
ion EA 3000 3000 2870 
[E 4135 4135 4680 
pis —1410 —1410 —1535 
Total overlap 5725 5725 6015 
Covalent 
Unpaired es}? 105 100 es EA 240 
Unpaired es}? 725 665 Co 5035 
Unpaired 1y28 —170 —160 142 —1440 
Total covalent 660 605 3835 
10 Dg= 2815 2760 6280 


Experimental 10 Dg= 7250 


that overlap effects dominate. The covalent contribu- 
tions play a small role in the present calculations and 
the resulting 10 Dg values are observed to be substan- 
tially smaller than experiment. 

Upon seeing such a disagreement with experiment, 
one might well inquire into the role played by uncertain- 
ties or errors in the y values. From a given set of matrix 
elements (say, the unpaired bonding (|/o|)’s), an un- 
paired bonding y, which is obtained variationally, gives 
the lowest one-electron (and in turn cluster) covalent 
crystal field energy obtainable for the orbital in ques- 
tion. Any deviation in the y eigenvalues, for electrons 
appearing in the cluster ground state, will raise the 
ground-state energy, hence decreasing 10 Dg. Any 
similar deviation for a y? will raise the excited state 
energy, thereby increasing 10 Dq. If we utilize the 
unpaired bonding (|/1o|) integrals, any variation in the 
ye? or Ys”, from the values of Table IX, will, therefore, 
decrease 10 Dg, while a variation in y,7? will increase it. 
Consider the effect of varying Ys”, since it contributes 
the dominant covalent term in Table XV. If we assume 
Eq. (24) to be rigorously appropriate to the experi- 
mental f, value, we can obtain an experimental oe 
which is four times the value appearing in Tables IX. 
and XI. Inserting this into Eq. (38) yields a e, covalent 
term of ~—1100 cm-! which, holding all other contribu- 
tions constant, leads to a 10 Dg of only ~1000 cm™. 
Thus, a mismatch between a y value and a set of matrix 
elements can severely affect a computed 10 Dg. The 
behavior of these matrix elements determines the result ; 
they could be of sufficient accuracy to yield qualitatively 

correct values while being quite unsatisfactory for an 
estimate of 10 Dg. 

As stated, Eq. (38) should be evaluated for the exact 
self-consistent Hamiltonian. In the absence of such 
computations, we can again gain some indication of the 
sensitivity of the results to matrix element behavior 


e 
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by evaluating Eq. (38) for 10 Dg with the bonding 
(¢|H|x)2 matrix elements of Eq. (48). Replacing a 
Ce|Ao|x)? integral by (e| H |)? affects both covalent 
and overlap contributions to 10 Dg. The results of 
Table X indicate that (¢| Hl | x)? is smaller in magnitude 
than its partner (¢|holx)? for small covalent mixing 
(eventually reversing sign for greater covalency), and 
imply, in turn, a decrease in 10 Dg. Using (|H |x)? 
values appropriate to the computed y’s, decreases the 
covalent and overlap contributions by ~ 2000 cm™ and 
yields a 10 Dg value ~800 cm, again increasing the 
numerical disagreement with experiment. 

As was the case for the f;’s, the present calculations 
yield a 10 Dg which is appreciably less than experiment. 
These underestimates may be related; if so, more than 
a simple underestimate of y values has occurred. We 
are involved with the more subtle, and more difficult, 
matter of matrix element behavior. Unfortunately, we 
have little evidence of whether a treatment involving 
the exact one-electron Hamiltonian, will or will not 
improve the numerical agreement with experiment. 
It is important to note that approximations were made! 
in estimating the Vz contributions to the (e| ho x) 
and (x|/to| x) matrix elements, but it is presently not 
obvious what repercussions, if any, these had on the 
results. This will be investigated in the future. Of 
greatest interest, of course, is the question of whether a 
treatment, such as the one outlined above, with the 
exact h, yields reasonable agreement with experiment. 
It need not. 


IX. OTHER PARAMETERS AFFECTED BY COVALENCY 


Other parameters are affected by covalency, including 
the orbital reduction factors appropriate to the inter- 
electronic Slater F* (or Racah B and C) integrals of 
multiplet theory*? and to g shifts and spin-orbit coupl- 
ing,” and the determination of superexchange™ interac- 
tions and neutron form factors.*? 

In the antibonding approach to the expectation 
value of a one-electron operator, Op, one compares 
(v4|Op|w;4) with the free ion (e;|Op|ej? value. 
Quite often, a reduction factor defined by 


k=(W.4|Op|;4)/(e:l Op| e2 (57) 


is introduced to account for observed differences. Such 
a factor is, in principle, straightforward to obtain but is, 
in practice, almost never evaluated because of difficul- 
ties associated with the (x|Op|e) and (x|Oplx) 
matrix elements. The one case where Eq. (57) has been 
evaluated occurs for Stevens’ orbital reduction factor, 
k,314 appropriate to 
GV [Ls |) — Es m psaillm| 225) 5 (58) 
% See, e.g., J. C. Slater, Quantum Theory of Atomic Structure 
(McGraw-Hill Book Company, Inc., New York, 1960), Vol. I. 
© K. W. H. Stevens, Proc. Roy. Soc. (London) A219, 542 


953). 
38 J. Hubbard and W. Marshall (unpublished). 
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where m is some component of the orbital angular 
momentum I. The ex and xx contributions to & can be 
included and & tends to be less than one. 

For other operators it is often assumed that the 
reduction effects are adequately predicted by the 
approximate relation 


(4 |Op|V;4) — NN esa Op| paa) (59) 


where the jV's are the normalization constants of Eq. 
(2). The (¢|Op|x) and (x|Op|x) terms are neglected, 
because they are inconvenient to evaluate and are, 
hopefully, negligible. Detailed inspection of the Stevens’ 
orbital reduction factor suggests that this approxima- 
tion is inadequate. Nevertheless, necessity led Tinkham 
to rely on such a normalization estimate when discussing 
spin-orbit effects. Sugano and Shulman also used this 
approach when dealing? with the experimentally 
observed! reduction of the Racah B and C integrals 
from their free ion values (here, the normalization 
enters in the fourth power because two-electron terms 
are involved). 

In the (multielectron) unpaired bonding approach, 
the simple one-electron picture of the traditional model 
is lost. As discussed earlier, the overlap effects arise 
from the bonding-antibonding pairs and covalent 
contributions come from the unpaired bonding electrons. 
Now, if one ignores the ['e*(x)x(r)] overlap density 
terms, overlap effects increase the charge and spin on 
the Ni?* ion. Covalent mixing also increases the charge 
but decreases the spin (ie., increases minority spin). 
Our picture of reduction effects therefore depends on 
whether we are dealing with a charge or a spin-depend- 
ent effect. 

The complications introduced by the multielectron 
considerations increase the size of the computations, but 
introduce no formal difficulties. When dealing with a 
diagonal one-electron operator matrix element one need 
only sum over paired bonding-antibonding, and un- 
paired bonding contributions. The neutron magnetic 
form factor is an interesting example of this case. For 
an off-diagonal matrix element [e.g., between the ground 
and excited states of Eq. (25) ] we must remember that 
as one changes the occupation of antibonding orbitals, 
the bonding orbitals are also changed by the change in 
covalency [it was this which invalidated Eq. (26) as an 
expression for 10 Dg]. Such variations cause the orbitals 
of one state to lose their orthonormality with respect 
to orbitals of the other. These complications are easily 
managed computationally by using a formalism such as 
Lówdin's?? for dealing with nonorthogonal functions or 
by evaluating the matrix element in terms of the anti- 
bonding holes (s). The difficulties associated with 
(e|Op|x) and (x|Op|x) matrix elements, mentioned 
above, also occur in the present approach. Detailed 
estimates of orbital reduction effects will not be at- 


* P. O. Löwdin, Phys. Rev. 97, 1474, 1490, 1509 (1955). 
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tempted in this paper. Let us briefly consider an ex- 
ample using the incorrect normalization correction to 
make an estimate of a reduction effect. In Sugano and 
Shulman's calculations, the competition between the 
overlap and the (dominant) antibonding covalent contri- 
butions, led to reductions in the Racah B and C integrals 
equal to those actually observed. If one were to consider 
such simple normalization corrections for the unpaired 
bonding results, one finds that the antibonding overlap 
terms lead to a 3 and 4% enhancement of the Racah B 
and C integrals and a negligible covalent contribution 
from the unpaired bonding electrons. 

Similarly, the small magnitude of the unpaired bond- 
ing covalent mixing leads to poor agreement [using, 
say, Eq. (59) ] with the various experimental “k” values. 

On several occasions in this section, we have cited 
difficulties associated with evaluating (x|Op| e) and 
(x|Oplx) integrals. As this matter has implications for 
any more exact theory of crystal field effects, let us 
consider it briefly here. One is frequently interested in 
the highly singular 77? operator appropriate to fine 
structure and hyperfine interactions. Matrix elements of 
this, and other, operators often nearly diverge because 
the LCAO wave functions were not required to have 
the proper singular behavior in the vicinity of each of 
the nuclei present (cf. the previous discussion of trans- 
ferred hyperfine effects). Thus, even if the (e|Op|x) 
and (x| Op | x) integrals are well behaved, their computed 
numerical values could be quite unrealistic. To our 
knowledge this matter has not been investigated and 
correcting for it promises to be a most interesting 
problem. Until this is done, one cannot claim complete 
quantitative understanding of the various orbital 
reduction effects. 


X. CONCLUDING DISCUSSION 


A large number of observations, some major and some 
minor, have been made, concerning the application of 
the LCAO molecular orbital method to crystal field 
theory, in this paper. Perhaps the most important of 
these is the fact that it is the unpaired bonding electrons 
and not the antibonding electrons, which make covalent 
contributions to physical observables. The importance 
of this observation is largely due to the intrinsic (hence, 
quantitative) difference in the nature of the two types 
of covalent mixing (based on diagonalizing the one- 
electron Hamiltonian). In addition, we saw indications 
that a theory based on the incomplete Hamiltonian so, 
is internally inconsistent, hence inadequate. 'The results, 
obtained with /o, were also seen to be in poor agreement 
with experiment, the covalent contributions being signifi- 
cant but by no means dominant. We are not optimistic 
that the same theory, evaluated with the exact H-F 
Hamiltonian, will recapture good agreement with all ex- 
perimental parameters. (These observations are, of 
course, made for the molecular orbital theory. The altér- 


native Heitler-London approach to the crystal field prob- > 
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lem? is being studied by Rimmer and Hubbard.” The 
Heitler-London method introduces covalency through 
configuration mixing and thereby loses the simplicity 
of the single determinant description of the MO 
theory.) 

The fact that the unpaired bonding orbitals make the 
only covalent contributions to an observable, is not 
surprising. It has been recognized for some time that, 
when dealing with Hartree-Fock single determinants, 
any mixing between a pair of occupied orbitals in no 
way affects the expectation value of an operator. 
This immediately implies that the covalency of the 
simultaneously occupied orbitals of a bonding-antibond- 
ing pair, cannot in any way, affect the prediction of 
any observable, for the (Nils) cluster states considered 
here. 

Our understanding of orbital reduction effects and 
of the covalent “delocalization” of charge and spin 
differs in the two approaches. In particular, the 
covalency of a bonding-antibonding pair conserves 
charge on the metal ion while that of the unpaired 
bonding orbital involves a shift of charge from the 
ligands to the metal. Such differences in the nalure of 
the two types of covalent mixing, have quantitative 
repercussions on the estimates of y values (cf. Table 
XII). This is the most immediate quantitative reason 

ior our preoccupation with unpaired bonding effects in 
this paper. As noted, the differences between computed 
paired and unpaired covalent mixing do not destroy the 
required spin symmetry of the cluster (Le., it is an 
eigenfunction of 5?). The unpaired bonding approach 
may be viewed as the proper application of restricted 
H-F theory to the current cluster model for KNIF;. 
Spin symmetry is maintained and what is more, if we 
apply the variation principle to the total energy of the 
cluster (with respect to the covalent mixing param- 
eters), subject to the (RHF) requirement that the two 
types of covalent mixing be identical, one of necessity 
obtains the unpaired bonding (or antibonding hole) 
values. (The discussion for Cr?* has, of course, en- 
compassed UHF theory and the breakdown of spin 
symmetry.) 

We also noted that a theory based on the mixing of 
free ion metal e and ligand y orbitals and based on the 
approximate Hamiltonian, /, appeared unsatisfactory 
(quite aside from the fact that quantitative agreement 
with experiment was not obtained) for we saw serious 
internal inconsistencies in the covalent mixing estimated 
for bonding-antibonding pairs. The effects did not seem 
so severe for unpaired bonding y values, an observation 
which is probably misleading. 


0D. Rimmer and J. Hubbard (unpublished), who use the 
configuration interaction method of Keffer ef al. (Ref. 13). 
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Having failed to produce even fair agreement with 
experiment, the question arises whether the same 
theory, evaluated for the exact self-consistent Hamil- 
tonian A, wil or will not produce agreement with 
experiment. We do not expect the current (free ion V 
and e) cluster model to yield good agreement (say to 
20%) for all the parameters considered in this paper. 
Should such a failure prove to be the case, there will 
again arise the old question of the variation of atomic 
orbital character away from the free ion behavior 
assumed in the present LCAO approach. Such a varia- 
tion can take several forms: first, there is the matter of 
bulk expansion or contraction of the metal?! ion (or 
ligands) ; secondly, there is the question of the extent 
to which 3de, and cs orbitals differ radially”; thirdly, 
there can be variation in orbital behavior from one 
multielectron state to the other; fourthly, one can allow 
different bonding and antibonding y’s to have different 
radial parts for their e and x basis orbitals (involving us 
almost inevitably with a UHF theory); lastly, we have 
to consider the variation of orbitals centered on one 
nucleus when in the immediate vicinity of another, a 
matter important to orbital reduction and hyperfine 
effects. A bulk expansion of the metal e orbitals may 
be expected with the covalent increase of charge 
occuring at the Ni site. This would improve numerical 
agreement with some experiments but it remains to be 
seen if this effect is fundamental to the current disagree- 
ment with experiment. Other questions which arise, 
include the role of correlation effects, one’s ability to 
adequately define Vex, and the justification of the 
cluster model itself. 

Finally, it is imperative that a proper crystal field 

cluster treatment, involving the exact Hamiltonian, be 
done. In this way, a number of the questions of the 
present paper would be resolved. We believe a calcula- 
tion of this sort to currently be attainable with reason- 
able accuracy. 
_ Note added in proof. In a recent publication, E. 
Simánek and Z. Sroubek [Physica Status Solidi 4, 251 
(1964)] have considered unpaired bonding effects for 
the same system. 
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The distribution of magnetic moments in the Fe-Rh system has been investigated by the neutron dif- 
fraction technique in the composition range between 35 and 50 at. Rh. These alloys have chemical order 
of CsCl type; the body-corner positions are occupied by Fel atoms and the body centers by Rh and Fell 
atoms. The magnetic moments in ferromagnetic alloys containing 35, 40, and 48%, Rh are pres =3.1 ug, 
prot =2.5 up, and urnn=1.0 ug at 25°C. The moment of the FeI atom in the 50% Rh alloy, which is anti- 
ferromagnetic at room temperature, increases to 3.3 ur. A detailed study of the unpaired spin density 
distribution in the 48% Rh alloy was made by the polarized beam technique. While the Fel atom has a 
nearly spherical density distribution, that of the Rh atom shows a strong tendency towards e; symmetry. 
The coherent nuclear scattering amplitude of Rh was determined to be 0.585--0.005 X 107? cm. 


I. INTRODUCTION 


N a previous paper! we have reported a systematic 
investigation of the Fe-Rh alloys by the Móssbauer 
technique. Up to 52% Rh these alloys possess a body- 
centered cubic lattice in which there is chemical order 
of CsCl type when the Rh concentration exceeds 20%. 
In the ordered alloys, iron atoms located at the body 
corners are labeled Fel and those in the Rh sites at the 
body centers Fell (see the insert to Fig. 1). The system 
exhibits two interesting magnetic properties. First, 
fairly large moments are observed over a wide concen- 
tration range,^? as shown in Fig. 1. Second, a ferro- 
magnetic-antiferromagnetic transition? occurs in alloys 
containing around 50% Rh. The transition temperature 
increases with increasing Rh concentration, and is 
observed to be 65°C in the 50% Rh alloy. The room 
temperature phase changes from ferromagnetic to 
antiferromagnetic between 49% and 50% Rh. 

The Móssbauer measurements! revealed two distinct 
hyperfine fields in ordered alloys, corresponding to Fel 
and Fell. For example, H;(I) and H,(I1) were found to 
be 277 and 384 kOe, respectively, in an alloy containing 
48% Rh. It was suggested that Fell, which is sur- 
rounded by 8 Fe nearest neighbors, may have a smaller 
magnetic moment than Fel despite its higher hyperfine 
field. The main interest in this alloy system therefore 
lies in the composition dependence of both the moments 
and the hyperfine fields of Fel, Fell, and Rh. 

A neutron diffraction study of the FeRh alloy was 


* Work supported by the auspices of the U. S. Atomic En- 
ergy Commission and the Advance Research Project Agency. 
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first carried out by Bertaut, de Bergevin, and Roult.* 
Using powder data, they reported pre=2.8 yp and 
Un — 0.8 up for the 53% Rh alloy in the ferromagnetic 
region above 60°C. The present work describes neutron 
diffraction measurements of the magnetic moments of 
Fel, Fell, and Rh in ordered alloys, and a detailed 
study of the spin density distribution of Fe and Rh in 
the 48% Rh alloy. A study of the 50% Rh alloy, which 
is antiferromagnetic at room temperature, was also 
made. 


II. EXPERIMENTAL 
A. Samples 


The alloys used in this work are those previously 
investigated by the Móssbauer technique. All alloys 
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Fic. 1. Average atomic moments of Fe-Rh alloys at 300*K. The 
values in the antiferromagnetic phase were obtained by extrapola- 
tion of the data obtained for the high-temperature phase. — 
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'T'ABLE I. Physical properties of F e-Rh alloys. The moment value 
of 50% Rh (*) is obtained by extrapolation from the ferromagnetic 
phase above 65°C. 


Composition 48% Rh 5095 Rh 
Lattice constant 2.989 A 2.986 A 
Curie temperature 500°C 405°C 
Magnetic moment 2.07 un 1.94 up" 

per atom, 300°K 4 
Néel temperature 65°C 


were prepared from high-purity metals by levitation 
melting with subsequent heat treatments as described 
in the previous paper.’ The chemical analysis agreed 
with the nominal composition within 0.5 at.%. Single 
crystals large enough for neutron diffraction study were 
obtained by levitation melting of the 48 and 5095 Rh 
alloys. Efforts to grow single crystals of the 25% Rh 
alloy by the electron beam technique were not success- 
ful. This may be due to the existence of a y phase at 
high temperatures. An attempt to obtain single crystals 
of the 35% Rh alloy by the stress annealing technique 
was also unsuccessful. 

Thus polycrystalline samples containing 35 and 40% 
Rh were used to study the moments of Fel and Fell. 
Detailed studies of the ferromagnetic and antiferro- 
magnetic phases were carried out on the 487% Rh and 
50% Rh single crystals, respectively. The magnetic 
properties of these two alloys are listed in Table I. 

Interpretation of the magnetic diffraction data 
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Fic. 2. The observed magnetic structure factors Fa, of 
Fes;Rhos. Reflections with 4+-k-+-/=2n are shown by solid 
circles, those with /i--&--1—2n-F-1 by open circles. The point at 
sing/A=0 corresponds to the observed bulk moment of 4.14 up. 
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requires a precise knowledge of the chemical order in 
these alloys. Previous Móssbauer measurements, 
coupled with x-ray powder patterns, had revealed 
pronounced order in all the alloys containing more than 
20% Rh. A careful study of the 50% Rh alloy, to be 
described later, showed that ordering is complete. 
Somewhat less accurate measurements on the 35% and 
48% alloys gave a similar result. 


B. Polarized Neutron Technique 


Except for the study of the antiferromagnetic phase 
containing 50% Rh, all the measurements were made 
with the use of the polarized neutron technique, which 
has already been described in detail.” By reversing 
the incident neutron polarization, we measure the 
polarization ratio 


Todes EE Y», 


where y is the ratio of the magnetic to the nuclear struc- 
ture factor (Far/Fx). The observed ratios were cor- 
rected for incomplete polarization of the incident beam 
(0.98) and for incomplete polarization reversal (0.98). 

It is also necessary to make allowance for multiple 
scattering processes within the crystal. It has been 
showr? that multiple scattering can introduce serious 
errors in the measured polarization ratios. In order to 
detect and eliminate this effect, the crystals were 
mounted on a goniometer in a magnetizing field of 
10 kOe, and the polarization ratio was studied as a 
function of the azimuthal angle around the scattering 
vector. 


II. SPIN DENSITY IN FERROMAGNETIC Fe,.:;Rho.is 


Single crystals were cut into the shape of pillars, 
roughly 2X3 mm in cross section and 15 mm in length, 
with their long axes along [110] and [100], respectively. 
In addition, a smaller cylindrical [110] crystal (1-mm 
diam), which was cold worked, was used to check for 
possible extinction effects in strong reflections. Extinc- 
tion effects were also investigated by mis-setting the 
Bragg angle.? Even for undistorted crystals, extinction 
corrections were proved to be negligible except for the 
two strongest reflections. 

Intensity measurements were made out to an angle 
corresponding to sin0/A— 0.85 A“, with a wavelength 
of 1.05 A. The number of reflections totaled 26, and all 
except two, (321) and (421), are contained in the two 
zones. The results are shown in Fig. 2 and Table II. In 
calculating the magnetic structure factor Fa from the 
relationship y=Fm/Fy we used by.=0.951? and 
brn= 0.585 X 107? cm, the latter value being determined 
from the 50% Rh alloy (see Sec. V). The errors quoted 


ER. Nathans, C. G. Shull, G. Shirane, and A. Andresen, Phys. 
Chem. Solids 10, 138 (1959). 
TR. Nathans, Suppl. J. Appl. Phys. 31, 350 (1960). 
i J. Pickart Bi ene Phys. Rev. 123, 1163 COP 
=G. and Y. Yamada, J. Phys. Soc. 7 . 
BL 1 (1992). mada, J. Phys. Soc. Japan 17, Supp. 


^ua? 


= 


ud 


MAGNETIC 


in Table II are the standard deviations of repeated 
measurements. 

In the CsCl type of structure, the structure factors 
Fx are divided into two kinds, 


Fy=pretpen for h+k+l=2n, 
Fu=proe— prn for h+k+l=2n+1, 


where pre and fm, are the magnetic scattering ampli- 
tudes of Fe and Rh, respectively, and are equal to the 
product of the magnetic moment » and the form 
factor f, 


Pre—Hurefre, PRh=HRhfRh. 


Figure 2 clearly shows that these two groups of reflec- 
tions lie on two separate curves which cross each other 
around sin8/A —0.45 A~. This implies that the rhodium 
form factor frn becomes negative at this point, in 
agreement with recent calculations by Watson and 
Freeman.” Thus it falls off much more rapidly as a 


TABLE II. Magnetic structure factors Fw, in units of 107? cm, 
per CsCl type unit cell of Feo.s2 Rho,as. 


hkl F y,(+0.001) hkl F3 (220.002) 
100 0.516 011 0.658 
111 0.410 200 0.400 
120 0.275 211 0.237 
300 0.136 022 0.141 
122 0.144 130 0.090 
311 0.104 222 0.053 
230 0.076 321 0.035 
322 0.039 400 0.052 
140 0.037 d 0.009 
133 0.023 411 0.030 
421 0.016 240 0.012 
500 0.003 233 0.000 
430 0.004 422 — 0.010 


function of the scattering angle than the form factor 
of Fe, which becomes negative only above sin6/A 
=0.80 A3. 

These data were analyzed in two ways. The first is 
the direct comparison of observed and calculated 
structure factors, and the second involves the Fourier 
inversion of the data. We shall describe the second 
approach first, since it gives a more visible picture. 
This process of Fourier inversion was used by Pickart 
and Nathans? for Fe;Al and Shull and Yamada? for Fe. 
Our preliminary analysis was made with a Fourier 
projection on the (110) plane, but a heavy overlap due 
to an extended wave function of Rh made the three- 
dimensional Fourier analysis desirable. 

Following conventional crystallographic notation, 
we have 


p(xyz) — (1/V) £ Fui cos2a(ha-tky+lz) , 


where p is the magnetic spin density and V is the unit 
cell volume. The summation has been carried out on 


uR. E. Watson and A. J. Freeman (unpublished). 
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Fic. 3. Magnetic moment density distribution in the (110) 
plane. The scale is taken to give 100 at the Fe nucleus, which 
corresponds to 6.82 ug/ A?. 


the computer. Figure 3 illustrates a portion of the (110) 
plane intersecting the two atoms in the unit cell. This 
Fourier map clearly illustrates the lobe along the [001] 
axis for Rh, the characteristic of e; symmetry. On the 
other hand, the Fe density distribution corresponds 
more closely to spherical symmetry. Figure 4 illustrates 
the moment density along two principal directions, 
[001] and [111], the latter being directed towards the 
nearest neighbors. This figure shows that the Fe spin 
density also shows some deviation towards e; symmetry, 
in that the density along [001] is slightly higher than 
that along [111] at the equivalent distance. 

It is emphasized that the large difference in the 
angular dependence of the form factors, fre and fen, has 
a significant effect on their Fourier contours. Available 
data within the given angle of sin#/A=0.85 A+ allow 
an excellent resolution of the Rh spin density distribu- 
tion to be made since its form factor crosses zero at 


MAGNETIC MOMENT DENSITY 


Fé 05 LO 1.0 05 Rh 
DISTANCE FROM NUCLEUS (À) 


Fic. 4. Magnetic moment density distribution in the [001] and. 
[111] directions as a function of distance from the nuclei. The 


scale 10 corresponds to 0.68,5/À3. [111] corresponds to the 


nearest-neighbor direction. _ 
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sind/A=0.45 Ao. We may note that the magnetic 
moment density at the Rh nucleus is almost zero as 
would be expected if the magnetic moment arises from 
the presence of unpaired d electrons. On the other hand, 
the densities near the Fe nucleus are considerably 
distorted by the series termination error. 

The magnetic moments of the Fe and Rh atoms were 
estimated in the following manner. Since the density 
distribution shown in Fig. 3 does not show any overlap, 
the integration of the densities around each of the 
atoms, namely > p(n4m«r, gives their magnetic 
moments. This method gives pre=3.1740.10 up. and 
prn=0.97=E0.10 us. These values are subject to errors 
which result from the fact that only a finite number of 
reflections are used in the construction of a Fourier 
map. The effect of this series termination error on 
magnetic density was studied in detail by Pickart." 
From this study we can estimate that in the present 
case the error is not more than a few percent for Fe. 
Another estimate was made directly from the data 
shown in Fig. 2. We draw a smooth curve through the 
observed points, taking into account the asphericity. 
Then for a given observed value of prot prh, that of 
pre—prn can be estimated with reasonable accuracy. 
In this way we can obtain pro and prn uniquely as a 
function of sin0/A. Then pro=urefre were compared 
with the form factor of Fe metal. 

If we now assign a value of 3.14 ug to the Fel moment, 
it may be seen from Fig. 5 that the form factor thus 
obtained is almost identical to that of Fe metal as 


e Fe 314 pp 
9 Rh 1,005} Feosz hoe 


— Fe METAL OBSERVED), 
=--Rh2* CALCULATED Xio? 


FORM FACTOR 


o 0.2 0.4 0.6 0.8 1.0 
sin 8/X (X!) 


Fic. 5. Estimated form factors of Fe and Rh, as described in 
the text. The observed form factor of Fe metal (Ref. . 9) is shown 
by the solid line and the calculated form factor (Ref. 10) of Rh** 
by the broken line. The calculated aspherical terms (74) are shown 
at the top. 


11 S, J. Pickart, Acta Cryst. 16, 174. (1963). 
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determined by Shull and Yamada? over a quite wide 
angular range. This Fe moment, combined with the 
observed bulk moment of 4.14 ug, gives ure 3.14 ug, 
urn= 1.0 us. 

At present, theoretical form factors’ for Rh are 
available only for Rh?* and Rh?*. The observed form 
factor fg, turns out to be in excellent agreement with 
the calculated value for Rh?*. The almost perfect agree- 
ment shown in Fig. 5 is probably fortuitous. In the 
above process, we have neglected the fact that a small 
fraction (4%) of the Rh sites are occupied by Fell 
atoms. If we assume the moment of Fell atoms is 
2.5 up, as will be shown in the next section, the true 
rhodium moment in Feo. ssRho.4s should be 0.94 up, and 
the form factor should be slightly modified. 

The aspherical moment distribution, particularly 
pronounced in Rh, was already apparent in the Fourier 
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Fic. 6. Graphical determination of the fractions of e; character 


for Fe and Rh in Feo.s2Rho.4s. 40% eg corresponds to spherical 
symmetry. 


densities shown in Figs. 3 and 4. A qualitative estimate 
was made by comparison of observed and calculated 
structure factors of three pairs of reflections, shown in 
Fig. 6, and (400) which shows the largest deviation 


from the spherical part. The form factor for e, symmetry 
is given by 


f (eo) = (Jo) HA (33) , 
EERE N BIO) 
(P+R+P)? 


The calculated values!” for (74) are shown in the top 
part of Fig. 5. By a graphical method, the best solution 
for the percentage of e, character in the spin density 


2R, E. Watson and A. J. Freeman, Acta Cryst. 14, 27 (1961). 
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distribution was 
e, (Rh) = 62-43%. 

Spherical symmetry corresponds to 40975 e, and 
60% (s, character. 


found to be e,(Fe)=47+1%, 


IV. MOMENT DISTRIBUTION IN ALLOYS 


Having established the values of the magnetic 
moments in the 48% Rh alloy, it is now of great 
interest to study the composition dependence of these 
moments, and in particular, that of FeII, namely Fe 
atoms on the Rh sites. These features were studied 
with polycrystalline samples of the 35 and 40% Rh 
alloys consisting of rectangular pillars 2X 5318 mm. 
Both the polarized and unpolarized beam techniques 
were employed in these experiments. In the case of 
the former, a correction for depolarization of the 
neutron beam within the polycrystalline samples is 
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Fic. 7. Moment distributions in Fe-Rh alloys 
at room temperature. 


required. With these in an applied field of 11 kOe, this 
correction turned out to be relatively small. The degree 
of chemical order in the 35% Rh sample was checked, 
and the order parameter was found to be at least 0.95. 

The data were collected from a few reflections, 
(110), (200), (211), and (100). From these we can 
determine Fr (4-) = prat prert pr and Fi (—)— pret 
—(pren+prn). These, combined with the observed 
total moment by magnetic measurements, give the 
individual moments uniquely if we assume that the 
form factors of Fe and Rh determined for the 4805 Rh 
alloy are unchanged in the other alloys. 

The results are shown in Fig. 7. Large uncertainties, 
particularly in the moment of Fell, were introduced 
because of poor statistics in the powder diffraction data. 
It is quite clear, however, that uror is definitely smaller 
than ura despite its higher hyperfine field. 
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Fic. 8. Form factor of Fe in the antiferromagnetic phase of 
Feo, soRha,s at room temperature. The assumption that umi = 1.0 ug 
does not introduce any noticeable change in the Fe form factors 
except for the (111) reflection. 


V. ANTIFERROMAGNETIC PHASE IN Fe;.;Rh»;; 


The basic magnetic structure of this phase was 
reported by Bertaut ef al.* and is shown as an insert to 
Fig. 8. This is the so-called G-type structure," in which 
every positive Fe spin is surrounded by six negative Fe 
spins. The magnetic unit cell edge is thus double that 
of the chemical cell. The Rh atom, which is located at 
the body-centered position, is now surrounded by four 
positive and four negative Fe spins. Thus the Rh 
moments may not be ordered. Even if they are ordered 
in the same G-type structure, the magnetic intensity 
is proportional to 


Fay? = (urefre)?+ (unn fun). 


Since the angular dependence of fre is quite different 
from that of fea, it is possible in principle to solve for 
both pre and urr. In practice, however, Fw” is extremely 
insensitive to small changes of urn when gmnfna« 0.1 
HFe, f Fes 

The specimen consisted of a single crystal in the shape 
of a cylinder 1 mm in diameter and 5 mm in length with 
its axis along [110]. The magnetic peaks are indexed 
as hkl all odd by doubling the unit cell. Accurate 
intensity data were collected for both magnetic (%,%,} 
all odd) and nuclear peaks using an unpolarized beam. 
The absorption of the sample was relatively high 
(u=4.2 cm™) because of its large Rh content. A pre- 
liminary comparison of the observed structure factors 
with those calculated from the expressions 

Fai (bye-zbgy)e 2 einen , ^ 


9 E. O. Wollan and W. C. Koehler, Phys. Rev. 100, 545 (1955). ~ 
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Tape III. Nuclear structure factors of Feo,sRho.s, in units 
of 10-2 cm. Statistical errors in Fa» are about 1%. Temperature 
factor B=0.433 A? is included in Fa. 


hkl Fop Feal hkl Foy Feat 
1.479 1.50 100 0.356 0.362 
200 1.46 1.46 111 0.358 0.354 
211 1.43 1.43 300 0.332 0.329 
022 1.39 1.40 122 0.326 0.329 
222 1.32 1.33 311 0.324 0.321 
400 1.28 1.26 322 0.293 0.298 
033 1.25 1.24 
411 1.24 1.24 


a Not included in the analysis because of a probable extinction effect. 


in which? bye=0.951<10- cm and 5g -—0.60x 107 
cm," revealed a systematic deviation opposite to what 
would have been expected in the case of imperfect 
chemical order. Therefore bg, was treated as a param- 
eter to obtain better agreement. Assuming the chemical 
order is perfect, the new value of bra =0.5850.005 
X10 cm, together with a temperature factor 
B=0.433 A?, give the excellent agreement shown in 
Table III. 

Tf chemical order is not complete, or the Rh con- 
tent not 50%, it is clear that dp, would become even 
smaller. The above value of the temperature factor B 
was used to correct the observed magnetic structure 
factors. Magnetic intensities were also corrected for 1/2 
contamination (70.295). Intensity data given in Table 
IV are average values of equivalent reflections. In view 
of the nuclear intensity agreement, it was assumed that 
no extinction correction was needed. 

The magnetic reflections were first analyzed on the 
assumption that urn=0. The Fe moment was deter- 
mined from the (311) and (133) reflections using the 
form factor of metallic Fe. This gave 


Lo 3.30 ug. 


The form factor fre was then calculated for all the 
reflections using this moment value. There appears to 
be a slight systematic deviation of the observed form 


TABLE IV. Magnetic structure factors Far, in units of 10? cm, 
of antiferromagnetic Feo. sRho.s per chemical unit cell. Indices are 
based on the magnetic unit cell (2—5.972 Â). fre is calculated on 
the assumption that uro — 3.30 up and uns —0. 


hkl Fu (0.002) Re 
111 0.594 0.817 
311 0,396 0.545 
133 0.258 0.355 
333 0.178 0.245 
511 0.191 0.263 
533 0.089 0.123 

DR 0.072 0,099 
711 0.078 0.107 


45 C. G. Shull and E. O. Wollan, Solid State Physics, edited by 
¥. Seitz and D. Turnbull (Academic Press Inc., New York, 1956), 
Vol. 2, p. 137. 
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factor from the solid line representing that of metallic 
Fe as shown in Fig. 8. A part of this might be due to a 
systematic error introduced by an inaccurate tempera- 
ture factor correction. (The polarized beam technique 
is free from this error since it is always the ratio, 
F/F'n, which is measured.) 

Some estimate of the Rh moment can be made on the 
basis of the innermost magnetic peak. If we were to 
assume that the Rh moment remains 1.0 ug in the 
antiferromagnetic state, then we obtain the value of 
the Fe form factor indicated by the cross in Fig. 8, 
which falls definitely below the extrapolated curve. 
Hence, our data does suggest that if the aligned moment 
on the Rh atoms in this magnetic state is not zero, it is 
at least lowered. However, the determination of the Fe 
moment is almost unaffected by this uncertainty in 
the Rh moment. 

The asphericity of the Fe density distribution was 
estimated from two pairs of reflections, (511) and (333), 
and (711) and (155). The amount of e, character is 
482-305, in good agreement with the value for Fel in 
the 48% Rh alloy. 


VI. DISCUSSION 
A. Magnetic Moments 


Our results on the 48% and 50% alloys can now be 
compared with the moment values reported by Bertaut, 
de Bergevin, and Roult® on the 53% alloy. Since the 
alloy compositions and temperature of measurements 
are different, only the ratio of the moments in the two 
magnetic phases are of any significance. They have 
reported pre=3.3 ug in the antiferromagnetic phase at 
room temperature and pre=2.84 up and urn=0.80 us 
at 100°C, when the alloy is ferromagnetic. If allowance 
is made for the fact that the Moët form factor!® was used 
in the derivation of the Rh moment, our present result. 
are seen to be in good agreement with these values 
The rhodium form factor determined in the present 
work drops off, as a function of scattering angle, some- 
what more slowly than that of Mo**, as might be ex- 
pected from their respective positions in the 4d series. 

It is interesting to compare the magnetic moment of 
Fe in the antiferromagnetic phase with the Fe moment 
extrapolated from the ferromagnetic region. At room 
temperature, the extrapolated total moment is 3.88 us 
per unit cell (Fig. 1). Assuming that the ratio of ure to 
urh retains the value found in 48% Rh, we obtain a 
value of 3.0us for the Fe moment. The observed 
moment is 10% higher than this value. This tendency 
may also be seen in Fig. 7, in which the moments were 
plotted as a function of composition. 

At the transition from the ferromagnetic to the anti- 
ferromagnetic phase, the unit cell volume decreases? by 


15 M, K. Wilkinson, E. O. Wollan, H. R. Child, and J. W. Cable, 
Phys. Rev. 121, 74 (1961). 
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1% and the hyperfine field at Fe? decreases by 697.1416 
This is another illustration of the fact that, in this 
system, the hyperfine field is not related in any simple 
manner to the magnetic moments. We have already 
shown that Fell in the ferromagnetic phase has a 
smaller moment, despite its considerably higher hyper- 
fine field, than Fel. We believe this is due to a different 
degree of polarization of the conduction electrons, with 
respect to which the hyperfine field is quite sensitive. 
At present there is, however, no direct experimental 
evidence to verify this assumption. 


B. Form Factors 


The form factors of Fe in the rhodium alloys are 
shown in Fig. 9, together with the previous results given 
for metallic Fe by Shull and Yamada? and for Fe;Al by 
Pickart and Nathans.’ The first striking characteristic 
is that the spherical parts of the form factors are very 
similar to each other, although the magnetic moments 
of the Fe atoms in their various surroundings are quite 


TABLE V. Magnetic moments and their asphericities in several Fe 
alloys. 40% eg character corresponds to spherical symmetry. 


Moment (uz) eg (Yo) 
Fe Metal 2.18 0.53 
Fe;Al {Fel 2.18 0.60 
Fell 1.50 0.48 
Feo.s2Rho. 43 Fel 3.14 0.47 
Feo.sRho.s Fel 3.30 0.48 


different, as shown in Table V. However, the asphericity 
of the density distribution differs considerably. 

Let us now compare the spherical part of the observed 
form factor with the calculated part. The most recent 
calculations available are the Hartree-Fock calculations 
with exact exchange polarization treatment by Freeman 
and Watson" for the two configurations 345 and 3d*4s*. 
Somewhat surprisingly, the form factors for these two 
configurations turn out to be almost identical.!® This is 
in striking contrast to previous calculations by the same 
authors! in which they obtained practically identical 
results for 3d94s? and 3d*(Fe?*), but considerably lower 
values for 3d. 

Shull and Yamada compared their observed form 


16 F, E, Obenshain, L. D. Roberts, and H. H. F. Wegener, Bull. 
Am. Phys. Soc. 8, 43 (1963). 

17 A. J. Freeman and R. E. Watson (unpublished), quoted by 
Shull and Yamada (Ref. 9). 

18 Tf the “true” calculated values of 34545? and 3d? are really so 
close to each other, we cannot hope to distinguish the 3d con- 
figuration from the observed form factors. 

19 A. J. Freeman and R. E. Watson, Acta Cryst. 14, 231 (1961). 
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Fic. 9. Iron form factors determined for various alloys. 


factors of Fe to the latest spin-polarized calculations 
and obtained a constant “ratio” between the two, 
namely, that the observed values were 10% higher than 
the calculated values. They attributed the difference 
to the polarization of 4s electrons assuming the calcu- 
lated form factors were correct. Our data for the FeRh 
alloys as well as the previous results for Fe;Al deviate 
from this constant ratio relation at the low-angle 
reflections with sind/A<0.2 A. If we include these 
points for comparison, better over-all agreement is 
obtained with the earlier calculations of Weiss and 
Freeman? We believe that, at present, the proper 
procedure for obtaining information about the conduc- 
tion electron polarization is to depend only on the 
experimental intensity observations and their Fourier 
transform. Our present data for the Fe-Rh alloys are 
not sufficient to draw significant conclusions concerning 
this point. 
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A statistical analysis is made of spin transitions induced by dij ¢ € 
The first-order effect of the dipole operator can be described 


magnetization while exactly conserving energy. 


by a function P(w), which is related to the level broadening observ : ri 
the power spectrum of the dipole operator. ‘I he cross-relaxa- 


lution of these two functions. Wer is calculated explicitly in 


effect leads to a function x (v) which represents 
tion probability Wor (w) is given by the convo 


various approximations, without appeal to moments. For single-spin flips in m T 
e concentration 7. There is a very sharp peak at w=0 with a 


ance width and of the nearest-neighbor dipole energy. 


the magnitude of Won depends linearly on th 


width proportional to the geometric mean of the reson 


I. INTRODUCTION 


S cross relaxation was first proposed’ as an 
explanation of intermediate relaxation times? much 
theoretical and experimental work has amplified and 
confirmed the occurrence of such a relaxation process. 
As a starting point, we summarize the present state of 
the theory. One considers a Hamiltonian 


5C=5Co+-ICaip , (1) 


where 3€, includes Zeeman and crystal energies, and 
Stai is the dipole interaction energy. The cross- 
relaxation transition probability is given by 


W @)= (2/1) Mg) , (2) 


where M is the transition matrix element arising from 
Faim g(w) is a probability function peaked at w=0, 
and fiw is the energy that is exchanged between 3Co 
and 3Cgip. In terms of a representation in which 3€, is 
diagonal, the g(w) embodies the effect of the diagonal 
and semidiagonal parts of 3Caip, which rearrange the 
entire dipole lattice in such a manner as to conserve 
over-all energy. 

Two prescriptions have been used to calculate g(w). 
The “exact” method evaluates moments of g(w) by 
means of the usual trace formulas. ? The “approxi- 
mate” method considers g(w) as the convolution of the 
resonance lines between which cross relaxation occurs. 
For instance, let gi() describe the line shape for ab- 
sorption between states A and B, and let g2(w) de- 
scribe the absorption line shape involving states C 
and D; then the g(w), for a process in which one spin 
flips from A to B while another flips from D to C would 
be given by 


sto | ato eoe (3) 


where œw corresponds to the “leftover” energy, AFEag 
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vole interactions which change the total 


ed in resonance lines. The second-order 


agnetically dilute systems, 


—AEgp. Since gi and gz are quite sharply peaked 
around w=0, g will also be peaked at w= 0 in agreement 
with experiment.‘ 

A number of difficulties have arisen, however, with 
the more detailed quantitative development of the 
theory. Among these we may mention the following: 


(1) The "exact" moment calculations of Kopvillem;? 
Kiel,? and Hirono!” indicate that the shape function 
which seems consistent with these calculations does not 
resemble the shape function which emerges from the 
"approximate" method. For 1- and 2-spin processes, 
Kopvillem predicts a square shape with a width of the 
same order as the resonant frequency, and essentially 
independent of concentration. Similarly, for a 3-spin 
process, Kiel and Hirono obtain leading terms for g(w) 
which are in first-order concentration and frequency- 
independent. Equation (3), on the other hand, predicts 
a sharply peaked resonance function with a width and 
a concentration dependence similar to the absorption 
signal. 

(2) The relationship of experimentally observed fea- 
tures to the predictions of either method seem some- 
what unclear. For a concentrated system like LiF,‘ the 
convolution scheme provides a satisfactory interpre- 
tation for much, but not all, of the data. For dilute 
systems like ruby, the effects are also sharply resonant? 
and, at least for two-spin processes, appear to have a 
concentration-dependent width. On the other hand, 
this width seems to exceed considerably the absorption 
width expected from the convolution scheme. Further- 
more, the intensity of the effect shows a concentration 
dependence unaccounted for by any calculation. 

(3) There are also two conceptual difficulties. One 
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involves the sometimes ambiguous relationship of a 
function to its moments (to be more fully discussed 
in a future publication); the other concerns the precise 
origin and meaning of the shape function g(w). Clari- 
fication of this latter question is one of the main ob- 
jectives of the present work. 


We shall re-examine the physical premises of the 
theory, construct a new mathematical formalism, and 
derive closed analytic expressions for the transition 
probability. While our theory is applicable to general 
cross-relaxation processes, we begin with the simplest 
possible process, in which a single spin flips. 


IL ELEMENTARY PHYSICAL CONSIDERATIONS 


We consider the Hamiltonian of Eq. (1). 
IC= FoF Caio 


Without the interaction 3C4;5, each particle is inde- 
pendent, and its energy has eigenvalues /745, Æp’, etc. 
The eigenvalues of 3€ are then simply sums of the 
single-particle eigenvalues. Because the particles are 
coupled by SCaip, their individual energy is no longer 
a constant of the motion. Nevertheless, as long as 
5Caip is small, it is still convenient to choose the eigen- 
states of 3Co as the zeroth-order eigenstates of 3C. The 
diagonal and off-diagonal elements of Iai are then 
defined with respect to a representation in which 3€, is 
diagonal. In first order, the diagonal elements of 5Caip 
can be thought of as shifting the energy of each par- 
ticle, by an amount A4», from its unperturbed value 
E. The energy shift will be different for each ion, 
because each ion sees a different local field. The ener- 
gies of all the particles in a given Zeeman state will 
form a distribution centered on the Zeeman energy. In 
this sense, one speaks of the dipole interaction as 
“smearing” the Zeeman levels. The energy shift of an 
individual ion will depend not only on the specific 
environment in which the ion finds itself, but also on 
the Zeeman state in which it happens to be, since the 


` diagonal elements of 3Caip are not all equal. 


In higher order, transitions between states can be 
considered as induced by the “rotating” off-diagonal 
dipole matrix elements. Just as each ion sees a different 
diagonal perturbation, so it sees a different transition 
operator. Both are effects of the same local field. 

We wish to calculate the probability that a particle 
will make a nonradiative transition from Zeeman state 
A, with unperturbed energy £.°, to Zeeman state B, 
with unperturbed energy Æg’. Such a transition is 
possible only for a spin which happens to see a dipolar 
configuration such that, by flipping, it automatically 
changes the dipolar energy by the right amount to 
compensate for the change in Zeeman energy. Energy 
conserving transitions between separated Zeeman states 
A and B are possible because (1) the actual energy of 
a spin in state A or B will, in general, deviate from 
Es or Eg; (2) the magnitude of this perturbation 
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Fic. 1. Energy conserving transitions between 
different Zeeman states. 


will be different when the spin finds itself in state A 
and when in state B. 

Figure 1 shows a distribution of states around 
Zeeman energies E4° and Ey. We are interested in 
transitions between points like A and B. Such transi- 
tions are nonradiative, since they fulfill the requirement 


Ean! = Eje f (4) 
where 
piod E+ Eto, (5) 


At the same time, such transitions change the relative 
populations of levels A and B, as well as the total 
magnetization of the system. 

We do not require that each spin flip be accompanied 
by a dynamic rearrangement of the entire lattice to con- 
serve energy, and in this respect our picture seems some- 
what simpler than the one suggested by Bloembergen. 

Stated in the present terms, the crux of the problem 
is to pick out those dipole configurations which meet 
the rather stringent requirement of Eq. (4). For each 
such configuration, we calculate the transition proba- 
bility and then sum the result. The probability of 
finding a suitable dipole configuration can be con- 
sidered as a density function. The total transition 
probability can then be interpreted as the average of 
the transition operator over this density function. 


II. GENERAL EXPRESSION FOR 
CROSS-RELATION PROBABILITY s 


We now derive in integral form a general expression 
for the probability of a dipole-induced single-spin flip. 
For clarity's sake, we somewhat arbitrarily divide the 
discussion into three steps. First, we set up a formalism 
to project the suitable dipole configurations. Then we 
calculate the transition probability, summed over these 
configurations. Finally, we discuss the significance 6f- 
the resulting formula from some alternate viewpoints. 
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The Density Function 


We first calculate the total number of configurations 
that can result from putting V spins randomly into 
M lattice sites. If the average volume per lattice site 
is u then M is simply the total volume in units of v. 
We define the spin concentration 


n=N/M=vN/V. (6) 
The total number of configurations is then 
C-MWVNYUM — N)!. (7) 
By use of Stirling's approximation 
xlx"e-t, (8) 


and after some elementary computations, we obtain, 
for small 2, 


C= (4/N)EM (1—2)]N. (9) 


The factor (1—m) accounts for the fact that we allow 
only one spin per lattice site rather than an arbitrary 
number. Stated in terms of the volume V 


1 si—ny\® 1 si—n\* 
C= ( ) yNa ( ) [om : dry. (10) 
NIN v N!\ v 


Let us consider the simple question: What is the 
probability that a spin in state A will have some 
specified energy E4'?*!? Now 


E4581— Ea- [air , (5) 


Eat"! and E4* are numbers, but Eai? is a function 
of the coordinates of every spin in the system. When 
we count the number of configurations, C (Eat), 
which fulfills the requirement (5), we proceed as before, 
except that we need to introduce the constraint 
[2(x—2) is the Dirac delta function] 


SEAST (4 49-]- E div) ] 
into the volume integrals of Eq. (10): 


1 /1—n. 
cam -— ) ics o -dtn 


NIN v 


Xs E, total — (E 40-]- Ea?) J. (11) 


The fractional number of configurations that meet this 
constraint is evidently 


à 1 
TQ) C my IS «dry 


Kol E, 6t91— (E+E) ]. (12) 


In similar fashion, the condition that a particle in 
state B has an energy Egt"! is expressed by the 
constraint 

&[ Up #!— (En?+ Ein) ] . 
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The condition, Z4'"*l— Ej*^!, is expressed by the 
constraint 
&[r ia tote Ete. 


Finally, to find the total number of "overlapping" 
states, one must integrate over-all values of teta 
and Egt, Thus, the fraction of configurations that 
satisfy our requirements is given by 


1 E 
=—— [ow 9 -dr yô[ Ea! — (Ex -- E dip) ] 
«0 [E 5:0 — (go - Egi») ] 
Xd E, tots1— Ept |f E geld E total $ (13) 


Using the elementary properties of ô functions,” and 


the notation l 
hw= E49— Ep? , (14) 


hAw= Eig» — Ea, (15) 
the expression (13) can be written more simply 


1 
DN [ow - dryó[ 9 — Aw ]. (16) 


Replacing the 6 function by its Fourier transform, we 
have 


1 ib em 
di) [ow : dex | gielado. (17) 
VN 2T J—o 


The Aw in Eq. (17) is a function of all the r's. We 
assume that the contributions of all the particles to 
the energy shift Aw are additive: 


Aw= 2 Ao(r;) (18) 


1 
ae 2 ge"? (r;)—ICy 49iP(r;). (19) 
ae 


The approximation of additivity Eq. (18) involves 
an error of the order of the ratio of the dipolar energy 
to the Zeeman energy, which is small except for very 
low magnetic fields. With Eq. (18), the r integrals 


implied in Eq. (17) are now separable, and we may 
write 


1 [7 
f@)=— Í c (p)e-i"*dp (20) 
: Dip y= 
with 
1 N 
(p)= lim |- Í arceseto | ; (21) 
Nw V 


The function (p) and its Fourier transform f(w) have 
been discussed by many authors.4—7 


13 H. Heitler, Quantum. Theory of Radiation (Oxford University 
Press, New York, 1957), 3rd ed., p. 66 ff. 

1 H. Margenau and J. Watson, Rev. Mod. Phys. 8, 22 (1936). 

15 H. Margenau, Phys. Rev. 48, 755 (1935). 

16 P. W. Anderson, Phys. Rev. 76, 647 (1949). 

11 P, W. Anderson, Phys. Rev. 82, 342 (1951). 
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We are thus able to calculate, in terms of known 
functions, the fraction of configurations for which an 
energy conserving transition between two Zeeman states 
is possible. This fraction is a function of the energy 
separation fiw of the Zeeman states. We have referred 
to it as a density function because it gives the density 
of available configurations as a function of c. 


The Transition Operator 


The introduction of the transition operator is 
straightforward. This operator is (>> M;), where M; 
is the off-diagonal dipole matrix element connecting 
Zeeman states 4 and B, and involving the ith spin 
and the reference spin we are considering. The M; are 
functions of the coordinate variables. Consequently, 
the transition matrix element is different for each 
allowed configuration of spins. This means that in the 
process of picking out suitable configurations from the 
ensemble, we weight each configuration separately with 
its own transition operator. In other words, the transi- 
tion matrix element goes inside the coordinate integrals 
of Eq. (13). Analogous to Eq. (16), we now obtain for 
the actual transition probability 


2r 
W(e)-2— aos dri:-:drw[ + M?(r)) f(e— ^e). (22) 
À V? i 
We note that 


(X My-Y M2. (23) 


The cross terms vanish in the understood integration 
over time because of the different time dependence of 
each M ;. 

Each term in the sum over 7 gives an identical 
integral, so that 


2m n [? F 

Wes | LOBo) iedo, (24) 
2. 9 Jo 

where 


1 
x6)—— J dM), (25) 


T 


and ®(p) is defined in Eq. (21). Actually, the JV in 
Eq. (21) is here replaced by N— 1, but this makes no 
difference whatsoever since 


1 
lim È Í dremeo |i. (26) 
yo V 


We remark parenthetically that our discussion is 
applicable rather more simply to resonance lines. The 
differences are the following: Our constraint A£tetsl— 0 
is replaced by the constraint AE!etsl— A EZeemas, which 
merely shifts the center frequency. The transition 
operator is changed to S,*, without any coordinate 
dependence, which eliminates x(e) in Eq. (24). In 
particular, (p) remains the same. Thus, d(p) is 
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essentially the Fourier transform of the resonance line 
shape. The function ®(p) is therefore accessible to us 
through experiment, even if we were not able to cal- 
culate it. 

Equation (24) is our general formula for the single- 
spin cross-relaxation probability, as a function of the 
Zeeman level separation, hw. 


Alternate Interpretations 


Equation (24) can be given a variety of physical 
interpretations. First of all, the function x(p) is closely 
related to the autocorrelation function of the transition 
operator, summed over the lattice. The autocorrela- 
tion of a function F(t) is defined as 


C(p)= (F(t+p)F*(t)) time average (21) 
If F isan operator, then in the Heisenberg representation 
F (+p) = exp (i3¢p/h) F(t) exp(—i3Cp/h). (28) 


When F and 3€ refer to the off-diagonal and diagonal 
parts of 3C4'?, we obtain 


M (t-- p) 9 efe?* M (t) (29) 


C(p)—( 


Summing C(p) over the lattice yields our x(p). We 
have defined, in Eq. (27), an unnormalized autocorre- 
lation function, which is not usual. In point of fact, 
our x(p) is unnormalized. It accounts for the magnitude 
of the transition matrix element as well. Indeed 


5 ; 2m n 
Í W (w)da=— — fiw. (31) 


Av 


and 


M (t) Bee verno n E (30) 


This can be seen immediately by evaluating the Fourier 
transform of W (w), that is, x(p)®(p), at p—0. 

If x(p) is the autocorrelation function of the transi- 
tion operator, then its Fourier transform is the power 
spectrum of this operator. The Fourier transform of 
the function $, as we have seen, is essentially the 
resonance line shape. The transition probability is thus 
given by the scaled power spectrum of the transition 
operator convoluted with the spectrum of the broadened 
levels. If the transition operator were a 6 function in 
w space, then we would surely expect that energy could 
not be conserved in a single-spin flip unless the Zeeman 
level separation were of the order of the dipolar broad- 
ening. This assumption is implicit in the calculation 
of the cross-relaxation probability according to the 
scheme represented in Eq. (3). Conversely, if the 
levels were sharp, they would see a resonant compo- 
nent of the interaction mechanism only when the 
level separation fell within the Fourier spectrum of 


this mechanism. This latter consideration, in fact, pro- — 


vided the basis for the calculation of spin-lattice Tc- 
laxation in Bloembergen, Purcell, and Pound’s classic 


tG 
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nuclear relaxation paper.’ In our case, the levels as 
well as the transition operator have a spectrum, and 
the resulting width is obtained by convolution. 

Alternatively, one can interpret Eq. (24) simply as 
an application of the “Golden Rule”: 


W= (2r/ħ) |5Ctransition | (E) s 


We must be somewhat careful in our interpretation of 
this formula. Any particular single spin, in flipping 
from Zeeman level 44 to level B, does not have available 
a continuum of states, but exactly one well-defined, 
albeit perturbed state. This state is either degenerate 
with the initial state or it is not; accordingly, p() is 
either 1 or 0. We must then sum these Kronecker-6 
probabilities over the discrete manifold of N spin 
states. If we replace the discrete manifold by a con- 
tinuum of levels, and the Kronecker 5 by a Dirac 6, 
we are immediately led to the equations that we have 
already derived. 

In a third interpretation, we could regard p(E) in 
Eq. (32) as referring to the lattice as a whole. It would 
then be a true density function. Given two Zeeman 
states separated in unperturbed energy by AE, and 
broadened into a continuous set of states by the dipole 
interaction, then p would represent the average oc- 
cupancy of that subset of states for which A Fitotal — (). 
Thus, p(.E) would correspond to our f(») in Eq. (21). 
When averaging the transition matrix element over the 
distribution p, one must evidently integrate over all 
the random variables involved. In the present case 
these include the space coordinates. This is precisely 
what we have done. This interpretation is implicit in 
the spin-spin line shape formula first given by Waller,” 
and quoted in Ref. 18. We note that in Ref. 1, how- 
ever, the spatial average of the matrix element and 
the level density function are calculated separately. 

We could indeed have written down Eqs. (24) and 
(25) almost immediately by invoking any of these 
three considerations. Our more pedestrian presentation, 
while perhaps less elegant, has been aimed at giving 
as clear a justification as possible for the mathematical 
expression of the theory. 

Finally, we remark that the concentration depend- 
ence of the shape of W (w) resides entirely in 9, not in 
X. The level broadening described by © depends linearly 
on the concentration for low concentrations and is 
proportional to the square root of the concentration 
for high concentrations, if the broadening is due to the 

dipole field. This, of course, is well known. The power 
spectrum of the transition operator, described by x 
depends only upon the spatial integrals of the relevant 
coordinate functions. The relative occupation proba- 
bilities of the lattice sites are clearly not affected by 
the concentration; hence, the functional form of these 


(32) 


tN: TTE E. M. Purcell, and R. V. Pound, Phys. 
Rev. 73, 679 (1947). 
19 T, Waller, Z. Physik 79, 370 (1932). 
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integrals is not affected. Of course, the concentration 
enters also as a scale factor in Eq. (24) governing the 
absolute magnitude of W (c). 


IV. EVALUATION OF THE INTEGRALS 


Our next task is to evaluate the function x and its 
Fourier transform. The transition probability itself will 
be proportional to the convolution of this transform 
with the transform of ®. 


Definition of Ao and M 


As a preliminary, we must define the functions 
represented by Aw(r) and M(r) in Eqs. (24) and (25). 
In Bloembergen's tabulation of dipole matrix elements, 
his A and B functions correspond to our Aw(r) and his 
C, D, E, and F functions correspond to our M (r). Ab- 
breviating his notation, we write 


Aw(r)= q(3 cos'0— 1)/5, (33) 
M(r)-g(sin0cos0)/^, if Am==1, (34) 
M (x)= g(sin?0)/r? , if Am=-2. (35) 


Here g and g contain appropriate diagonal and off- 
diagonal spin-matrix elements. These expressions are 
valid only for two special cases: (1) There is no crystal 
field, or, (2) if there is a crystal field, the magnetic 
axis coincides with the crystal axis. In the more gen- 
eral case, when the magnetic field makes an arbitrary 
angle with the crystal axis, the expressions are gen- 
eralized as follows: 


2 1 
Aw(r)= ?, GP x= —f(8) j (36) 
k=0 f 
i 2 1 
M(r)=— È 8: P2.=—m(0). (37) 
yi k=0 05 


Here Ps», are the Legendre polynomials P20, P21, P22, 
and the qx and g; now represent exceedingly compli- 
cated functions of the spin-matrix elements, the mag- 
netic field, and the crystal field parameters. But even 
in the most general case, Aw and M resolve into a 
product of an angular function and rz. 


Evaluation of x 


Instead of evaluating x(o) directly, we consider first 


( aul more general integral than the one in Eq. 
25): 


(38) 


! o — mn? (0) 
I(pn)= | dcosd| rdr elias (@)I73) 
—1 TO. n us 


so that 
x(p)=I (0,2). (39) 


The lower limit of the r integral is the nearest-neighbor 
distance ro. This distance can never be zero since two 
particles cannot occupy the same spot. 
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To perform the integration over r, we put 
a pf(0), (40) 


t= —ia/r. (41) 
We obtain 


3 


: m? (0) —ialro 
d cos9 —————— [ edi. (42) 
0 


gis =i 3(—ia)". 


The ¢ integral will be recognized as defining the in- 
complete y function” so that 


! m0) Ny(n—1, —ia/r$)) 
I(pn)—]| d eos ) ; (43) 
ET 


3rg QD (— ia/rd)) n—l 
l m (0)Ny (1, —ia/rg) : 
x(p)= d eos J—— ; (44) 
=1 Sr —iafrò 


After a slight manipulation of the expansion”? 


o (— ) natn 2 
Ayes M (45) 
n=2 ni(a+n) 


the result of Eq. (44) may be written 


1 m? (B) /1— etr 1r 
x(p)— d eo X) . (46) 
ci 3r gc ipf (0)/r 
'The result (46) could, of course, be obtained directly 
from (38), but we shall need the general expression 
(43) later on. 

We must still do the angular integral and then take 
the Fourier transform. These operations cannot be 
performed analytically for the most general angular 
functions (6) and f(0) defined in Eqs. (36) and (37). 
We shall, therefore, consider a number of special cases, 
amenable to calculation, from which the significant 
features of the result will become apparent. 

Let us assume, as is invariably the case for a transi- 
tion probability W (w) that is symmetric in œw, that the 
diagonal perturbations Aw and — Ac occur in pairs, 
the perturbation of either sign being associated with 
the same off-diagonal M. Under this assumption, we 
shall make, in turn, a sequence of approximations: 


Case a. We replace both angular functions in Eq. (46) 
by some constant average value, (m°) and (f). 

Case b. We replace m?(0) by some average value; 
replace f(8) by a function defined only by the property 
that, as cosÓ goes from —1 to +1, the function as- 
sumes with equal likelihood all values from —L to L. 

Case c. We replace m(@) by g(sin@ cos0) and f(8) by 
q(3 cos?@—1). These functions can be considered as 
Specializations of Eqs. (36) and (37) by setting q1— q2 


? A. Erdely, F. Oberhettinger, W. Magnus, and F. G. Tricomi, 
Higher Transcendental Functions (McGraw-Hill Book Company, 
Inc. New York, 1954), Chap. 6. 
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= go=2=0. They are the physically correct functions 
when the magnetic field is parallel to the ¢ axis. 

Case d. Finally, we consider the possibility that Ac 
does not occur in pairs, so that W(w) will not be 
symmetric. We shall evaluate the integrals with the 
physically exact functions m(@)=g(sin9 cos) and 
f (8) q(3 cos?*0— 1). 


The actual computations are quite involved. They 
are outlined in the Appendix. Here we only present 
and discuss the results. 

It is convenient to define the quantity 


(ic 1/rj . (47) 


Then expressions like ge, Le, (fje all represent, at least 
to order of magnitude, the effective nearest-neighbor 
dipole energy. 


Case a 
An (m?) w w 
xlo) =— due) (21)], (48) 
arde f) ef) elf) 
wy(x)=0, x«0 l 
(49) 
=1, x0. 


Equation (48) represents a square shape of range 
—&f)«e«(f)e and magnitude 4z(m/)/3rj e f). The 
area is 8(n?))/3rj. In ruby, the nearest-neighbor dipole 
energy in the absence of exchange would be about 2.3 
kMc/sec, so that this result is rather reminiscent of 
Kopvillem's.* 

What happens to this disconcerting square shape, 
once we assume almost any kind of nonconstant angu- 
lar function f(8), is intuitively apparent. As we inte- 
grate over 0 we shall be summing different square 
waves corresponding to different values of f. These 
rectangles are all centered at the origin and their 
height is inversely proportional to their extent along 
the e axis. The result must inevitably be a function 
with a maximum at the origin and a falling mono- 
tonically to zero in the wings. If the range of /(@) 
includes the value zero, which it does in all practical 
cases, the resulting function must approach infinity at 
the origin. The area of x(w) is always equal to x(p),.0, 
and inspection of Eq. (46) shows that this area still 
remains 8z72/3rj, regardless of any assumption about 
mand f. Consequently, the convolution of this singular 
function with the bell-shaped transform of (p) will 
always give a finite, more or less bell-shaped result. We 
already know, then, from these very general con- 
siderations, that the cross-relaxation probability W (w) 
must have a resonant shape. 
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This function has a logarithmic singularity at the 
origin and an area of 8m(»?)/3r$, as expected. The 
“half-power” width is, of course, undefined. If we define 
a half-width 1/2 by the relation 


@1/2 [7] 
| xmi. xs, (51) 
0 0 j 
then wy2 turns out to be 
w= Le/5.4&2L e/ 2e. (52) 
Case c 
If |e| «qe, 
Arg v3—1 
x2) — 5 — E+ In 
27v3rege v3-+ 1 


Ee 


Econ, er) 
[1— (v/qe) |/?— [1+ (o/g ]2— 
(53a) 
If ge« |o] <2ge, 
drg’ v3—1 1 ONG 
1) eM t "BH (1+ qe ) 


1 «| /qe) 2 
[1+ (jo|/ge)] = ay 
[1—(lo|/ge) ]^— 


3 lel 
Š <{ 5——— |H-In 
ge 


Li 
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Fic. 2. x» [ Eq. co] and 
xc LEq. (53) ]. 


If |w|>2ge, 
x(w)=0. (53c) 


In Fig. 2 we display this function, together with the 
Se logarithmic function of Eq. (50). In Eq. (50) 
we have chosen L=2 and (m?)= (sin?0 cos0)— 2/15, 
and in Eq. (53) we have put g=q=1, so that the 
parameters match in both expressions. With this choice 
of numbers, the logarithmic function behaves like 
In|o|-999/* near the origin, whereas the more com- 
plicated function behaves like In |c|-99859/«. The exact 
function has the sharper singularity. The areas, as 
well as all qualitative features are the same. 


Case d 
If w< —qe, 
x(@)=0. (54a) 
If —geco«0, 
Amg? 1 [^] 1/2 [^] 
ers ente) 
27N8rggel | 3Nqe qe 
1 un 
ame e Ns |l. (54b) 
[14- @/ge) ] ^ — 
Ti 0 o«2ge, 
Ang? v3—1 1/o 1/2 
Oa Sin = ( +1) 
2iN3roege V8+1 3Nqe 
1+ up 
x(5-s e |. (54c) 
qe [1+ (o/qe) ] ^— 


GENERAL 


If w> 2qe, 


x(w)=0. (54d) 
We remark that this function is not symmetric in w. 
The asymmetry is important primarily in the wings. 
x(w) and x(—w) assume identical form as w approaches 
zero. The associated function @(w) behaves similarly. 
It seems likely, therefore, that any asymmetry arising 
from the present considerations would be accessible 
only to measurements made under conditions where 
large amounts of energy are transferred to or from the 
dipole system. 

Qualitatively there is little difference between the 
symmetric case c and the nonsymmetric case d, just 
as little difference results from using the approximate 
f and m of case b instead of the f and m of case c, 
which represent the dipole interaction exactly provided 
the Zeeman and crystal field operators commute. In 
all cases there is a logarithmic singularity at the 
origin, and an over-all width determined by the effec- 
tive nearest neighbor dipole energy. It is clear that 
these characteristics of x can be expected to hold even 
when the most general form of f and m allowed by 
Eqs. (36) and (37) is physically appropriate. 


Convolution 


As indicated by Eq. (24), the function we seek, 
W (w), is proportional to the convolution of (c) and 
x(e). 

The widths of two convoluted functions, generally 
speaking, add in some fashion. Two Lorentz curves 
add width algebraically, two Gaussians add width rms 
wise. In particular, in the limit that the width of one 
function goes to zero, the resultant width approaches 
the width of the other function. A 6 function simply 
leaves its partner intact. Our function x(w) is an ex- 
ception to this rule. Its peculiar properties under con- 
volution are of considerable importance in determining 
the shape of W (w). 

As far as the effect on the resultant width is con- 
cerned, the exact shape of $ does not make much 
difference, particularly when 4 is either very much 
broader or very much narrower than x. If we use the 
simplified form of x(w) given in Eq. (50), and simplify 
®(w) by a square shape of unit area and half-length a, 
we can do the convolution analytically. 

If a> Le, it turns out that the resulting half-width 
is always identically a. Thus, when 4 is broader than 
X, x adds no width at all, but merely smoothes out 
the discontinuity of the square shape. In this case x 
is practically equivalent to a à function. The more 
interesting case is when a« Le. Then the convolution 
gives: 
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If 0< |u| <(Le—a), 


Cr 2 pera | Le | 
moz) m 
4LLe Le\a l 


If (Le—a) € |w| < (Leta), 


ou i [e| 
W(o)= —|-+— ( -—Xn 
4La Le a i 


If |w| > (Leta), 


Le 
a —|+1)] z (55b) 
a— jæ] | 


W(w)=0. (55c) 

2r n 8r (m?) 

C-—- : (55d) 
Av 3ré 


From Eq. (55a), the quantity wi», the half-width of 
the resultant W (w), can easily be calculated. We define 


N=o4;/¢. (56) 
Then Eq. (55a) can be written 
C Le 
W (0) -—— 1m | 3 (57) 
2Le a 


W (Na)=— n— ——— 
2Le D (N+1)a 


(V—1) Le 
— Iud ] . (58) 
2 (N —1)a 


If the x width Le/2e is much larger than a, w1/2 will be 
likewise much larger than a, and N will be large. If we 
expand the logarithms to first power in 1/.V, Eq. (58) 
can be reduced to 


T (wW e 


W (Na)=ln(Le/Na). (59) 
If we now solve W (Na) 2 3W (0), we obtain 
Na=o1/2=[2a(Le/2e) ]^*. (60) 


This is a remarkable result. As the $ width a goes to 
zero, w1/2 of W does not approach w2 of x, Le/2e, as one 
would intuitively expect, but rather it approaches zero. 
This strange property of multiplying widths rather than 
adding them turns out to be very nearly true even 
when ® and x have comparable widths, as can be seen 
from Fig. 3. We have done the convolution of the 
physically real x of Case c, using square waves, Gaus- 
sians, and Lorentzians for d. The convolutions were 


done numerically on the IBM 7090, and the behavior - za 


of the half-widths is illustrated in Fig. 3. 


The shape of the resulting function is also notes 


worthy. When ® and x have comparable widths, the 
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a/Le 


Fic. 3. Half-width (ws) of convolution of P with Xe, versus 
half-width (a) of &. For b=square wave, «1/5799; P= Gaussian, 
wja~a?™, p= Lorentzian, ei/27a955. 


shape is nearly that of $. As P becomes narrower than 
x, the shape of their convolution somewhat resembles 
an upside-down funnel. These changes in the shape of 
W (w) are illustrated in Fig. 4. [The extreme changes 
in shape have necessitated drawing Fig. 4(a) on a 
different scale than Figs. 4(b) and 4(c), as indicated. ] 

The significance of the present result is that it is 
the key to the concentration dependence of the width 
of W(w). As we have already remarked, this depend- 
ence is contained entirely in $. Our x has a width of 
the order of the effective nearest-neighbor interaction. 
In general, one would expect x to be considerably 
broader than 4. If x added width, in convolution, like 
normal functions, one would predict a concentration- 
dependent width only for very high concentrations. 
The unusual behavior of x in this regard predicts a 
dependence proportional to the square root of the con- 
centration, even at vanishing concentrations. Further- 
more, x introduces the multiplicative factor (Le/6e)'?, 
which makes the widths substantially larger than the 
resonance width a. This convolution property of x 
thus enables us to meet all the experimentally given 
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requirements: to have W(w) much narrower than 
either the resonant frequency or the maximum dipole 
energy, but much broader than the absorption width, 
and at the same time concentration-dependent. 

Finally, it is clear from Figs. 2 and 3 that all the 
essential features of our results hold whether we use 
the x of Eq. (50) or the more complicated one of Eq. 
(53). Thus, it is unnecessary to evaluate x explicitly 
for general Aw and M [Eqs. (36) and (37)]. The 
relevant parameters are the average of M?, the range 
of f, which we have called L, and the value of ro. These 
parameters can, in principle, be readily calculated for 
any physical situation, and they will define x in terms 
of Eq. (50). 


V. SUMMARY 


We have laid a physical foundation for a general 
theory of cross relaxation, and we have applied our 
considerations particularly to single-spin transitions. 
Our formulation makes it unnecessary to appeal to 
moments or to plausibility arguments; rather, it yields 
the relevant functions exactly, within the limits of the 
physical assumptions. We review the principal steps of 
the argument. 


(1) For each spin, the dipole interaction causes both 
diagonal, “static,” energy shifts, and transitions be- 
tween Zeeman levels. 

(2) Due to the energy shifts, transitions between 
Zeeman states are possible which strictly conserve 
energy. 

(3) The transition probability depends on the den- 
sity of degenerate states. This density is obtained by 
counting the number of dipole configurations which 
meet the relevant energy constraint. The transition 
probability is given by integrating over this distribu- 
tion the square of the off-diagonal dipole elements. 
We note a certain analogy to the method of the micro- 
canonical ensemble, where the trick also is to calculate 


(c) 


o2 03 [X] 0.2 0.3 
w/Le ie 


Fic. 4. Convolution of xe and Gaussian d, for various values of w2 (Gaussian) /a/2(Xc). 


(a) exis (G) [eus (xe) =5. (b) eus (G) [eis (x) 7 0:2. (c) wirs (G)/ mus c9) — 0.008. 
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the phase volume (number of configurations) corre- 
sponding to a given energy. 

(4) We showed that the equations derived in this 
manner can also be derived from the Fourier analysis 
of the dipole operator and from any of several inter- 
pretations of first-order time-dependent perturbation 
theory. 

(5) For single flips, the transition probability W (o) 
was given by the convolution of two functions, x(w) 
and (w). x is essentially the power spectrum of the 
off-diagonal elements of 33», To an excellent approxi- 
mation, it depends on w as In | 1/o|. P describes essen- 
tially the level broadening due to the diagonal elements 
of ctir, It is equivalent to the absorption line shape. 
The convolution of $ and x, which yields W (w), has 
a sharp peak whose width is proportional to the geo- 
metric mean of the x width and the d width. This 
sharp peak is superposed on a broad spectrum whose 
width is of the same order as the nearest-neighbor 
dipole energy. The transition probability per spin is 
linear in the concentration n, and for low concentra- 
tions its width in w space is proportional to the square 
root of the concentration. 


We have considered only single-spin flips explicitly, 
while experimental measurements are made on double, 
triple, or higher multiple processes. Nevertheless, the 
frequency and concentration dependence of our cross- 
relaxation probability satisfies a number of experi- 
mental features which have not been accounted for 
previously in terms of a single unified calculation. 

The explicit extension of the theory to multiple-spin 
processes is the principal subject of the succeeding 
paper (part II); detailed quantitative application of 
the entire theory to typical experiments is made in 
part III. 


APPENDIX: EVALUATION OF x UNDER 
VARIOUS APPROXIMATIONS 


We refer to Eq. (46). If we assume that the diagonal 
perturbations Aw and —Aw come in pairs, then Eq. 
(46) becomes 


1 m? (8) g-iP/ Oe girs Ge 
x(o) =| d ses JC ———) (Ala) 
—1 arg —ipf(8)e 


1 2m? (0) \si 0)e 
=| d cos a Pp e } (A1b) 
= are pf(8)e 


where e is defined in Eq. (47). 
Case q. If we set m?(8)— (m?) and f(8)—(f), then 
Eq. (A1b) reduces to 


x (p)  (An/3r$) sine f)o/«(f)o)- (A2) 
The Fourier transform of (sinx)/x is well known. It 
*1 A. Erdely, F. Oberhettinger, W. Magnus, and F. G. Tricomi, 


does of Integral Transforms (McGraw-Hill Book Company, 
nc., New York, 1954), Vol. I, Chaps. 1 and 2. 
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is a square wave of amplitude + extending from — 1 
to 1. Hence, we arrive at the Eq. (48) for x(w), given 
in the text. 

Case b. If we replace f by a function which goes 
from —L to L as cos goes from —1 to 1, Eq. (A1) 


becomes 
L 2(m?)Nsine fp 
x(p)= i df (—) - (43) 
at 3Lr$/ efp 


The integral will be recognized as essentially the sine- 
integral function (Ref. 20, p. 145), and so 


_ Am?) Si(Lep) 
x(p)— : (A4) 
3rë Lep 


The Fourier transform of Si(x)/x is also well known,” 
and so we arrive at Eq. (50) in the text. 

Case c. If we set f—q(3cos0—1) and m= g sin’? 
X cos?0, Eq. (A1) becomes 


g / sin? cosh 1 
m 
3rgeq J 1 3 cos'9—1/ N— ip 
X texp[— ipeq(3 cos'8—1)] 
—exp[Zpeq(3 cos'9—1) ]). (A5) 


The angular integrals in Eq. (A5) cannot be done 
analytically as they stand. Instead, we shall avail our- 
selves of the well-known?! theorem that if 


| flo)e-**dp=F(w), (A6) 
then 


ie) ~isedp= | - Fours (A7) 


—1p — 


We use the following ploy: First we take the Fourier 
transform of Eq. (A5) ignoring the —ip in the denomi- 
nator. Then we do the angular integral. Finally, we 
integrate with respect to w as indicated by Eqs. (A6) 
and (A7). 

The first step immediately yields 


Arg? [(! sin? cos? 
| d os (I) 
Sreq J o 3 cos0—1 
X (G[o'-- eq(3 cos'8— 1) ] 
—é[«'—«q(3 cos'9—1)]). (A8) 


Since the integrand is symmetric in cos#, we have 
written the integral over half the interval and multi- 
plied by 2. 

We now perform the angular integral in Eq. (A8., 
Integrals of the form in Eq. (A8) are handled by the 


* 


* 
^ 


if —2eq<w'<eg. If w lies outside these limits, the 
result is 0. 

We have completed the angular integral. The ex- 
pression (A13) corresponds to F(w’) in Eq. (A7). It 
remains to integrate over w. The main point is that 
the integral over w, from — to o, must be con- 
sidered separately for three cases: 


(1) w<—2eq: The integrand is always 0 and the 
integral vanishes. 

(2) —2eq<w<eq: The integrand is given by Eq. 
(A13) and the limits are —2eq to w. 

(3) w>eq: The integrand is given by Eq. (A13) and 
the limits are —2eq to eg. 


The integration is straightforward. We quote only 
the results 


w<—2eq: 0; 
—2eq«o« eq: 


Ang? 1 wN 1/2 m 
HEES 
2irjeg- 3 eq eq 
1— (w/e) ]?—1 v3—1 
— _ a | v3-- 


In—— |; (A14b 
U= (o/e) 2 +1 xe TM 


(A142) 
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Finally, combining the results of Eqs. (A14) and 
(A15), we arrive at the function given in Eq. (53) in 
the text. 

Case d. If the Aw’s do not occur in pairs, we revert 
to Eq. (46) in the text rather than to Eq. (A1). We 
note that the only difference between Eq. (46) and 
Eq. (A1) is that in Eq. (46) there is a 1 in place of one 
of the exponentials in Eq. (A1). The computation is 
therefore essentially unaltered from Case c, the same 
basic arguments applying step for step. The result of 
the first step [Eq. (58)] will now have one of its ô 
functions replaced by 6(w’). The angular integration 
for that portion of Eq. (A8) which now contains 5(w’) 
offers no subtleties. The final integration over o 
merely replaces the 6(w’) by a unit step at w=0. The 
result for this portion of the integral is 


w<0: 0; (A162) 


90: 


g v3—1 
—— peri | , (A16b) 
27V3req v3-+1 


By combining Eqs. (A15) and (A16), we arrive at the 
x(w) given by Eq. (54) in the text, 


HEN ë 
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following general procedure : 
b F(b) 
l G(a)dLP (9) dx — i G(x) (QE /du)- 6 (P)dF 
a F (a) i 
G 3 
al if F(x) =0 at some point in the interval aSwSb 
(dF /da) p (yo (A9) | 
=0, if F (4) 40 at every point in the interval a€x& b. "og 
| 
Using the first 6 function in Eq. (A8), the corre- w> eq: ies MAPA aed 
spondence with Eq. (A9) is E ijr : | (A140) 
drg? sin? cos' (A10) 2Tró'eq ST xx 
= = E £ . . 
mu 3réeg 3 cos'0—1 These are the results for the first à function in Eq. 
T (A8). The computation for the second 8 function is 
F (cos) =u! + ej (3 cos0—0, (IER carried out analogously. The result is 
dF * 
= 6eq cosh. (A12) w<—eg: 0; (A15a) 
d cos 
—eq<w<2eq: 
From Eq. (A11) we have F=0= > cos0= (eq—w'/3«q)'”, : o 
and after some algebra, 4rg [1 wif w 
ae sd ee 
G 27V3rhegl3 eq eq 
F [1+ @/eq) ]^—1 
(dF/d cos6) F=0 Ing? wN 12 2eq +1n ik (A15b) 
= ——— (0-2) (4) . (A13) [1+ (/eq) ] ^1 
27N8rg eq? eq w w> 2eq: 
Ang? v3—1 
This result is valid if [w’-++eq(3 cos'0— 1) ]— 0 at some = T$ ls ps | ; (A15c) 
point on the interval 0 cos 1, or in other words, 2iv3reeq vV3-+1 
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The theory of part I is extended to multiple spin processes. The lowest order two-spin process yields a 
cross-relaxation probability Wer whose principal characteristics are the same as those of Wer for single 
spin flips. Higher order two-spin processes occur with much lower probability and this probability is not 
sharply peaked at harmonic coincidence. Three-spin and higher multiple K-spin processes yield functions 
Wcr(w) whose width becomes progressively larger and less dependent on the concentration, and whose 
magnitude depends on the concentration n as n. The effects of short-range forces (exchange) are easily 
embodied in the theory, but in the absence of such forces special attention must be paid to the dipole fields 
of near neighbors, which are then likely to dominate the cross-relaxation process. 


I. INTRODUCTION 


N part I! we presented a general formalism to de- 

scribe the effect of the dipolar interaction on the 
otherwise uncoupled spins in a lattice. We applied our 
considerations particularly to nonradiative single-spin 
transitions. In the extension of the theory to higher 
order processes, we presuppose the entire machinery of 
part I. 

In generalizing the previous results to more complex 
processes, we follow these steps: (1) generalize the sta- 
tistical procedure developed in part I, (2) discuss two- 
spin processes in detail, (3) consider triple and higher 
multiple processes, and (4) consider subsidiary problems 
arising from near-neighbor interactions and inhomo- 
geneous broadening. 


II. STATISTICAL PROCEDURE 


The matrix element used for single-spin flips was 
>; M2. This element corresponded to an interaction 
in which spin one interacts with all the other spins in 
such a manner that spin one flips and all the others stay 
put. The spin variables of Mı; have a form like S. 1S;*. 

Now let us consider a double flip. By a double flip 
we mean a process in which spin one flips down and 
any one of the others flips up. What is the appropriate 
statistical procedure and what matrix element do we 
embed in this procedure? 

The statistics breaks up into two parts. First, we con- 
sider the interaction between spin one and some fixed 
spin p. The matrix element has the form M;5(ri5) with 
spin variables typically of the form .S-1S,». With this 
one-? combination fixed, we count the possible con- 
figurations of all the remaining spins. Secondly, we sum 
Mi, over all possible interactions. When one and p 
interact in a double flip, their interaction with one 
another is evidently on a different footing from their 
interaction with all the other spins. 

A closely related question concerns the indistin- 
guishability of the spins. Two spins are distinguishable 


* This work was supported in part by the U. S. Army Signal 
Corps. under Contract DA-36-039-sc-89169. 
s J. C. Grant, Phys. Rev. 134, A1554 (1964), part I, preceding 
er. 
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if any of the following conditions obtain: (1) They 
belong to a different species; (2) they occupy different 
Zeeman states; (3) they occupy different positions in 
the lattice. Conditions (1) and (2) are obvious. The 
implication of condition (3) is that we do not get a new 
configuration if we interchange spin q for spin p, but we 
do get a new configuration if we move spin p (or spin q) 
to a different lattice site. 

Suppose that the distinguishability conditions (1) 
and (2) give N, spins of kind 1, V2 of kind 2, and so 
forth, with a total of JV spins. Suppose we fix the posi- 
tions of A spins: A; of kind 1, 45 of kind 2, and so forth. 
We now distribute the remaining N— A spins randomly 
into the remaining M— 4 lattice sites. The number of 
configurations is clearly 


(M— A)! 
C(A)= sann ee) 
(Nı— A1) N 2— A 2) fees (M—N)! 


By Stirling's approximation, this becomes 


(M— A ) M—Ag-N+A 
C(A)=— : — m0] 
CN 1—À 1) (N a— À 2) (M — N Ja 


Now 


lim srs. (M—AY)f-4 — limus Ma 


X- 4/M))4- Mc, (3) 
so that (putting n= N/M) 


C(A)= [M *-4/(N1— A 1) !(N5— A 2)! $ -] 
X[c*/—2)*-N]. (4) 


We notice that the second bracket of Eq. (4) is inde- 
pendent of A and will always cancel out when we com- 
pute a fractional number of configurations. For a two- 
spin process involving spin 1 and spin p, we have 
A,;=A2=1, A=2, and the fractional number of con- 


figurations is - 
CQ) NNa A 
CO) vs | fodr3° - dro dro; * dry, (3). 
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where 
v= V/M (6) 


and V has been replaced by fdr in the numerator. 

To obtain the transition probability, we need to 
introduce the transition operator and the appropriate 
energy constraint. The transition operator, as we have 
seen, is Mi! (Tıp) summed over all ri; The energy con- 
straint is discussed in the next section. The transition 
probability may be symbolically written 


x [ attdeusir 9 dri, padri, pk -drin 


Xenergy constraint. (7) 


We have introduced a factor 1/v to keep the units and 
dimensionality consistent in passing from the lattice 
sum for Map to an integral. Also, the reference spin 1 
has been chosen as origin of coordinates. 

For a single-spin process, C(1)/C(0) will have one 
N in front, and one more volume integral, so that W 
reduces to the expression 22 of part I. 

The extension of this counting scheme to processes 
involving an arbitrary number of spins is obvious, and 
some examples will be given below. 


Ill. TWO-SPIN PROCESSES 


First, we formulate the energy constraint for the 
two-spin process. Second, we dicusss the first-order 
two-spin process. Third, we discuss a second-order two- 
spin process. Fourth, we consider the long-range 
process. 


Energy Constraint 


In a two-spin process, spin one flips from state A to 
state B, while another spin flips from state C to state D. 
The labels A, B, C, D designate the quantum numbers of 
the unperturbed single-particle states. The correspond- 
ing unperturbed energies are E45, En’, Ec, E p*. These 
energies are perturbed by the dipole interaction, and in 
first order we write 


ptt E-- pio. (8) 


Etot and E’ are numbers, but Æt? depends both on the 
specific configuration of all dipoles which the spin sees, 
and on the quantum state in which the spin finds itself. 
For a transition to be possible, we require zero change 
of over-all energy. The net change in Zeeman energy 
must be balanced by the net change in dipole energy: 


(Eatt— Egt9*)-- (Ezgtet— Ep’) =0 ; (9) 


| Equation (9) replaces the analogous condition 
Eat*— E59t-—0, which was appropriate for single flips. 


= 
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To formulate the analogous energy constraint, we 
specify that: (1) spin 1 have energy /4'" when in 
state A; (2) spin 1 have energy Ep when in state B; 
(3) spin p have energy £c when in state C ; (4) spin p 
have energy Ep'** when in state D; and (5) the con- 
straint expressed in Eq. (9) holds. The total number of 
configurations for which Eq. (9) holds is obtained by 
integrating over all values of /£4595,75'^t*, Ec, Ep, 
The energy constraint is then expressed as the operator 


| 8E — (Ba? E tin) B Ent — (Ea? + En?) 


KO Ecet— (E+E) BE p'9*— (En + E ptio) ] 
XE ate! — Er + Ect! Ep] 
xdE td Eyed Eod Ept**. (10) 
Performing the integrations reduces expression (10) to 
aL (24^ — Ep 4- Ec — E p?) 
— (E yti»— Et? E tin— Ect?) ]. (11) 


The first parenthesis in expression (11) is the total 
change in unperturbed energy and we denote it by tw. 
The second parenthesis is the negative of the change in 
dipole energy. We put 


TiNo Ap — Ep ?— Eat, (12a) 
TiNocp- Epdi?— Ec. (12b) 


We again assume that the contributions of all spins to 
Ac are additive: 


AcAn— 2, AeAn(ru), (13a) 
kA 

Awen= > Awcp(rpr) - (13b) 
k#p 


Expression (11) now becomes 


S(—[XS Aean(ru)d- E Awen(rpx) D - (14) 


kl kztp 


Finally, replacing the 6 function by its Fourier trans- 
form, we have for the energy constraint: 


1 2o 
UE exp(—ip[o—?; Aeas(ru)— 22 Aocp(rax) ]) de 
k1 kp 


Dipl) ey 


1 [77 
=f (exp(2p[ AeAn(1r15)-- Awcpn (rp) |} 
x AT exp{ipL Awan (ru) + Awon (rox) Ie Tn 


First-Order Process 


We may now write the transition probability in the 
form of Eq. (7) using the energy constraint we have 


NA ONE 
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just derived. We put V;/V=n,/v. We then have 


y Pii 2 1 
ire)-— | BL IL exp{ip[ Ae An (rip) 


20) Phe v 


1 1 
+Aweo(rn) dr = | exptipfauan(e) 


N—2 
Foco(ru)]Mdr) Jem (16) 


This expression is very similar to Eq. (24) of part I, 
with the integral containing M,,? corresponding to x 
and the round bracket raised to the (N—2) power 
corresponding to d. The difference is that x and & 
cannot now be evaluated separately. They are coupled 
through the dependence of on r;,. This becomes more 
apparent if we write rj,— riy—ri, As we integrate 
over r5 to evaluate x, we should simultaneously have to 
integrate out the ri; dependence in ®. This calculation 
would be impossible. 

This impasse can be overcome through the following 
consideration: Let us consider the limiting situations, 
ri — 0 and m,— c. In the first case, the rp de- 
pendence of ® is eliminated and ® has exactly the form 
we discussed in part I. It will essentially describe a 
quasiresonance line whose parameters are the algebraic 
sum of the parameters for the resonance lines A €» B 
and C <> D. In the second case, we note that Aw(r) 
depends on r as 7~*. Within any arbitrarily large neigh- 
borhood of spin 1, rj; will be infinite, Aw(rpx) will be 
zero, and exp[ipAw(r,x) ] will equal one. The converse 
is true within any arbitrarily large neighborhood of 
spin p. Consequently, each of the (V —2) integrals in 
® breaks up into the sum of two integrals. 


1 
lim, = (— | texeCioawan (r) ] 
V 


N-2 
-Fexp[?pAvcp (ar) EST 


It is easily shown that ® now corresponds to the con- 
volution of the two resonance lines 4 €» B and C e D. 

In either limit, # is functionally identical to an ab- 
sorption line and has a width roughly twice of an ab- 
Sorption line. For most practical applications, this is 
Sufficient information. If more detail is necessary one 
must split the calculation into two parts corresponding 
to ri; very small and ri, very large. For ri; small it is 
always possible to calculate exactly the diagonal dipole 
elements which determine Aw. For rp large it is always 
Possible to convolute known absorption lines. 

What is the physical meaning of the preceding argu- 
ments? In the first place it is no longer exactly true that 
X and 9 represent, respectively, the rf power spectrum of 
the dipole interaction and the level broadening. Such a 
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separation was possible for the single-spin theory of 
part I because of our assumption that the interactions 
of spin 1 were independent. Now, however, the inter- 
actions of spin 1 are not completely independent; they 
are correlated with the interactions of spin p. From our 
present point of view, we still assign to x a meaning as 
power spectrum, and take care of the correlations in «P. 
In line shape language, (p) is the coherence relaxation 
function and its width corresponds to T;. When two 
spins are very near one another, they are very strongly 
correlated. With respect to the rest of the system, they 
behave as a coupled unit with a coherence time equal 
to that of a single spin. When two spins are very far 
apart, on the other hand, they are uncorrelated, and 
their joint coherence time is given by 


1/720,2) 2 [1/ T» (1) -L1/730)] , 
or, if T(1)=T:(2), 
T2(1,2)=372(1)=$72(2) . 


The dependence of $ on 71, is thus interpretable as the 
dependence of the pair coherence time on the spin 
separation. 

We may then write for the double-flip probability 


2m nm [° 


W(z)-— — | x(eX(p)e "dp, (18) 
where auo 
x)= | Materiae, (19) 
2r 
Acx(r) = Awan (tip) + 4wcenlrp) - (20) 


We now discuss briefly the more precise functional 
form of M ıp and Aw appearing in Eqs. (18) to (20). The 
matrix element My, is, of course, 


(4 (0C (p) [e| B(1))D (9). 
In the absence of a crystal field, we would have 
Mıp=g(3 cos'ü;,—1)/r9, 


Mi g sin'$i,/rij, 


(21a) 
(21b) 
The quantity g is the value of the appropriate spin 
matrix element. Like the matrix element for single 


flips, M1, can be expressed even in the most general case 
as the product of an angular function and'7~*. 


Mi,—m (81) /r15*. 


Ac 0 , 


Ami 252. 


Q2y 


The Aw’s have the same form as in the single-spin 
process. In the absence of a crystal field 


Awan (tz) = qan(3 cosój,—1)/rió, ~ (23) 
with an analogous expression for Accp. The qas is the 
difference of the values assumed by the spin variables, 
in the matrix elements 50447? and 3655diP, In the 
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general case, Aw too can still be expressed as the product 
of an angular function and r^: 


Ac (rix) f (01)/ru?. (24) 


We see then that the x of Eqs. (18) and (19) is 
identical in form to the single-spin x of Eqs. (25) and 
(38) in part I. The functions m(0) and f(0) are still 
sums of Legendre polynomials of order two. 

The entire discussion x developed in part I applies 
without essential change to the two-spin case. Thus, 
we may write 


Salm) 1 
x(@)= —In|Le/w| , |o|<Le 
ar 2Le (25) 
=0, |o|>Le. 


Here, (m?) is the average value of m?(0) over the 
sphere, L is the maximum value of f (6), and e— ro. 
Unlike the single flip, the double flip is not character- 
ized by the occurrence of Ae and — Ae in pairs. The 
resulting x is asymmetric and this asymmetry is well 
approximated by the addition of an additive constant 


My, =} x=B yap 


(A()C (9) | det» | X (D) Y (6) XX CD Y (9) | 39» | B) D (9) 


GRANT 


on one side of the origin. As mentioned in part I, such 
asymmetry manifests itself primarily in the wings. 

The discussion of W (w) developed in part I also 
applies, with two changes: (1) There is now a coeffi- 
cient of n? instead of 1, so that the transition proba- 
bility depends on the square of the concentration. 
(2) When calculating the central peak of W, we must 
use a & that is wider than its counterpart in the single- 
spin transition. The theory developed in part I re- 
garding the shape of W (w) and the concentration de- 
pendence of this shape applies to the double flip 
without further modification. 


Second-Order Process 


So far, we have treated the two-spin process in terms 
of first-order theory. It may happen, however, that the 
“resonant” condition is met, i.e., AE Ap - AEcp?— 0, 
or w=0, but at the same time the matrix element M15 
vanishes. Such is the case for ruby at a number of mag- 
netic field values when the field is parallel to the c axis. 
To see what effect is expected under these circum- 
stances, we must go to second-order theory. 

The second-order matrix element is given by 


where X and Y are the intermediate states for spins 1 
and p. Each of the first-order elements connecting the 
intermediate states to either the initial or final state is 
of the form of Mi, given in Eqs. (21) and (22). Equa- 
tion (26) may be abbreviated in the notation of 
Eq. (22), 


my(815)m»(015) / riz 
DOE 1 VID 15)/ T1» | 


: (27) 
AExy 

In other words, Mıp breaks up into products of an 

angular function, 7-5, and an energy denominator. The 

energy denominator is 


1/AE=1/(AB’--AE*») = (1/AE) 
X[1— (AESP/AE?)- (AESP/AE?---]. (28) 


Since, for the bulk of the transitions, AE?!» is small 
compared to AE?, we approximate the energy de- 
nominator by 1/AE". 
The number of terms in the XY sum is generally 
small. The reason for this is that the vanishing of the 
"first-order matrix element is generally due to some sym- 
metry condition which causes many of the other 
matrix elements to vanish as well. For instance, in the 
case of ruby at 0? orientation, for g8H — 4D, the tran- 
sition (3, —3) & (—4, —3) is linked by two inter- 
mediate states. Both terms in the sum over inter- 
mediate states are proportional to PzoP217®. The co- 
_ efficients of the coordinate functions are 0.4 and —2.4 
so that their sum is —2. We cite this example to illus- 
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trate that the order of magnitude of the sum is the same 
as the order of magnitude of each term. 

We now proceed exactly as in the first-order theory. 
The integral for x’ is now 


x (p)— : 
ad r2 


1 101 mêm? O 
Í my (8) m2?( yO (29) 
2r (AE)? 


We identify this integral with the integral J (p,4) of 
Eq. (38) in part I. Then, according to Eq. (43) of part I, 
the integration over r (and o) yields 


x o-(—). jn toon) 


E3, —ipef (8) 
Eies ' 


(30) 


where y represents the incomplete y function. We ap- 
proximate f(8), as before, by a function which assumes 
values — L< f< L as cos assumes values —1<cos#< 1, 
and we replace m2(@)m:2(@) by (m?m2). With eco 
we then have 


; 1 / 1N nts?) busy (3, —ipef) 
xe) (—) | de 
€ NAE 3L L (—ipef 


E 


-t d 
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Through the relation? 


Y (a,x) y(a— 1, x) 
Í dx--— ; 


Bye go 


(32) 


we obtain 


. 1/1 \? (mêm®é p pyQ,-—x)—v(,x) 
x (p) 2— ) : | i (33) 
€ NAE 3 te 


where «= —ipeL. By means of the recursion formula? 
yla+ 1, x) 2 ay (ayx)— x*e-* (34) 


and the relation 


240629 1 eas (35) 


Eq. (33) may be reduced to 


1/ 1 V Oiuem?)e 
x (p) -( ) 
3 


We have now explicitly evaluated x’ (p). 
To obtain x’(w) we need 
v= [ xcu. (37) 
—0 
This Fourier transform is obtained almost trivially 
from expression (36) by taking successive integrals of 


6 functions located at o— -ceL. The Fourier transform 
of expression (36) is 


2 


2x(mimy)e 3 w\? 
Ee eu 
Org(AE) 4Le Le (38) 


zu 
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x (v) turns out to be a parabola. It has a half-width of 
0.7Le. For L-24, as is appropriate for ruby at 0° 
orientation, this half-width corresponds to 1.4 times the 
effective nearest-neighbor dipole energy. This width is 
much larger than the width of any reasonable (o) at 
low concentrations. It follows that W’(w), obtained by 
convoluting ® and x’, will have essentially the same 
shape as x’ (v). 

The coefficient in front of the second-order W” is 
down from that of the first-order W by 4 (me/AEsy. 
Since me is approximately the dipole energy and A; is 
approximately the Zeeman energy, this factor is typi- 
cally of order 10-2. This is not all, however. The second- 
order W’(w) shows no sharp resonance. If we consider 
a (w) of width one-tenth the nearest-neighbor dipole 
energy, the sharp resonance of the first-order W de- 
creases the ratio of W’(0)/W(O) by almost another 
order of magnitude. Far away from the resonant con- 
dition, however, W’ eventually may become com- 
parable to W. 

In summary, the second-order process gives rise to 
an effect which is relatively small, as expected, and 
which is quite spread out in its frequency dependence. 
One would not expect this process to be directly ob- 
servable, but it does contribute, to some extent, to 
“general” cross-relaxation, ie., to relaxation which 
appears more or less independent of any resonance con- 
dition between levels. 


Long-Range Process 


Finally, we consider still a third type of two-spin 
process. In this process, spin 1 and spin p both flip, but 
independently of one another. It represents, in a sense, 
the joint probability of two single spin flips. Because the 
two spins operate independently, their separation 
becomes irrelevant. Our theory gives a reasonable 
account of the nonexistence of such a process. 

The matrix element governing this process is 


(A (1)X (4) (Fe BCL) XZ) )(C(p) Y (G) [aee] D(p) Y (7) 


My" = SE | 
XY Ec— Ep 


, €()Y (j) |sev| DG) Y (7) KA (0 X G) 389] BO) X G)) 


'The two portions of this matrix element correspond to 
the two possibilities of either spin 1 or spin p flipping 
first, thus giving rise to different intermediate states. 
Since our method always envisions exact conservation 
of energy in any one process, one energy denominator 
is the exact negative of the other. By abbreviating the 
notation in an obvious manner, we have 


1 
My" = 


-E (MuM — M My). (40) 


AE xy 


? A. Erdely, F. Oberhettinger, W. Magnus, and F. G. Tricomi, 
Higher Transcendental Functions (McGraw-Hill Book Company, 
Inc. New York, 1948), Vol. 2. 


(39) 
E4— Ep 
If ij, the spins referred to in Mi; and Mp; are all 
different, Mı; and Mp; commute, and M;," vanishes 
identically. If i=j, Mi," does not vanish. However, 
the extra index “uses up" one more volume integral in _ 
our configuration counting [see Eqs. (7) and (16) ], 
and as a result, there is an extra factor of 1/V left over. 
This leads to a transition probability which is down by 
a factor of N, where N is the number of spins in the 
crystal. Physically, spin 1 and spin p cannot interact 
at all without any interaction mechanism. They can 


interact via some common third spin i, but since they — — 
B us $9 
act independently, the chances of their “picking” out — 
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the same spin i from the V available spins are down by 
a factor of N. 
This completes our treatment of the two-spin process. 


IV. HIGHER ORDER PROCESSES 


Triple and higher multiple spin flips are handled in 
a fashion quite analogous to double flips. The principal 
difference is that dipole operators can never connect 
more than two spins at a time; for this reason, multiple- 
spin processes must be treated in higher order pertur- 
bation theory. 

We consider a triple flip in which spin 1 goes from 
state A to state B, spin p from C to D, spin q from Æ to 
F. If the selection rule Am=-b1 holds for each spin, 
the possible processes are that either spin 1 or spin 2 or 
spin 3 flips first with the remaining two spins flipping 
second, or the converses of these three possibilities. 
There are thus six intermediate states, with a given 
process and its converse haying energy denominators 
that are exact negatives of one another. These processes 
have been discussed in their detailed application to Lik 
by Pershan.? In the more general case, when 7 is not a 
good quantum number, there is no selection rule. The 
number of processes can become very large and the 
matrix elements very complicated. Nevertheless, a 
typical term, together with its converse, can be repre- 
sented by 


Mipg= (MiyM 14— M3,M35)/ AE . (41) 


M54 does not vanish since Mi; and Mı do not com- 
mute. The coordinate dependence of Mi, and Mi, is 
defined in Eqs. (21) and (22). We therefore have 


3 ma(015)m»(014) 
Miyyt—m E (42) 
AE Tipi. 


When following our previous reasoning, the operator 
for a triple flip becomes 


1 \2 emi(815)ms (614) 
mw-( ) J dae doa (43) 


vAE rip Tig? 


In our configuration counting, we must include the fact 
that we have already assigned three spins to three 
“boxes” and thus, the analog of Eq. (5) is 


C3) NN Na 


Xdrı, pil'^ dri, q-1dt, 05 *driy-. (44) 


In setting up the energy constraint, we require that 

AEH AE9dip — 0 when all three spins are considered. In 

analogy to expression (15), this constraint turns out to 
a P, S. Pershan, Phys. Rev. 117, 109 (1960). 


^ 


A 
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be 


1 oo 
-f exp (ipLAean (5) H Awan Ti) + een (to) 
Diy aes 


4 Awe (tod) +Aere (ta) +Aera (tar) J} 
x II exp(ipLAeAn rix) T Awcn (ton) 


kAl, p, d 


TAo relta) ]) e dp . (45) 


Finally, defining Aw, in analogous fashion as in Eq. (20) 
we have 


2r mnang [° 
W (w) = E CREE 


1 v —2 


1 /1wy ppm 
xo) f [enm] 
On NAE/ J ers 


m? 
x] preen pomemanan (46b) 


Tip 


x(p)b(o)e- "dp , (462) 


1 
Q (p) = Gl exp(ip[ Awas (34) + Awep (Ep) 


N—3 
Apc ra) dra : (46c) 


Once again, x and are not really separable, but our 
previous discussion of this point applies without change. 

A new difficulty arises, however, because x itself 
is now given by two integrals which are not separable. 
'They are not separable because of the appearance of 
rj, in the third exponential. To test the effect of this 
coupling term, we have computed x(w) numerically, 
both with and without the coupling term. We used P21 
for m(8) and Pao for Awy. We find that the two xis gen- 
erated in this way have virtually the same shape, 
except that the x without the coupling term is narrower 
by about a factor of 1.5. It appears to be still prac- 
ticable, therefore, to calculate x on the basis of separable 
integrals with the proviso that one expects the correct 
function to be qualitatively the same but somewhat 
broader. 

Each of the separate integrals gives the power spec- 
trum of the projection of the dipole operator governing 
the interaction between spin 1 and spin p and between 
spin 1 and spin q, respectively. If these two interactions 
were completely independent, there would be no 
coupling term, and the joint power spectrum would 
simply be the convolution of the individual power 
spectra. But the transitions do not occur independently; 
hence, a coupling term. In picturesque language, the 
relative squiggles of spins 1 and p are modulated by the 
squiggles of spin q, and the spectrum is broadened. 
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Fic. 1. Three-spin functions: W half-width versus 
$ half-width in units of x half-width. 


The calculation of the triple flip W (w) again resolves 
itself into the convolution of the x and ® functions of 
part I. Only now we need two x's with their associated 
P’s instead of one. We note that W (w) will have a factor 
proportional to the cube of the concentration and to 
the square of the ratio of the dipole energy to the 
Zeeman energy. 

Four-spin and higher order processes require no new 
physical or computational considerations. We note that 
a matrix element of the form MıpM qr gives zero con- 
tribution. The two factors commute and the element 
will be identically cancelled by its converse. This is 
important since there is no implicit requirement in such 
a matrix element that spins 1 and p be near spins q 
and r. 

When K spins flip, we will therefore require (K— 1) 
*th-order perturbation theory, and the matrix element 
will be a sum of terms each of which contains (K—1) 
products. Our counting scheme will generate a concen- 
tration dependence of order z*, and the (K—1) 
products of dipole operators will generate a factor of 
(dipole energy/Zeeman energy)*(€-?. The functional 
form can be estimated by convoluting (K— 1) x func- 
tions and their associated $'s. Owing to this increasing 
number of convolutions, it is clear that higher order 
processes will show a less and less marked resonance 
effect, and that their W (w) will become more and more 
smeared over w space. When this consideration is 
coupled with the step n dependence of these processes, 
we have at least a partial explanation of why the 
“general” cross-relaxation increases so markedly at 
higher concentrations. 

The concentration dependence of the shape of W (w) 
enters exclusively through the ® functions, as we have 
seen in part I. We showed that W (w) for a one-spin 
process has a width which varies as the square root of 
the $ width. If the resonance line is homogeneous, the 
width of this W (w) will vary as 2/*. The same argument 
applies without change to the two-spin process. For 
higher multiple processes, however, the concentration 
dependence of the width of W (w) is much weaker. In 
Fig. 1, we show the half-width of IV as a function of the 
half-width of $, for a three-spin process. The curve is 
virtually flat until the width is about one-tenth the 
x width. The slope of the curve then increases gradually 
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and reaches the value 4 only when d and x have equal 
widths. The concentration dependence of the width of 
Ww) rapidly disappears as the number of spins in- 
volved increases. The reason for this is that the succes- 
sive convolutions rapidly smear out the sharp central 
peak which alone depends critically on the width of 4. 


V. EXCHANGE 


We have so far ignored certain considerations which 
do not alter the general structure of our calculation, 
but which nonetheless may have a pronounced effect 
on actual results. 

An exchange interaction of the form J(S;-S;) fre- 
quently occurs between neighboring transition metal 
ions. If such an interaction is present, we distinguish 
two cases: (1) When the exchange energy is of the same 
order as the dipole energy, we treat these two inter- 
actions in parallel fashion. We still consider the un- 
perturbed energy of the ion as an eigenvalue of a single- 
ion state. We consider the diagonal elements of J (S;- S;) 
as contributing to Aw and we consider the off-diagonal 
elements as contributing to the transition matrix ele- 
ments. The integral defining x [ Eq. (25), part I ] will, 
of course, be split into two parts. The first part sums 
over those sites that are exchange coupled, the second 
over the rest of space. (2) When the exchange energy 
becomes comparable to the Zeeman or crystal field 
energy, the unperturbed states can no longer be con- 
sidered as single-ion states. If J is large enough, Æ” is 
given by the diagonal elements in a representation in 
which J (S1- S2) is diagonal. In any case, since the energy 
separations between pair states are now, in general, 
quite different from the energy separations between 
single-particle states, such exchange coupled pairs do 
not, in general, contribute to the resonant cross-relaxa- 
tion process. In terms of computational procedure, this 
means that the integral defining x starts at a radius 
where exchange is no longer large. 

This prescription is somewhat oversimplified for two 
reasons. First, it may happen accidentally that the 
separation between some of the pair levels does, in fact, 
coincide closely with the separation between some of the 
single-particle levels. Secondly, as we have seen, the 
frequency width of higher order processes becomes large. 
At the same time, the number of pair levels is (25-+-1) 
times the number of single-particle levels. As the re- 
quirement of harmonic coincidence is made less and. 
less stringent, it becomes more and more probable that 
some of the level separations in the larger pair manifold 
wil meet an approximate harmonic condition. One 
expects that exchange coupled pairs would, in general, 
enhance the probability of higher order processes, par- 
ticularly under conditions where the single-particle 
levels alone do not exactly meet the harmonic condition. 


» 


^ 


Such participation of exchange-coupled pairs in the 


> 


NH 


* 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1572 W. J.C. 


cross-relaxation process has been discussed by 
Bloembergen and Pershan* and by Gill and Elliott.’ 


VI. THE NEAR-NEIGHBOR SPECTRUM 


From the very beginning, we have replaced lattice 
sums by volume integrals, using the volume per lattice 
site v to keep the dimensionality consistent. Clearly, 
such a substitution is valid when the density of lattice 
sites per unit solid angle is large. For the calculation of 
& this procedure will introduce a negligible error, except 
possibly in certain concentrated systems. The principal 
reason for this is that the dominant contribution to P 
comes from distant lattice sites. For the calculation of 
x, the equivalence of the discrete and “continuous” 
lattice is of much more restricted validity, because the 


-5 o 5 10 15 20 25 
@ 


Fic. 2. W (w) for simple cubic lattice, [001 ] direction, 
M —sin& cos (arbitrary units). 


5 

4N. Bloembergen and P. S. Pershan, in Advances in Quantum 
Electronics, edited by J. R. Singer (Columbia University Press, 
New York, 1961), p. 373. : 

J.C. Gill and R. J. Elliott, in Advances in Quantum Electronics, 
édited by J. R. Singer (Columbia University Press, New York, 
+ 1961), p. 399. 
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dominant contribution comes from close-lying sites. It 
is valid when a substantial number of neighbor shells 
are strongly exchange coupled. In this case, as we saw 
in the preceding section, there is no contribution to x 
until the sites fall outside the exchange radius. An 
example is ruby, where exchange is large over at least 
the first seven neighbor shells. The equivalence is not 
valid when the nearest neighbors participate in the 
cross-relaxation process. The contribution of these 
neighbors will dominate over the contribution of the 
rest of the lattice because of the 7-9 dependence of the 
transition operator. The integral for x is then replaced 
by a sum. 
1  mj(0;) 
Xie x eee 


2m i Tj 


(QUA (47) 


where the sum need extend only as far as the first few, 
sometimes only the first, neighbor shells. Then x(w) 
will be given by a series of à functions 


m (05) 


7 Tj 


5(w—Aw,) . (48) 


Finally, W(@) will be given by a sum of resonance 
curves, each displaced from the origin by Aw; and 
weighted by the corresponding transition operator: 


mp? (03) 
Wlo)~ i ——® (w— Aoj) . (49) 
7 Te 


In systems that are very dilute and have no inhomo- 
geneous broadening, the various &’s may be resolvable. 
To illustrate, we show, in Fig. 2, W (w) for a simple cubic 
lattice with the magnetic field in the [001] direction 
and with Aw;= (3 cos’#;—1)/r;, mj- sinb; cos6;. The 
width of ® has been taken as 1/50 the nearest-neighbor. 
dipole energy. The peak at 18 is due to the (0,1,1) and 
equivalent sites. The six nearest neighbors are not 
effective because their o; vanishes. If we take 
m= (3 cos0— 1), then the neighbors at (0, 0, +1) will 
produce an effect 32 times greater than the (0,1,1) 
neighbors and 11 times as far away from the origin. The 
shape of W is quite sensitive to the angular functions 
involved. This means it is quite sensitive to the par- 
ticular matrix element for a given process and to the 
direction of the magnetic field. 

In more concentrated systems or in systems with 
substantial inhomogeneous broadening, ® will be too 
broad to resolve subsidiary peaks such as shown in 
Fig. 2. Nevertheless, the shift of the peak away from 
w=0, i.e., away from the exact resonance condition, will 
still occur. 

The shift of the peak cross-relaxation probability 
away from the harmonic condition becomes intuitively 
obvious if one considers that the matching of level 
separations must include the diagonal dipole energy 2$ 
well as the Zeeman energy. Thus, the peak in cross re 
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laxation still occurs where the total energies are exactly 
matched, but this point will be somewhat displaced from 
the point where the Zeeman energies alone are exactly 
matched. The amount of displacement depends rather 
critically on the specific operating conditions. 


VII. INHOMOGENEOUS BROADENING 


Inhomogeneous broadening affects the distribution 
of energy levels about their unperturbed value. In our 
formalism this distribution manifests itself in the func- 
tion P. The question whether ® should include the in- 
homogeneous broadening or not is isomorphic to the 
question whether the spin packets within an inhomo- 
geneous line cross-relax together or separately. For 
limiting cases one can give an unequivocal answer. If 
the spin-diffusion time within the inhomogeneous line 
is much shorter than the cross-relaxation time for one 
packet, then the whole line relaxes together. If the 
spin-diffusion time is much longer than the cross- 
relaxation time, then the packets relax separately. To 
put this in terms of numbers, we take Ta= 10-5 sec and 
Ti2=10 sec. For Tairs we use Bloembergen's rule of 
thumb? 

Tai TX (Ts/ Tz, (50) 


where 1/7 5* is the total width of the inhomogeneous 
line. We then obtain the condition that the homo- 
geneous width of the line must be much greater than 


€ N. Bloembergen, S. Shapiro, P. S. Pershan, and J. O. Artman, 
Phys. Rev. 114, 445 (1959). 
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297 of the inhomogeneous width, if the entire line is to 
relax as a whole. If this condition is met, *P should there- 
fore include the inhomogeneous broadening. If this con- 
dition is not met, the situation becomes complicated. 
Relaxation takes place between all the spin packets in 
one inhomogeneously broadened level and all the spin 
packets in the other inhomogeneously broadened level. 
At the same time spin diffusion takes place within each 
of the levels. 


VIII. SUMMARY 


We have extended the basic cross-relaxation theory 
of part I to relaxation processes of all orders. Among 
other results we have derived the following: (1) For a 
K-spin process, the magnitude of the transition proba- 
bility depends on concentration as n*, (2) The fre- 
quency width of the transition probability is propor- 
tional to the square root of the $ width for one- and 
two-spin processes. For three-spin processes, the width 
of W depends on the width of © only when $ and x have 
comparable widths, and even then the dependence is 
relatively feeble. For higher multiple processes, the de- 
pendence of the W width on the width is negligible. 
Instead, the width of the K-spin W is roughly propor- 
tional to K. (3) In the absence of strong exchange 
interactions, the nearest neighbors dominate the cross- 
relaxation process. In this case, W is obtained from the 
convolution of functions with a discrete near-neighbor 
spectrum, and its peak may be displaced from the har- 
monic point by an amount corresponding to the 
nearest-neighbor dipole energy. 
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The theory developed in parts I and II is applied to the classical experiments of Mims and McGee, and 


Pershan. The predictions of the theory regarding the dependence of W c j 
as the prediction of the actual magnitude of W cpr, are con- 


concentration and on the exchange radius, as well 


cr on the energy imbalance Aw on 


firmed in the Mims and McGee experiment on ruby. The crucial significance of near-neighbor dipole inter 


actions, in particular, the effect of the associated power spectrum, whic 


h is quite broad and quite sensitive 


to crystal direction, is illustrated by application of the theory to Pershan's experiments on Lil’. 


INTRODUCTION 


N the preceding papers,' we have developed methods 
for calculating from first principles the cross- 
relaxation probability Wer for processes of arbitrary 
complexity. The theory makes specific quantitative 
predictions about the dependence of Wer on the num- 
ber of spins participating in the process, on the energy 
imbalance involved, on the spin concentration, and on 
the existence of short-range forces. Our purpose now is 
to test the theory in its application to classical experi- 
ments in the field. We shall consider the experiments 
of Mims and McGee? on ruby and the experiments of 
Pershan? on LiF. 

Transition probabilities are related to observable 
quantities through rate equations. The transient solu- 
tions of rate equations containing both cross-relaxation 
and spin-lattice terms have been discussed at length in 
a previous paper,‘ both in general and with particular 
reference to the experiments we shall discuss. We shall 
avail ourselves here of those results. 


I THE MIMS EXPERIMENT ON RUBY 


We denote the four ground-state spin levels of 
Cr+ in ruby by 1, 2, 3, 4 in order of decreasing energy. 
As discussed in detail in Mims’ paper, the level separa- 
tion (2-3) is held fixed at 7.17 kMc. The magnetic field 
strength and direction are varied to bring the (1-2) 
separation into coincidence or near coincidence with 
the (2-3) separation. Harmonic 1: 1 coincidence occurs 
at about 22.3°. Levels 2 and 3 are saturated with a 
short pulse, and the decay of their populations is then 
observed. From the decays observed at a series of angles 
centered about 22°, the cross relaxation as a function of 
energy imbalance can be determined. 

The details of the decay function calculated from the 

_rate equations (see Ref. 4) depend rather critically on 
the assumptions one makes about the spin-lattice pro- 
babilities. Assuming equal s-/ probablities Wz between 


all pairs of levels, the observed signal S(/) is given by 
S(D)~1.5 exp(—t/71) +0.5 exp(—t/72), (la) 
rp 1 1.5Wert4W 1, (1b) 
Tg =4W 1. (1c) 
If one assumes s-/ probabilities proportional to the 
energy separation of the levels one obtains 
S(t)~0.8 exp(—1/71) — 0.3 exp(—4/72) 
+1.5 exp(—t/Ts), (2a) 


nol=2.17W 1, (2b) 
To = 6.28W 1, (2c) 
Tg = 1.5W eg4- 2.23W zr, z (2d) 


Wz is here taken as the s-/ probability between levels 
3 and 4. 

When the relaxation process is dominated by cross 
relaxation, one obtains reasonable estimates of the cross 
relaxation rate regardless of detailed assumptions about 
s-l probabilities. It is clear, for example, that Eqs. (1a) 
and (2a) coalesce for times of the order of Wer, when 
Wer/W becomes large. 

Fortunately, many of the processes observed by Mims 
are indeed dominated by cross relaxation. One can fit 
the decay traces either by a double exponential [ Eq. 
(12)] or by a triple exponential [Eq. (2a) ] in the region 
where cross relaxation is rapid. For cross-relaxation 
times shorter than a few msec, closely agreeing values 
result whether one uses equal, unequal, or infinite 
lattice times. The sensitivity of these particular experi- 
ments does not appear sufficient to allow consistent and 
quantitative interpretation of the residual trace after 
the fast component has been extracted. 


TABLE I. C-R probabilities from Mims experiment. 


E. : e (Mc) 7 (ms) Wen(sec-3) 
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We show in Table I the C-R probability versus the 
frequency separation of the (1-2) and (2-3) transitions, 
as derived from theMims experiment. The same data are 
also presented in Fig. 1. Because of the rapid variation 
of the energy imbalance o with the field direction, and 
because Wer becomes of the same order of magnitude 
as Wz for large w, the uncertainty in Wer ranges from 
~ 10% for small w to a factor of 2 or so for large w. We 
note that the last two points for negative c fall outside 
the main sequence. This effect is sufficiently marked so 
that it should probably be considered real, and we shall 
comment on these points later. 

The theoretical expression for Wer, we recall, is 
essentially the convolution of two functions: x, which 
is the power spectrum of the dipole operator, and d, 
which in the present case is itself the convolution of the 
(1 €» 2) and (2 3) resonance lines. In the following 
computations, we have used the form of x given in 
Eq. (53) of part I. This x function was computed using 
matrix elements appropriate for the magnetic field 
parallel to the crystal axis, and is consequently not the 
exactly correct function. As pointed out in part I, 
however, the salient features of the x function do not 
depend on the detailed angular functions assumed, and 
Eq. (53) of part I represents a close approach to the 
exact function, without requiring excessive numerical 
computation. We may then write 


Wes [32z?(n?y?/3hwLe W(o; Lea). (3) 


The quantity (m°) is the square of the transition dipole 
matrix element, averaged over a sphere of unit radius; 
the quantity Lis the change in dipole energy between 
two flipping spins, maximized with respect to angular 
position, and again for unit radius; v is the average 
volume per lattice site; e is defined by the relation 
e 37 = the effective nearest-neighbor distance; n is 
the fractional mole concentration of Cr?* in each level; 
a is the half-width of $. Le is directly related to the 
width of x, [ Eq. (52), part I]. The numerical constants 
have been so chosen that V is normalized for Le= 2g?8?. 

The quantities (m°) and L are derived from matrix 
elements that can be calculated precisely. The quantity 
a can be estimated with reasonable accuracy. In the 
present case, as discussed in part II, it is the width of a 
function obtained by convoluting the (1+2) and 
(2€» 3) resonance line shapes. (These line shapes in 
turn can be calculated from first principles by methods 
closely related to our present theory.) The quantity zo 
is a free parameter, in the sense that its theoretical 
prediction lies altogether outside the present theory. 
The value of rp which our theory extracts from the cross 
relaxation data must, however, be compared with the 
radius of strong exchange as determined by Statz and 
co-workers. Exchange-coupled pairs would not, in 

5L. Rimai, H. Statz, M. J. Weber, G. A. De Mars, and G. F. 
Koster, Phys. Rev. Letters 4, 125 (1960). H. Statz, L. Rimai, 
M. J. Weber, G. A. De Mars, and G. F. Koster, J. Appl. Phys. 


32, 2185 (1961). M. J. Weber, L. Rimai, H. Statz, G. A. De Mars, 
Bull. Am. Phys. Soc. 6, 141 (1961). 
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Fic. 1. Experimental data (from Mims, Ref. 2). 


general, be expected to participate in the two-spin cross- 
relaxation process, with its sharply defined resonance. 

We first discuss the calculation of the matrix elements. 
These are computed from the pair Hamiltonian 


Spar = par T Ipai P (4a) 


Koai? = 28H - (Sit 83) — DCS 9-8, , (4b) 


doa 3(r 2" S )(r So) 
Rath $e]. (4c) 


ru? rus 


For a particular value of H, taking into account both 
its magnitude, and its direction with respect to the c 
axis, one diagonalizes 3€? numerically. Then the same 
transformation is applied to 3€?/», The matrix elements 
relevant to a transition between the pair state (2,2) and 
the pair state (1,3), for example, are easily obtained 
from the resulting matrices. (The labels 1,2,3, refer to 
single-particle energy states in order of decreasing 
energy. They do not refer to m states, since m is not a 
good quantum number.) The matrix elements are 
complicated functions of the coordinates. For the Mims 
experiment they are 


Aw= (1,3) 36999] (1,3))— (2,2) 3c» | (2,2) 
= (g°8°/r8)[1.36(3 cos'8— 1) 
+0.73 sind cos6-- 1.34 sin'8] (5) 
m= ((1.3)|3c4i | (2,2)) £ 
= (g?8?/r°)[0.191 (3 cos'8—1) 
+ 1.562 sin# cos#-+-1.417 sing]. (6) 
These matrix elements lead to the values 
L-—2.182g8 , i 
(m?)= 1.425181, 
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Tac. 2. In y versus w/Le. 


with 
gg — 51x 10* Mc(À)*. 


We note that there is no need to use projection opera- 
tors. Through the device of the pair matrix we project 
the appropriate portion of the dipole operator simply 
by inspection of the appropriate matrix elements. Of 
course, the use of the pair matrix allows an exact compu- 
tation of the matrix elements for any operating condi- 
tion, regardless of the orientation and magnitude of H, 
that is regardless of whether or not 7 is a good quantum 
number. 

We next consider the dependence of Wer on a and 
on Le. 

Although in principle a is determined by our know- 
ledge of resonance line shapes, in practice the situation 
is complicated by the presence of inhomogeneous 
broadening and by the complex structure not only of 
the total resonance line but even of its homogeneous 
component. At a Cr?* weight concentration of 0.05%, 
the half-width ascribable to homogeneous broadening is 

‘about 6 Mc, and the homogeneous line shape is a 
Torentzian whose falloff gradually becomes Gaussian 
about 1 kMc from the center frequency. The inhomogen- 
eous broadening is a pure Gaussian of half-width about 
18 Mc. The total line is the convolution of these two. 
Tt has a half-width of about 23 Mc. At the center it is 
dominated by the Gaussian. At frequencies more than 

pbout 60 Mc from the center it is dominated by the 

Lorentzian homogeneous component. In the context of 

‘cross relaxation we are usually talking about frequency 
€ 
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separations of the order of hundreds of Mc, where the 
total resonance line behaves essentially like its Lor- 
entzian homogeneous component. Very near the exact 
harmonic condition, cross relaxation is so rapid that the 
inhomogeneous line may no longer relax as à whole, in 
which case we are observing relaxation of individual 
Lorentzian spin packets rather than of the entire system. 
We note that the function ® is the convolution of the 
(1€ 2) and (2 3) resonance lines, and that in con- 
volution, Lorentzians add width algebraically, while 
Gaussians add width in rms fashion. One way to over- 
come some of these complications is to find an approxi- 
mation to Y [ Eq. (3)] from which the dependence on 
Le can be extracted more or less independent of detailed 
assumptions about @, or about the shape of the func- 
tion ®. 

Fortunately, such an approximation exists. If one 
plots In¥ as a function of w/Le, one obtains curves 
which are very closely straight lines over several orders 
of magnitude in Y. The slope of these straight lines is 
dependent only weakly on the shape of ® or on the 
value of a. In Fig. 2, we show InY as a function of 
w/Le for various values of a/Le, and with ® Lorentzian. 
In Fig. 3, we have plotted the quantity (a/Le) 
XdW cr/d(w/Le) against a/ Le. Figure 3 is, in a sense, 
the meeting point between theory and experiment. If 
din Wer/dw and a are known experimentally, then 
Fig. 3 determines a/Le, and therefore e. We note that a 
straight line of unit slope in Fig. 3 would indicate that 
the slope of In Wer is independent of a. For a Gaussian 
®, the acutal curve deviates negligibly from such a 
straight line; for a Lorentzian ®, it deviates only slightly 
for small a more seriously for large a. 

Our theory then predicts that In Wer should vary 
almost linearly with w over several orders of magnitude 
(with some rounding at the very peak of the curve). 
The slope of this straight line yields a close estimate of 
the quantity e, that is, of the exchange radius 70. 

This linearity is a good approximation when a/Le 


Fic. 3. afid(Iny)/dw] versus (a/ Le). 
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is considerably smaller than unity, that is for magnetic- 
ally very dilute systems. As can be seen from Fig. 1, 
the linear approximation to InWer is consistent with 
the data of the Mims experiment. 

For the Mims experiment, dnW ¢r/dw=0.010 Mc. 
For a Lorentzian of half-width 12 Mc (the convolution 
of two Lorentzians of 6 Mc each), one obtains r= 6.28 
A. For a Gaussian of half-width 32 Mc (the convolution 
of two Gaussians of 23 Mc each) one obtains ro=6.15 
A. In this case, therefore, it happens to make very little 
difference whether one uses the width of the entire 
resonance line or only its homogeneous component. 
While this is partly fortuitous, it is primarily due to 
insensitivity of Le to the shape of 4. 

The work of Statz and co-workers? indicates that 
the exchange radius lies between about 5.5 and 6.5 A. 
Thus, our value of 6.15-40.15 A is in excellent agree- 
ment with his. We feel that such corroboration is rather 
encouraging in view of the startling conclusion that the 
first twelve or thirteen neighbor shells, or about the 
first fifty nearest-neighbor sites, are sufficiently ex- 
change coupled to be excluded from this cross-relaxation 
process. 

Because of the trick of working with the derivative 
of InW cp, we have so far concerned ourselves exclusively 
with the shape of Wer(w), completely separating out 
the question of its magnitude. The magnitude of Wer 
depends sharply on the concentration, and the absolute 
concentration is difficult to determine accurately. 
Nominal values of concentration are accurate within 
about 50%. Furthermore, there is some evidence of 
pronounced variations in concentration, over a scale 
of the order of microns, which makes the value of the 
true effective concentration even more obscure. In 
view of these uncertainties, the absolute magnitude of 
the cross-relaxation probability calculated by our 
theory, as we shall now show, agrees astonishingly well 
with the experiments of Mims. 

Since we have already accounted for the shape of 
Wer(w), it suffices to calculate its magnitude at one 
point. For convenience, we choose the peak of the curve, 
Wcr(0). The required value is given by Eq. (3), using 
Y (0; Lea). For the Mims experiment, Y(0; Lea) 2.9, 
if one assumes a Lorentzian ®. L and (m?) are given by 
Eqs. (7) and (8). The nominal weight concentration 
of 0.05% corresponds to a molar concentration of 
0.00034 or to a concentration per level of 8.2X 10-5 
One then computes Wer(0)=25 900 sec-!. Extrapola- 
tion of the curve of Fig. 1, yields Wer(0)= 18 000 sec-'. 

Even leaving out of account the considerable experi- 
mental, uncertainties in the determination of the points 
in Fig. 1, the discrepancy between these two values can 
be accounted for by an actual concentration 17% below 
nominal. One must also bear in mind that the x function 
on which this analysis is based is inexact, and that 
truly precise numbers cannot therefore be expected. 

n view of these circumstances, the agreement with 
experiment is excellent. 
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We note that a linear concentration dependence of 
W cr would give values that deviate from those observed 
by more than four orders of magnitude. 

The interpretation of a direct measurement of the 
concentration dependence must be made with some 
care, since the concentration enters not only as a 
coefficient, but also determines the shape of Y via the 
parameter a. In Fig. 4 we show Iny, evaluated at 
various values of w as a function of In (a/ Le). The curves 
are not linear and depend drastically on the value of w. 
As a/ Le becomes larger, the curves would all approach 
one another, and all would approach negative unit 
slope. This can be seen intuitively because y is nor- 
malized and approaches the normalized function 4 
as a/ Le becomes large. The curve for y(w=0) already 
has a slope of —0.8 at a/Le—1/e. For 0.05% ruby, 
a/Le is about 1/20, and the slope of V(0) is about 
— 0.5. Since a varies almost linearly with the concent/&- 
tion z in this region, one would expect to measure a 
dependence of 5! for the peak value of the cross- 
relaxation probability, the dependence becoming less | 
steep at higher concentrations. Such a prediction is 
somewhat sanguine, since it depends not only on the - 
precise form of x and ® but also on the precise values 
of a and Le. 
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Finally, we comment on the “stray” points in Fig. 1, 
which are obtained for negative values of w. Two ex- 
planations suggest themselves: (1) Our theory predicts 
an asymmetry in general, if | Aw max| # | Ac min|. In 
the present case [ Eq. (5) ] there does exist a difference 
between the two extremals, the difference being about 
10% of the extremum values. (2) It is also possible that 
the exchange-coupled pairs have a resonance about 800 
Mc below the signal frequency of 7.17 kMc, that is at 
about 6.37 kMc. To check the reasonableness of this 
suggestion we have computed the pair levels assuming 
the conditions of the Mims experiment and an exchange 
energy J of 28.75 kMc (or 5D). The antiferromagnetic- 
ally coupled pairs will almost all be in the manifold for 
which S,: $:— 9/4. It turns out that two out of the six 
transitions within this manifold have an energy interval 
of about 6.4 kMc. The energy separations within a 
given S: S manifold depend only very feebly on the 
precise value of J, as long as J is large. For this reason 
our computation indicates a reasonable likelihood that 
the pairs will contribute to the single ion relaxation 
when the single ion (1 €» 2) transition is about 800 Mc 
below the cross-relaxation harmonic frequency. 

We appear to be able, then, to explain all the features 
of the Mims experiment in a consistent and unified 
way. 

Our comparison of theory with experiment might be 
criticized on the basis that the experimental evidence is 
not sufficiently copious or conclusive. For example, in 
Fig. 1, we have drawn a straight line on the basis of 
three points—an extrapolation which admittedly lacks 
uniqueness. Nevertheless, we can assert minimally that 
our calculations are consistent with the experimental 
data available. These data include independently the 
cross-relaxation decay studies, the available information 
regarding the structure and concentration dependence 
of the absorption resonance, the independent determina- 
tion of the exchange radius. The area of consistency 
extends not only to the shapes of the functions, not only 
to parametric dependence, but, what is most important; 
to actual magnitudes. We are thus actually able to tie 
together, in a fully quantitative way, & substantial 
amount of dispersed experimental evidence. 


I. PERSHAN’S EXPERIMENTS ON LiF 


We next consider the cross-relaxation process in 
LiF, which has been investigated experimentally and 
theoretically by Pershan,? in a notable study which 
established decisively the existence of cross relaxation. 
Pérshan interpreted his experiments in terms of moment 
theory and the convolution of Gaussian resonance lines. 
We shall see that our own theory leads in some circum- 
stances to similar conclusions as Pershan's; but certain 
experimental features which proved anomalous in terms 
of nis formalism find their natural place in ours. 
^ Cross relaxation in LiF contrasts sharply in several 


respects with cross relaxation in ruby: (1) The process 
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of main interest is a three-spin process involving two 
Li’s flipping opposite one F, which requires slightly 
more complicated rate equations. Furthermore, neglect- 
ing quadrupole interactions, there is no zero-field split- 
ting. The 4 Li levels are equally spaced, and transitions 
occur simultaneously between all pairs of adjacent 
levels. (2) There is no exchange interaction in LiF. 
This frees us from some of the major concerns in ruby, 
namely the extent of the exchange radius and the possi- 
ble influence of exchange-coupled pairs. On the other 
hand, as we have seen in part II, the dipole interaction 
itself is now dominated by nearest, or by near, neighbors. 
This means that the use of a x function based on a 
continuous dipole density is not justified. To obtain the 
correct x we must sum over the discrete sites. (3) Lil 
is a magnetically concentrated system. The shape of d 
is Gaussian, and there is no question of inhomogeneous 
broadening. 

The three-spin process can occur in any of nine 
mutually exclusive ways, since each of the two Li 
atoms can be in any one of three initial states. In 
addition, a second-order two-spin process can occur in 
which a single Li spin changes its z component by two 
magnetons. Two initial states are possible for this 
process. The decay constants for each of these relaxa- 
tion modes are obtained from rate equations in Ret. 4, 
where it is also shown that the observed decay constant 
is their average. 

The calculation of the matrix element governing 
each of these decay modes has been discussed in general 
in part II and with special reference to LiF in Pershan's 
paper.’ We recall that each matrix element is a second- 
order element consisting of a sum running over six 
terms. The intermediate states correspond to the pos- 
siblities of any one of the three spins flipping first, and 
the converse. The actual computations gain consider- 
ably in simplicity and lucidity by evaluating the matrix 
elements of products of spin operators directly in the 
appropriate pair manifold. 

The function x(w) is given by an expression analogous 
to Eq. (48) of part II: 


X(w) = 2m YM? (risio) 
pa 


X[ Ao; (115) Aw, (tr) Ao (Ys) —w]. (9) 


An Li is here taken as the reference atom 1; p labels all 
other Li atoms; and q labels the F atoms. The Aw, (rij) 
are the changes in dipole energy induced by the flipping 
of atoms 4 and j, as discussed in detail in part II. As 
discussed in part II, correlations are explicitly included 
through the term Aw,(rp,). M? is the square of the 
transition matrix element we have just discussed. It 
consists of a series of terms typically of the form 


(1/AE)*[[m2 (1p) 102? (635) /ris8r148]]- (10) 


T he symbols m? and AE have been defined and discussed 
in part II, Eqs. (41)-(45), and (28). In the present case 


= 3 * z 
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AL=guBH when an Li flips first, AE= gyBH when an F 
flips first. 

We have computed the pq sums in Eq. (9) twice, once 
including the first neighbor shell only, and then the first 
two neighbor shells. The first shell consists of the nearest 
6 F atoms and 12 Li atoms. The second shell contains the 
next 8 F atoms and 6 Li atoms. There are thus 72 terms 
in the first sum, 252 in the second. Each of the calcula- 
tions has been done for the [001], [011], and [111] 
directions. It turns out that for the [001] direction the 
second shell already contributes only a small correction. 
For the other directions, the sum is also converging, 
but clearly has not yet reached its limit. Nevertheless, 
the main features of interest to us already appear 
clearly, as we shall see, from the summations over the 
first and second shells. For this reason we feel that the 
very steep escalation in computation time and labor, 
which would be necessary to carry the sum further, is 
unwarranted. 

The appropriately weighted average of the x functions 
is presented in Fig. 5, for each of the three-crystal direc- 
tions. The units along the w axis are gauss, with the 
correspondence w= (2g1;— gr)8H or 1 G— 0.697 kc. The 
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CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


EI = 


CROSS RELAXATION. III A1579 
6 functions of Eq. (9) have been smoothed by averaging 
over 0.6-G intervals. The w? dependence arising from 
the energy denominator has been suppressed. The 
reason is that this factor is excluded from the convolu- 
tion integral for the transition probability. That is 


Worl) o — f x' (Blod, (M) 
[m 


where x’ is &? times the x defined in Eq. (9), or in Eq. 
(46b) or part II. 

The crucial point in Fig. 5 is that x for the [111] 
direction is narrowly peaked around the origin, but x 
for the [001] direction is spread out to a distance of 
24 G. In part II we pointed out that the existence of 
large components of x(w) far from the origin will result 
in an apparent shift of the C-R shape function away 
from the point of harmonic coincidence. 

In the final step, convoluting x with a Gaussian, 
the standard deivation o used was 13 G for the [111] 
direction, 14 G for the [011] direction, and 19 G for 
the [001] direction. Pershan obtained a best fit using 
12.7 G for the [111] direction and 14.8 G for the [011] 
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direction. This slightly larger o for the [011] direction 
compensates for his implicit approximation of x by a 
ò function at the origin. For this direction, our x does 
not vanish until 12 G, although it has an enormous peak 
at the origin. For the [001] direction, Pershan cannot 
match a Gaussian to the experimental data. From our 
point of view, it is the convolution of a Gaussian with 
x that is relevant and for the [001 ] direction, x extends 
to 24 G. At fields beyond 20 G or so, the contribution of 
the central peak of x to the resultant function will be 
small. For example, at 65 G, the value of x at 24 G 
weights 34 times as heavily in the convoluted result 
as does the value of x at the origin. To a first approxima- 
tion we might indeed describe the resultant Wer by a 
Gaussian, but by a Gaussian centered at 24 G. 

In Fig. 6, we show calculated and experimental C-R 
times. Again, the w axis is in gauss. The experimental 
points shown are reproduced from Pershan's data. The 
solid lines show calculations based on the first two 
neighbor shells. The dashed lines show calculations 
based on the first neighbor shell only. For the [011] 
orientation, agreement between the calculated and ex- 
perimental values are excellent. The change between 
the one-shell and. two-shell calculations is negligible, 
indicating that the nearest neighbors account for prac- 
tically the entire transition probability. For the [001] 
direction, agreement of the two-shell calculation with 
the data is fair, the worst discrepancy occurring at 
around 60 G, where the calculated times are too long 
by almost a factor of 2. Nevertheless, the progression 
fom the one-shell to the two-shell approximation 

clearly indicates that the inclusion of further neighbors 
would push the calculated curve even closer to the 
experimental points. In this case too, the experimental 
numbers decisively corroborate our theory. For the 
[111] direction, the two-shell approximation yields 
times too long by about a factor of 2 over the entire 
observed range; the one-shell calculation yields times 
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too long by about a factor of 10. This is to be expected. 
'The x function for this case consists essentially of a 
single sharp peak at the origin. 

The contribution of the more distant neighbors is 
primarily in the neighborhood of the origin because of 
the 77? dependence of Aw. Where the near neighbors pro- 
duce no high-frequency peaks in x, the more distant 
atoms can be expected to alter the magnitude of the 
resultant Wer function, without appreciably altering 
its shape. To investigate this point, we have calculated 
C-R times for the [111] direction, extending the lattice 
to infinity, but replacing the lattice sums by integrals, 
exactly as in Eq. (46b) of part II. We have made the 
additional simplifying assumptions that all off-diagonal 
dipole matrix elements are given by gxi°6?/r* or 
grigr6?/7, and that all the Aw, are zero (i.e., x can be 
approximated by a single spike at the origin). The re- 
sults of this calculation are shown by the dotted line in 
Fig. 6, and are seen to be in astonishingly close agree- 
ment with experiment. 

Once again, our theory yields results consistent with 
Pershan’s LiF data. Since for LiF, the d function is 
Gaussian, our calculation and Pershan’s yield coin- 
cident results when x is concentrated at the origin. Our 
theory, however, appears to extend as well to an ex- 
planation of the process in the [001 ] direction, where x 
has a considerable spread in frequency space. 


III. SUMMARY 


: We have applied our cross-relaxation theory to the 
interpretation of two classical cross-relaxation experi- 
ments. Not only the shape but also the magnitude of the 
calculated cross-relaxation function is consistent with 
the experiments of Mims and Pershan. The dependence 
of Wer, in dilute systems, on the energy imbalance o; 
on the spin concentration, and on the exchange radius 
is confirmed in the interpretation of the Mims experi- 
ment. The dependence of W cr, in concentrated systems, 
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on interactions with near neighbors, and its associated 
spread and displacement in frequency is confirmed in 
the interpretation of Pershan's experiment. 


IV. COMPUTATIONAL NOTE 


This study has involved a sizeable mass of numerical 
detail. The generation and manipulation of large pair 
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CROSS RELAXATION. III .« — A1581 : 
matrices, the tabulation of the various x functions, the 
evaluation of different convolution integrals, to mention 
only the more obvious computational tasks, have had 
to be performed on a mass-production basis. The use of 
the Bell Telephone Laboratories IBM-7090 computer 
has proved indispensable. The computer programs were 
written by the author. 
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The susceptibility and magnetization of YIG of high purity have been measured from 4.2 to 650?K by 
means of a precision vacuum balance. The spontaneous magnetization has a saturation value of 37.90 
emu/g at 4.2°K and 27.40 emu/g at 292°K. The Curie point is at 559*K as determined by both the vanishing 
of the spontaneous moment and the discontinuity in the susceptibility curve. Using a program written for 
the IBM 7090, the molecular field coefficients were determined by fitting the experimental total magnetiza- 
tion curve. The sublattice magnetizations and the exchange interactions are calculated and compared with 
other results. On the basis of the molecular field model the intrasublattice interactions must be larger than 


previously supposed. 


INTRODUCTION 


CCURATE sublattice magnetization data are re- 

quired for an adequate description of such quanti- 
ties as the magnetic anisotropy and magnetostriction in 
ferrimagnetics. The first calculations of the sublattice 
magnetizations in the garnets were made by Pauthenet 
in 1957 based on the molecular field model.! Though his 
results have been the only ones available it has long 
been apparent that they could be improved upon by 
using purer samples and a better method of solving the 
molecular field equations. An alternative approach is to 
measure the magnetization of a sublattice indirectly 
by observing the magnetic resonance frequency or the 
Mossbauer absorption of nuclei situated in that sub- 
lattice. This was first done for YIG by Solomon? and 
Robert? and has been repeated by several others,t-? 


* This work is an excerpt from a thesis submitted to the 
University of Maryland in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy. The research was per- 
formed at the U. S. Naval Ordnance Laboratory. 

1R. Pauthenet, Ann. Phys. (Paris) 3, 424 (1958). 

? I. Solomon, Compt. Rend. 251, 2675 (1960). 

3 C. Robert, Compt. Rend. 251, 2684 (1960). 

1G. K. Wertheim, Phys. Rev. Letters 4, 403 (1960); J. Appl. 
Phys. 32, 110S (1961). S: 

$C. Alf and o K. Wertheim, Bull. Am. Phys. Soc. 5, 428 

1960). 2 
‘ oe L. Boyd, L. J. Bruner, J. I. Budnick, and R. J. Blume, 
Bull. Am. Phys. Soc. 6, 159 (1961). 
7S. Ogawa and S. Morimoto, J. Phys. Soc. Japan 17, 654 
1962). * 
a TA D. Khoi and M. Buyle-Bodin, Compt. Rend. 253, 2514 
1961). 
‘ 9 D L. Boyd, V. L. Moruzzi, and J. S. Smart, J. Appl. Phys. 
34, 3049 (1963). 
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some of whose results will be discussed in a later sec- 
tion. It may be said here, however, that the NMR 
measurements have not been extended to sufficiently 
high temperatures to represent any real improvement 
over Pauthenet's results. 

In this present study the total spontaneous magneti- 
zation was obtained by subtracting the field-dependent 
magnetization from the measured values for high- 
purity YIG over the temperature range from 4.2 to 
650°K. These values of the spontaneous magnetization 
were fed into a program written for the IBM 7090 by 
Gerhard Heiche of this laboratory and the molecular 
field equations were solved for all temperatures. The 
molecular field coefficients computed in this manner are 
used to calculate the sublattice magnetizations and the 
exchange interaction energies for YIG. Similar results 
for the three sublattice garnets will be the subject of a 
separate paper. 


WON Mie ar ee RT RENTEN 


NOME RTT TR 


EXPERIMENTAL APPARATUS 


The technique used for the measurements r te i 
here is a modification of the Curie method. Briefly, : 
the sample is placed in a magnetic field having a large 
gradient and the force on the sample is measured by a 
sensitive balance. The essential unit is an automatic - 
vacuum balance and recorder which can weigh ac- 
curately to 3X 10-* g. A quartz sample holder 
pended from one pan of the balance so that the sai 


? P. Curie, Ann. Chim. Phys. (7) 5, 289 (1895). 
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is at the point where the magnetic field gradient is a 
maximum. The sample holder is protected by a vacuum- 
tight glass tube which may be either evacuated or 
subjected to a helium atmosphere at any desired pres- 
sure. For low-temperature measurements a stainless- 
steel double Dewar is used. High temperatures are 
achieved by using a noninductive wire-wound heater 
whose magnetoresistance coefficient is extremely low. 
The temperature of the sample is monitored by two 
thermocouples which are mounted close to the sample 
on the inside of the protective tube. In the interval from 
50 to 250°K the maximum temperature error is esti- 
mated at +0.5°C and above room temperature it is 
probably less than +0.2°C. The magnetic field is pro- 
vided by an electromagnet having parabolic pole 
pieces situated so as to produce a large gradient in the 
vertical direction. The maximum field attainable is 
10 kG and the corresponding gradient is 1241 G/cm 
at the optimum sample position. The latter was deter- 
mined by mapping the field of the magnet and choosing 
the point where 0?H/dz’=0. Ideally, if 0H/dz is the 
only nonzero derivative, then the force per gram of 
sample is given by 

F=c(0H/dz). (1) 


The gradient was determined for each value of magnet 
current from measurements on spectroscopically pure 
nickel (Johnson Matthey Code 890) using the value of 
54.39 emu/g for nickel at 20*C.!! The precision of this 
calibration is’ better than --0.1% and includes any 
errors due to sample weight, sample position, and field 
current setting. It also includes any uncertainties due 
To sample shape since spherical and nonspherical sam- 
ples were used to make the calibration runs. 


« SAMPLES 


YIG single crystals were grown from the melt by 
J. Richard Cunningham, Jr., of this laboratory using 


* 1 R. M. Bozorth, Ferromagnelism (D. Van Nostrand Company, 
Inc., New York, 1951), p. 867. 
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oxides of 99.99% purity. Polycrystalline samples were 
made by ceramic techniques using ultrapure yttrium 
oxide (Lindsay Code 1118). The single crystals were 
ground into spheres and were x-ray oriented so that the 
applied field could be directed along a [111] axis. 
Measurements on the polycrystalline samples were 
made on sintered powders as well as on spheres ground 
from toroids of high density. X-ray diffraction patterns 
were obtained for all of the polycrystalline materials 
using a Norelco high-angle goniometer spectrometer. 
The scanning rate was 1? per minute using unfiltered 
chromium radiation. None of the samples showed any 
unwanted phases or impurities and the powders ap- 
peared to be homogeneous and completely reacted. 


MOLECULAR FIELD THEORY OF YIG 


On the basis of the two sublattice model for ferri- 
magnetics !? the molecular field equations for YIG may 
be expressed as follows: 

O= Ta, 
847 75.70B5/2(3.359X 10-/ T) Ha; 
oa= 113.55B3/5(3.359X 10-4/T) Ha, 
H7 Naa8a— Nad d, 
Ha — Nadat Ndasa- (2) 

In Eqs. (2), o is the total magnetization, ca is the 
magnetization of the octahedral sublattice and c4 is 
the magnetization of the tetrahedral sublattice, all ex- 
pressed in emu/g. B(x) is the Brillouin function, T 
is the absolute temperature, and Ha,a is the molecular 


field in gauss. The molecular field coefficients sj are 
related to the 7; of Pauthenet! by 


Ng 92.25n5. 
'They may also be expressed in terms of the three 


12 L, Néel, Ann. Physik 3, 137 (1948). 
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Fic. 2. Spontaneous magnetiza- 
tion of YIG. The solid curve is 
the experimental curve of Fig. 1 
with data points omitted. The 
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exchange energies as follows: 
Naa= 5661J aa, 
Naa= 1887S aa, (3) 
Naa= 2821S aa, 


where the J,; are in units of cm™!. 

Equations (2) were solved by means of a computer 
program designed to determine the molecular field 
coefficients by fitting the experimentally determined 
total magnetization curve. 


EXPERIMENTAL RESULTS 


The spontaneous magnetization and the suscepti- 
bility of YIG are shown in Fig. 1 in units of emu/g. 
The value of 37.90 emu/g at absolute zero corresponds 
to slightly over 5 Bohr magnetons and is in excellent 
agreement with the value of 5.01 reported by Geller 
et al? This is about 5% higher than Pauthenet’s value 
and attests to the purity and homogeneity of these 
samples. The Curie temperature is 559°K and is 
indicated by the discontinuity in the susceptibility 
curve as well as the vanishing of the magnetic moment. 
There is little question about the shape of the magnetiza- 
tion curve as is evidenced by the smoothness of the 
data. The points shown in Fig. 1 represent only about 
one-fourth of those measured and the omitted points 
fit the curve just as smoothly. The total magnetization 
is again shown in Fig. 2, where it is compared with the 
data of Pauthenet and the NMR results. The latter, 
unfortunately, do not extend much above room tem- 
perature and thus shed no light on the region where 
the magnetization changes most rapidly. The broken 
curve in Fig. 2 is the molecular field solution obtained 


13S. Geller, H. J. Williams, R. C. Sherwood, and G. P. Espinosa, 
Phys. Chem. Solids 23, 1525 (1962). 
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by assuming that the a-a and d-d interactions are 
zero. Adjusting the a-d interaction coefficient to give 
the experimental Curie temperature of 559°K, one 
obtains the results shown in line 3 of Table I. However, 
the experimental curve (solid line) of Fig. 2 is best fit 
by the two sets of coefficients in lines 1 and 2 of Table I. 
The coefficients of line 1 were obtained by assuming 
them to be independent of temperature while those of 
line 2 were given Aléonard’s temperature dependence, 
namely, 


MT)=A(o) + (1—1.3X 10-47). 


The magnetization curve calculated from each of these 
sets of coefficients is extremely close to the experimental 
values. The relative error is generally less than a few 
tenths of a percent except near the Curie point. Two 
factors probably contribute to the latter: (1) The 
spontaneous magnetization in this region is quite small 
and the measured magnetization consists largely of the 
field-dependent contribution. Thus, a small error in 
the susceptibility will produce a large relative error in 
spontaneous moment. (2) The molecular field model 
neglects short-range order which is known to play an 
ever increasing role as the long-range order disappears. 


'TABLE I. Molecular field coefficients for 
YIG (J,; in cm). 


Aaa Aaa daa Ja Jon Ju TER 


Anderson 71505 47820 22304 25.36 845 11.86 559 
Anderson 71505 44200 20700 25.36 7.8 11.0 550 
Anderson 38 371 0 0 13.61 9 
Aléonard? 71100 34500 19500 25.2 


Pauthenet® 68500 32500 19700 243 
Wojtowicz 68 500 0 0 24.3 


* Reference 14. b Reference 1. 
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d DISCUSSION 


It is evident from Fig. 2 that the shape of the total 
- magnetization curve is not very sensitive to the values 
of the a-a and d-d interactions provided the e-d inter- 
action is constrained to satisfy the Curie temperature. 
"Thus, scattered data can be approximately fitted by a 
wide range of values for the coefficients. The curve in 
Fig. 1, however, is sufficiently smooth to give the 
unambiguous sets of molecular field coefficients of 
lines 1 and 2, Table I. These coefficients and the cal- 
culated exchange energies are compared with earlier 
values in the table. The tabulated values for the Curie 
temperature are all calculated values which were ob- 
tained from the solution of Eqs. (2) using the cor- 
sponding sets of coefficients. Note that the coefficients 
lines 1-3 are the only ones that give a Curie tempera- 
re which agrees with the experimental value of 
. Pauthenet obtained a Curie temperature of 
experimentally but his temperature-dependent 
ts predict a value 40° higher. 
g the results of spin-wave theory! to 
dispersion constant D and the Landau- 
ange constant A are related to the J’s 


=35;(5Jaa—8Jaa—3J aa)arh?. (4) 
= 3, 

8Taa—3J aa) ; 

Jas—3J aa)- 


1675 (1961). 
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A summary of the values of D and 4 that have been 
obtained by magnetization, specific heat!?-?? and micro- 
wave measurements??? js given in Table II. The values 
calculated from lines 1 and 2, Table I are smaller than 
the earlier magnetic results and are much less than 
those obtained from specific heat measurements. The 
discrepancy with the other magnetic results arises 


'TABLE II. Summary of exchange coefficients 


for YIG. 
5Jea—8Jaa—3Jaa — D(erg-cm?) A (erg/ cm) 
Magnetization 
Line 1, Table I 4.691075 erg 22.5X10- 11.8% 10-8 
Line 2, Table I 6.24 29.9 15.8 
Line 3, Table Y 13.5 64.6 34.1 
Pauthenet 8.9 43. 22. 
Aléonard 9.2 44. 23 
Wojtowicz 24.1 115. 61 
Specific heat 
Edmonds and 
Petersen 51. 27. 
Meyer and 
Harris 63. 33. 
Kunzler, Walker, : 
and Galt 83. 44. 
Shinozaki 83. 44. 
Microwave 
"Turner : 99. 53. 
LeCraw and 
Walker 96. 52: 


Ån 
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MOLECULAR 


FIELD MODEL AND MAGNETIZATION OF YIS A1585 
principally from the differences in the values for Jaa. Eeee n TE | 
The disagreement with the specific heat data is pro- a | 
bably due to differences in the spin wave and molecular | : Loans | 
field models. p ON | 

Figures 3 and 4 show the octahedral and tetrahedral a 
sublattice magnetizations for all temperatures as cal- E 
culated using the coefficients of line 1 (solid line) and ee 
line 3 (dashed line) of Table I. The dashed line is not — __ 
shown for the octahedral sublattice since it nearly © 
coincides with the solid curve. The calculations of EC 
Pauthenet and some of the NMR results are also s TIU PUPA AL 


shown. 

In Méssbauer and NMR experiments in magnetic 
materials it is assumed that the temperature dependence 
of the effective field at the nucleus is due entirely to 
the temperature dependence of the sublattice magneti- 
zation. This is probably not a bad assumption since 
it has been shown” that the hyperfine interaction is 
the dominant contributor to the effective field at the 
nucleus. 

The hyperfine term of the Hamiltonian may be 
written in the form 


H= — gifBil Ha, (5) 
Hets= (87/3) p (0) g8S (6) 


is the effective field seen by the nuclear spin. Since 
p(o) is nonzero only for s electrons, it is not obvious 
why Fe?*, which has no unpaired s electrons, should 
have an effective field at the nucleus. Sternheimer?* 
suggested that a net unpaired spin density at the 
nucleus is produced by a polarization of the core elec- 
trons due to the spin of the d electrons. If this core 
polarization is essentially constant up to the Curie 
temperature, then Hert and the hyperfine energy will 
show the same temperature dependence as the sub- 
lattice magnetization. That this assumption is valid, at 
least up to 430°K in YIG, is borne out by the NMR 
data in Figs. 3 and 4. 


where 


SUMMARY OF RESULTS 


(1) The saturation moment of YIG is 37.90 emu/g 
(5.01 Bohr magnetons) at absolute zero and 27.40 
emu/g (3.62 Bohr magnetons) at 292°K. The Curie 
temperature is 559°K. The magnetic susceptibility 
vanishes below 190°K. 

25R. E. Watson and A. J. Freeman, Phys. Rev. 120, 1125 


(1960); J. Appl. Phys. Suppl. 32, 118S (1961). 
2 R. Sternheimer, Phys. Rev. 86, 316 (1952). 
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Fic. 4. Magnetization of the tetrahedral sublattice of YIG. 
The solid curve was calculated using the coefficients in line 1 of 
Table I, the broken curve using line 3. 


(2) The experimental magnetization curve can be 
reproduced by means of the molecular field model. 
However, a unique set of interaction coefficients can- 
not be obtained by this method if there is much scatter 
in the data. 

(3) The sublattice magnetizations are less sensitive 
than the total magnetization to the exact values of the 
interaction coefficients. 

(4) The sublattice magnetizations reported here are 
in good agreement with available NMR results. 

(5) On the basis of this model the a-a interaction 
must be larger than previously supposed. 
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Nole added in proof. A. B. Harris [Phys. Rev. 132, 
2398 (1963) ] gives the following values for the molecular 


field coefficients of YIG: 4,4— 89 700, Maa = 35 950 and 


Aaa 11 980. However, this set of coefficients results in 
a Curie temperature that is greater than 900°K. > 
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A previous rotationally invariant perturbation tre 


neighboring sites interacting via direct d-d exchange 
alignment of two localized moments is never 


perturbed d-state energy (depending upon the exchange interactions) only 


atment of localized d states is extended to the case of 
and indirect s-d scattering. It is found that the parallel 
the ground state of the pair. For certain values of the un- 


one of the sites will be occupied 


by a d electron so the pair can have a net moment. If the conditions are such that both sites are occupied by 
d electrons, these electrons must be aligned antiparallel so there is only a small net moment due to scattering 


to and from the band states. 


I. INTRODUCTION 


N a previous paper, a rotationally invariant pertur- 
bation calculation of the susceptibility of a localized 
d state in a metal was made, using a model of this state 
proposed by Anderson. In this model the "d state” is 
assumed to have no orbital degeneracy, and to have a 
wave function distinct from the Wannier functions of 
electrons on the sites of the host lattice. When this 
state is singly occupied, its energy is ea, when doubly 
occupied by electrons with antiparallel spins, is 2Zeat+U, 
where U is positive. The Hamiltonian of the model 
consists of the band electron energies, the aforemen- 
3 tioned “d state” energies, and a scattering term capable 
; of converting electrons on the local “d” site into band 
: electrons and conversely. No other terms (such as 
: Coulomb interaction between conduction electrons and 
| electrons on the ‘‘d” site) are included. Physical reasons 
for restriction of the energies to the ones mentioned 
above, are given by Anderson? Within the framework 
of this model, he showed that, provided the scattering 
to the band was not too strong relative to the correla- 
tion energy U, and provided e4 was below the Fermi 
surface and 2ez4- U above, a localized magnetic moment 
would result, which could be calculated by the Hartree- 
Fock method. Since this method violates rotational 
invariance, it is hard to see how one can use it to com- 
pute a Curie constant as distinct from a saturation 
magnetization. 
In SFI it was shown how this calculation may be done 
by perturbation theory. The method divided the possible 
States of the system into magnetic and nonmagnetic 
ones, and calculated two partition functions Sa, 3s in 
these two manifolds. The Curie constant then has the 


* Supported in part by the U. S. Air Force Office of Scientific 
Research. 

VH. Suhl and D. R. Fredkin, Phys. Rev. 131, 1063 (1963) 
¢hereafter called SFI). 

2 p. W. Anderson, Phys. Rev. 124, 41 (1961). 
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form 


Car Cyl Bu/(8ut+3s)]- 


The actual calculation used the resolvent technique, 
deriving resolvents Yar and Jts for the two manifolds 
with the help of projection operators. These resolvents 
were then evaluated by perturbation theory, keeping 
only the lowest order self-energy corrections and, in 
fact, only their imaginary parts. The crucial point is 
the following: If these imaginary parts (due, of course, 
to the band scattering) were neglected, the partition 
function 3s of the nonmagnetic manifold would be 
smaller than 3y by a factor e®4, for the “empty” 
component of the nonmagnetic manifold, and by a 
factor e-&G«^U) for the “filled” component of that 
manifold. (Energies are measured from the Fermi level.) 
Hence, at very low temperatures, the magnetic moment 
would be essentially that of the isolated "d" state 
without scattering to the band. The imaginary parts of 
the self-energies appropriate to 3s, which are in effect 
the damping rates of the states of S due to the presence 
of M, cause 3s to assume the same exponential depend- 
ence on the temperature as that of Sj. For this reason 
there is always a reduction of the moment. Qualita- 
tively, the results agreed with those of Anderson, except 
no sharp cutoff was found in the moment, which tended 
to zero smoothly with increasing scattering. This might 
be a basic feature of perturbation theory, unless 
prescriptions are found for solving for the self-energies 
consistently in the way in which this is done in other 
many body problems. The complicating feature here; 
which makes the standard recipes hard to apply, is that 
the *d"-state correlation energy is treated rigorously 
as part of the unperturbed propagator. 

In the case of two d states, we distinguish between 
two situations: (1) The d sites are nearest neighbors and 
electrons on them interact directly, and (2) the d sites 
are far apart and interact only via band electrons. 

In case (1), we must include in the Hamiltonian an 
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MANY-BODY THEORY OF LOCALIZED d STATES 


essentially single-particle scattering between electrons 
on the two “d” sites, In addition, we will show that the 
direct Coulomb interaction between electrons on these 
sites may become important even if smaller than the 
former, provided its exchange part has ferromagnetic 
sign. To these interactions we must, of course, add the 
same Hamiltonians as in SFI, one for each site. In case 


_ (2), all direct d interaction is neglected and only the 


sum of the two Hamiltonians of SFI remains. 

Problem (1) has also been treated in Hartree-Fock 
approximation by Alexander and Anderson,’ who con- 
clude that the pair aligns antiparallel with zero net 
moment unless one of the energies e; or 2ea+U is close 
to the Fermi level. In that event, they can obtain 
parallel coupling, and therefore a net moment. As we 
shall see, perturbation theory parallels these conclusions 
somewhat. 


II. RESOLVENT METHOD 


We will treat the “d”? pair Hamiltonian rigorously, 
and, as in SFI, only the band scattering as a perturba- 
tion. (A simultaneous treatment of both these mechan- 
isms will be briefly considered at the end of this paper.) 
Then we may divide the unperturbed states of the 
system into three manifolds, broadly describable as 
singlet, doublet, and triplet, and the effective Curie 
constant, by a trivial generalization of the formula in 
SFI as* - 

Cior-- Cis 3p 


Byte dL 


where C, and Cj; are the “bare”? Curie constants 
appropriate to a free spin one and a free spin 2, respec- 
tively, and 37, 3p, 3s are the partition functions of the 
three manifolds. These manifolds, split into submani- 
folds, which may be classified according to their origin 


(1) 


Ceu 


` before the single particle scattering between the two 


sites is turned on. The result is as follows: 


(S) One electron on each site, the pair forming a 
singlet. Two antiparallel electrons on one site, 
the other site empty. Both sites doubly occupied 
by antiparallel electrons. Both sites empty. 

(D) One electron on either site, the other empty. 
Two antiparallel electrons on one site, the other 
singly occupied. 

(T) One electron on each site, the pair forming a 
triplet. 


Including the possible S; states in D and T, this gives a 
total of 16 unperturbed basis states. When the single- 


(19 A Alexander and P. W. Anderson, Phys. Rev. 133, A1594 
‘Tt should be emphasized again that formulas like (1) and (7a) 
of SFI give only the thermal expectation value of nat na} in the 
case of (7a) SFI, and of the total moment on the pair of d sites 
alone in the case of (1) of the present paper. Any polarization 
effects of the conduction electrons over and above their back- 
ground Pauli moment are not determined by these formulas. 


IN METALS  A1587 
particle scattering between the two sites is turned on, 
some of these form admixtures. The d part of the 
Hamiltonian (exclusive of the band scattering) is 


a= 2, eai, Hnos) +U (tma d matita) 
+V ie D (dyo*dag+doe*d yy) , (2) 


where d*, d are creation and annihilation operators on 
the sites indicated by the subscripts, « is the spin 
index and the ws are occupation numbers on the 
indicated sites. Coulomb interation is neglected for 
the present. Under the Vız term in Eq. (2), the sub- 
manifolds enumerated above (linearly combined where 
required) combine and regroup as in the following 
table. In the ket symbols, the first and second places 
denote occupancies of sites 1 and 2, respectively. The 
results are given in Table I. 

The low-temperature behavior of the partition 
functions of all these manifolds is dictated by that of 
the partition function of the energetically lowest 
manifold. (Just as in SFI, the others are “damped” by 
lowest manifold and so acquire its temperature depend- 
ence.) For this reason we need only determine which 
energy is lowest as the parameters ej, U and Viz are 
varied. 

We assume e4,«0, 2ea+U>0 so that the isolated 
sites favor single occupancy let Vi?70. If Viz»3U, 
the E(S,1) is always lowest. If Vis 4U, then E(S m1) 
or E(D;*) is lowest according to whether 


— «Vif 1—8(V 12/U)L14+ A+ 6Vi£z/U*)) F3. 


If Vi2<0, the same result holds, but with Vi replaced 
by |Vis] and D,+ by Dr. Presumably for reasonably 
small Vi» the situation with singlet lowest is more 
common. Only when e, approaches the Fermi level to 
within ~ Vi», can the doublet energy become lowest. 

It follows that in the common situation, any moment 
exhibited by the pair is due to band scattering. For 
small values of the scattering energy V, such moment 
as there is comes principally from 3p, since the doublet 
manifold can reach the singlet manifold by lowest order 
damping involving V?. The triplet manifold connects to 
the singlet only in order V*. Qualitatively, this agrees 
with the Hartree-Fock procedure of Alexander and 
Anderson, who find no moment at all unless! Vis] ~ | eal. 

The triplet energy is always above E(S,i). It is 
also below E(D;*), unless 


Vi — eu> Vif 1—8(V34/U) 
X[14- A+ (16V,2/02))]-$ 7 


in which case E(D;*) lies between the singlet S, and 
the triplet. In that case the triplet partition function 
acquires the exponential temperature dependence of 
that of the doublet, but at temperatures such that AT 


is much less than a quantity of order Vi2/U, the — 


damping process still favors the singlet. 
^ 
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"TABLE I. Energies of various states.® 


AND H. 


SUHL 


States 


Energies 


Singlets 


both sites empty, Se= |0; 0) 
both sites filled, 5,2 111; Tl) 

“polar” singlet; Sp= Q)*(11150)— 105 10) 
“mixed” singlets 


Si (D "pa (115 D— 15 DHT- 10,11] 
Smo= QyhE- (1; D— ll; D) +e (110) +101) J 


0 
4eq--2U. 
2ea--U 


2«4— U sinh?(0/2) 
Qea+U cosh*(0/2) 


Doublets 


one-electron doublets; Di* = (2)? (11,0) 10,0) 
three-electron doublets: D+ = (2) (115, 1)2: 1,1) 


and two similar pairs with the spins reversed. 


eae XU sinhd 
3eq--U7FXU sinh6 


Triplets 
ID 2a 
T= 2)3*(1; D--1 1) 2ea 
Is) 2«a 


a In this table sinhü 24Viz/U, a = —coth(0/2)/[1 -Fcoth2(0/2) ]?, and b —1/[1 -Fcoth?(0/2) ]/*. For brevity, the energies of these states Sy, Sp, etc. 


will be denoted by E(Sys), E(Sp), etc. 


Finally, when E(Di*) is also below E(Sm1) (see 
inequality), the susceptibility for small band scattering 
is dominated by spin 4 with V? corrections from spin 1, 
and spin 0. This is the situation in which Alexander and 
Anderson find triplet coupling favored. In the present 
theory no such coupling ever occurs. 

However, some caution is indicated by the fact that 
in the present model the singlet-triplet separation is 
only of order Vi1.?/U. If a Coulomb interaction between 
the electrons on the d sites is added, and its exchange 
energy W, say, turns out to have ferromagnetic sign, 
the above conclusions become invalid as soon as W 
exceeds Vı2/U. The triplet would then tend to be 
lowest, and the susceptibility would be dominated by 
spin one. 


HI. EVALUATION OF THE PARTITION FUNCTIONS 


We use the resolvent technique (as in SFI) for 
evaluation of the partition function. The complete 
Hamiltonian is taken to be 


go DE EkCr,o' Cko 
kyo 


2 
+y DD D» (etre, o dio te di Jc o) - 90a 


k,c i=] 
=R tu F V= 3054-0, (3) 


shere 304 is the Hamiltonian of the “d” states alone, 
<tr are the position vectors of the sites, and U is the 
band scattering operator. The resolvent operator 
J= (26—2)-!, could be projected onto the unperturbed 
singlet, doublet, and triplet manifolds as in SFI. The 
result will involve “nested” propagators and have an 
- ,e&tremely complicated appearance, each propagator 
involving the manifold one more remote than its 


^ 


[2 


predecessor. We have calculated one of these nested 
expressions in the approximation in which, at most, two 
conduction electrons or holes are excited in the inter- 
mediate states. 

The resolvent for the D, state (which exhibits the 
behavior of resolvents of other states as well), with the 
denominator evaluated to first order in A (A—pV?, 
p= density of states), may be written as 


Jibi - LEQD9 -2- Tetil, (4) 
where ô is an infinitesimal, 
KDVD m c. KPVS m 
ETa © E(S)-:—« 
(1| 0 [52 |?(1—22) 
p | A 


2 


and X} s’ means the sum over the three singlet states 
with two d electrons, Sp, Smi, Sm2- The matrix elements 
may be evaluated with the aid of the wave functions 
derived above. For example, N| (D| U| Sm) =YV" 
Xcosk.r, N|(Dt |V| T) | 3V? sin?k-r, where y4=1 
-E[AVis/ (U?2-16V332))/2]. The summation over k is 
somewhat complicated by the presence of sin?k-r (etc-). 
Since « is isotropic, the angular integration will give 


sin2kr 
sin?k-r 2kr Lm 
fal [zr =?r| |. 
cos?k-r sin2kr py 
1+ 
2kr 


If we assume 


| e(dk/dE) p | «max (kr, 1/7) , 
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then 
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sin2kr cos2kpr sin2[ e(dk/dE)er H-cos2[e(dk/dE)pr |sin2ker sin2ker 


2kr 2[kp-F- e (dR/dE)p ]r Qkpr 


Then u4 becomes 
o4 7 1+ (sin2krr/2ker). 
Now we will neglect the real part of »7,77, and calculate 
(2/A)Im 2 Tr=3n E(T)—z]æ- 
-FEuEE(S;) —z Ja--nLE(S m) —2 Javy— 
+nLE(S m2)—2 loyy4+2n[—z]a,, (5) 


where n[z] is the Fermi distribution. Following the 
procedure of SFI, the partition function may be 
written as 


8»e- | Im9f(z)e-8:dz 
is e [84- AI (1) Je-*tdt 
=en | SEE 
— CEA OT 


where 


1 
I()—Im?, T, 
A k 


and we have substituted (+ E(Di*) — z. 

I(t) will be infinitesimally small unless ¢>E(Sm1) 
—E(D;*), and the integrand of Eq. (6) will be infinites- 
imally small unless ¢<0. The problem then requires 
consideration of two separate cases depending on the 
sign of E(Sm1)— E(D1*). 


A. Magnetic: Es E(Ss1) — E(D1*) 0 


In this case the integrand behaves like a delta 
function near ¢=0, in the limit à—0. The 7 (f) term in 
the numerator can only contribute for ¢>Zo, and, 
since e~#=*<1, this contribution is negligible. (For large 
negative values of ¢ the integrand could become large 
because of the exponential factor, even compensating 
for the small factor 6. However, either the band bottom 
or the finite spatial extent of the impurity potential V 
will introduce a cutoff before this happens.) The parti- 
tion function for the magnetic case is now simply 


Bote Perr, (7) 


B. Nonmagnetic: Eo= E(Sm)—E(D1*) «0 


I(t) is now finite at ¢=0 so there is no contribution 
from the ô in the numerator in the limit 6-0. If we 
make the transformation 


€ in(a—t)- e ?on(t— a), 


then we immediately see that the largest term in e #7 ) 
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Hence, the partition function may be written 


Eo f 
ps i 
The /(f) does not contribute to the denominator 
because we are integrating over that portion of the ¢ 
axis where /(¢)=0. The straightforward evaluation of 
the integral then gives 


Boyt = —(A/2bEo)y-ae fE Sm), (8) 


Calculation of Curie constant. We could proceed to 
evaluate all the other partition functions in the same 
manner. However, examination of Eqs. (7) and (8) and 
the analogous results in SFI will enable us to write the 
following set of simplifying rules for the calculation of 
the partition functions: 


(1) All partition functions have the form 4,e ?£', 
where A; is independent of temperature, £,;=min 
X[E(Sm1), E(Dr*)], and n; is an integer indicating 
the multiplicity of the state. 

(2) The partition function for the state of lowest 
energy has A=1. 

(3) For any state coupled to the lowest state to 
first order in U (states coupled in higher order are 
neglected in the present paper), 


As (An os)/ (5, — Ei). 


The a and y factors are determined from the square of 
the appropriate coupling matrix element. 
Applying these rules, we can calculate the effective 
Curie constant from Eq. (1) for the two cases. 
Nonmagnetic: Eo= E(S 1) — E(D1*) «0 


Cei CiiF/ (434-F), (9) 
where 
A ay Y+ ; 
nA l 
2LE(Di)—E(S.) E(DT)—E(Smı) 


en ME 
E(Dj)—E(Sa) E(Di-)—E(Sa1) d 


The labelings of the energies in the denominators of F x 


indicate the four levels to which Sm1 is coupled in frst 
order. es 


^ 
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Magnetic: Ey E(Sm)— E(D1*) 20 


3C Aas. | 


Ca Cyt 
i | "ET ZEECD)- EQD:9] 


" “| Y+ ik Y-a- 
x ze 
| ILESE) E(Sm)— E(Ds*) 
j 3 -1 
me | —— || . (10) 
E(Sj—EQ;9 ETE! 


Each doublet state is actually counted twice, but 
since all the terms represent doublets, the factors of 
two cancel out. The accuracy of these expressions 
requires A«|E(S,)—E(Di*)|. In. the region of 
transition from magnetic to nonmagnetic, the two 
levels cross and this inequality cannot hold. We must 
therefore do a more detailed calculation involving only 
the S,,1 and D;* levels. 


IV. ZERO TEMPERATURE LEVEL SPLITTING 


When the 5,51 and D;* levels are close together, we 
are permitted to rediagonalize them with respect to U 
and ignore the other levels. In this section we will there- 
fore assume |E(D;*)— E(S43)| «| E(T) — E(D1)], 
[E(Sm2)—E(D1*)|, etc. For E(Sm)<E(Di*) it is 
appropriate to consider a two-electron manifold. We 
take the wave function for this restricted manifold to 
be of the form 


V-Y ara Di*)es FUE); (11) 


with the coefficients normalized so that 


D | Ane 2(1—1,)+|b|?=1. 


The renormalized eigenvalues are found by minimizing 
the energy [calculated with Eq. (11) as wave function], 
minus the normalization condition multiplied by A, 
with respect to a; and b. This procedure gives the 
equations 


[EDH eA Jat V (y-04/2)! 5 —0, 


12) 
V(y-o,/2) ue x ax(1 eT 779) + LE (Smi) ei aJo =0 , 
determining the new energies as solutions of 
1—1n& 
e =)+E(Smi)=3 Vy- $,—— ——————. (43) 
E à E(Di*)d-e,.—À 


Tt should be noted that we have actually renormalized 
only the states Smı and Dıt accompanied by an electron. 
Tf E(Sm1)> (Ds), the appropriate manifold contains 
only one-electron states. The wave function is then of 
the form 

P=a|Dy)+>d br) Smer. (14) 


L 


AND H. 


SUHL 


This leads to the characteristic equation 


ny 
NEE (Dav aC, ccce is 
(Di "T ASESOR e) 


which renormalizes the states D1* and S'mı accompanied 
by a hole. ae 
It can be seen at once that for | E(D1*) — E(S n1)|>>A, 
we have X= E(S,,1), X= E(Di*). Using this value of A, 
and the normalization condition we find 
V (y-a4/2) 1/2 
a= " 
(1-2- FP PLE(SS1) —E(D1*)— e] 


1—n 


where 


F = i V?y a4. xD i 
x [E(Sn)— E(Di*) er 


If the summation is performed, it is readily seen that F” 
is the same as the first term of the F used in Eq. (9). 
With this value of a;, we can obtain the result 


Cert — Cua $a =k) (16) 
k 


which is the same as Eq. (9), except that the terms 
involving levels other than Smı and Di* are missing. 
Hence, the choice of the wave function given by Eq. (11) 
corresponds to choosing E(Smı)<E(Dr*), as would be 
expected. 

By the same reasoning, we can use M— E(Di*) to 
find the coefficient a for the wave function given by 
Eq. (14). In this case 


Cott= Crea”, (17) 


which is easily seen to correspond to Eq. (10), with the 
terms involving states other than Smı and D;*+ removed. 
In order to determine the energy eigenvalues M when 
the states are close together, we rewrite Eq. (15) as 


Ty 


4 =V y X (18) 
k y'— ex Ly 
where 
y —E(Dy)—X. 


For positive Eo, Eq. (17) will always have one positive 
root, which gives the shift in energy of the D;* state 
when it is lowest. If we approximate the density of 
states by a parabolic band starting at k=0, then the 
sum can be converted to an integral from k=O to 
k=kr which can be easily evaluated. If |y-I-Eo| «&Er, 
then the resulting equation simplifies to 


a 1 4Er 1 
y =3V? In -—| : 
2Erg y +E, Er 


(18a) 


For a convenient numerical example, the parameters 
3V?/Eg?— 10-4, Eo=0 give the solution 


y'=3.6X 10-* Er. 
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The solution of Eq. (13) is quite similar, provided 
certain approximations are made. The & summation is 
over all electron states above the Fermi surface, so a 
simple parabolic density would cause the integral to 
diverge. This divergence is always stopped by the 
actual high-momentum cutoff of the interaction poten- 
tial V (which has here been assumed independent of £). 
For purposes of simplicity we will assume the density 
of conduction electron states to be symmetric with 
respect to the Fermi surface, although this is actually 
inconsistent with the assumption of a parabolic band. 
If we change variable to y= E(S,,1)—2A, then Eq. (13) 
becomes 


1—7 ny 


e ahve 
mena ym wae 
k y—er— Eo 


m o — —— 
k y+er— Eo 


For Es«0 (singlet lowest) this always has a positive 
root, and the result of the summation is the same as 
Eq. (18a) with the substitutions y — y, £o = — Ev. 
Hence, in this approximation, the unperturbed crossing 
point remains unshifted. Using the value of y’ calculated 
from Eqs. (18a), (16) gives 


C; 2 
Cer Say sh. (iom 
1+[Ay_-o;/2(y'+ Eo) ] 
and 
CipAy-a+/2(y— Eo) X 
eff Eo<0. (19b) 


iy GSE i 


The only restriction on the size of A is that A be very 
much smaller than the “distance” between E(S m1) or 
E(D,*) and any of the neglected states. There is, 
however, a limitation on the size of the correction term 
since ln4Er/y>1, Eq. (18a) shows that y, y’>V?/Er 
=A (for anything resembling a parabolic band), so the 
A terms never become larger than one. This implies 
that the effective moment is always larger for E90 
than for Eo« 0, in agreement with the previous distinc- 
tion between magnetic and nonmagnetic cases. In 
general in this order there will be a sharp discontinuity 
(but no longer an infinity) in the moment at the 
boundary between the two regimes. 
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Equation (18a) will permit positive solutions for y' 
for slightly negative My. These solutions are slightly 
larger than for small positive values of £o However, 
this does not mean that the corrected energies do not 
cross. The increase in y’ is only logarithmic in £s while 
the unperturbed separation of energies is linear in Eo 
and therefore the latter is dominant. 


V. LONG-RANGE EFFECT 

If the two impurities are separated by many lattice 
spacings, the d-d exchange interaction Vis negligible. 
In this case, the s-d scattering will dominate, and the 
resolvent can be calculated with respect to a simpler set 
of wave functions than those used above. For the 
magnetic case (ea<0; ea+U>0) the unperturbed 
ground state consists of single occupancy of the d level 
of each site. The two electrons can form a spin singlet or 
triplet, and these two levels will be degenerate in energy 
(to zero order in V). In order to determine whether the 
pair is magnetic or nonmagnetic, we must examine the 
manner in which the s-d interaction removes the 
degeneracy. 

In the unperturbed representation, the triplet and 
completely empty and filled states are the same as 
were used previously. The other states are: 


State Energy 
$1211150) 2e4- U 
52710; 1D) Zeat U 
Sp (DEUN Y= Ths 1) Zea 

Di 2—1150), 10; 7) €4 
D= ii Ts DD Seat U 


S,, and T are the degenerate unperturbed ground states. 
As far as s-d scattering is concerned, the only difference 
between the Sm and T states is that the T state (because 
of the exclusion principle) will not allow a scattering 
from the d to the conduction band and back to the d of 
the opposite site. 

The lowest order terms to show this effect will involve 
coupling the S, or T to the Sı or Sz via Di or D; 
intermediate states. For example, the interaction term 
for the sequence $,— D; —5,— Dj) — Sn and 
T Dj = Si,— Di >T is 


1342 Tr (cwr tew t 8 Chr a Cnt) (Cr tC E C ex *) cos(k+k'—k”—k”)-r 


; (20) 


(-- ea— zt €k— ewt ewr) (2e4- U — 3H ex — €x) (ea— 2+ €x) 


where the upper (+) signs refer to the Sm sequence and 
the lower (—) signs refer to the T sequence. Taking the 
trace of the c; operator product gives 


[s (1— 18) H- 04 (1— ri) JD 
FE — moa)2e 0 mea) JE Q0 — et) & met) 
X bearer. (21) 


The first term of this expression is the same for both 
Sm and T; hence, it does not help remove the degen- 
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eracy. It is also independent of position because the 6 
function conditions make the argument of the cos in 


Eq. (20) equal to zero. The second term will give zero 


in the T state unless the conduction electrons are 
polarized. We will therefore consider only that part of 
Eq. (20) which is due to the second term of Eq. (21). 
If we assume the density of conduction electrons to be 


isotropic and sum over £', k’, average the directiong of - 
k and k”, and replace k” by k’ for simplicity, then wè 


o 


e 
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eee ^ 
-VJ : 
8? 2e«4-U-—s 
where E 
y (1—7;)sin2kr 
> E 
k k(ea— ster) 


(It should be noted that this sum is really no longer 
discrete because the angle has been averaged out.) 
There are three other sequences of this class, and they 
are obtained by substituting Ds and D; in one or both 
places. The result of adding all four terms together is 


ysv2 (fFe) 
T= = 
8r? 2e H U— z 


where : 
ny; sin2kr 


ce . 
k R(3ea+U—2—ex) 

The main contribution to the partition function 
integral will come from z close to 2e4. Both f and g are 
real in this region. If we make the approximations 


ker>1, —(ream/hkr)>1, 
and neglect terms which are exponentially small in 
these quantities, then an asymptotic approximation 
gives 
ps CoS2krr 


ps cos2krr 
È g= i 
rm(3ea HU — 2) 


rm(ea— z) : 


Now 7 only occurs for S,1, and for zz2e4 we find 
720. Hence, the Sn: is lowered with respect to the T 
state. 


VI. SIMULTANEOUS CONSIDERATION OF d-d 
EXCHANGE AND BAND SCATTERING 


The above calculations seem to indicate that the 
singlet (351) is always lower than the triplet (T). In 
order to show that this is not due to our perturbation 
treatment for the band scattering, we have performed 
the diagonalization procedure of Sec. II, with the 
additional inclusion of all those states coupled by V. 
(This will be exact in the two-electron manifold.) The 
trial function consists of the four possible states with 
two d electrons with opposite spins (di1*ds*, diy*dot™, 
dıt*dıs*, d2t*d2,*), the four possible states with one 
electron and one conduction electron (dit*cxi, dot*cx1™, 

"dicent, doy*cx*, with the conduction electron aver- 


GOTTLIEB AND H. 
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aged over & states as in Sec. IV), and finally the four 
possible states with three d electrons and one conduction 
hole (dii*d21*d21 "Crt, didi dye, day dos dj, 
doy*dit*dys*ext, With the conduction holes averaged 
over k). Using the method of Sec. IV we can diagonalize 
the energy by means of the equations obtained from the 
energy by differentiating with respect to each of the 
twelve coefficients of the trial states (remembering 
that eight of these coefficients depend upon &). The 
solutions are rather cumbersome, but if we assume 
kpr>>1, then the four states with two d electrons 
combine in a manner similar to that found in Sec. II. 
The wave functions and energy eigenvalues have the 
same form as Sp, Sm Sw» T (S.=0) with the 
modifications 


> 1—7 
Vi Vi =V {it y? ——————- 
D : i k (ea - € —A)?— W? 


Wy. 
TV? 2. ] 
k (Seat U— e,—N) We 


and 


(eat+e.—A) (1 — 14) 
ry ee N 
(Zea t+U—A) nx 
* (Bet U— eA} Wt. 


cea => eg! = ea— V? 


=A 


As in Sec. IV, A is simply the corrected energy of the 
state under consideration. Since these corrections have 
the same form for both Smı and T, it is apparent that 
they have no effect on the previous conclusion that 
E(Sm1)<E(T). 


VII. CONCLUSION 


The calculations of this paper indicate that in 
perturbation theory neither the d-d exchange, the s-d 
scattering, nor any combination of the two will produce 
a parallel alignment of a pair of localized moments. 
The analysis does, however, show that the pair can 
show a sizable magnetic moment provided that the 
d-d exchange is such that E(D,+)<E(Smi). The condi- 
tions required for this relation are similar to those 
required by Alexander and Anderson? for parallel 
alignment. This suggests that the two methods may be 
equivalent, even though the representation of Alexander 
and Anderson? does not characterize the parallel align- 
ment as either doublet or triplet. 
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Low-Temperature Magnetoresistance in Magnesium and Aluminum 
Containing Small Concentrations of Manganese or Iron* 
F. T. HEpGcock AND Y. Murof 
The Franklin Institute Laboratories for Research and Development, Philadelphia, Pennsylvania 
up (Received 20 January 1964) 
Magnetoresistance measurements in magnetic fields up to 21 kOe have been made on Mg-Mn, Mg-Cd, Mg- j 
: 


Al, Al-Mn, and Al-Fe alloys in the temperature region of liquid helium. Magnesium alloys containing more 
"lv than 0.1-at.% Mn which exhibit a resistance maximum and minimum in zero field, show a negative mag- 
netoresistance, whereas the more dilute samples (0.001-0.1 at. % Mn) show a positive magnetoresistance, 
the magnitude of which decreases with decreasing temperature. The magnesium alloys containing non- 
transition element impurities, as well as the aluminum alloys containing transition metal impurities, are 
found to obey Kohler’s rule. From an analysis of these data it is found that the magnetoresistivity of a dilute 
alloy of magnesium containing manganese, can be considered as the sum of a normal positive magneto- 
resistivity (obeying Kohler's rule) and an anomalous term which is negative in sign, does not obey Kohler's 
ur rule and is presumably due to a magnetic scattering of the conduction electrons. Using values of the s-d 
exchange integral and the Coulomb scattering integral derived from an analysis of the zero-field resistivity 
permits an explanation of the magnetoresistivity based on Kasuya's theory, at temperatures near the Néel 
point. 


1. INTRODUCTION 


UMEROUS studies made on dilute monovalent 
and divalent!? metals containing paramagnetic 
impurities have indicated a negative magnetoresistance 


present work are studies on aluminum-manganese and 
aluminum-iron alloys where the magnetoresistivity is | 
studied to fields of 21 kOe. | 


i 
] N^ magnetic fields of$ 1.5 kOe and no anomalies were 
| found. In the present paper we wish to report some Conca 
l à magnetoresistance studies on Mg alloys iu to Alloy or Sample p EU 
Ja exhibit both a resistance minimum and a maximum’ as metal Dumb (at.%) pue a2 cm 
: well as measurements on normal magnesium alloys 3 
i | exhibiting no resistance anomalies. Also included in the S ae Gade ooon ree 
> f 87813 0.01 0.0111 4.28 
| ! * This work was supported by Aeronautical Systems Division, 90911 0.08 0.1231 4.54 
j Wright-Patterson Air Force Base, Ohio, under Contract AF 91081 (a) 0.35 0.3188 5.47 
2 i 33(657)-8744 and the U. S. Office of Naval Research under Con- 91081 (b) 0.74 0.5140 6.83 
| tract NONR-3644(00). MgCd 396 0.55 0.0955 4.55 
1 On leave of absence from The Research Institute for Iron, 397 1.01 0.1369 4.82 
: Steel, and other Metals, Tohoku University, Sendai, Japan. MgAI 400 0.29 0.1242 4.71 
1A. N. Gerritsen, Physica 25, 489 (1959) (a tabulation of pue o toes 7a 5 
experimental results up to 1959). : a 3 é > 
4G. J. van den Berg and J. de Nobel, J. Phys. Radium 23, 665 Pure Al GKP e 0.00187 ee oa 
(1962) (a recent review). AIMn GKO 0.045 0.1209 kieta 
AlFe GKK 0.02 0.0045 ELE 


at low temperature when the alloy exhibits both a low 
temperature resistance minimum and maximum. Recent 
studies on the? Cu-Fe and‘ Zn-Mn known to exhibit 
only a minimum down to the lowest temperatures 
available indicate the presence of a negative magneto- 
resistance in the more concentrated samples. A mag- 
nesium alloy containing small concentrations of manga- 
nese has been shown? to exhibit deviations from the 
Kohler rule, but no conclusive evidence for a negative 
magnetoresistance was found even at the lowest tem- 
perature (1.3°K) and the highest magnetic field avail- 
able (25 kOe). Magnetoresistance studies on dilute 
aluminum alloys containing either manganese or iron 
have been carried out at liquid-helium temperatures at 


(1994) Muto, K. Noto and F. T. Hedgcock, Can J. Phys. 42, 15 
* Y. Muto, Sci. Rept. Res. Inst. Tohoku Univ. (A) 13, 1 (1961). 
* R. T. Webber, Phys. Rev. 105, 1437 (1957). E 
€ J. G. Thomas and E. Mendoza, Phil. Mag. 43, 900 (1952). 
7 G. Gaudet, F. T. Hedgcock, G. Lamarche, and E. W. Walling- 
ford, Can. J. Phys. 38, 1134 (1960). 
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2. SPECIMEN PREPARATION AND 
EXPERIMENTAL METHOD 


The dilute Mg-Mn, Al-Mn and AI-Fe alloys used in 
the present investigation are the same as those used 
earlier in the studies of the electrical and magnetic 
properties of these systems."-? The high-concentration 
Mg-Mn alloys were prepared by quenching from near 
the melting point after the samples had received a high- 
temperature homogenizing anneal. The manganese 
concentration of each Mg alloy was estimated from the 


Taste I. Alloy number, solute concentration, resistance ratio 
and resistivity at 273°K for all samples studied. Sample number 
refers to detailed spectroscopic analysis given in Refs. 8 and 12. 


8F. T. Hedgcock, W. B. Muir, and E. W. Wallingford, Can. J. 
Phys. 3$, 376 (1960). 


(1962). 


*E. W. Collings and F. T. Hedgcock, Phys. Rev. 126, 1654 — 
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Fic. 1. (a) Resistivity of some 
Mg-Mn alloys as a function of tem- 
perature in a constant magnetic field. 
O: 0 kOe; O: 9.93 kOe; A: 20.9 kOe. 
(b) Anisotropy in the magnetoresistiv- 
ity at 4.2°K for pure Mg and the most 
dilute Mg-Mn samples. The angle 
axis was selected to obtain agreement 
in the minimum value of Aprmmax/ 
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p4.2°x for both samples. ^ prrmax means 
Ap at the maximum field studied 
(20.9 kOe). 


H = 20900 Oe 
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=e 
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smoothed graph of resistance ratio versus nominal 
composition.” The concentration and resistivity of the 
Mg-Al and Mg-Cd alloys were derived from the earlier 
-data reported by Salkovitz! ef al.? The concentration 
of manganese or iron in the aluminum alloys is derived 
trom the spectroscopic analysis. Table I contains values 
of the resistance ratio and resistivity for the alloys 


uE, -W. Wallingford, thesis, University of Ottawa, Ottawa, 
Ontario (unpublished)). 

u We would like to thank Dr. Salkovitz for his generosity in 
supplying these samples. 

* E. L Salkovitz, A. I. Schindler, and E. W. Kammar, Phys. 
Rey. 105, 887 (1957). 
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“87812 0.001 At.% Mn 


(b) 


studied. The resistance samples were in the form of 
rolled strips (approximately 10Xx0.3X0.03 cm); the 
contacts in the form of knife edges were pressed into 
place on a bakelite former containing the sample. A 
current-potential method was used for the resistance 
measurements with a galvanometer amplifier to read 
the voltages. The sensitivity of the amplifier was better 
than 310-9 V/mm deflection. The galvanometer and 
current stability did not result in more than a 2% 
change in calibration during any one magnetoresistance 
run. Since the magnetoresistance of the more highly 
concentrated samples was small while the zero-field 
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Fic. 2. Ap as a function of the magnetic field strength for a number of Mg-Mn alloys: (a) for samples 91081 (a) 
and (b); (b) for the sample 90911; (c) for samples 87812 and 728 (pure Mg). 
resistance was comparatively high, the magneto- significant difference appears for any chosen direction. 


resistance measurements on these samples was accom- 
plished by suppressing the zero-field sample voltage 
with the aid of a Leeds and Northrup Type K-2 potenti- 
ometer. A Harvey Wells Corporation electromagnet 
was used which produced fields of approximately 21 
kOe in a gap of 1.25 in. with a stability of magnet cur- 
rent of one part in 10? at midcurrent values. All meas- 
urements were made at 4.2°K or lower by pumping on 
liquid helium. The vapor pressure of the bath was 
controlled by a Cartesian manostat. 


3. EXPERIMENTAL RESULTS 


Figure 1(a) shows the resistivity of some of the 
Mg-Mnalloys asa function of temperature in a constant 
magnetic field and it can be noticed that all of the 
samples exhibit anomalies in the temperature-dependent 
resistivity. At the maximum field available samples 
87812 and 90911 exhibit a positive magnetoresistance, 
while both samples exhibit a resistance minimum. 
Samples 91081(a) and (b) exhibit an absolute decrease 
in the resistivity on the application of a magnetic field, 
the magnitude of the decrease increasing with magnetic 
field strength in the temperature region studied. These 
samples exhibit resistance maxima (and minima) in 
agreement with earlier reports." 

There is a marked anisotropy in the magnetoresis- 
tivity for Mg samples which is presumably due to 
preferred orientation induced by cold working the 
samples. The magnitude of the anisotropy can be seen 
for samples 728 and 87812 shown in Fig. 1(b). The 
Maximum magnetoresistance values were eed and at 
this value very little angular dependence could be noted 
in the relative magnetoresistivity. For more concen- 
trated samples, the anisotropy is so small that no 
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Figure 2 shows the magnetoresistance as a function 
of magnetic field strength for a number of specimens. 
As seen in Fig. 2(a) there is presumably sufficient 
magnetic scattering in the alloys to make the sign of the 
magnetoresistivity totally negative for samples 91081 (a) 
and (b). Samples 90911 in Fig. 2(b) and sample 87812 
in Fig. 2(c) show that Ap is smaller at the lower tem- 
perature, while no such temperature dependence is 
observed for a pure magnesium sample exhibiting no 
anomalous behavior in the zero field resistivity. [See 
sample 728, Fig. 2(c).] It can therefore be suggested 
that there is a negative magnetoresistivity component 
in the more dilute Mg-Mn samples. Figures 3(a) and 
3(b) show Ap/ps.o*x as a function of magnetic field 
strength for a sample of Mg-Cd and Mg-Al, respec- 
tively. As would be expected, Ap/p4.»*k increases with 
increase in magnetic field, and the magnitude of the 
change at fixed field decreases with impurity concen- 
tration increase (ie., normal impurity scattering). 
Figures 4(a) and 4(b) show Ap/pr as a function of 
magnetic field for an Al-Fe and Al-Mn sample, where it 
can be seen that there is no temperature dependence to 
the magnetoresistance in these alloys. 


4. DISCUSSION 3 
The change in resistivity Ap in a magnetic field H is” 
given by the Kohler rule as -— 
Ap/pr — F(Hps/pr) , (1) 


where” p; is the resistivity at the Debye temperature 6 


13 As there is some uncertainty in the value of 8 to be usech in. 
alloys, most workers approximate by using the value of umani 
at the ice point and we will follow the same approximation. 
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alloys. The magnetoresistivity data for sample 90911 is 
shown as an insert in Fig. 5, where it can be seen that 
Mg Cd Kohler's rule is not obeyed. The present studies on Al 
i XOU EOD AL te Cd alloys are in agreement with the conclusions of Thomas 
a SO a ef al.,* and Hedgcock et al.,° that the transition metals 
Mn and Fe do not induce a resistive anomaly in 
aluminum. | 
As discussed in an earlier publication by the authors? 
for the Cu-Fe system, the magnetoresistivity of a dilute 
alloy containing paramagnetic impurities can be con- b 
sidered as the sum of a normal magnetoresistivity term 
which is temperature independent (pn) and an anomal- 
ous term which is negative and decreases with decreas- | 
ing temperature (ps) at liquid helium temperatures. | 
This last term is presumably a result of the presence of | 
paramagnetic impurities since it usually appears only 
near and below the magnetic ordering temperature for 
the alloy system. Yosida™ and Kasuya!® have both 
considered the theoretical problem of the magnetic 
interaction between conduction electrons and localized 
magnetic ions where an s-d exchange interaction is | 
assumed to produce a molecular field. Below a magnetic 
ordering temperature (usually antiferromagnetic in 
nature) theory predicts a monotonic decrease of resis- 


0.10 © 400 0.29 At.% Al 
A 401 0.53 At.% Al 
X 402 0.80 At.% Al 
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o 2° 4 6 8 10 12 14 
H (KOe) 
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Fic. 3. Ap/ps.2°x as a function of magnetic field 
strength: (a) Mg-Cd and (b) Mg-Al alloys. 


wee : o 
and pr the resistivity at temperature T, the functional UE NU GAS Jg B9 UP 


form of F being unspecified. Kohler’s graph forsome of the 0.03 
Mg and Al alloys is shown in Figs. 5 and 6, where it can 
be seen that all of the alloys presented obey the Kohler’s 
rule. It can be noticed that the deviation from Kohler’s 
rule is larger for the Mg-Al alloys than for the Mg-Cd RU 
alloys, and it could be speculated that this is a result of EY 
the difference in valence between solute and solvent in 
the case of the Mg-AI alloys. In Fig. 5 is the magneto- 0.01 
resistivity data at 4.2°K for the more dilute Mg-Mn 
alloys. It can be seen that Kohler’s rule is obeyed which 
-is in agreement with the previous conclusions of 
Webber,® Yntema,“ and Thomas and Mendoza.’ As we (2 4 —6 @ 10 1 4 6 18 20 22 
will see later this Kohler behavior is a result of the fact H (kOe) 
that although there is a negative component in the ^ po 4 Ap/p1a°x as a function of the magnetic field strength 
magnetoresistivity of these alloys it is much smaller at 42 and 1.5°K: (a) for pure Al and an Al-Fe alloy; (b) the 
than the normal positive magnetoresistivity of these — ^'-Mn alloy. 


zT 15 K. Yosida, Phys. Rev. 107, 396 (1957). 
^u G, B. Yntema, Phys. Rev. 91, 1388 (1953). 16 T, Kasuya, Progr. Theoret. Phys. (Kyoto) 22, 227 (1959). 
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tance as the temperature is lowered. Both theories 
predict a negative magnetoresistance which is propor- 
tional to the square of the magnetization at low magne- 
tic field strengths. Neglecting a small term in Kasuya's 
expressions for the magnetoresistivity under various 
field and temperature conditions, we can write 


3r m* C 
H=0,T=0 m= [A+ (g— 1? 7277], (2) 
he Ne 
3r m* C 
H=0, TT Py —— 
2 ne Ne 
XTE- 17 0 DUST 
H#0, T¥0 
Irm C LA (Gya P 
Br E D (4) 
2 ne Ne xps 
At (in i) 
Gow: 


where m* and e are the electron effective mass and 
charge, respectively, ¢ is the Fermi energy, C is the 
concentration of paramagnetic ions, JV is the number of 
atoms/cm}, J is the s-d exchange interaction constant, 
A is the Coulomb interaction constant, x— gusH/ET, 
jz is the average z component of the angular momentum 
j of the paramagnetic ion, and (j2)sy is the average of 
the square of the angular momentum of the paramag- 


Apr 


Pax 


O PURE Mg 

Mg Cd 396 

MgCd 397 

MgAl 400 

MgAl 401 

MgA! 402 

MgMn 87812 

MgMn 87813 

MgMn 9031! 

MgMn 91081a (p, ONLY) 
Mg Mn 91081 (p, ONLY) 


A 
A 
ü 
s 
© 
e 
v 
Y 
x 
* 


Parsee 
PTS. u (08) 
Pa.2*K 


Fic. 5. Ap/ps.c°x as a function of (p273°K/p4.2°x) -H for Mg 
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for the sample 90911 at 4.2 and 1.5°K. 
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Fic. 6. Ap/pr as a function of (pa73°x/pr) -H for Al samples. 


netic ion. For the method of finding these averages, the 
reader is referred to the original publication of Kasuya.!^ 
Using a simple molecular-field model, H=H y when 
Hipp 0, and if Ho is zero, then it has the value Mapp in 
the presence of an external field. Collings and Hedgcock® 
have shown through electron spin resonance and sus- 
ceptibility studies that an antiferromagnetic transition 
occurs in the Mg-Mn system in the region of the resis- 
tance maximum. If one neglects the presence of the 
resistance minimum in these alloys and assumes the 
temperature of the maximum to be the Néel temper- 
ature," we can use Eqs. (2) and (3) to calculate A and 


TABLE II. J: the s-d exchange integral; A : Coulomb interaction 
constant; and A/J values of some MgMn alloys. The most reliable 
values for 4 and J are estimated from the data of Gaudet ef al. 
(Ref. 7), where the resistivity was measured below 1°K. 


Concen- 
tration J Pe 
(at.%) (1072 erg) (10-7? erg) AJJ 
0.35 0.491 2.87 5.85 Present 
investigation 
0.80 0.466 2.77 5.95 Wallingford’s 
thesis 
0.72 0.498 2.76 5.55 Wallingford's 
thesis 
0.46 0.478 2.78 5.82 Wallingford's 
thesis — 
0.16 0.473 2.74 5.80 Gaudet et al. i; 
average 0.48 2.78 5.80 


17 Magnetic studies on these alloys (Ref. 9) have shown that 
only in the most concentrated sample is there any evidence for a 
Néel temperature but there are deviations from Curie’s law in the 
region of the resistance anomaly for all of the Mg-Mn gi 
studies. Whether the temperature of the resistance F 
occurs at what one no y considers the Néel tem; 


certainly still an open question theoretically. [See ok 
Westinghouse Research Report 029-B000-P1, 1961 (unpublish 
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Fic. 7. (a) Ap/pis"k as a function of the magnetic field. O: 
experimental results for the sample 91081 (a). A: experimental 
results for the sample 91081(b), and the calculated curve 
(p.) based on expression (4). (b) Ap/pi.s°x as a function of the 
magnetic field. O: experimental results for the sample 91081 (a). 
: the calculated curve (p,) based on expression (4). - - - -: 
the corrected curve (p) after allowance has been made for pn. 


J. This has been done for a number of alloys and the 
resulting values of 4 and J are listed in Table IT. It can 
be seen that the values so derived for the various alloys 
are independent of concentration and result in an 
average value of 0.48X 107? erg for J and 2.78X 10" 
erg for 4. The manganese ion is assumed to be inan S 
state with the value of the effective magneton number 
as derived from susceptibility studies (S=2). The free 
ion value of J derived for the Mn* ion would yield a 
value of 0.35 10-” erg, whereas a value of J for the 
Cu-Mn system derived from the experimental results is 
0.67 X 10-” erg. 

Using the values of A and J derived from the zero- 
field results and expression (4) of Kasuya, it is possible 
to calculate the magnetoresistance as a function of 
magnetic field. Shown in Fig. 7 (a) is the result of such a 

_ calculation for a sample 91081(a) and (b) at the tem- 
perature of 4.2°K. The difference between the calcu- 
lated and experimental curves can be assumed due to 
the positive normal magnetoresistance component pr. 
If the difference between the theoretical and experi- 
mental curves is in fact equal to pn, then the difference 
should obey the Kohler rule. This difference is shown in 
Fig. 5 where as good a fit to the Kohler rule is found as 


AND Y. MUTO 

for alloys containing nontransition metal impurities, 
Figure 7(b) shows the same calculation done at 1.5°K, 
The solid curve corresponds to that calculated from 
Eq. (4) and the dashed curve after allowance has been 
made for pn, the normal impurity scattering discussed 
above. As can be seen, the agreement between experi- 
ment and theory at this temperature is not good. If one 
suggests a cooperative magnetic interaction and a re- 
sulting internal field, Ho will shift the zero on the field 
axis but will not alter the magnitude of the magneto- 
resistance. Only by altering A/J could agreement with 
experiment for a particular magnetic field be found, but 
it is unlikely that these constants should be a function 
of temperature or magnetic field—certainly no theo- 
retical treatment suggests that they should be. It 
therefore seems that some essential feature is lacking in 
the Kasuya theory for the temperature dependence of 
the magnetoresistance in metal alloys containing transi- 
tion metal impurities. 

Although there is no adequate theory to explain the 
resistance minimum, it is interesting to apply Kasuya’s 
expression for the magnetoresistivity in those alloys 
exhibiting a resistance minimum only [Eq. (4)]. For 
example, by assuming the same value of J and A as for 
the more concentrated alloys exhibiting a maximum, 
an estimate of Ap at maximum magnetic field for sample 
90911 yields Ap, — —2.4X 10-5 Q-cm at 4.2°K. This can 
be compared with the experimentally observed value of 
Ap—6.1X 10-5 Q-cm. Therefore, the normal magneto- 
resistivity term, the difference of these two values, 
equals 8.5X10-5Q-cm. Then Apn/pi.2*x=0.152 for 
H273/p4.2°x=149 kOe. This point is shown as A in 
Fig. 5, giving better agreement to the initial Kohler 
curve plotted for this alloy. If this improvement in the 
fit is a result of the influence of Aps, then it could be 
speculated that the magnetoresistance of alloys exhibit- 
ing a minimum will be negative? when A/J is small and 
the concentration relatively large and becomes positive 


when A/J is large and the concentration of para- 
magnetic ions is smaller. 


5. CONCLUSIONS 


(1) From the temperature dependence of the elec- 
trical resistivity of concentrated Mg-Mn alloys in zero 
magnetic field a value of the s-d exchange integral J 
and the Coulomb integral Q of 0.48X10-" erg and 
2.78X10-* erg, respectively, are found which are in- 


dependent of manganese concentration up to the solid 
solubility limit. 


(2) For Mg-Mn alloys exhibiting both a resistance 


18 Yosida pointed out that experi i i 
; : xperimental agreement with his 
ee Mood as the temperature was lowered below the Néel 
p TEIG remarked on the discrepancy and suggested the 
marni y ordered states should be represented by some sort of 
wa ve spin mode at sufficiently low temperatures. : 
GU OR ee magnetoresistivity has been observed 1n 
Cu-Fe, Cu-Mn and Zn-Mn alloys exhibiting only a resistive 
imum. (See Refs. 3 and 4.) 
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minimum and maximum the total magnetoresistance is 
negative. This magnetoresistance can be separated into 
a negative (p,) and positive (p,) component. The spin 
component p, can be explained using the same J and A 
values as derived above at temperatures near the Néel 
temperature. The normal component (pn) is found to 
obey Kohler's rule. The observed temperature de- 
pendence of the negative component (p,) does not follow 
from the existing theory. 

(3) For the more dilute Mg-Mn alloys exhibiting 
only a resistance minimum, there appears to be a nega- 
tive magnetoresistivity component again requiring the 


IN Mg AND Ai — Af599 
same A and J values as above. The positive magneto- 
resistance component obeys Kohler's rule. 

(4) Magnesium containing nontransition impurities 
follow Kohler's rule as do the alloys of aluminum con- 


taining transition metal impurities. 
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F-Band Shape in the CsCl Structure Under Pressure*] 
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Department of Physics, University of Illinois, Urbana, Illinois 
(Received 24 January 1964) 


Color centers have been formed by x irradiation of RbCI above the polymorphic transition pressure, and 
the shape of the new F band at 509 my in this CsCl-type structure has been examined under a pressure of 
6200 atm at liquid-helium temperature. The band shows no evidence of the multiplet structure observed 
in the cesium halides, thus indicating that the lattice structure is not responsible for this effect. The F band 
in CsCl has also been formed and studied under pressure at helium temperature, and the triplet components 
show the usual relative intensity and the characteristic F-band shift. No evidence of K' bands was observed 


for these hydrostatic pressures. 


INTRODUCTION 


ECENT studies of color centers in CsCl and 
CsBr'-? revealed structure on the principal ab- 
sorption band, in contrast to the usual F band in 
alkali halides with the NaCl-type lattice. This struc- 
ture, which Rabin and Schulman? decomposed into 
three components at liquid-helium temperature, ap- 
pears to be identical for centers formed by x irradiation 
or additive coloring, is.unaffected by bleaching or heat 
treatment, and is not sensitive to sample purity. It 
therefore seems reasonable to associate these multiplets 
with a single electron-excess center, presumably an F 
center composed of an electron bound to a negative-ion 
vacancy. Rabin and Schulman’ first proposed that if 
these components are all due to one center, then they 
must represent a splitting of the excited F state by a 


* This work was supported in part by the U. S. Atomic Energy 
Commission. 

+ Based on part of a thesis submitted to the University of 
Illinois in partial fulfillment of the requirements for the Ph.D. 
degree (1962). A preliminary report of this aspect of the work 
was given by D. Lazarus at the Conference on Crystal Lattice 
Defects, Kyoto, Japan, September 1962. f 

i Present address: Department of Physics, Cornell University, 
Ithaca, New York. 

1P. Avakian and A. Smakula, Phys. Rev. 120, 2007 (1960). 

2H. Rabin and J. H. Schulman, Phys. Rev. Letters 4, 280 
(1960); Phys. Rev. 125, 1584 (1962). 

3D: W. Lynch, Phys. Rev. 127, 1537 (1962). 


crystal-field effect peculiar to the CsCl-type lattice, or 
a splitting caused by the cesium ion itself. 

The present work was undertaken to determine the 
influence of crystal structure by studying the F band 
formed in the pressure-induced simple cubic phase of 
a normally face-centered cubic alkali halide. RbCl 
transforms from the NaCl lattice structure to the CsCl 
lattice structure at about 5000-6000 atm. On the basis 
of a Mollwo relation between peak position and lattice 
constant for other CsCl-type crystals, one expects the 
F center to appear at about 520 my in the high-pressure 
form. Jacobs‘ looked for this peak in RbCl crystals 
x rayed or additively colored at atmospheric pressure 
prior to the structure transformation, but was unable 
to detect any new center. Drickamer et al^ found a 
new band, which they called the K’ band, which re- 
placed the F band in simple cubic phases at very high 
pressures and room temperature. 

With the apparatus described below, it is possible 


to transform the crystal first and then form the CC 


type centers by x raying under pressure at room or 
low temperatures. In particular it is possible to form 


4T. S. Jacobs, Phys. Rev. 93, 993 (1954). 
sW. G. Maisch and H. G. Drickamer, Phys. Chem. Solids 5, 
328 (1958). 5 
i 2 A. Eppler and H. G. Drickamer, J. Chem. Phys. 32, 1418 
1 : 
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and study the new F center at helium temperature to 
investigate whether it has the triplet structure ob- 
served in the cesium halides. 

In addition, the F band was formed in CsCl under 
pressure at helium temperature to verify that the 
triplet components grow in the same ratio and exhibit 
the characteristic F-band pressure shift. 


EXPERIMENTAL PROCEDURES 


The four-window pressure cell and cryostat used in 
this experiment have been described in detail else- 
where.” Color centers were produced at high pressure 
and low temperature by irradiating the sample with 
50-kV x rays through a beryllium window in the 
pressure cell. Optical absorption spectra were measured 
at right angles through sapphire windows, using à 
Beckman DU spectrophotometer. The sample was 
suitably masked to measure only a region of nearly 
uniform coloration, and the radiation-induced colora- 
tion was determined by subtracting the pre-irradiation 
spectrum from those measured at later times. Helium 
gas was used to apply hydrostatic pressure in the 
range 0-8000 atm, the pressure being determined with 
a manganin gauge except at temperatures where the 
helium solidified, when thermodynamic PVT data for 
solid helium were used to estimate’ the pressure in the 
cell. 


RESULTS AND DISCUSSION 
a. RbCl 


Two runs were made using samples 0.43 and 0.27 
mm thick cleaved from a piece of “pure” Korth RbCl. 
Pressure was applied at room temperature at a rate of 
about 300 atm per min, and the transition pressure was 
detected by monitoring the optical transmission of the 
sample. At about 7300 atm, the transmission dropped 
abruptly to 0.17% in the first case and 1% in the 
second case. These values correspond to an optical 
density of roughly 80 cm. 

The crystallographic aspects of this type of structure 
transformation, which achieves a 15% reduction in 
volume,® have been discussed by Buerger? and applied 
to the phenomenon of opacity by Jacobs.* The trans- 
formation proceeds from a large number of nucleation 
centers giving rise to many mismatched grains. Light 
is scattered at the boundaries of these microscopic 
regions causing the observed opacity. 

. The transition in RbCl has been observed at 5700 
atm by Bridgman,® 6000 atm by Pierce, and 6500 
“aia by Jacobs, who also used the opacity as an 
indicator. The higher transition pressure seen in this 
case could be due to three causes. In another study of 


1 D. B. Fitchen, Rev. Sci. Instr. 34, 673 (1963). 

3 P. W. Bridgman, Z. Krist. 67, 363 (1928). 

?M. J. Buerger, in Phase Transformations in Solids (John 
Wiiey & Sons, Inc., New York, 1951). 
* 1 C, B. Pierce, Phys. Rev. 123, 744 (1961). 
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the polymorphic transformation in RbI with helium 
gas as a pressure fluid, Jacobs” found that the initial 
transformation for a virgin crystal occurred at about 
1095 higher pressures with the purely hydrostatic 
helium pressure medium than for liquid media. Ap- 
parently, the slight shear stresses due to viscosity of 
the liquid help to initiate the local shear involved in 
the transformation. A second reason might be hysteresis 
during the relatively rapid pressure rise, since Pierce 
found that the initial transformation required nearly 
45 min to go to completion. However, no change in 
transmission was seen until the sudden abrupt and 
complete change. Finally, the crystals of less than 
“atomic-weight purity" usually have a higher transi- 
tion pressure, but Pierce observed the transition at a 
lower pressure in Korth RbCl which had been doped 
with additional divalent impurity. Therefore, the first 
explanation seems the most probable here. 

After the crystal had been transformed, it was cooled 
to helium temperature and x rayed under pressure. A 
preliminary run on the coloration of Korth RbCl at 
helium temperature below the transformation pressure 
had shown that the normal F band grows very slowly 
at about 590 my. In the present runs, the principal 
absorption grew slowly at about 510 my. Because of 
the very high background absorption, only the range 
of 420-620 my could be measured with any certainty, 
and readings outside this range fluctuated erratically. 
No systematic growth of any other major center was 
detected. 

'The shape of the new center was measured carefully 
twice in each run, and no indication of structure ap- 
peared, as shown in Fig. 1. 

In the first run, a small peak was seen at 600 mp, 
presumably due to NaCl-type F centers formed in 
portions of the crystal which had not completely trans- 
formed. When pressure was released slowly at nitrogen 
temperature, very marked hysteresis was observed and 
the new centers were unaffected to about 3500 atm, 
at which point they disappeared and the NaCl-type 
F center appeared with almost identical magnitude at 
600 my. In the second run, the crystal was allowed to 
warm to room temperature under pressure and was 
then x rayed again. Instead of growth, the irradiation 
appeared to remove most of the residual centers which 
remained after warming. Similar hysteresis occurred 
when releasing pressure at room temperature. The 
optical density only began to decrease at about 4500 
atm. At atmospheric pressure, the transmission had 
returned to about 20% of its initial value. The samples 
appeared milky and were highly strained but still 
intact. 

The parameters of the new center as measured at 
various pressures and temperatures are listed in Table I. 

While the designation of this new band as the P 
band is still tentative, the following evidence supports 


uR, B. Jacobs, Phys. Rev. 54, 468 (1938). 
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TABLE I. Parameters of the principal absorption 
band in transformed RbCl. 


: Pressure 

Temperature (atm) Am (mp) H (eV) 
Helium 6400 500 — — 0222-001 
Nitrogen 5230 517 0.25-t-0.01 
Room 7370 514 0.35-£0.03 


it. The band appears to be the principal absorption 
center formed and has a shape and half-width similar 
to the P center in the low-pressure form. The peak of 
this new center shifts with temperature and pressure 
in the same manner as the F center, and its half-width 
increases with temperature in the same way. In the 
first run, it was possible to convert these centers 
directly to the normal F centers by returning to the 
low-pressure structure at nitrogen temperature. Finally, 
the new band appears at just the position predicted 
by a Mollwo-like relation for the CsCl structure, \m 
=K(d)", where d is the interionic distance. Using 
values of Am and d at low temperature for CsCl and 
CsBr, and the value of d for the high-pressure form 
of RbCl from Bridgman's density measurements,® the 
new F center should appear at 514 my, in good agree- 
ment with experiment. 

The lack of structure on this new F band in RbCl 
indicates that the triplet structure observed for CsCl 
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Fic. 1. Shape of the F band formed in the high-pressure CsCl 
Phase of RbCl. The curve shows the difference in optical density 
measured before and after x irradiation for 220 min at this tem- 
perature and pressure for a sample 0.27 mm thick. 
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Fro. 2. Pressure shift of the F band in CsCl. The solid curve is 
for centers formed at this temperature and pressure aíter x 
irradiation for 15 min with a sample 0.84 mm thick. The com- 
parison curve is taken from Ref. 2. 


and CsBr at liquid-helium temperature is not caused 
by a crystal-field effect indigenous to the CsCl-type 
lattice. Hughes and Rabin! have confirmed this result 
by their observation of clearly resolved peaks in the 
principal absorption band of CsF, the only cesium 
halide having the NaCl-type lattice. Thus the cause 
of the splitting seems to be the cesium ion itself. 

Speculation has centered on relativistic effects which 
would be appreciable only for the cesium halides. A 
calculation by Sufíczynski? of the spin-orbit splitting 
for a localized excited state in a cesium halide gave a 
result orders of magnitude smaller than the observed 
effect. Recently, Knox" has proposed that a spin-orbit 
type splitting of the conduction band might be re- 
sponsible. Calculations! of the other relativistic 
effects on band structure for heavy ions suggest that 
these might also contribute for cesium. 


b. CsCl 


A subsidiary experiment was performed to test 
whether the three components of the principal absorp- _ 
tion band observed in CsCl at helium temperature 
would exhibit the characteristic pressure shift of the — 
F band. 3 

A freshly cut and polished platelet of Semi-Elements 
CsCl was x rayed and measured at approximately 4600 — — 
atm and 11°K. Figure 2 shows one of the measured | 
spectra with a zero-pressure spectrum of Rabin an 
Schulman? normalized to the same peak height 
comparison. The component structure is clearly 3 
and appears essentially unchanged after a p 
shift of AE/E--0.021. (This shift is within 


12 T, Hughes and H. Rabin, Phys. Chem. Solids 24, 
3 M. Suffczynski, J. Chem. Phys. 38, 1558 (1963). 
“R, S. Knox, Phys. Rev. 133, A498 (1964). 
15 L. E. Johnson, J. B. Conklin, and G. W. Pi 
Rev. Letters 11, 538 (1963). 2 
1 F, Herman, C. D. Kuglin, K. E. Cuff, 
Phys. Rev. Letters 11, 541 (1963). : 
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Jacobs’ room-temperature shift! when corrected for the 
change in compressibility with thermal expansion.) If 
the two curves are superimposed, the low-energy side 
is almost identical, while the high-energy side differs 
slightly. The effect of this change is to increase the 
apparent total width at half-maximum under pressure 
by 3.5% to 0.017 eV. Jacobs* also measured an ap- 
parent increase in half-width in CsCl at room tempera- 
ture of about 5% in 4000 atm. 

Maisch and Drickamer? studied the effect of pressure 
to 50 000 atm on CsCl and found a new band, the K’ 
band, with peak energy about 0.1 eV higher than the 
F band which grows at the expense of the F band with 
increasing pressure and is reversible upon decrease in 
pressure. It is thus possible that the high-energy com- 
ponent of the triplet is related to this K’ band and is 
increasing in relative importance with pressure. How- 
ever, no such K’ band was seen in this range of hydro- 
static pressures either for RbCl above or below its 
transition pressure, or for KBr, in which they observed 
a prominent K’ band at considerably higher pressures. 
Either the K’ band is not observable in the present 
range of pressures, or its appearance in Drickamer’s 
experiments is due to shear and pressure inhomogenei- 
ties in his apparatus which are not present in our gas 
system. 
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SUMMARY 


Using a novel technique a new absorption band has 
been formed by x irradiation of RbCl above its poly- 
morphic transition pressure. The position, width, and 
temperature and pressure dependence of this band 
support its designation as the new F band in this 
CsCl-type phase. The band could be converted to the 
normal F band by reversing the transition at low 
temperatures. The shape of this new band has been 
examined under pressure at liquid-helium temperature, 
and shows no evidence of the multiplet structure ob- 
served in the cesium halides, thus indicating that the 
lattice structure is not responsible for this effect. 

The pressure measurements on CsCl prove that for 
still another way of forming and observing the prin- 
cipal band, the triplet components all appear to be due 
to the F center. 
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In this paper a group-theoretical approach to the problem of a Bloch electron in a magnetic field is given. 
A magnetic translation group is defined and its properties, in particular its connection with the usual transla- 


tion group, are established. 


J. INTRODUCTION 


HE translation symmetry of the Hamiltonian for 
an electron in a periodic potential leads to the 
classification of the solutions of the corresponding 
problem by means of a wave vector k and to the possi- 
bility of introducing Bloch functions. When a constant 
magnetic field is also present, the Hamiltonian is no 
longer invariant under the translation group. However, 
“operators may be defined which commute with the 
«Hamiltonian of a Bloch electron in a magnetic field.” 


* Supported by the U. S. Air Force Office of Scientific Research. 

T Permanent address: Department of Physics, Technion-Israel 
Institute of Technology, Haifa, Israel. 

1p. G. Harper, Proc. Phys. Soc. (London) A68, 879 (1955); 
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2 Recently, Brown [E. Brown, Bull. Am. Phys. Soc. 8, 257 
(1963); Phys. Rev. 133, A1038 (1964)] has considered magnetic 
éranslation operators for constructing a ray representation of the 
usual translation group. 
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In this paper, a magnetic translation group which 
commutes with the Hamiltonian is defined and its 
general properties are established. In a following paper, 
we derive the irreducible representations of the mag- 
netic translation group and give the classification of the 
solutions of the Schrédinger equation for an electron in 
both a periodic electric potential and a constant 
magnetic field. 


II. DEFINITION OF THE MAGNETIC 
TRANSLATION GROUP 


Let a Bravais lattice be represented by the vectors 
Re niay tnat 1383 (1) 


(where a1, a», a3 are the unit cell vectors and 11, 12, %3 
are integers), each of which defines a point in the Bravais 
lattice. Let us define a path joining the origin O with the 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


MAGNETIC TRANSLATION GROUP 


point defined by Rn. We start from point O and move in 
a direction of another lattice point, say, given by R;; 
from R; we move in a direction, say, Rz to the point 
given by R,-- R» and so on until we achieve the point 
given by R,. Clearly, the point given by R, can be 
achieved by different choices of vectors Ri, Rz and so on. 
Let one of the possibilities be given by the vectors 
Ri, Rs, +++, Ri. We denote the described path by the 
symbol 

| RR, 5 R). (2) 


According to (2) we achieved the point of R, by means 
of i steps. Also, 

R,-RuRGe z -+R;. (2a) 
In a similar way, we could choose another set of vectors 
satisfying (2a) and define a different path joining O with 
the point defined by R,. 

Let us now define an operator which depends on both 
the vector R, and the path joining the point O to the 
point defined by R,,: 

T(R, | Ry R5, EMT R) 
=exp{ (i/A)Rn- [p+ (e/c)A]} 
Xexp( G/2)[R; x Rz-- Ri x R+- 
+R; x R; ]-h} B (3) 
where h— eH/7ic, H is the magnetic field, A is the vector 
potential of the magnetic field, p is the momentum 
operator of the electron, and e is its charge. The order 
of the vectors in the vector products in (3) is given by 
their order in path (2). The expression 
i[R; x R+ Ri x R3+ "EPA +R x R:]-h (4) 


has a very simple meaning. To show this, let us take 
the projection of the path (2) onto the plane perpen- 
dicular to h. This gives a path 


|R17,R2?,: : RISE (5) 


where R;? is the projection of R, onto the plane perpen- 
dicular to h. It is easily seen that 


3[ Ri x Ro+Ri x R+- -+R xR;]-h 
— Ri? x Re+ Ri? x Rart +RexRr]-h. (6) 
The brackets with the factor 3 on the right side of (6) 
give the area of the polygon enclosed by the vectors 
R,?, R,?, ey R;?, —R,? (7) 


(in which the vector —R,? was added in order to close 
the path). We thus see that the expression (4) gives the 
flux of the magnetic field (with the factor e//ic) through 
the polygon which is obtained by projecting the vectors 


R;, Rs, Meo: R;, —R, (8) 
onto the plane perpendicular to H. Therefore, the 
elements (3) may be written in a different form 


7(R, |R, R,- - -,R) =exp{ @/A)Ra-[p+ (/2A]) 
xexp(i e(Ry Rs,- % R2) , (9) 
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where e(R, Rs,- - -,R?) is the flux of the magnetic field 
through the polygon (7) multiplied by e//ic. It should 
be noted that the path (2), and therefore its projection 
onto the plane perpendicular to H, can be very compli- 
cated. For example, from the point given by Ra, which 
may be reached along the path (2), one may continue 
along the vector — R, and then again turn back to point 
R, by path (2). In such a case, the area in (6) will be 
doubled. The second exponent in the definition of the 
clements (3) or (9) thus expresses the curl nature of the 
magnetic field. 

One may now show that the operators defined in (3) 
or (9) commute with the Hamiltonian of a Bloch electron 
in a magnetic field for a proper choice of the gauge. The 
Hamiltonian is 


H —1/2m[p— (e/c)A J*-- V (r), (10) 


m is the mass of the electron and V(r) is the potential 
of the periodic field. It is easily shown that 


€ € € OA, 0A; 
p n x—-A Had +] * (11) 
Cc 


& fr cL Ox, Ox, 


"Therefore, if the gauge in (10) is chosen in a way that 
0A,/dx;+0A,/dx,=0; i,k=1,2,3, (12 


the commutator (11) will vanish and the operators (3) 
will commute with the Hamiltonian (10). Relation (12) 
holds, for example, for a gauge 

A-—3[H xr]. (13) 


In order to show that the operators defined in (3) 
form a group, we shall check that a product of two such 
operators is again an operator of the same form. Let us 
take two operators 


7(R,|Ri,Re,---,R:) and 
Their product is 
(Rn | RR; gy ,R27(R,'| R^, Rz, ud Ri’) 
=exp{ G/2)) (R.-- R^) -Lp+ (e/c)A]} 
Xexp( (//2)[ Ri x Rot ---+Rıx R/3-- -- 
TROSXxRZ]h), (14) 
where the second exponent is defined by the path 
| Ry R;, is RR; Ry, shee Ry) . (15) 


In obtaining relation. (14) we used the following 
expressions: 


7(R,/| R^ R7. p Ri’) . 


i e 1 e e 

ES (o+<a), Ra (r^) |-ia. xR,)-h (16) 

h G h c 

(without any gauge limitations) and 
£AgB = e4tB+HIA,B] 


a 


for A and B such that [A,B] commutes with both ” 


E 


a 
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A and B. The product (14) can be written according to 
the definition (3) as follows: 


c (R, | Ry Rs;- a R37 (R,/ | KR RZ;- 6 - Ry) 
= (Rat Rn’ | Ry Rs: d RRi, Ra, zs Ry’) . (17) 


We now have a multiplication rule for the elements 
defined in (3): In order to multiply two elements [left 
side of (17) ], we have to add the vectors R, and R,’, and 
the paths |RyRs:::,;R) and |R;, RZ,: Re); the 
vector on which the product [right side of (17) ] depends 
is R, +R,’ and the path is | Ry, R2,” - -Ra Rr, Ry, Re’) 
[the latter is obtained by displacing the path 
|Ry’,R2’,--:,Rx’) by the vector Ra, thus obtaining a 
path joining point O with the point defined by the vector 
R,-- R,/]. Thus, the product of two elements of the form 
(3) is an element of the same form. One sees that the 
elements defined in (3) form a group because one may 
also easily show that 7(—R,|—Ri, — Ria, o — R) is 
reciprocal to the element 7(R, | Rz Rs, : :; R2). We shall 
call this group the magnetic translation group (M.T.G.) 
and denote it by G2"It should be noted that the 
operators 


exp( (i/h)R.: (p+ (¢/c)A)} (18) 


do not forma group (as pointed out by Brown’), because 
a product of two such operators is not an operator of the 
form (18). 

The group G defined in (3) is an infinite group for two 
reasons. First, there are an infinite number of vectors 
Ra. Secondly, for any vector R, an infinite number of 
different paths joining point O with R, can be defined. 
We can, for instance, obtain from path P a different one 
P' by starting at and returning to any point belonging 
to the Bravais lattice on path P and adding a closed 
path to the path P at the mentioned point. 

The structure of the group G becomes more apparent 
after its connection with the usual translation group, 
denoted by R, is established. To do this, let us define, for 
every vector Rn, a set of elements H (Rn) to which all 
the elements 7(R, | Ri,Rs,-- -,R:) of G belong. Each such 
set H (Rn) consists of an infinite number of elements. It 
is easy to see that we can look at the sets H (Ra) as at 
elements of a group. If we multiply an element of the set 
H(R,) by an element of set H(R,’), we shall get an 
element of the set  (R,-- R,)). H(O) is the identity and 
H(— R») is reciprocal to H(R,). This group, denoted by 
H, is an infinite commutative group, and is isomorphic 
to the translation group R. The isomorphism follows 
from the one-to-one correspondence between R, and 
QE (R;). The group G is homomorphic to H, 


7(R;| Ry Ro: dE R;) E H(R,) ? (19) 


2a After the abstract of this paper appeared in Phys. Rey. 
Letters, Gerald A. Peterson pointed out Greate communication) 
that in his Ph.D. thesis he has defined a closed set of magnetic 
translation operators and has called it magnetic translation group 
TG. A. Peterson, Ph.D. thesis, Cornell University, 1960 (un- 


‘= ~ published) ]. 
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where the arrow shows the one-sided correspondence of 
the elements of G with those of H. From the homo- 
morphism of G to H and the isomorphism of H and R, 
it follows that the magnetic translation group G is 
homomorphic to the usual translation group R. The last 
fact will be used to introduce the Born-von Karman 
boundary conditions for the representations of the 
magnetic translation group. 

The group G has an invariant subgroup which is of 
great importance for constructing its irreducible repre- 
sentations. Consider the vectors 7m in the plane aj, a; 
and construct elements of the group G: 
7(tm| Ry’ ,/ Rz: Ei Ry’) 

= exp( (i/f)tm- [pt (6/2) 

xexp( (G/2) LR: x Ro/+Ry x Ry- 
TRACUR/]hj), (20) 


(21) 


and Ry, Ry, --+, Ry are arbitrary vectors of (1) except 
for the requirement that they form a path joining point 
O with the point defined by rm. The elements in (20) 
form a subgroup of G because a product of any two 
elements from (20) is also an element belonging to (20). 
Let us denote this subgroup by F. A similarity trans- 
formation by means of any element 7 (R; | Ry Ro : :, R) 
of G when applied to an element 7(tm| Rr, R”, Ry) 
of F gives the following result: 


771 (R, |R Ra : :; R)7(r; | Ri, Re’, - -RA 
Xc(R.| Ry R5, XE ;Rà 

—R., REN —R,, RY, Rz, Dey 
Ry, Ri, Re, UE) R,) c 


where 
Imn =Ma} 11585 


zT (tm| ESE: R;, 
(22) 


The relation (22) shows us that subgroup F is an 
invariant subgroup of group G. It is clear that another 
invariant subgroup of G can be obtained by taking in 
(20) vectors rm’ in the plane a», as. This latter invariant 
subgroup is isomorphic to F. With respect to the in- 


variant subgroup F, group G can be written as follows: 
G=71(0|0)F+7(a2|a2)F-+--- 
+r (1325 | 1522) F+: (23) 


It is of interest to note that when the magnetic field H is 
in the direction a3 the subgroup F is a commutative one; 
while the elements of F do not commute with those 
elements of G that do not belong to F. 

So far, the groups have all been infinite. For con- 
structing their irreducible representation, it is COR- 
venient to deal with finite groups. To do this, the Born- 
von Karman boundary conditions can be applied. 


Ill, FINITE MAGNETIC TRANSLATION GROUP 


The finiteness of the usual translation group is 
achieved by looking for its special representations, 
namely, by seeking representations which satisfy the 


f 


| 
| 
| 
f 
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rá 
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Born-von Karman boundary conditions 
D((G|a2*) = D((ea2)*) = D((]a)*) —D((10)) , (24) 


where (e| az), (e| as), (e|as) are translations in the direc- 
tions. of the unit cell vectors (e is the identity of the 
rotation group in the usual notation?), N is a large 
integer, and D denotes a representation of the translation 
group A. When seeking representations of R which 
satisfy the conditions (24), we can consider the group 
R as being a finite group Jt of order N3, the elements of 
which are given by the direct product of the following 
three groups: 


(e| a3)55 (e (25) 


Similarly, in order to make G finite, we require 
similar boundary conditions to those imposed for the 
representations of the translation group (24). To do this, 
we use the fact that the translation group A is iso- 
morphic to H and require the following conditions on 
the representations of H: 


D(H? (aj)) = D(H" (as)} 
—D(H*(aj))-D(H(O)). (26) 
With the conditions (26), the group H can be considered 


as being finite and given by the direct product of the 
following three groups: 


{H*(a1)}; U1(a2)) ; (1*(a3)) ; 
d OUR Dc 


4); (e[a3)*; 1, J, k=1, 25 eee, Af 


(27) 
Let us denote the last group by H and the corresponding 
M.T.G. by G. It is clear that again R and H are 
isomorphic. 

The requirements (26) mean that elements 


r(Was|RiRs,---Ry); &-1,2,3, (28) 


of G can be considered as constant factors. From this it 
follows that the elements (28) commute with all the 
elements of G. It is easy to show that 


7(R,| Ry R5, Q0 0 R27 (Ra! | R^ Rz, Pay Ra) 
=r (R, |R, R: + RZ)r(R; | Ri Ro, - -; R2) 
Xexp{i(R, xR,)-h). (29) 
Hence, the elements (28) commute with all the elements 


of G if the exponential term in (29) equals 1 for R,= Va; 
(k— 1, 2, 3). We have therefore the following condition: 


(Na, x R,):h-2xm; k=1, 2,3, (30) 


where m is an integer and R,’ is any vector from (1). 
Before discussing condition (30), let us derive another 
condition from the requirements (26). Consider two 
elements of the form (28) and take their product 


7(Na;| RyR;,- es ,R) (Nai Ry’, Ro’, -- Ro’) 
= 7(Na;+Na;,| RyRs;- a - RRi, Re’, 23 Ro’) 5 (31) 


Te Seitz end D 
3G. F. Koster, Solid Stale Physics, edited by F. Seitz and D. 
Turnbull Co IO TES Inc.. New York, 1957), Vol. 5. 


TRANSLATION GROUP 
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But since the elements on the left side of (31) should 
behave as constant factors, the additional factor on the 
right side of (31), 


exp(3 (Na; X Na;-h) , (32) 
should be equal to 1. We have thus 
(Na; X Nay) -h=4rm; i, k= 1, 2, 3 $ (33) 


for odd N [for even N there is no additional condition 
to the one expressed by (30) ]. Combining (30) and (33) 
we get for odd N the condition 


(Na, x R,)-h-4xrm; k=1, 2,3, (34) 


which is stronger than the requirement (30). We now 
write the conditions (30) and (34) in a more convenient 
form, using the fact that (30) and (34) will be satisfied 
for any vector R,’ if they hold for the unit cell vectors 
21, 25, az: 


N(a.xa):h-2mn4 (foreven N), — (35a) 
N (as xai)-h=4rny; (for odd N), (35b) 
where 7, is an integer and k, J=1, 2, 3. By using the } 
definition of the reciprocal lattice vectors | 
K;= (2r/V)a; x az, (36) 


where i, j, k form the cycle (123) and V is the volume 
of this unit cell, we can write the conditions (35a), (35b) 
as follows: 


N(V/2mr)Km+h=2am 
N(V/2x)K,-h-4xm 


(37a) 
(37b) 


(for even N), 
(for odd N). 


Here m is an integer. Equations (37a) and (37b) show 
that 
h-(2z/V)(R,/N) (for even N), (38a) 


h=(47/V)(R,/N) (for odd N). (38b) 


The requirements (38a), (38b) which follow from (26) 
are limitations on both the possible directions and values 
of the magnetic field. Let us choose the magnetic field in 
the direction a; [this can always be done when condi- = 
tions (38a), (38b) hold] and let R.=a3; then 


h-(2x/V)(n/N)as (for even N), (39a) 
h-(4z/V)(n/N)as (for odd N). (39b) 


For N sufficiently large the limitations (39a), (39b) on A 
the magnetic field H are not essential. It should, how- T 
ever, be noted that when the crystal has dimensions oí ^ 
the order of 1 cm (V ~ 104), the fields satisfying condi- ; 

tions (39a), (39b) differ by quanta of the order of 10 G 
(ie., the lowest nonzero magnetic field satisfying. the 
above conditions is of the order of 10 G). Unlike the 
boundary conditions for the usual translation group 
(which lead to no physical consequences), in the case o! 
the M.T.G. we have restrictions on the magnetic field 


| 
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even for crystals of the dimensions of 1 cm on which 
bulk experiments have been carried out. 

We now show that the Born-von Karman conditions 
(26) on the representations of H, which led to require- 
ments (39a), (39b), turn the group G into a finite one 


G. By using (39a), (39b), we find that the elements of 
G are of the form 


7 (RR, Rs: : R2 - exp( G/2) Rs (P+ (e/c)A)} 
exp{2mim(n/N)} (for odd N), (40a) 
| exp{rim(n/N)} (40b) 


where m is the coefficient by the product a; x az in the 
expression 


Ri x RoR. x R+: : -+R x R;. 


(for even N), 


(41) 


From Eqs. (40a), (40b), we see that the number of 
elements of G is finite. In order to count the number of 
elements of G, we treat the cases for even and odd N 
separately. For odd M, condition (39b) is imposed and 
the elements of G are given by (40a). The number of 
different values of the vector R. is N3. For each R, there 
are different elements in (40a) arising from the second 
exponential factor. We first show that the number m in 
(40a) [and also in (40b) ] can take all integer values. 
This follows from the fact that we can add to the path 
joining the point O with point R, a closed path which 
will change (41) by an elementary area a; x a» and will 
change m in the expressions (40a), (40b) by unity. Let 
us now assume that p is the largest common factor of 
n and N (when there is no common factor, p=1), and 
let V/p=N’, n/p=n’'. In order to count the different 
elements of (40a) for a given Ry, let us check that 


exp(2mim(n/ N)) ~exp{ 2rim’ (n/ N)) (42) 


for m'm (we exclude here those cases for which m’ 
differs from m by the number WV’). By assuming the 
equality sign in (42) to hold, we get 


exp{2ai(m—m’)(n/N)}=1. (43) 


ZAK 


Relation (43) holds only when 
(m—m’) (n/N) = (m —m’)(n’'/N')=integer. (44) 


But since n’ and JV have no common factor, the relation 
(44) is possible only for m—m' > N', which means that 
m should differ from m at least by N’ in order to get 
the equality sign in (42). The second exponential in 
(40a) thus takes on JV" different values for a given R,. 
Since R, itself has JV? different values, there are N'N” 
different elements in G in the case of odd N. In a similar 
way, it is easy to show that there are N?(2N") different 
elements in G for even N. 

Tt is of interest to note that when the magnetic field 
H is very strong (of the order of 10? G), we can have 
the case that 


h= (4r/V)as, (45) 


in which case the group G consists of IV? commutative 
elements of the form 


exp( (i/#)Ra: (p+ (¢/c)A)} 


and is isomorphic to the usual translation group. 


(46) 


IV. CONCLUSION 


We have defined here a magnetic translation group 
G which commutes with the Hamiltonian for a Bloch 
electron in a magnetic field. In a following paper, we 
construct the irreducible representations of this group 
and give the classification of the solutions of Schréd- 
inger’s equation for an electron in both a periodic elec- 
tric potential and a constant magnetic field. 
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Magnetic Translation Group. II. Irreducible Representations 


J. Zax* 
National Magnet Laboratory,t Massachusetts Institute of Technology, Cambridge, Massachusetts 
(Received 13 January 1964) 


The physical irreducible representations of the magnetic translation group (M.T.G.) defined previously 
have been found. From these a set of solutions of Schródinger's equation for a Bloch electron in a magnetic 
field has been constructed. In general the M.T.G. is non-Abelian, However, when the magnetic flux through 

iy areas enclosed by any vectors of the Bravais lattice become multiples of an elementary “fluxon” /ic/e, 
the M.T.G. becomes isomorphic to the usual translation group. 


I, INTRODUCTION 


r a previous paper! a magnetic translation group was 
defined and its general properties were established. 
The elements of this group, which depend on both a 
point of the Bravais lattice and a path joining the origin 
of the lattice with this point, are given by 


7(R| Ri,Ro,-  -,R:)= exp( (i/4)Rn [p+ (c/o) A} 
Xexp(2mi(n/bN)m), (1) 


where R, is a vector of the Bravais lattice, A is the 
vector potential of the magnetic field H, m is the coeff- 
cient by the product a; X a» of the unit cell vectors ay, a» 
in the sum of ordered products 


Ri x Ro+ R: x Raet- e eR: x RR: x Rte 
+RiixR;, (2) 


n, N, and b appear from the Born-von Karman condi- 
tions which turn the magnetic translation group into a 
finite one and impose the following condition on the 
magnetic field : 


eH 4rR, 4r n 
A 3) 
hc VON V bN 


V is the volume of the unit cell, V =a; x a2-a3, and b 
takes the values 1 or 2 for N odd or even, respectively. 

The finite magnetic translation group G is of the order 
b(N/p)N?*, where p is the largest common factor of N 
and n. For constructing the irreducible representations 
of G we use the fact that it has an invariant subgroup P 
consisting of elements 


i e 
T(rn| R,,R,, SU, ;Rà m exar. (e+) | 
h c 


n 
x e| beim] , 4 
bN 


where 
I, — 121-1583. (5) 


* Permanent address: Department of Physics, Technion-Israel 


Institute of Technology, Haifa, Israel. s 
1 Supported by thew S. Air "Force Office of Scientific Research. 


1J. Zak, Phys. Rev. 134, A1602 (1964). 


The group G can therefore be written as 
G (a5 az) P4- - - ---r(Nas| Na). (6) 


The invariant subgroup F is a commutative group and 
its irreducible representations are well known. In order 
to find the irreducible representations of @ we use the 
method of Frobenius.?? This method enables one to find 
all the irreducible representations of a group from the 
irreducible representations of its invariant subgroup. 
In Sec. II the physical irreducible representations of 
the M.T.G. are constructed. They turn out to be the 
same in the case of the finite M.T.G. as the ray repre- 
sentations of the usual translation group that are given 
by Brown.‘ This result is not surprising because in the 
usual method for constructing ray representations of a 
finite group*:® one augments the group, finds the vector 
representations of the augmented group, and then 
chooses those representations which correspond to ray 
representations of the original group. It appears, how- 
ever, more natural to define a group which commutes 
with the Hamiltonian for a Bloch electron in a magnetic 
field than to treat ray representations of a group (the 
usual translation group) which does mot commute with 
this Hamiltonian. Moreover, by finding all the physical 
representations of the M.T.G., we are sure that we have 
all the possible symmetry types for the wave functions.” 
For constructing the irreducible representations of the 
M.T.G., we first start with the finite group obtained by 
applying the boundary conditions. From the method 
of construction, it Is possible to find a complete set of 
symmetry adapted functions for the problem under 
discussion.? These functions are presented in Sec. IIT. 
As shown in Ref. 1, the boundary conditions on the 
magnetic translation group lead to physical restrictions 
on the magnitude of the magnetic field. In Sec. IV we 


? F. Seitz, Ann. Math. 37, 17 (1936). 

1J. Zak, J. Math. Phys. 1, 165 (1960). 5 

*E. Brown, Bull. Am. Phys. Soc. 8, 257 (1963); Phys. Rev. 
133, A1038 (1964). 


i9 
51. Schur, J. Reihne Aug. Math. 127, 20 (1904) ; 132, 85 (1967). 
S. 


6 W. Döring, Z. Naturforsch. 14, 343 (1 

7 The statement used in Ref. 4 for showing the completeness of 
the irreducible representations, namely, that the sum of squares of 
the dimensions of nonequivalent irreducible ray representations. 
equals the order of the group is, in general, not true. There are, for 
example, finite groups that have no ray representations at all. 


* An alternative method for constructing symmetry adapted 
RE 


functions is discussed in Ref. 4. 
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remove these restrictions by dealing with an infinite 
M.T.G. Finally, in the same section, the significance of 
the elementary fluxon /ic/e for the magnetic translation 
group is discussed. 


II. CONSTRUCTION OF THE IRREDUCIBLE 
REPRESENTATIONS OF G 


The invariant subgroup I’ of G is a commutative 
group and its irreducible representations are 


Dk) (7 (r, | Ry Rs, see R3) 


n kiny mns 
=exp | » (7) 
bN N N 


where s takes the values from 1 to bN/>p, and kı, ms the 
values from 1 to N. Relation (7) thus defines (DN /9)N? 
one-dimensional representations of P^. These are all the 
irreducible representations of / because the order of the 
latter is just (DN/5)N?. However, some of the repre- 
sentations (7) are nonphysical as may be seen by setting 
r,=0. The element 7(0|Ri,Ro,°- -,R,;) is a constant 
factor which will be represented by different constants if 
we take different values of s in (7). When the operator 
r(fn|Ri,Re,---,Ri) itself operates in the functional 
space, the only physical representations in (7) will be 
those for s= 1. We therefore consider the representations 
of P only for s=1 and from these we construct the 
irreducible representations of G. 

Let us first consider the case p=1 (i.e., no common 
factor for N and 2) and denote the vector that spans 
representation (7) (for s= 1) by |i, m3). Then 


D Gvm3 {7 (rp | RR; ys R2) |R,mts) 
n 2nknı 
= exp] 2a] Lm + 
bN bN 


M3n3 


» |} lesa) (8) 


where a factor 2n/b was introduced for convenience in 
front of kını/ N. (This is equivalent to an alternative 
numbering of the eigenvectors |R1,m3).) We now 
perform a similarity transformation: 
+ (—n3| —152)7 (ra| Ry Ro, = : Rr (92282| 22282) 

—(r,| Ry Rs: - :;R) exp{4ri(n/bN)) mina}. (9) 


Tt follows from (9) that 
D(c(r, | Rs Rs: D - RD( (1525 | 12a2)) | ma) 
2n(ki-n2)ni Mna 
E 

oN N 
xDíc (13a:| 11532) ) |km) o (10) 


It is seen that the vector D((1a2|1222)) | kms) is an 
eigenvector of D( (x, | Rs Rs;- - -;Rj)) corresponding to 
the eigenvalues ki +12, ms (D now denotes a representa- 
tion of G). Since we assumed that N and m have no 
. "common factor we get? N vectors D(r(nas|n:22)) 


Li 


à 
i 
3 
E 


n 
-ople i 


m 
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X |n) for 12— i Yay aoe IN 
D{r(n2a2 (11) 


Let us show that relations (10) and (11) for each m; 
define an irreducible representation of G. According He 
(6) each operator 7(Rn|Ri,---,R:) can be written as 
follows: 


7(Ra| Rs: ei 3:5] 
— c (1522 | 22) r (t| — naz, Ry, +++, Ri), (12) 


71582) | xa) = | ydo no, ms). 


where R,— r,-- 125, ra 121-5585. Therefore, 
D(r(R,|Rs-- R3) | ma) 
2nkym 


n 
-ex ani] — a 
P| "T bN N 


Ma3 


|| | kın, m), (13) 


where g is the coefficient of the product aı X a» in 
— na XR, +R: x Rot Ri x Rs +++ Ra xR. (14) 
From (13) it follows that 
Dyx™{7 (R,|Rs,: R2) 
= (my | (R4 | Rs: : Rz) | es) 
n — 2nkm 


; {2 l a iie | 
= ôw rpne EXP 2m] since S 
uy UN. IWON | = 


where Dwg”? denotes a matrix element k'k of the 
operator c(R,| Ri: ::;Rj; &' and k take values from 
1 to N, and k+72 is taken modulo N. 

We next show that for each ms the matrices D™ form 
a representation of G. To do this we first calculate the 
product of two matrices (15) corresponding to two 
elements 7(R,|Ri,---,R,) and 7(R,’|Ry’,---,Ri’) and 
then the matrix which corresponds to the product 
7(R,|R;:-:,R)z(R,|Ri/,:-:,R/). The two results 
must be the same if (15) is to be a representation of G. 
From (15) we have 


N 
2 Dwra™fr(Ra| Ra: o R2) Dais(7 (R^ | Ry’, - : RY} 
2nk(ny4- n) 


n 
T [y „knang EXD | anil — reg) + 
bN oN 


Qu - d 
oN N 


On the other hand, the matrix corresponding to the 
product of the two elements is 
Dy y": (7 (R,4- R,/ | Rs, Io SR ay SRZ)) 
2 Ink (msn 
Ew. o m ba e Nu em 
bN oN 


| an 
N 


esr 


) 


6) 


he 


fe 
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where g” is the coefficient by the product a; x a; in the 
expression 
— Gunz )ai x (R,4- R,/)-- R; x Rot + 

+R x Ri-+Ry x Ry/+-- FR xR. (18) 


It is easily shown that 
g'— gg +2. (19) 


It therefore follows that expressions (16) and (17) are 
equal and thus that the matrices (15) for each m3 form 
a representation of G. These representations are 
irreducible because all the vectors | km) for a given ma 
and different k(k=1, 2; ---, N) form a closed subspace. 
An alternative way to check the irreducibility of (15) is 
to calculate the character for a given ms, 


N 
x^ Gr (R4 | Ra, am - Ri} = a. Dy 
k=l 


z | n Tilata 
= N ani05n30 exp [2e] e+ = || 7 (20) 
bN N 


and to find the sum 


XheGR RDN. — QD 
G 


Result (21) shows that the representation (15) with the 
character (20) is irreducible [for reducible representa- 
tions the sum (21) is greater than the number of ele- 
ments in G]]. Note that the way of constructing irre- 
ducible representations here corresponds to the method 
of finding conjugate representations of the invariant 
subgroup F by means of the elements of G that do not 
belong? to F. In the case when JV and 7 have no com- 
mon factor all the conjugate representations are non- 
equivalent and the representation of G which they 
induce is thus irreducible. 

According to the method of Froebenius, relation (15) 
gives us all the irreducible representations of the group? 
corresponding to s— 1. Since we are interested only in 
the physical representations of G these are all the repre- 
sentations of G. 

Let us now treat the case when the largest common 
factor of V and n is p. If we start with a vector | ym) 
[see (8)] and apply all the operators r(m2a2|m2a2) we 
shall get in general only JV —N/? independent vectors 
| Ems) for k=1, 2, - - -, N’. According to Ref. 2 it again 
follows that we may assume 


D(r (1325 | nzas)) | kms) 
— exp(2«i (mans/ N)) | 4-2, m3) (22) 


for nm=1, 2, ---, A. and m=0, 1, =, p—1. The 
representations induced by the vectors (22) can be 
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written as 
Dy, 7m) C (R,, | R, ste R2) 
Mii Mette n 


= On kpn ep [aei -H— 
ge NON 


Inkny mnz 
E, e» 
bN N 


where the values of m,, m» range from 0 to p—1; k, k’ 
assume values from 1 to N” (&-I-z5 is defined modulo 
IV), and m; ranges from 1 to N. Relation (23) defines 
PN irreducible and nonequivalent N^X N' representa- 
tions of the group G and again gives us all the physical 
representations of Œ. Note that in this case when WV and 
n have a common factor the conjugate representations 
of F, constructed by means of elements of G that do not 
belong to F, will in general no longer be nonequivalent. 
The representations of G induced by the conjugate 
representations of F will be reducible; the irreducible 
parts are given in (23). In the case when p=1, repre- 
sentations (23) go over into (15). Hence, relation (23) 
gives all the irreducible physical representations of G. 


TORT TETTE N E NONN N PEN 


II. SYMMETRY ADAPTED FUNCTIONS 


We now use the irreducible representations of the 
magnetic translation group G to construct symmetry 
adapted solutions of Schródinger's equation for a Bloch 
electron in a magnetic field. By using the gauge 


A=;[Hxr], (24) 


we can write the elements (1) of the magnetic transla- 
tion group as 


i n 
r(R,|Ry---,R)- exp| »pti—(n.2K,—m,Kz2)- 1 
h bN 


n 
Xex hri n] 25 
P| re (25) 


where Kı and Kz are vectors of the unit cell of the 
reciprocal lattice. Let 


V(r) =exp{ik-r}u(r) , (26) 
with 


1 Qnky 

k=] (mt) rmn. 

7,—0,1,--7,5—1; kia, N i 
P 


d 
v(r, | Ri: z - Riu) —exp(2zi(n/bN)g)u(r) . 


The last condition on u(r) means that 


n i 
u(r+ra)= e| mo luo , 
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in which case 


v(r,| Rs, ns RVG) 


n mu 2nkyn mans 
-eppri QUT. 0» 
bN N bN N 


In order to satisfy condition (28) and the boundary 
conditions, we choose u(7) as follows: 


DR | Kir Kahn (30) 
u(r) Sw) E irs Ne , 
with 
zo (r-- rs) = w(r) , 


(31) 
vw(r--N'ay) = exp( —iV'as x h-ryw(r). 


The function (26) will therefore be 
i n 
V(r) =exp{ik-r} esp e Repo (32) 
2a bN 


By using Eq. (32) and the relation (22) and by re- 
introducing the magnetic field according to Eq. (3) we 
get the functions 


Ya" (1) =exp{ik-r} 
Xexp(i[q4- (1/47) Ks:r ](as xh) -rywik(r--gas), (33) 


where k= (1/N) (miKy4-moKo--m5K;), mı, m» are inte- 
gers which take values from 0 to $—1 (with ? again the 
common factor of N and s), m; is an integer whose 
values range from 1 to V and the integer q has values 
from 0 to N’—1. The functions vo; (r) satisfy conditions 
(31). The meaning of / will be given later. For fixed k 
and 1 there are N’ (=N/p) functions in (33) which form 
a basis for an NW’ XN’ irreducible representation (23) of 
the magnetic translation group G. For different k we 
get nonequivalent representations of G. The functions 
in (33) are thus the symmetry adapted functions for all 
the irreducible representations of G. The physical 
picture connected with this description is the following. 
All the functions (33) with fixed 7 and k (and varying q) 
belong to the same energy. When k also varies the 
energy will be in general varying and one obtains an 
energy band for each /. The number / was thus intro- 
duced to distinguish between different energy bands in 
the same way in which it is done in the case of the usual 
translation group. 
From the fact that the function (33) form bases for 
irreducible unitary representations (23) of the magnetic 
* translation group, it follows that functions (33) are 


ZAK 


where the integration is over the volume of the crystal, 
we get 


Jem (ny? (r)d ye 6 qq’ Ori Ôw z (35) 


Hence, the functions (33) form a complete orthonormal 
set. 
IV. INFINITE MAGNETIC TRANSLATION GROUP 


The construction of the representations of (23) of G 
and of the symmetry adapted functions (33) was made 
possible by imposing the Born-von Karman boundary 
conditions. We know, however, that in the case of the 
usual translation group it is possible to write down the 
representations without using the boundary conditions, 
in which case the wave vector k describing the repre- 
sentations varies continuously in the first Brillouin zone. 
One may expect the same to be valid also in the case of 
the magnetic translation group. The crucial point in 
deriving the irreducible representations of G was the 
existence of an invariant Abelian subgroup F of the 
group G. The infinite magnetic translation group G 
always has an invariant commutative subgroup F 
[Eq. (23), Ref. 1] when the magnetic field H is in a 
direction of a lattice vector. The restriction imposed on 
the direction of the magnetic field only is a much weaker 
one and it leaves the group G infinite. Since the Born-von 
Karman boundary conditions do not appear explicitly 
in the functions (33), it is easy to obtain a generalization 
of the functions in the case of an infinite M.T.G. (with 
the restriction that the magnetic field is in a direction 
of a lattice vector, say, again in the direction of a3). 
We have, however, to distinguish the case 


hzé (4v/ V) (n/ N)as (36) 


from the case 


h= (4z/ V)(n/ N)as, (87) 


where n and JV have no common factor. The integers 
n and JV are no longer connected with the boundary 
conditions. In the case when (36) holds the symmetry 
adapted functions will be 


yí Qm (r) & exp (2msKs-r) exp{i[g+ (1/47) Ks- r] 
D (as xh. x) Wim, (r+ gas) D (38) 


where 0<m3<1 (continuously) and q takes on integer 
values from 0 to infinity. For the case (37) we get as 
in (33): 


y 4 O9 (r) =exp{zk-r} exp([g4- (1/47) Ke-r] 


= orthogonal for different k and q. [Note that the func- X (az xh) -r}wie(r+ga2), (39) 
tions wix(r) also depend on the vector k. This depend- where 
ence is obtained by substituting functions (33) into — — 
Schródinger's equation.] By assuming (as is usual) the q—0, 1, ---, N—1 E(QV/2)—1] for odd (even) N, 
orthogonality of wix(r) for different 7 mı mz 
E k-—K;--—K;-7;K;, (40) 
dit J wyr” (r)wix(r)d V = ôr , (34) x, y 
O<m, m<1 (2) for odd (even) N, 0€mi«t, 
r CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 
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and 
Wx(t-+ Na? = exp( —iNas x hr) (r) 
for odd N, 


wik[ rt (V/2)az]— exp( —1(V/2)a; x h-r)wis (1) 


for even N. 


(41) 


One should note that the appearance of /V/2 for even 
N is in agreement with the distinction between even and 
odd X in the construction of the irreducible representa- 
tions (23). 

There are two interesting points connected with the 
generalization for an infinite M.T.G. The first point is 
that by changing the magnetic field continuously we can 
pass from case (37) to case (36). This means that by 
changing the magnetic field by an amount which can be 
as small as we like we change the degeneracy from a 
finite one (given by the number N) to an infinite one. We 
thus see that the representations are very sensitive to the 
magnitude of the magnetic field. The relation (37) can 
also be written in a different way by multiplying both 
sides by a1 x a». We get (by using the definition of h): 


H-3(a:xas) m 


xm (42) 
hc/e N 


where / is Planck’s constant. In the numerator on the 
left of (42) we have the flux of the magnetic field 
through an elementary area enclosed by the vectors 
a a2 and —(ajd-a;). The quantity /c/e is the ele- 
mentary quantum of magnetic flux (fluxon) which is of 
great importance in superconductivity? and may have 
an influence on the significance of potentials in quantum 
mechanics? From Eq. (42) it follows that the repre- 
sentations of the magnetic group are finite when the 
ratio of magnetic flux through areas enclosed by any 
vectors of the Bravais lattice to the elementary fluxon 
are given by rational numbers. Through Eq. (42) we 
come also to the second point, namely, when the ratio 
given by this equation is an integer, all the constant 
factors in the definition of the elements of the M.T.G. 
[Eq. (1)] will be unity. In this case the magnetic 
translation group will be isomorphic to the usual trans- 
lation group and all the irreducible representations of 
the first will become one dimensional, and one has an 
energy band picture as in the Bloch case. In this con- 
nection, it is interesting to refer to the work of Azbel!! 


? M. Peshkin and W. Tobocman, Phys. Rev. 127, 1865 (1962) 


iven therein. 
E P ee D. Bobni Phys. Rev. 115, 485 (1959); 123, 
1511 (1961). ; 
n Mi Y) ‘Azbel, Zh. Eksperim. i Teor. Fiz. 44, 980 (1963) 
[English transl.: Soviet Phys—JETP 17, 665 (1963) ]. 


TRANSLATION GROUP, 
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who shows that for a special Hamiltonian all physical 
properties of electrons in a magnetic field are periodic in 
the magnetic field with a period 


h= (V7 V)as . (43) 


The relation (43) expressed in the form (42) is just a 
requirement on the magnetic flux through areas enclosed 
by any Bravais lattice vectors to be multiples of the 
fluxon /ic/e. The fields satisfying condition (43) must be 
of the order of 10!° G for a2210-5 cm, and cannot be 
practically achieved. The fundamental significance of 
flux quantization in the problem under discussion 
is, however, of great interest and requires further 
investigation. 


V. CONCLUSION 


We have indicated the usefulness of the magnetic 
translation group for investigating the dynamics of a 
Bloch electron in a magnetic field. From the irreducible 
representations of the M.T.G. a set of orthonormal 
functions was constructed. It turns out that when a 
magnetic field is present the imposition of boundary 
conditions leads to quantization of the field, i.e., by 
imposition of boundary conditions on crystals, even of 
dimensions of the order of 1 cm, the magnetic field can 
only take values which differ by about 10 G. (It has been 
usual to assume that the Born-von Karman boundary 
conditions have no influence on the physical description 
of an electron in a periodic potential.) It was therefore 
important for us to remove the boundary conditions. By 
doing so, we found the irreducible representation of the 
M.T.G. (and the symmetry adapted functions) when 
the magnetic field is required to be along a lattice 
vector—without any restrictions on the magnitude of 
the magnetic field. It turns out that the representations 
are very sensitive to whether or not condition (42) holds. 

It would be interesting to find the representations of 
the M.T.G. without any requirements on the magnetic 
field at all. 
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We have measured the nuclear rel 


axation times T1 and Ts for 1% Ni“ in 99% iron, 1% Ni“ in 99% 


cobalt, and 0.5% Co?? in 99.5% nickel at several temperatures using the pulsed-free precession method. 


The relaxation curves are generally nonexponential and power- 1 y 
to thermal fluctuations of the domain walls. At high-power 


om nuclei in domains, the longest measured relaxation times T, are 
s. The longest T's are found to be inversely propor- 
ĉl in cobalt, TT —1.2 sec °K for 1% Ni“ in iron, and 


are from nuclei in domain walls and 7 is due 
levels, where the signal is mainly fr: 


lower limits for relaxation times for nuclei in domain 
tional to temperature with T1T — 1.0 sec °K for 1% Ni 


dependent. The signals at low-power levels 


T3T —0.3 sec °K for 0.5% Co? in nickel, and are believed to be due to conduction electron relaxation. The 


quantity 1/y,?TiT (where yn is the nuc 
of the conduction electron relaxation mechan 
the nuclei of the solvent atoms in the pure so 


lear gyromagnetic ratio) is a normalized measure of the strength 
ism and is found to be smaller for the impurity nuclei than for 
lvent metal. This is believed to show that the relaxation of the 


nuclei by the conduction electrons (which is enhanced in the pure metals by spin waves) is slower for the 


nuclei on the solute atoms in the alloy 


s due to a reduction of the spin wave enhancement. T» for Ni“ in 


cobalt is 0.15 msec at room temperature and is believed to be due to a coupling of Nit! and Co? nuclei by the 


Ruderman-Kittel interaction. 


I. INTRODUCTION 


N the pure metals, iron, cobalt, and nickel, the 
nuclear relaxation times Tı and T» have been 
measured by Weger? by the free precession method. 
The decay of the longitudinal and transverse magnetiza- 
tion in the ferromagnetic metals is found to be non- 
exponential and a function of the rf power level. Nuclei 
in the domain walls experience a large enhancement of 
the applied rf level due to the domain wall motion. 
Thermal fluctuations of the domain walls also provide 
a relaxation mechanism; consequently, at low rf pulse 
levels, the nuclear signals are from nuclei near the 
center of the domain walls which have the largest rf 
enhancement and shortest relaxation times. At higher 
rf pulse amplitudes signals come from nuclei farther out 
from the centers of domain walls, which have lower 
enhancements of rf fields and longer relaxation times. 
Finally, at very high rf pulse levels, signals come 
primarily from nuclei in the domains which see a much 
smaller rf enhancement than those in the domain walls. 
For these nuclei, relaxation times 71 are longer, being 
determined by conduction electrons now and no longer 
by domain wall motions. For T», the experimental 
situation is more complicated. For low abundance of 
nuclear spins and at high temperatures the nuclear 
spin-spin interactions can generally be neglected. In 
this case, T% is found to be of the same order as Tı. For 


“cases in which nuclear spin-spin interactions are 


important, as in pure cobalt, these determine T». 
In the present work we have used the free precession 
method to measure T, and Tz for Ni® in dilute alloys 


1M. Weeer, thesis, University of California, Berkeley, 1961 
(unpublished). 
ae E. L. Hahn, and A. M. Portis, J. Appl. Phys. 32, 
1645 (1701). 
“sM. Weger, Phys. Rev. 128, 1505 (1962). 


of 1% Ni® in cobalt! and iron, and for Co? in a 0.5% 
Co® in nickel alloy.® Preliminary to studying relaxation 
times of the impurity nuclei in the alloys, some measure- 
ments were also made of Tı and T; for Ni? in pure 
unenriched nickel powders to determine the effect of 
annealing on the relaxation times. The Ni*' relaxation 
times were also measured in the 0.5% cobalt in nickel 
alloy to determine the effect of the cobalt impurity on 
the Ni® relaxation. 

In Sec. II the experimental methods and techniques 
are discussed. In Sec. III we review briefly the theory 
of relaxation, both in domain walls and in the domains. 
In Sec. IV we give the experimental results, and com- 
pare them with the theory. 


Il. EXPERIMENTAL METHOD AND TECHNIQUE 
A. Equipment 


We have used the free precession method to measure 
the inhomogeneous linewidth and the longitudinal and 
transverse relaxation times T, and T». A pulsed push- 
pull rf oscillator (Arenberg Ultrasonic Laboratories 
Model 650-C) was used to supply the rf pulses. The 
widths of the rf pulses used were about 1 usec and the 
maximum rf voltage level was about 300 V peak-to-peak 
rf. The metal powder was inserted in the rf coil of the 
oscillator. A second coil coaxial to the first was used to 
detect the free precession signal. It was convenient to 
heterodyne the nuclear signal with a mixing frequency 
differing by about 20 Mc/sec, using a television tuner. 
The television tuner was followed by a wide-band rf 


‘tR. L. Streever, L. H. Bennett, R. C. La Force, and G. F. Day, 
Phys. Rev. 128, 1632 (1962). 

5R. L. Streever, L. H. Bennett, R. C. La Force, and G. F. Day, 
J. Appl. Phys. 34, 1050S (1963). 
CA H. Bennett and R. L. Streever, J. Appl. Phys. 33, 1093S 
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amplifier and detector. The video signal was then 
amplified and observed directly on the oscilloscope. 
The dc pulses necessary to gate the pulsed oscillator 
were supplied by appropriate combinations of Tektronix 
Nos. 162 and 163 units. 


B. Measuring Methods 


To measure 7; and Ts, one usually employs a method 
in which one makes use of 90° and 180° pulses. In a 
ferromagnetic metal as a result of the skin depth and 
the distribution in enhancement factors, different 
nuclei see different rf fields. Consequently, 90° and 180° 
pulses are impossible to obtain. The decay of the 
magnetization in the transverse direction with time 
constant T, is, however, independent of pulse adjust- 
ments and T$» can be obtained from the echo decay 
envelope. To measure 71, we saturated the nuclear 
signal with one or more closely spaced rf pulses, and 
then monitored the recovery of the magnetization by 
observation of the echo following two closely spaced 
pulses applied some time after the saturating pulses. 
An echo rather than a single pulse was used to sample 
the recovery of the magnetization because for the broad 
resonance lines where the echo width is a few psec, the 
receiver blocking after the rf pulse makes the decay 
itself impossible to observe. Weger! has shown that the 
above method is subject to some error due to diffusion 
of magnetization in space out of the domain walls. By 
applying a large number of saturating pulses whose 
separation is much less than T°, this effect can to a large 
extent be overcome. 

The relaxation times Tı and Tz were measured as a 
function of rf pulse amplitude. The echo signal following 
two pulses of equal width goes from zero through a 
maximum at low rf pulse amplitudes as the pulse 
amplitude is increased from zero, then slowly decreases 
again as the pulse amplitude is further increased, as 
discussed in Ref. 1. The variation in the echo signal 
intensity results from the distribution in rf fields due 
to skin depth and a distribution in enhancement factors, 
and the signal reaches a maximum when ynniH sm 2. 
Here m is an average effective enhancement factor, 2H, 
is the linearly applied rf field, 4, is the pulse width, and 
yn is the nuclear gyromagnetic ratio. The relaxation 
curves were in general nonexponential, and a function 
of the amplitude of the rf pulses for a given pulse width, 
which was usually about one usec. T» and 7; can in the 
case of a nonexponential decay be defined for any point 
on the relaxation curves from the slope. That is, 


1/T:= —d nM (0/d!, 
where M(t) is the amplitude of the echo signal at a 
time / after the first pulse and 
1/Ty= —d MEM (9) M(H /dt, 


where M (I) is the value of the sampling signal at a time 
t after the last saturating pulse. 


A1613 


C. Samples 


The majority of the relaxation time studies were 
made on alloy powders of 1% Ni" and 99% cobalt, 
1% Ni" and 99% iron,* and 0.5% Co” and 99.5%, 
nickel (all atomic percents). The Ni“! in cobalt was 
annealed for $ h at 600°C. The Ni! in iron sample was 
annealed for } h at 680°C. The Co? in nickel sample 
was annealed for 1 h at 1100°C, but some measurements 
were also made on an unannealed sample. The relaxa- 
tion times for nuclei in the domain walls is expected to 
depend somewhat on the degree of particle annealing 
and will be discussed in Sec. III.A. The average 
particle size was about 10 u. More details of sample 
preparation have been previously discussed.*- 

Some studies were also made on the Ni?! resonance 
in pure unenriched nickel samples with various degrees 
of annealing to investigate the effects of strains on the 
relaxation times. These samples were all 10 u powders. 
The preparation of the pure nickel samples has been 
discussed previously.’ 


IH. THEORY 
A. Relaxation in the Domain Walls 


Nuclear relaxation in the domain walls has been 
studied both theoretically and experimentally by several 
authors.!?:$71? For the pure metals Weger’ gives for 1/7; 


1/T1= (Tras MP) benz). (0) 


Here k=Boltzmann constant, T=absolute tempera- 
ture, r,—correlation time for wall motion due to 
thermal fluctuations, ô= wall thickness, M,= saturation 
magnetization, and w= resonance frequency. For cobalt 
at room temperature Eq. (1) gives good agreement with 
experiment.’ For the pure metals and the alloys studied 
in this paper, wr,< 1 is a good approximation. In this 
limit 


1/Ti ET rM. Q) 


The weak temperature dependence of T; in the pure 
metals is believed to arise because while kT enters 
directly into Eq. (2), re probably increases with 
decreasing temperature due to greater eddy current 
damping of the domain wall motion.? 

Equation (2) was derived for the pure metals; how- 
ever, quite generally in the limit wr<1 the relaxation 
time will be given by (see Ref. 11) 


1/Ti— (H P yré'r- (3) . 


Here Hy is the fluctuating field perpendicular to, the» 
hyperfine field Hn resulting from thermal fluctuations 


1 R. L. Streever and L. H. Bennett, Phys. Rev. 131, 2000 (1963). 
Wn de Pru ep E ae ys. 311, 764 (1961). 
3 E. Simanek and Z. Sroul DE . 311, 764 (1961). 
» M, Matsuura ef al., J. Phys. Soc. Japan 17, 1147 (90), 
u C, P. Slichter, Principles of Magnetic Resi , (Harpes and 
Row, New York, 1963), p. 5, p. 153. i LORS 
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Ga 
of the domain wall. Equation (3) can be rewritten 


1/T == (y vH: (Tm (05) 2" j (4) 


where 0 is the angular displacement of the electron spin 
in the domain wall. : 
At a given temperature Eq. (4) shows that either an 
increase in the amplitude of the wall motion or the 
correlation time of the motion will shorten T. These 
parameters should be sensitive to the degree of anneal 
of the metal. Hence, in the pure metals, for example, in 
pure nickel, we might expect the Ni®! relaxation times 
T, to depend on the degree of annealing of the sample. 
Also, for the same annealing time we might expect the 
parameters (0?) and re to depend on the concentration 
of impurities so that the relaxation times Ti of the 
solvent nuclei might be expected to depend on the 
impurity concentration. We also see from Eq. (4) that 
samples with the highest rf enhancements (due to 
domain wall motion) will be expected to have the 
shortest relaxation times for the nuclei in domain walls, 
a result previously noted.” 

Consider now the relaxation time of the nuclear spin 
of the solute atom. From Eq. (4) we see that if the 
impurity spin in the domain wall moves with the same 
correlation time and angular displacement as the 
solvent atom spin in the dilute alloy, then the value 
of Ty? for the nuclei of the solute and solvent atoms 
should be equal. This relation is used in Sec. IV.C to 
discuss values of Tı measured for the nuclei of the 
solute atoms in the domain walls. 


B. Relaxation in the Domains 


The spin-lattice relaxation time for nuclei in the 
domains has been considered by Weger.’ The relaxation 
time T; for nuclei in domains due to conduction electron 
relaxation is given by the expression ; 


1/T,— (kT wa?) (ho3 2SN, (5) 


where a=lattice constant, w,=parameter describing 
spin-wave spectra, S— average spin/atom, Z—area of 
Fermi surface, w=nuclear resonance frequency, 
k= Boltzmann constant, and 7=absolute temperature. 
Equation (5) applies to the pure metals and the Tı 
mechanism is a second-order process which involves the 
coupling of the nuclei to the unpaired electron spins 
through the hyperfine interaction (AI-S) and the 
coupling of the electron spins to the conduction elec- 
trons through the sd coupling. The intermediate states 
involve the excitation of spin waves. 
~ In the cases where hyperfine fields are proportional 
to [dcal atomic moments (this is, at least, roughly true 
for the cases of concern here)? the factor (w?/S?) 
appearing in Eq. (5) is just proportional to yn’, where 
yn is the nuclear gyromagnetic ratio. Quite generally, 
the quantity 1/y,?T,T is a normalized measure of the 
stréngth of the conduction electron relaxation me- 
'chanism. If the mechanism in which the nuclear 


= 


STREEVER 


relaxation is enhanced by the spin waves applies to the 
solute nuclei in the dilute alloys, as well as to the pure 
metals, then the quantity yn211T would be expected to 
be of the same order of magnitude for the nucleus of the 
solute atom in the dilute alloy as for the nuclei of the 
solvent atoms in the same alloy or in the pure solvent 
metal. One might expect that the creation of spin waves 
by a spin flip of the solute atom will in general involve 
a higher energy than in the case of the same process in 
the pure metal and that the enhanced conduction elec- 
tron relaxation mechanism will be weaker for the solute 
nuclei. In this case the quantity y,?71T will be longer 
for the solute nuclei and closer to its value in the 
nonferromagnetic materials. 


C. Transverse Relaxation 


Weger has measured T» as well as Ti in the pure 
metals; iron, cobalt, and nickel.'~* In the case of Ni“ 
or Fe? in the pure unenriched metals, where nuclear 
spin-spin interactions are weak, T», at least at higher 
temperatures and in the domain walls, is of the order 
of T, and the transverse relaxation is related to the 71 
mechanism. In the case of Co® in pure cobalt where the 
abundance of active nuclei is high, T+ is determined by 
nuclear spin-spin interactions. 

At low temperatures in the domains where T' is long, 
T, may be determined by a mechanism of the type 
discussed by Moriya.? He considers in detail nuclear 
relaxation in antiferromagnets; however, similar con- 
siderations would apply to ferromagnets as well. 
Fluctuations in S; can arise from the interference of two 
spin waves and might be particularly important in the 
alloys. These fluctuations would be very effective for 
relaxation since they occur at lower frequencies than 
the spin waves. Hence, at low temperatures in the 
domains, we might expect T< T3, even in cases where 
the nuclear spin-spin coupling can be neglected. 


IV. RESULTS AND DISCUSSION 


A. T, and T; for Ni?! in Pure Nickel and 
in the Dilute Alloys 


Preliminary to studying relaxation times of the 
solute nuclei in the dilute alloys, measurements were 
made of Tı and Ts for Ni®! in pure unenriched nickel 
powder with a particle size of 10 u or less to determine 
the effect of annealing. The measurements were made 
at 77°K and the initial relaxation rates were measured 
at low pulse levels to determine T; and T» appropriate 
to the nuclei in domain walls. We found for unannealed 
samples or for samples annealed for 1 h at 300 and 
500°C that T; was roughly independent of annealing 
and about 1.2 msec with 7» about 0.3 msec. In nickel 
annealed at 1100°C for 1 h, however, Tı decreased to 
0.5 msec and T» to 0.06 msec. The Ni“ relaxation time 


1T. Moriya, Progr. Theoret. Phys. (Kyoto) 16, 641 (1956). 
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NUCLEAR MAGNETIC 
appropriate to nuclei in domain walls was also measured 
at 7 K in a sample of 99.5% nickel and 0.5% cobalt, 
which was annealed at 1100°C for 1 h. T, was found to 
be 1.3 msec, which is about the same as the T, in the 
pure nickel which was not well annealed. 

The decrease in T, in the well-annealed samples can. 
be explained as follows: In the well-annealed nickel, the 
initial permeability is high and we expect a larger 
amplitude for the domain wall motion (for a given 
excitation). This is in agreement with previous 
continuous-wave studies,’ where the large dispersion 
signals in the well-annealed samples were explained 
partly by the larger rf enhancements in these samples. 
The decrease in T; is partly related to the T decrease; 
however, the ratio of Tı to T» is even greater in the 
well-annealed sample, indicating that fluctuations in S; 
along the direction of the nuclear hyperfine field may 
be more important here. We also see that in the well- 
annealed sample of 0.5% cobalt in nickel the Ni! 
relaxation time T, is about the same as in the un- 
annealed pure nickel. Both strains as well as impurities 
seem to have the effect of limiting the displacement of 
the domain walls and lengthening the relaxation time. 
For the same annealing time at least the relaxation time 
is longer in the 0.5% cobalt sample than in the pure 
nickel. 


B. T, and T» of the Solute Nuclei in the Alloys 


The longitudinal and transverse relaxation curves 
were obtained as a function of pulse level at various 
temperatures, as discussed in Sec. II.B, for the solute 
nuclei in annealed samples of 1% Ni® in cobalt, 1% Ni! 
in iron and 0.5% Co* in nickel. The relaxation curves 
were, in general, nonexponential and dependent on the 
pulse level. The longest and shortest times are given in 
Table I. The shortest times Tio and T2 were determined 
from the initial slopes of the respective relaxation curves 
obtained at a low pulse amplitude below that at which 


'TABLE I. Relaxation times for the impurity nuclei in the dilute 
alloys, Tio and T» are the relaxation times appropriate to nuclei 
in domain walls, while Tis and Tzu approach the relaxation times 
in domains as discussed in the text. 


1% Ni* in Cobalt 


Temp. (°K) Tio (msec) Tiar (msec) Tso (msec) Tw (msec) 


3.5 0.15 0.15 
D AG 13 0.18 0.18 
4 35 250 019 022 


-1% Ni“ in Iron 


Tiu (msec) Tzo (msec) Tzar (msec) 
4 


Temp. (°K) Tiv (msec) 


300 0.7 0.17 20 
77. 1.7 15 ois ae 
h a 1.0 3.0 


0.5% Co® in Nickel 


RELAXATION OF IMPURITY NUCLEUS A1615 


the echo signal went through a maximum, and are 
believed to approach the shortest relaxation times 
appropriate to nuclei near the center of the domain 
walls. The longest times Ty and Toy were measured 
at the highest pulse level from the slope of the respective 
relaxation curves at the farthest time out, and are 
believed to approach the values for nuclei in the do- 
mains. The highest rf pulse level which was roughly 12 
times that at which the echo signal went through a 
maximum corresponded to an H: of roughly 6 G for Ni“ 
in cobalt and Ni“! in iron and about 3 G for Co? in 
nickel. 

The longitudinal relaxation curves were dependent on 
the rf pulse level, nonexponential at low pulse levels 
and more nearly exponential at higher pulse levels, 
where the signal was mainly from nuclei in domains. For 
the case of Ni“ in iron and Co? in nickel, the transverse 
decay curves at the higher temperatures were also 
dependent on the pulse level and nonexponential and 7; 
in the domain walls was of the same order as T, 
indicating it was related to the T; mechanism. At lower 
temperatures, particularly in the domains, T» is seen 
from Table I to be much less than T, and less dependent 
on pulse level. In the case of Ni"! in cobalt, the trans- 
verse decay curves are nearly exponential and T4 was 
nearly independent of pulse level. The nearly unique 
value of T» for this case indicates it is being determined 
by nuclear spin-spin interactions, a similar result having 
been observed for the case of Co? in pure cobalt metal.' 

The inhomogeneous linewidths at room temperature 
obtained from the width of the echo (assuming an 
exponential decay) and measured by the quantity 
1/zTs;* were about 240 kc/sec for 1% Ni“ in cobalt 
and 190 kc/sec for 0.5% Co in nickel and were in 
approximate agreement with continuous wave measure- 
ments. The 1% Ni“ in iron width (~700 kc/sec) was 
too broad to estimate accurately from the width of the 
echo. 


C. Discussion of T; in Domain Walls 


In Table II we have compared the value of Tio (the 
Tı appropriate to nuclei in domain walls) for the nuclei 
in the dilute alloys with the shortest T';'s measured in 
the pure solvent metals by Weger? which correspond to 
our Tios. In view of Eq. (4), if the angular displacement 
and the correlation time associated with the thermal 
fluctuations of the solute electron spin in the domain 
wall were the same as for the electron spin in the pure 
solvent metal, then the quantity Tio? (where v is the 
resonance frequency) should be the same for the solute , 
nucleus as for the nucleus in the pure solvent meta? In 
the last two columns of Table If we compare the 
quantity »?To for the solute nuclei in the alloys and the 
nuclei in the pure solvent metal. We see that in all cases 


S Tso (msec) Tw (msec) : 7 ji 
NU K) T se (mse Tur z ec) i 0.12 this quantity is longer for the solute nuclei. This can 
7 0:13 4 0.04 du probably be understood as due to the fact that ‘the 
4 1.4 0.15 : thermal fluctuations of the domain wall in the diluse* 
© 
CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar =° 


—— 'WWA7/-W (" J""Gu""-"-——— M 9 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


A1616 R. L. STREEVER 3 
f LE of 
i itudi cati i i in we ‘yo, for the impurity nuclei in the dilute all 
"TABLE II. A comparison of the longitudinal relaxation times in the domain walls, Trio, for t JU EDUE oys 
(from Table I) with the corresponding times in the pure host metal measured by Weger et al.* The quantities » Tio are also compared,> “ 
Tio for Co? in pure cobalt and 1% Ni® in 99% cobalt. j 
È : a M Mc 2Tio (Mc?/sec) Tio (Mc 
Temp (°K) Tn ER Tn aec) v eaa »( Tere) »* Tio as 10 Ce /sec) 
LONE 
300 0.1 1.4 213 70 4.5 6.9 
77 0.2 2.0 217 72 9.4 10.4 
4 0.2 3.5 217 72 9.4 18.1 
Tio for Fe’? in pure iron and 1% Ni® in 99% iron. d 
4 M Mc/sec vT 0 (Mc*/sec) v°T 19 (Mc/sec) 
Temp (°K) Tio inso Bie (nsec) v Mer »( s ) 10 i 1 ar 
300 0.9 0.7 45 85 1.8 5.1 
77 0.7 1.7 47 89 1.5 13.5 
4 10.0 6.0 47 89 22.1 47.5 


Tio for Ni?! in pure nickel and 0.5% Co? in 99.5% nickel. 


= 


Tio (msec) 


y (Mc/sec) 


» (Mc/sec) 


» Tio (Mc?/sec) 


Tio (Mc*/sec) 


Temp (°K) Tyo (msec)* 


Nij“ Cos? Nis Co? Nist Cos 
300 0.35 0.05 26 112 0.24 0.63 
7I 1.5 0.13 28 120 12 1.9 
4 15 1.4 28 121 11.8 20.5 
a See Ref. 2. 


b The resonance frequencies are rounded to the nearest megacycles. See A. C. Gossard, Ph.D. thesis, University of California, 1960 (unpublished) ; 
J. I. Budnick, L. J. Bruner, R. Blume, and E. L. Boyd, J. Appl. Phys. 32, 120S (1961) and Refs. 4-7. 


alloy tend to be decreased compared with those in the 
pure solvent metal. 

Although in Table II the Tio for the solute nuclei 1s 
compared with the T1 in the pure host nickel, it would 
be better to actually compare the relaxation times for 
the nuclei of the solute and solvent atoms in the same 
sample. In the case of 0.5% cobalt in nickel at 77^K we 
have done this. As already discussed (see Sec. IV.A), 
Tio for the Ni® in a sample annealed for one hour at 
1100°C was 1.3 msec, while for Co? the value of Tio in 
the same sample was 0.13 msec. From Eq. (4) we see 
then that (6?)7, is about one-half as large for the Co? 
as for the Ni®, and this probably represents a somewhat 
smaller angular displacement of the cobalt spin in the 
domain wall. 

In the 0.5% cobalt in nickel sample, the Co® relaxa- 
tion times in domain walls were found to be nearly 
independent of annealing time. This seems to indicate 
that in this sample the solute atoms themselves rather 
than the degree of anneal are limiting the wall motion. 


D. Discussion of T; in the Domains 


The longer times Tim given in Table I are nearly 
inversely proportional to temperature and are believed 
to approach the limiting value of T, appropriate to the 

nuclei in domains and to be determined by relaxation 
. through the conduction electrons. The quantity 
— 1/54 T14T is a measure of the strength of the conduc- 
tion electron relaxation mechanism normalized for the 
difference in nuclear gyromagnetic ratios. In Table III 
we compare Yn Tim for the nuclei on the impurity 
atoms with the corresponding quantity for the nuclei 
of the pure host metal using the value of 7,7 measured 
. jn the pure metals by Weger.’ We also list in the same 


r 


table the atomic magnetic moments measured by 
neutron diffraction. 

The quantity y,?T:1xT is seen in all cases to be greater 
for the impurity nucleus than for the nucleus of the 
corresponding pure host metal and this is particularly 
true for the case of 0.5% Co? in nickel. Also, especially 
in the case of Ni® in iron where the signal at high power 
levels was weak and the dependence of T; on the power 
level was quite strong, we cannot be sure we have 
measured the longest time so that the Tiar is only a 
lower limit for the value of T; in the domains. 

The longer relaxation times for the impurity nuclei 
can probably be understood qualitatively as represent- 
ing a partial decoupling of the spin on the impurity 


"TABLE III. A comparison of the quantity TiarT for the impurity 
nuclei in the dilute alloys (from Table T) with the corresponding 
quantity in the pure host metal, as measured by Weger.» The 
quantities y;?T13T are also compared. The atomic moments* in 
Bohr magnetons (bm) are also given. 


Bo TuT Yn 
Nucleus (bm) (sec?K) (cps G) —— wysTiuT 
Co*? in Co 1.7 008 627 x10 341 x10 
1% Ni“! in 99% Co 0.6 1.0 2.88 X103 5.7 X10? 
Fe?! in Fe 22 2.5 0.864103 — 1.9 X10 
1% Ni® in 99% Fe 1.0 1.2 2.88 X103 6.8 X10° 
Ni! in Ni 0.6 0.115 2.38 X10 0.65108 
0.5% Co in 99.5% Ni 1.7 0.3 6.27 X103 11.8 x10 
Ur. D e aaa E E CENE ME C or 
a See Ref. 3. 


b For values of yn see Freeman et al., Proc. Roy. Soc. (Lon 
- R. Locher and S. Geschwind, Phys. Rev. Letters 11, 


(1957) for Co**; P 


don) 242, 455 


333 (1963) for Nit; and C. W. Ludwig and H. H. Woodbury, Phys- Rev. 


117, 1286 (1960) for Fe57. 
* See Rets. 13 and 14. 


13 M. F. Collins and J. B. Forsyth, Phil. Mag. 8, 


4 M. F. Collins and D. A. Wheeler, Proc. Phys. 


82, 633 (1963), 
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atom from the spin-wave spectrum; that is, a spin flip 
of the impurity atom spin, which is different from that 
of the host (see Table III), will probably in general 
require a higher energy than for the case of the pure 
metal. This probably reduces the enhancement of the 
conduction electron relaxation by the spin waves. 

Note also that while 4?T,4T for Ni® in cobalt and 
iron is longer than in pure nickel, it is still shorter than 
Y'IiT for the case of a nonferromagnetic metal like 
Cu® in copper where *?T4T is about 453 10* (with y in 
cpsG^ and TT in sec ?K).'5 Similarly, while 4?T1,T 
for Co® in nickel is longer than for Co® in pure cobalt, 
it is still less than y?71T for V*' in vanadium, where 
Y*TAíT is about 39X 105.16 


E. Discussion of T, 


In the case of Ni“ in cobalt, Tz is being determined 
by nuclear spin-spin interactions. We will discuss 
various coupling mechanisms. The Suhl-Nakamura 
interaction!^! involving virtual excitation of spin 
waves is a nuclear spin-spin coupling mechanism for 
Co? in pure cobalt.! This mechanism, however, at least 
in the pure ferromagnets and at low temperatures (well 
below the Curie point), couples nuclear spins according 
to Iy*Ig---I3*Ii- and this would not be a broadening 
mechanism for unlike nuclear spins which would not be 
precessing at the same Larmour frequency. This would 
probably to a large extent be true also for the case of 
1% Ni* in cobalt, where the spin-wave excitations are 
still mainly transverse. 

The Ruderman-Kittel? interaction involving a scalar 
coupling between nuclear spins would be a broadening 
mechanism for unlike spins. One might argue that the 
same mechanism which leads to the short values of 7," 
for both Co9 and Ni® in the 1% Ni® in cobalt alloy 
would also lead to an enhanced Ruderman-Kittel inter- 
action. However, an enhanced interaction of this sort 
involving spin-wave excitations would at low tempera- 
tures be anisotropic in the same way as the Suhl inter- 
action and would not be effective in broadening unlike 
Spins. However, the ordinary Ruderman-Kittel inter- 
action is about the right value to explain the observed 
T». This can be seen as follows: In the case of Cu® in 


15 A. G. Redfield, Phys. Rev. 98, 1787 (1955). 

16 J. Butterworth, Phys. Rea > T (1960). 

1! H. Suhl, J. Phys. Radium 20, 59). 

AT. NER Progr. Theoret. Phys. (Kyoto) 20, 542 (1958). 
19M. A, Ruderman and C. Kittel, Phys. Rev. 96, 99 (1954). 
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copper metal, the experimental results could be ex- 
plained by taking the scalar indirect exchange coupling 
A/h between two nearest Cu® neighbors to be about 
420 cycles or three times the calculated value.!5 Assum- 
ing a similar coupling between the nickel and cobalt 
Pl leads to an exchange coupling between neighbors 
o 


420 (Co) (Ni*)/[y (Cu) P= 120 cycles. 


The nearest-neighbor contribution to the Van Vleck 
second moment is then given by 


(Av) - AII--1) A/M. 


This gives a width [(A»?)]'? of about 1 kc/sec or Ts 
about 0.2 msec. Although the above is the order of 
magnitude of the T observed, the fast spin exchange 
of the Co? nuclei would tend to narrow the line and 
make the shape more nearly Lorentzian so that an exact 
calculation of T% is difficult. 

In the case of Ni“! in iron and Co? in nickel (where 
we believe we can neglect nuclear spin-spin interaction) 
the T» relaxation mechanism is evidently related to Ty 
at high temperatures in the domain walls. At low 
temperatures, particularly in the domains, 7; may be 
due to the mechanism discussed by Moriya. However, 
the situation is not completely understood. 


V. CONCLUSION 


The longitudinal relaxation times for the nuclei at 
the solute atoms in the domain walls in these alloys can 
be understood as due to the motion of the domain walls. 
However, it appears that the relaxation time for the 
solute nuclei will generally be somewhat longer than 
the relaxation time for the nucleus at the host atom 
due to the pinning of the domain wall near the solute 
atom. In the domains it appears that the coupling of 
the nucleus to the conduction electron which determines 
Tı at high power levels is weaker for the solute atom 
than for the host atom in these materials. In the case 
of Ni* in cobalt, T» is being determined by nuclear 
spin-spin interaction. A  Ruderman-Kittel indirect 
coupling of nuclei by conduction electrons seems to 
account for the observed value of 75. 
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Following the methods presented in an 
equivalent of over 80 000 points in the first 


were calculated. The constant energy surfaces were constructed for severa 
energy. The hole and electron volumes were calculate 


by the requirement that the two volumes be equal. 
actually been used to determir 


was defined by about ten cross sections perpendicular to the 
results and experiment is generally good and slightly 


INTRODUCTION 


N a previous article! the Fermi surface of beryllium 
was determined theoretically and compared with the 
experimental results of Watts. The surface was con- 
structed from conduction electron eigenvalues at the 
equivalent of 5184 general points in the first Brillouin 
zone (BZ). These eigenvalues were calculated by ex- 
panding the wave functions in a linear combination of 
23 orthogonalized plane waves (OPW 's). The crystal 
potential used in these calculations was self-consistent 
and its construction was fully described in the original 
article. 
'The same program which was developed for the 
earlier calculations has been used to calculate the con- 
duction electron eigenvalues at the equivalent of over 
80 000 points in the first BZ. All of these points were in 
the immediate vicinity of the Fermi surface. From these 
eigenvalues the Fermi energy was determined for the 
first time by the well-known requirement that the hole 
and electron volumes be equal. The Fermi surface 
corresponding to this Fermi energy was constructed and 
is presented in this report. It differs only slightly from 
the previous results, but tends to bring the theoretical 
and experimental results into closer agreement. The 
interesting feature here is not the new results, but rather 
the reminder that there is available, through the applica- 
tion of a well-known conservation requirement, a 
method of determining the Fermi energy which requires 
the calculation of energy eigenvalues only in the vicinity 
of the Fermi surface. 


DETERMINING THE FERMI ENERGY 


In the work of Loucks and Cutler! the Fermi energy 
was determined in the usual way by arranging in in- 
creasing value the energy eigenvalues which were cal- 
culated for representative points throughout the BZ. 
The energy corresponding to the highest occupied state 


*Much of this work was done while the author was at the 
Pennsylvania State University, University Park, Pennsylvania, 
and supported by the U, S. Air Force Office of Scientific Research, 
Grant Number AF-AFOSR-61-100. 
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aT. L. Loucks and P. H. Cutler, Phys. Rev. 133, A819 (1964). 
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earlier paper, valu ylliur 
Brillouin zone (all in the immediate vicinity of the Fermi surface) 


the conduction eigenvalues for beryllium at the 


Ivalues of the energy near the Fermi 


d for each case, and the Fermi energy was determined 
This is the first time that this well-known result has 
ne the Fermi energy in a band calculation. The corresponding l'ermi surface 


[0001] direction. The agreement between these 


improved over the previous results. 


was taken to be the Fermi energy. This method, al- 
though correct in principle, suffers in that it is difficult 
to get a representative sample of points (particularly 
near the zone edges) without calculating the eigenvalues 
at an extremely large number of points throughout the 
zone. Many of these points are not near the Fermi sur- 
face and hence are of little value except in the counting 
procedure for the determination of the Fermi energy. 

The method proposed here has the advantage that all 
eigenvalues are calculated at points in the immediate 
vicinity of the Fermi surface. Thus, they serve not only 
to determine the Fermi energy but also to define more 
accurately the surface itself. In addition it provides 
enough information about the other energy surfaces 
near the Fermi surface to enable one to calculate first 
and second derivatives for use in determining such 
parameters as the density of states at the Fermi energy 
and the effective mass. 

Essentially, the method employed is based on the 
obvious conservation requirement that the hole volume 
in the first double zone must equal the volume of the 
electron pockets in the second double zone. Since the 
electrons spill out of one zone into the next in order to 
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Fic. 1. Dependence of electron and hole volume on 
virtual Fermi energy. 
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Fic. 2. Cross-sectional area of electron and hole surfaces in 1/24th 
zone for planes perpendicular to the [0001 ] direction. 


minimize the energy, the vacated volume must be 
exactly equal to the volume occupied by the electrons. 
This is an admittedly obvious fact. However, it has 
never been used in practice to determine a theoretical 
value of the Fermi energy from a band calculation. The 
reason it has been avoided is perhaps because of the 
large number of calculations necessary to determine the 
electron and hole volumes. In this work they were 
determined for each of several constant energy surfaces 
near the Fermi surface. The results are plotted in Fig. 1. 
The curves of the hole volume and electron volume were 
found to intersect at £Zp— 0.909 Ry. The Fermi energy 
presented in the original paper was £p- 0.901 Ry. 


FERMI SURFACE OF BERYLLIUM 


The constant energy surface was constructed at 
Ey=0.91, as well as at other points, in constructing the 
curves shown in Fig. 1. Since we have found that the 
true Fermi energy is E p= 0.909, it is within the accuracy 
of the procedure used in constructing the surfaces to 
consider the one for Ey=0.91 to be the Fermi surface. 
These results are presented in Figs. 2, 3, and 4. In 
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Fig. 2 the cross-sectional area perpendicular to the 
[0001] direction is shown for the holes and the electrons. 
In both drawings the left-hand side represents the 
symmetry point K, and the abscissa corresponds to 
distances measured along the symmetry edge KH. The 
identifications such as G1, G2, etc., correspond to the 
cross sections shown in Fig. 4. It should be noticed that 
in the lower curve of Fig. 2, corresponding to the elec- 
tron cigars, the abscissa extends from K to H. However, 
in the upper curve, corresponding to the hole coronet, 
the distance between G1 and G11 extends only 4 of 
the distance from K to H. Figures 3 shows the inter- 
section of the Fermi surface with the 1/24th zone faces. 


Fic. 4. Intersections of Fermi surface with planes perpendicular 
to the [0001] direction. 


DISCUSSION 


'The Fermi surface which corresponds to the Fermi 
energy determined by the above method is compared to 
the previous theoretical results and the experimental 
results in Table I. It is seen that the agreement between 


TABLE I. Comparison of representative dimensions of the 
Fermi surface as determined by the present work, the de Haas-van 
Alphen measurements, and the previous theoretical work. 


Designation Loucks Watts Loucks & Cutler 

be 0.453 0.48 0.44 

ba 0.141 0.09 0.13 

bt 0.075 0.09 0.07 

ak 0.008 0.08 0.01 

kl 0.246 0.23 0.26 

gh 0.118 013-2, Otis 

no 0.017 0.02 0.04 

ml 0.574 0.56 057 t 
aed 

= 


the present theory and experiment is very good, 
especially for the bottom four entries which correspond 
to the coronet. However, the theoretical cigar is still 
shorter than the experimental one and almost triangular, 
rather than circular in cross section. This almost tri- 
angular shape also yields extremal cross-sectional: ads 
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which are consistent with the observed de Haas-van 
Alphen frequencies. Further experiments or calculations 
which can distinguish between this model and the 
circular one proposed by Watts are necessary. It should 
be noticed in Fig. 2 that there is a waist on the almost 
triangular cigar. This is also consistent with the de 
Haas-van Alphen data. The larger cross-sectional area 
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is about 2% greater than that of the waist. The cor- 
responding experimental results predict 3%. 
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Specific Heat of Thulium Metal Between 0.38 and 3.9°K* 


O. V. LouNASMAAT 
Argonne National Laboratory, Argonne, Illinois 
(Received 20 January 1964) 


The specific heat C, of thulium metal has been measured in a He? cryostat. Between 0.38 and 3.9°K 
C, —2.839T3--17.94T2-23.43T-?— 1.79T *—0.060T * (in mJ/mole °K). The last three terms represent 


the nuclear specific heat Cy. 


On the basis of earlier estimates, we put Cy, =0.243T3 and Cg —10.5T for the 


lattice and electronic specific heats, respectively. According to the simple spin-wave theory, the magnetic 
specific heat Cy is proportional to T’ for a ferrimagnetic metal; experimentally one finds Cyr=6.2T 5? for 
thulium, which has a rather complicated ferrimagnetic structure. Further, there seems to be no evidence 
in Cy for an exponential factor, to be expected because of magnetic anisotropy. All conclusions on C are 
tentative, however, until data at temperatures between 4 and 20°K become available. Cy does not fit to the 
simple picture as given by Bleaney either. Since 7 —3 for the only stable thulium isotope Tm, quadrupole 
interactions are zero and there are only two nuclear energy levels, their separation being determined by the 


magnetic hyperfine constant o^. This would give a nuc 


lear specific heat with even powers of T only, with a’ 


determining the values of the coefficients. The observed Cy cannot be fitted into an equation of this type 
which indicates that other interactions, probably nuclear exchange interactions, are present. Formally, 
the experimental situation may be expressed by writing a’=ao—b/T, instead of treating a’ as a constant. 
Our results are in good agreement with recent Méssbauer data by Kalvius ef al. who found 22.9 for the 


coefficient of the T~? term. 


I. INTRODUCTION 


T low temperatures, the specific heat of rare-earth 
metals has four components which, depending on 
circumstances, can be separated totally or partially 
from each other. These are the lattice specific heat 
CL—AT*, the electronic specific heat Cg— BT, the 
magnetic specific heat Cyr, and the nuclear specific 
heat Cy. In the higher lanthanides, Cy, is primarily 
caused by exchange interaction between the 4f elec- 
tronic spins. At 4.2°K and below, thulium has a unique 
ferrimagnetic structure, to be described in some detail 
later (cf. Sec. IV.B).! It is interesting to see how well 
the magnetic specific heat follows the prediction, 
Cu=CT*, of the simple spin-wave theory.” 
Cw is due to splitting of the nuclear spin states by 
“interaction with the 4f electrons. By far the largest 


contribution to Cy comes from the magnetic field pro- 

—— a 

A Based on E performed under the auspices of the U. S. 
tomic Energy Commission, and the Finnish R: i 
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Tusk, Turku, Bilan E y: aboratory, University of 
1 W. C. Koehler, J. W. Cable, E. O. Wollan, and M. K. Wilkin- 

son, Phys. Rev. 12d, 1672 (1900). : à 
4 T. (an Kranendonk and J. H. van Vleck, Rev. Mod. Phys. 30, 


4 (1958). 


duced by the orbital angular momentum of the Af 
electrons. The resulting nuclear specific heat has the 
familiar appearance of a Schottky curve with its 
maximum somewhere below 0.1°K. Above the maxi- 
mum, Cy may be expressed in inverse powers of T, 
the leading term being proportional to T ?. 

According to the above discussion the total specific 
heat of thulium becomes 


C,—AT*- BT4-Cu Cy, (1) 


where A and B are constants. 

The present measurements on thulium are a part of 
our research program for studying the heat capacities 
of rare-earth metals between 0.4 and 4°K,*~ with 
particular emphasis on Cy. The specific heat of thulium 
has previously been measured by Jennings, Hill, and 


30. V. Lounasmaa, Phys. Rev. 126, 1352 (1962). 
(19 d Lounasmaa and R. A. Guenther, Phys. Rev. 126, 1357 
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60, V. Lounasmaa, Phys. Rev. 128, 1136 (1962). 

10. V. Lounasmaa, Phys. Rev. 129, 2460 (1963). 
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Spedding" between 15 and 360°K and by Dreyfus 
Goodman, Lacaze, and Trolliet? between 0.5 and AK. 
The former data do not extend to low enough tempera- 
tures for determining Cy. Measurements by Dreyfus 
el al. are published as a brief research note only and 
detailed comparisons are thus impossible; this is unfor- 
tunate in view of the interesting behavior observed for 
Cy in the present research (cf. Sec. IV.C). 


II. EXPERIMENTAL 


The heat capacity measurements were carried out in a 
He? cryostat which has been described earlier; only 
the most important experimental features are related 
here. He! exchange gas was used for cooling the sample 
down to 4.2°K; the space surrounding the calorimeter 
was then evacuated by pumping until a mass-spectrome- 
ter-type leak detector showed a very small helium 
reading. For further cooling a mechanical heat switch 
was employed. In this way, good thermal insulation 
was achieved when the heat switch was opened and 
desorption of helium gas from the sample during heat 
capacity measurements was prevented. By pumping on 
He’, a temperature of about 0.33°K was reached and 
maintained in the He’ pot for 48 h without recondensing. 
With the heat switch closed, the sample was cooled 
from 4.2 to 0.35°K in about 24 h. 

For the heat capacity measurements, a colloidal 
graphite (Aquadag) thermometer was employed; its 
construction has been described elsewhere.^* The 
thermometer was calibrated against the vapor pressure 
of He‘ between 4.15 and 2.2°K, against the vapor pres- 
sure of He? between 2.2 and 0.75°K, and against a 
magnetic thermometer (chrominum methylamine alum 
salt) between 0.75 and 0.35°K. He* temperatures were 
determined according to the Tss scale. For He? the 
1962 Los Alamos scale“ was employed; a correction 
was made for the 0.6% of He‘ in our He? gas. The con- 
stants in the susceptibility versus temperature curve 
for the magnetic thermometer were determined from 
calibration points between 2.2 and 0.75°K. Altogether 
about 30 points were measured for the carbon thermom- 
eter between 4.15 and 0.35°K. 


IH. RESULTS 


Our thulium metal was purchased from Research 
Chemicals, Inc. (Division of Nuclear Corporation of 
America). It was vacuum distilled by the manufac- 
facturer, then remelted in a vacuum and cast into a 
tantalum crucible. Next, the tantalum was machined 
off and the sample turned down to a cylinder 0.9 cm 


uL, D. Jennings, E. Hill, and F. H. Spedding, J. Chem. Phys. 
34 . 

E e B. B. Goodman, A. Lacaze, and G. Trolliet, 
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m. G RS ids B van Dijk, M. Durieux, J. R. Clement, 
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long and 2.7 cm in diameter; its weight was 49.902 [4 
(=0.29539 mole). The spectrographic laboratory at 
Argonne found the following metallic impurities (weight 
%): Al, 0.03%; Cr, 0.003%; Fe, 0.01%; K, 0.01%; 
Mg, 0.005%; Mn, 0.01%; Na, 0.02%, Ta, 0.12%; 
Ti, 0.01%. These analyses are uncertain by a factor of 
2 except in the case of Ta for which the precision is 
+10%. The Ta result was obtained by a very careful 
densitometric analysis. Of nonmetallic impurities, the 
Argonne chemical laboratory found: H, 0.0006%; N, 
0.20% ; O, 0.10%; F, 0.024%; C, 0.014%. The accuracy 
is about 10%. 

The experimental specific heat results are listed in 
Table I and the points are also plotted into Fig. 1. All 


"TABLE I. Specific heat (in mJ/mole °K) of 
thulium metal. Experimental results. 


T(°K) G T (°K) G T (°K) 67 
RunI 2.6331 101.5 Run III 
0.3876 129.6 2.8513 119.7 1.0345 42.04 
0.4304 — 110.1 3.0789 — 140.9 11744 — 4192 
0.4808 92.70 3.3205 167.8 1.3213 43.54 
0.5405 78.06 3.5976 199.1 1.4803 46.24 
0.6114 65.71 3.9122 241.7 1.6372 . 50.29 
0.6949 56.08 1.7819 — 5475 
0.7916 48.96 Run II 1.9211 60.12 
0.8996 44.46 0.3799 133.6 2.0550 66.33 
1.0157 42.26 0.4155 1162 2.1880 72.38 
1.1468 41.85 0.4572 100.2 2.3306 80.92 


1.2877 43.24 0.5111 84.73 2.4937 . 91.61 
1.4233 45.37 0.5746 — 71.73 2.6047 106.8 

1.5598 48.40 0.6495 60.86 2.9315 126.8 

1.6952 52.48 0.7350 . 52.63 3.1984 151.9 

1.8286 57.04 0.8341 46.87 3.49903 187.5 

1.9666 62.44 0.9428 43.44 3.8202 226.6 

2.1092 69.19 1.0000 41.96 

2.2622 71.53 1.2077 42.49 

2.4347 88.37 1.38524 44.19 


calculations were performed by an IBM-704 digital 
computer. The results have been corrected for curva- 
ture due to finite temperature increments when measur- 
ing Cp. The size of these increments may be computed 
from the separation of successive points in Table I. 

After a heating period (about 1 min), the sample 


came to equilibrium in less than 2 sec at all tempera- _ 


tures. The heat leak to the calorimeter was always less 
than 0.2% of the corresponding heat input during 
heating periods. A correction for this was made in the 
customary manner by assuming linear drifts. The scatter 
of the points belonging to the same rur: from a smooth 
curve is about 0.3% (cf. Fig. 2). The heating current „ 
was measured with a Rubicon No. 2781 potentiometér 
and timed with an electronic timer using à tuning? 
fork frequency standard. The potentiometer was cross 
checked against a Rubicon No. 2773 double potentiome- — 
ter, the timer compared with radio signals over a 24-h - 
period, and standard cells and standard resistors cali- 


15 P. R. Roach, Argonne National Laboratory Technical Ke kat 3 
No. 6497 (unpublished). > ae 
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Fic. 1. The specific heat of thulium metal. Solid curve corre- 
sponds to equation Cp=2.839T3+17.94T+23.43 —=2—1.79T3 
—0.066T-* (Cp in mJ/mole °K). 


brated against NBS certified equipment. Possible sys- 
tematic errors here are thus negligible. The electrical 
leads between the He! bath and the calorimeter were 
made of lead-covered constantan and were supercon- 
ducting below 7°K. As described in another paper? 
it is likely that the coating had tiny cracks at which a 
small amount of heat was generated. Due to this, there 
might be an error of 0.1% in the effective heater resist- 
ance (R7=362.13+0.03T 9). 

The heat capacity of the empty calorimeter (Cea 
—0.009079--0.116T mJ/°K) was known from an 
earlier experiment; it was never more than 1.205 of 
the heat capacity of the sample. Possible uncertainties 
here can cause only errors smaller than 0.1% in the 
final results. 

As is usual in low-temperature calorimetry, the 
largest errors in Cp are probably due to calibration of 
the carbon thermometer. The scatter of the calibration 
points was, with a few exceptions, less than 1 mdeg. 
'The calculated temperature is thus probably within 
1 mdeg of the temperature defined by the He* and He? 
scales. Uncertainties of about 2 mdeg may arise 
from the extrapolation of the magnetic thermometer 
calibration below 0.75°K. 

The three runs and the thermomenter calibrations 
were made during the same experiment without warm- 
ing the sample above 4.2°K in the meantime. Largest 
discrepancies between runs occur in C, at 2.0°K where 
they are about 0.5%. 

A£ter considering all the sources of errors and the dis- 
crepancies mentioned above, the accuracy of the present 
results is estimated as 0.6% between 1 and 4°K and 
2% at 0.4°K. 

The observed specific heat points from Table I, over 
the entire temperature range from 0.38 to 3.9°K, were 

fitted, after assuming. Cx - DT32—ET-*(F — D*/R, cf. 


n 
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Sec, IV.C) and Car CT*, to Eq. (1) by the method of 
least squares. The result was (Cp in mJ/mole °K): 


Cp=2.14T°+19.2T+19.15T2—0.044T =. (2) 


However, as may be seen from Fig. 2, deviations of the 
measured points from this curve are fairly large at low 
temperatures. This indicates that a term proportional 
to T must be taken into account when expressing 
Cy (for a discussion on the temperature dependence of 
Cy, see Sec. IV.C). By writing Cy - DT3— ET?— FT 
the least-squares result was 


Cp= 2.839T?+ 17.947 + 23.43T ^ 

—1.79T-3:—0.066T—*. (3) 
Standard deviations of the first four coefficients are 
0.012, 0.12, 0.26, and 0.11. Equation (3) gives a reason- 
ably good fit to the experimental points over the entire 
temperature range from 0.38 to 3.9°K (cf. Fig. 1); 
maximum deviations in AC/C are 1.9% at 1.3°K. 
Figure 2 shows that the fit is much better than with 
Eq. (2). 

IV. DISCUSSION 
A. The Lattice and Electronic Heat Capacities 


Since the temperature dependence of Cy is uncer- 
tain, we must assume that the first term in (3) is due 
to C; and C yr, the second term to Cz and C y, and the 
last three terms to Cy. It is thus impossible to separate 
Cz and Cg from Cy without additional information. 

On the basis of considerations described in an earlier 
paper, we estimate that the lattice heat capacity of 
thulium should correspond to a Debye characteristic 
temperature 6=200°K giving C,=0.2437%, and that 
Cg—10.5T (Cr and Cg in mJ/mole °K). The major 
contribution to the first term in Eq. (3) thus comes 


A [2 „mJy mole °K 


04 


0.8 
7, °K 


1o L2 


Fic. 2. Deviations AC, of the experi tal i t points 
from the expressions ‘Cp=2. ATS 19.27 3- 19. ecific heat aT 
Laua p n d 1 and Cp=2839T3H17.94T 423437 
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from Car. The precision of these values is estimated as 
5% for 0 and 7% for Cg.’ 


B. The Magnetic Heat Capacity 


The magnetic susceptibility of polycrystalline thulium 
has been studied by Rhodes, Legvold, and Spedding" 
and by Davis and Bozorth. A Néel point was observed 
at about 56°K and a Curie point at 22°K. According to 
Jennings, Hill, and Spedding" the heat capacity shows 
a pronounced peak at the Néel point but no anomalous 
behavior was noted at the Curie point. 

The magnetic structure of thulium has been deter- 
mined by Koehler, Cable, Wollan, and Wilkinson! by 
using neutron diffraction techniques. At 4.2°K, where 
saturation has very nearly been reached, thulium has an 
antiphase domain-type structure in which four layers 
of atoms with their magnetic moments pointing in the 
direction of the positive c axis are followed by three 
layers of atoms with their moments in the opposite 
direction. The low-temperature magnetic structure of 
thulium is thus ferrimagnetic. Each atom has an 
ordered magnetic moment of 7 ug, and the ferrimagnetic 
structure has thus a net moment of 1 ug per atom. 

According to the simple spin-wave theory? the 
magnetic specific heat of a ferrimagnetic metal, at 
temperatures well below the Curie point, may be 
written C3; — CT?, where C is a constant. Niira and 
Cooper? have shown, however, that in metals with 
magnetic anisotropy there is an energy gap E, at the 
bottom of the spin-wave spectrum. Due to this, 
C ur — CT?? exp(—E,/kT) for a ferromagnetic metal and 
Ca — CT? exp( — Ej/kT) for a ferri- or antiferromagne- 
tic metal? More complicated formulas have also been 
proposed.!? The physical reason for £, is that it always 
takes a finite energy to turn a spin against the aniso- 
tropy field. Energy gaps of the order of E,/k=20—30°K 
have been predicted for dysprosium™ and for terbium.? 

We have tried to fit the magnetic specific heat of 
thulium, calculated from the relation (in mJ/mole °K; 
C, is the measured specific heat) Cu=Cp—0.2437° 
—10.5T—23.43T?-+1.79T*+0.066T—, to the above 
formula for Cyr. The best agreement was obtained 
without an exponential term and with Cy proportional 
to T9? instead of T’. The magnetic specific heat has 
been plotted into Fig. 3; the straingth line corresponds 
to equation (Cy in mJ/mole °K). 


Cy= 6.275/2 . (4) 


Above 2.2°K, where Cx is more than 60% of the total 
Cp the agreement is quite good; at lower temperatures 
Cy is small in proportion to the other contributions 
and the relative precision with which it can be deter- 


16 B. L. Rhodes, S. Legvold, and F. H. Spedding, Phys. Rev. 
109, 1547 (1958). à 

1 p. D. cO Men Phys. Rev. 118, 1543 (1960). 

18K. Nii s. Rev. 117, 129 (1960). 

wB. pam Plor Phys. Soc. (London) 80, 1225 (1962). 

? A. R. Mackintosh, Phys. Letters 4. 140 (1963). 
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Frc. 3. The magnetic specific heat of thulium. The straight line 
corresponds to equation Cy — 6.2T*? (Cy in mJ/mole °K). 


mined becomes poor. Of course, the weak points in 
our calculation are the assumptions made about the 
size of Cz and Cz. By writing Cg— 17.94T [cf. Eq. (3)], 
Cu would be proportional to 7?. It seems unlikely, 
however, that the electronic specific heat is much larger 
for thulium than for the other trivalent lanthanides.* 
Of course, by subtracting Cr, Cz, and Cy from C; in 
Eq. (3), we get Cu=2.596T°+7.44T. This relation 
for Cu has, however, no theoretical basis. 

Before drawing any definite conclusions about the 
temperature dependence of Cy, measurements of C; 
should be made at higher temperatures, especially be- 
tween 4 and 20°K. This would allow us to decide 
whether or not an exponential factor is necessary in 
expressing Cyr. At the moment one can only say that 
such a factor does not seem to be very important. 

Measurements by Dreyfus et al.” gave C,=0.949T? 
--21.5T--27T- (C, in mJ/mole °K). Their data agree 
with ours at 1.5°K, are higher at lower temperatures 
and lower above 1.5°K; at 4°K the difference is 40%. 
Large discrepencies between specific heat results of 
different investigators are quite common between 2 and 
4°K for rare-earth metals.-*? The disagreement is 
usually attributed to impurities which, even in small 
concentrations, might significantly alter the energy 
gap E, at the bottom of the spin-wave spectrum.*52* < 
There is some difficulty with this explanation here since 
the exponential factor in Cy, seems to be absent ®r its 
effect is small in thulium. Another possibility is magnetic 
ordering in an impurity which, even in quantities of 
0.1% of the sample, will cause large contributions to 
the observed specific heat, provided a magnetic trans- 
formation occurs in the impurity in or near the *tem- 
perature range under investigation.’ T we 
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C. The Nuclear Heat Capacity 


Bleaney? has discussed the nuclear specific heat of 
rare-earth metals in detail. The splitting of the nuclear 
energy levels is given in general by the Hamiltonian 


ge a'id-P[2—31(-2-1)]-3 32 20; K L), (5) 
jk 


where the first two terms represent magnetic hyperfine 
and electric quadrupole interactions between the 
nucleus and its own electrons (usually by far the biggest 
interaction is due to the 4f electrons) and the third is 
exchange interaction between neighboring nuclei. 
Since J=4 for the only stable thulium isotope Tm'*, 
there are only two nuclear energy levels, corresponding 
to i=+4, —3 in (5), and quadrupole interactions are 
identically —0. The magnetic hyperfine constant a’ 
varies as (J), which measures the electronic magnetiza- 
tion. At temperatures below 2°K, where Cy is appreci- 
able when compared with the other contributions to 
C,, complete electronic magnetization is usually 
assumed and (J=) is replaced by J(=6 for Tm** ions) 
and thus a' — const. 

Once the nuclear energy levels are known, Cy can 
be calculated from the partition function in the usual 
way and, by expanding the nuclear specific heat in 
inverse powers of T, we obtain [cf. Eq. (3)] 


Cy — DT3—ET3—FT3:-, (6) 


where 


D/R—&(a 1 3-1) 
+ G/9)P(- I) 27 (KI-MREVMEKS)R, (7) 


E/ R- (4/9) (a i (Ky) PUHI, (8) 
F/R- (1/30) (a)!T +1) QP--21--1) (9) 


(P=0, R is the gas constant, magnetic field is assumed 
in z direction, some higher terms in K have been 
ignored). 

According to Bleaney's?! estimates, contributions 
from the nuclear exchange interaction should be rela- 
tively small. However, if we put K=0 only even powers 
of T appear in (6) and (/—$ 


F= (3D2/10R) (22-+21-+1)/(P+I)=D/R. — (10) 


By assuming D=19.15 mJ °K/mole [cf. Eq. (2)], we 
get F=0.044 mJ ?K?/mole. Even at 0.4°K, the T~ term 


‘accounts only for 1.5% of the total Cy. By ignoring 


relation (10) and by letting all the coefficients in (2) 
be freé in the least-squares analysis, we obtain a con- 
siderably larger value of F (C; in mJ/mole °K): 


€5—2.821*--18.2T3-21.6T— 0.4974. (11) 
This equation fits to the measured points almost as well 


TaB. Bleaney, J. Appl. Phys. 34, 1024 (1963). 
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as (3). Our experimental accuracy is not sufficient to 
decide whether the “residual” Cy is proportional to 
T= or to T^. 

In any case, the observed Cp below 1°K forces us to 
assume that other interactions, besides the magnetic 
hyperfine term, have a fairly large effect in the value 
of Cy. Due to the peculiar magnetic structure of thu- 
lium, the nuclear exchange interaction, acting through 
the spin waves, might be unusually large in this metal. 
We shall adopt Eq. (3) for representing the specific heat 
of thulium; this choice was made since it is impossible 
to explain the large 7'* term in (11) on theoretical 
grounds. In these two relations the coefficient of the T= 
term differs by 8%. On the basis of this and our earlier 
error estimates (cf. Sec. III), we put D=23.4+1 
m] ?K/mole. If the temperature dependence of Cy 
in Eq. (3) and the ratio F/D=D/R [cf. Eq. (10)] can 
be assumed correct, the limits of error in D would be 
+0.5 mJ °K/mole. 

Dreyfus ef al. have calculated from their specific 
heat data D=27++3 mJ °K/mole without taking 
higher terms in Cy into account. Since their measure- 
ments have been published as a smoothed curve only, 
it is impossible to reanalyze them. However, inclusion 
of higher terms would tend to increase D. From atomic 
beam experiments on neutral thulium atoms by Ritter,” 
Bleaney? has derived D=26.9 mJ °K/mole, which is 
in good agreement with the result by Dreyfus et al. 
However, recent Móssbauer measurements by Kalvius, 
Kienle, Eicher, Wiedemann, and Schüler? on thulium 
metal gave, at 5°K, the splitting of the ground state 
as 9.04-10-* eV, which corresponds to D=22.9 mJ 
°K/mole. If electronic magnetization was not complete 
at 5?K, this value would tend to increase towards 
lower temperatures; this effect cannot be larger than 
about 0.5%.” 

The present specific heat measurements are thus in 
good agreement with the Móssbauer data.? The ob- 
served temperature dependence of Cy remains a puzzle. 
Formally one can write for thulium, in the temperature 
range studied, a/— 29—/T, whatever the reason may 
be. Here a and b are constants. By extending the meas- 
urements to still lower temperatures, more would be 
learned about the nature of the interactions, in par- 
ticular, one could see whether cooperative effects are 
present. 
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A splitting of the ground acceptor state in silicon due to equilibrium homogeneous strains and a cor- 
responding transition temperature have been calculated. The equilibrium homogeneous strains are due to 
linear terms in the strain Hamiltonian which couple the acceptor states with the strain field. The transition 
temperature was found to be V4X5.6X 107? °K, where N4 is the acceptor concentration in acceptors per 
cubic centimeter. For indium, concentrations as high as 10? acceptors per cc ¢ should be possible before 
Coulomb and exchange effects between acceptors are dominant. 


I, INTRODUCTION 


HE acceptor state in silicon having the lowest 

energy transforms like I's of the tetrahedral 
double group. That is, the lowest acceptor state pos- 
sesses the same fourfold degeneracy as the Ps» valence 
band edge at k=0.! At temperatures sufficiently low, 
all holes can be considered to be in the ground acceptor 
state. 

It has been found?? that the valence band edge of 
silicon and germanium at k=0 is split under the ap- 
plication of a homogeneous uniaxial stress. These 
stresses also split the degeneracy of the ground accep- 
tor state.* 

If there are two acceptor states in the crystal, they 
should be able to interact through the strain, or 
phonon, field. If there are a great many acceptor 
states, they should collectively produce a strain in a 
manner analogous to the way in which a lattice of 
magnetic dipoles interact to produce ferromagnetism. 

In this paper the lowering of the energy correspond- 
ing to this collective equilibrium strain and a corre- 
sponding transition temperature are calculated. 


II. DIAGONALIZATION OF THE HAMILTONIAN 


In this section a Hamiltonian density will be set up 
and diagonalized, yielding the energy density associ- 
ated with various equilibrium strains in the crystal. 
The strains associated with the minimum energy den- 
sity and the maximum lowering of the energy density 
are obtained. 

Although the strain Hamiltonian for silicon doped 
with acceptor atoms contains strains of all wavelengths, 
and the equilibrium strain in the neighborhood of an 
acceptor state will contain strains having wavelengths 
comparable to the dimensions of the acceptor state and 
even shorter, the strains responsible for the collective 
effect of all acceptor states are homogeneous strains. 


* Supported by the National Science Foundation. 

+ National Science Foundation Predoctoral Fellow. _ 

1W. Kohn, Solid State Physics, edited by F. Seitz and D. 
Turnbull (Academic Press Inc., New York, 1957), Vol. 5. à 

2W. H. Kleiner and C. M. Roth, Phys. Rev. Letters 2, 334 
(1959). 

3J. C. G. Feher, Phys. Rev. 129, 1041 (1963). 
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placement of the atoms which destroy the symmetry" 


15 JUNE 19654 


Therefore, in all that follows, only homogeneous strains 
will be considered. 

If an individual acceptor is considered, it can strain 
in one of the normal-mode strains. However, the zero- 
point motion is large enough to average over all of the 
possible normal-mode strains and their negatives, so 
that on the average no strain exists. The only effect is 
to increase the amplitude of the zero-point motion. A 
homogeneous strain, however, induces a splitting even 
after averaging over the zero-point motions. Only the 
states split by the homogeneous strain are considered 
to be strain states of the system. This is in contrast 
with treatments of Jahn-Teller phase transformations of 
ordered impurities in which the individual atomic 
strain states are considered.* 

In the strain Hamiltonian there are two types of 
terms. First, there are the terms independent of strain, 
quadratic in the strain and of higher order in the strain 
which are present when the acceptor concentration, 
and hence the hole concentration, is zero. All powers 
of the strain higher than the second will be neglected. 

Second, there are terms which depend on the ac- 
ceptor concentration. Because the acceptor concentra- 
tion will always be much less than the atomic concen- 
tration of silicon, strain terms quadratic and higher in 
the strain which depend on acceptor concentration will 
be neglected. The change in elastic constants with 
acceptor concentration has been computed only for 
quite high acceptor concentrations? and will be neg- 
lected here. The linear terms in the Hamiltonian exist 
because the acceptor state is a wave packet of states 
taken from the top of the valence band, and exhibits ^ 
the degeneracy in the valence band. The Jahn-Teller 
effect? states that, if a nonlinear symmetric arrange- 
ment of atoms has a degenerate ground state (not 
including spin degeneracy), there will,exist in the 
Hamiltonian nonvanishing terms linear in the dis- 


and split the degeneracy. s. 
The most general form for these linear termS'for a — 
crystal with cubic symmetry has been given by Kleiner — — 


5 P, J. Wojtowicz, Phys. Rev. 116, 32 (1959). ` oy 

2 Csavinszky and N. G. Einspruch, Phys. Rev. 132, 2435 
1963). Re 
e H. A. Jahn and E. Teller, Proc. Roy. Soc. (London) 161, 220 - 
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and Roth? and can be obtained from symmetry con- 
siderations alone. The values of the deformation po- 
tential coefficients D, and D,’ for states in the valence- 
band edge of silicon at k=0 have been determined by 
Hensel and Feher? Because the ground acceptor state 
has the same symmetry as the valence-band edge, the 
linear terms caused by the acceptor state will have 
exactly the same form as those for the valence-band 
edge, except for a change in the deformation potential 
coefficients. 

If the binding energy of the acceptor state were 
much less than the energy lowering due to the equi- 
librium strain, the deformation potential coefficients 
obtained for the valence band edge by Hensel and 
Feher could be used for the acceptor state. Although 
this condition is not satisfied, these deformation po- 
tential coefficients will be used. The Hamiltonian den- 
sity due to acceptor states is taken to be the acceptor 
concentration times the energy per hole given by 
Kleiner and Roth.’ 

The Hamiltonian density is then 
=o (Seu) dH etc 22 exei (Seas) 22 6i? 

i >j >i 


+N a(Det X, eit GDE U2— Hes 
HGD EU J;JJ2/2]e , (1) 


i»j 

where c;; and c;, are the elastic stiffness constants, ¢;; 
and e;; are the conventional strain components,’ Da’, 
Du, and D,’ are the deformation potential coefficients 
appropriate to the various types of strain, 3C, is the 
Hamiltonian density with no strain, V4 is the acceptor 
concentration, J; is an angular momentum matrix for 
angular momentum 3, and the sums over 7 and j run 
from one to three over the Cartesian components. 
Where no angular momentum matrix is indicated, the 
identity matrix should be taken, because the terms 
independent of acceptor concentration do not split the 
acceptor states. 

If a conventional phase for our angular momentum 
matrices is chosen,® we can write our Hamiltonian in 
matrix form. 


erQit Qe Qs— 106 Q3—104 0 
e= QsT-1Qs €d-Q1— Qs 0 Q3—104 
Q3s+104 0 i e+Qi—Q2 —Qs+706| ’ 
0 QidiQ. ——Qs—iQs €+Q:1+Q2 
(2) 
where : 
= Q1=N Da? (ezzF evy t ezz); 


@:=Na ($D,) (a= Czar €yy) ] E 
5 Q3= Na (GDu) (2V3) (ezz— vy) ; (3) 
Qs=N4 GD.) (GN3)ezy; 
Qs— Na GD.) (Gv3)ez. ; 
Qs NA GD.) $V3)ey; 5 
81.1. Schiff, Quantum Mechanics (McGraw-Hill Book Com- 
pany, Inc., New York, 1955), 2nd ed., p. 146, 


- 
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and 


e=Eot Ben) iei 012 22 i566 
Haca) 26; (4) 


or, in terms of the Qi, 


pe cT 2612 Q+ C11— C12 (02-+02) 
6 (Da*N A 2 3 (2N4D,)* 
RO O rO (5) 
3 (4N aD? 


This matrix can be diagonalized to give eigenvalues 
for the energy density, 


N= et Ort (OP+OF+OP+0E+08)"?. (6) 


The minimum for the energy density cannot depend 
on the particular choice of Q> and Q; as long as Q?-+Q;” 
is equal to a constant. The same degeneracy holds for 
Qs, Os, and Qs. Also, strains of the type Qı are separated 
from the other types and cause no energy splitting. 
Minimizing the energy density À with respect to Qi, 
one obtains an equilibrium value for Qı. Terms in Qi 
can be included with the terms independent of strain 
to form the arbitrary zero of our energy. 

The energy density change due to the other strains is 


N= Aa?-- By (a?4-?) V2 , (7) 
where 


A=} (cunc) (GN aD)? B=32cu(GNaDw)?; (8) 
= 02+037; y=O0?2+02+0¢. 


The energy density can be minimized with respect 
to x and y by finding the values for x and y for which 
the first derivative of M with respect to both x and y 
is equal to zero. For AB, as is usually the case, 
either x or y must be zero at equilibrium. However, 
x=y=0 gives A —0 and is not a minimum for X. If 
A>B, as is true for silicon, the minimum energy 
density is 

Amin = "d 1/4B= —N4?D4/664 , (9) 


and the corresponding distortions are 
Czy tere + eye = N2D,2/3cd ; 
6227 Cyy= Cz2= Q1/3N ADq*. 


The strains ezz, €yy, and ezz are chosen so that Qz and 
Qs are zero, and their value is determined by the value 
of Q; at equilibrium. 


(10) 


II. THE WEISS FIELD METHOD 


The analogy with ferromagnetism will now be pur- 
sued, letting the strain field play the part of the mag- 
netic field, and the average value of the angular mo- 
mentum components of the acceptor state play the 
part of the magnetization. It will be found that the 
tatio of the average angular momentum components 
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COLLECTIVE 
to the applied strains obeys a “Curie-Weiss law.” and 
the transition temperature can be computed. : 

The response of the crystal to an applied strain must 
be determined. If the applied strain is of the form 
ezn then a “Weiss field” of the same form is assumed. 


= a SENT | 
£97 Czy| 0; Cxz| 0= Cyy| o= 62 0= ezl o= eyl 5 0. (11) 


The notation |o means to evaluate the strain with no 
external strain fields applied. 


Cry lot Ezy; Crz™ Ezd; Etc, (12) 
where ezy' is the applied strain. 
The Hamiltonian for the splitting of the acceptor 


state due to the total strain field esy is 
S6s— $DV[ Jyt Iy] 2)/2]e2y. (13) 


After diagonalizing (13), one finds that the acceptor 
level is split into two states, each twofold degenerate, 
with an energy difference 


AE- | Q/V3)DV'es,] . (14) 
For the higher energy state, the eigenfunctions are 
(Wijotiv_s)2)/V2; (ajo 1 —/2)/N2. This state will be 
denoted by +. The lower energy state has eigenfunc- 
tions (W1j2—iW_3/2) V2; (Jap — $9 12)/V2. This state will 
be denoted by —. 
The density of states at each level is 
N 4 exp(—Dy'ezy/V3kT) 
exp(Dy'ex,/V3RT) +exp(— Du’ ezy/V3kT) : 
N 4 exp(Dv'exy/ N3kT) 


7 exp (Duexy/VSRT) + exp (— Duen BET) 


N.- 
(15) 


N. 


'The density of the angular momentum coefficient is 


UIT Jyt Jw iG IJI Ji IN 
+4(— |I Jut JuJ 2| —)N-. (16) 


Here (+||-+) indicates that the expectation value 
between the upper states should be taken. This gives 
the upper eigenvalue +-V3/2. 

Substituting (15) into (16), one obtains 


WU Ju Ju] uem — Na(88/2) tanb(DV'es/NSRT) , (17) 


This form of strain dependence has been experimentally 
verified for the case of Oz centers in alkali halides.’ 
Equation (17) becomes, for Du'ezy&kT, 


aJ JytSyJ z)av 
= — (NgDul/2k) (€zy/T)=C(Cx/T). (08) 


The expectation values between dore states for 
all other angular momentum combinations in Our 


Hamiltonian (1) vanish, so that the assumed strain 


gives an average value for only its coefficient. 


This average splitting of the acceptor states 1n turn 


? W. Kanzig, Phys. Chem. Solids 23, 479 (1962). 
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leads to an equilibrium strain in the crystal. The 
energy density is 


= aD 4 Sly II 
cien 22 tiv bei 22 mieuesy 
Ficu Lowe (19) 


The energy density is then minimized with respect to 
the possible strains to obtain the equilibrium strains 


£zz| o= eyy| 077 €; o= zs| o= &y,| 970; 
exy| o7 697 —$DV [3 U Jy JJ sus] 
=U Syt Iuav]. (20) 
Equations (12), (18), and (20) yield 
A Jyt Jy Jav C 
TE yog 


(21) 


Cy, the transition temperature, is given by 
JE Cy =N 4D? kis (22) 


Comparing (9) and (22), one obtains for the lowering 
of the energy density 


Amin = —4N kT. (23) 
The splitting of the lowest acceptor level is then 
AE=kT., (24) 


which might have been expected. 

Using D,/—2.682:0.25 eV, determined by Hensel 
and Feher, and a value for cs,=8.007 X10" dyn/cm’, 
which it has at low temperatures and which varies 
little with temperature at low temperatures,” one 
obtains 


T,=N4X5.6X10-°K = 4X 2800°K, (25) 


where 224 is the fractional number of acceptors and V4 
is the acceptor concentration in acceptors per cc. 

Feher, Hensel, and Gere‘ found that a stress of about 
300 kg/cm? in the (100) direction was necessary to 
split the ground acceptor state enough for paramagnetic 
resonance absorption transitions to be observed over 
the broadening due to the background of random 
strains in the crystal. The energy splitting due to an 
applied stress in the [100] direction? is 


AEs=4D.T/(eu—a2), (26) 
where T is the applied stress. The splitting necessary 
for transitions to be observed is roughly 8X10-* eV, < 
or 0.9°K. 3 
In order to observe the effect of the equilibzium 
strain, one would like to increase the concentration to 
a value which would allow the energy splitting of (24) 
to be greater than the energy splitting necessary to 
overcome the random strains in the crystal. However, - 


» H, J. McSkimin, J. Appl. Phys. 24, 988 (1953). ipai 
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"TABLE I. Approximate concentrations and 
transition temperatures. 


Acceptor Na in acc/cc Tone 
B 1015 0.06 
Al 3X10!5 0.17 
Ga 4X 10!5 0.22 
In 102 5.6 


at very high concentrations, the acceptor states gain 
more energy by binding into hydrogen-like molecules 
and finally bands than they gain due to the Jahn-Teller 
effect. The energy of binding into hydrogen-like mole- 
cules comes from both direct Coulomb and exchange 
interactions. For concentrations in the range of inter- 


CALVIN S. 
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est, this binding energy is much larger than the energy 
due to Van der Waals forces between acceptor states 
and the direct magnetic coupling between acceptor 
states. 

Therefore, one must know at what concentration 
the molecular binding energy exceeds the energy lower- 
ing computed from (9). Table I gives the approximate 
concentrations at which this occurs, and the transition 
temperatures associated with these concentrations. 
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Band Structure of Gallium Phosphide from Optical Experiments at High Pressure* 


RICHARD ZALLEN AND WILLIAM PAUL 
Division of Engineering and Applied Physics, Harvard University, Cambridge, M assachusetts 
(Received 22 January 1964) 


The effect of hydrostatic pressure on the following optical properties of GaP has been measured at room 
temperature: the fundamental absorption edge region from 2.2 to 2.7 eV, an infrared absorption band 
appearing in zi-type material at 0.3-0.5 eV, peaks in the reflectivity spectrum at 2.8 and 3.7 eV, and recom- 
bination radiation in forward-biased p-n junctions at 1.7-2.3 eV. The results have been interpreted by means 
of a proposed energy band structure in which the conduction band states X1°, X2°, T°, Tis° are located at 
energies of 2.2, 2.5, 2.8, 3.7 eV, respectively, above the valence band maximum at T15”. The following pres- 
sure coefficients have been measured (the transition involved is given in parenthesis), where energy is 
expressed in eV and pressure in 10° bars: Ec(Di? — Xi) 2222—14P ; Eoi — 7i?) =2.78+10.7P; 
Eo (Tij? > Tis) =3.71+5.8P; AES (X16 > X3) =0.3+1P. The coefficients of Eg and Ey’ are close to those 
for the corresponding transitions in Si; that of Eo is close to the corresponding coefficient in Ge. The weak 
reflectivity peak at 2.8 eV, the direct gap, shifts with temperature at a rate of about —4.6X 10 * eV/?K, 
compared to a value of about —5.2X 1074 eV/°K for the 2.2 eV indirect gap. 


1. INTRODUCTION 


HIS paper describes an experimental investigation 
of the electronic energy band structure of GaP. 
The approach taken in this study was to measure the 
effect of hydrostatic pressure on certain optical prop- 
erties in the infrared, visible, and near ultraviolet 
regions of the spectrum. Included in these were absorp- 
tion near the fundamental absorption edge, extrinsic 
absorption. reflectivity, and recombination radiation. 
Several features of the lowest conduction bands have 
been elucidated in the energy range from 2 to 4 eV 
above the valence band maximum. 
Part of the basis for this work is a systematic correla- 


e 


group 4 and group 3-5 semiconductors.-* It has been 


* Research supported by the U. S. Navy Office of Naval 
Research. 
1Some of the principal results of this work were briefly men- 
tioned in R. Zallen and W. Paul, Bull. Am. Phys. Soc. 9, 61 (1964). 
2,W. Paul and D. M. Warschauer, Phys. Chem. Solids 5, 89 
1958); 5, 102 (1958); 6, 6 (1958). 


observed that corresponding energy separations in 
these materials have similar pressure coefficients. In 
other words, the effect of pressure on the energy of an 
electronic transition depends primarily on the particular 
type of final and initial states involved (each specified 
by a band index v, a reduced wave vector k, and a 
group-theoretic classification), and is relatively insen- 
sitive to choice of material from among this class 
of simple semiconductors. We have listed some pressure 


$ W. Paul, Phys. Chem. Solids 8, 196 (1959). 

* W. Paul, J. Appl. Phys. 32, 2082 (1961). 

PW. Paul and D. M. Warschauer, Solids under Pressure 
areor Book Company, Inc., New York, 1963), Table 8-4; 
p- 226. 

¢ W. Paul and H. Brooks, Progr. Semiconductors 7, 135 (1963). 

1R. Zallen, W. Paul, and J. Tauc, Bull. Am. Phys. Soc. 7, 18 
(1962); some of these data are shown in the article by J. Tauc, 1n 
Proceedings of the International Conference om the Physics 0 
Semiconductors, Exeter (Institute of Physics and the Physical 
Society, London, 1962), p. 341. 

*R. Zallen, Gordon McKay Laboratory of Applied Science, 
Harvard University, Technical Report HP-12, 1964 (unpub- 
lished); R. Zallen and W. Paul, unpublished data. 
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coefficients in Table I and briefly discuss two examples 

below. 

TABLE I. Pressure coefficients of Some energy separations in 
group 4 and group 3-5 semiconductors. 


Transition 
(low energy—high energy) 


y ðE/ðP in Representative 
group 4 group 3-5 units of aterial 
element compound 10-6 eV/bar measured 
Tos >To II, +(10—15) Ge,” GaAs,» InSb¢ 
Trl, TL, +5 Ge UST me 
Ia — Ay Tis Ay —(1—2) Si,! Get 
Ly—> Ly LL, -+(7—8) Ge, GaSb, InAs 

a See Ref. 10, 
b See Ref. 11. 
a See Ref, 12, 
d See Ref. 3. 


* S. H. Groves, Gordon McKay I. aboratory of Appli i arva 
d [ ( La Applied Science, Harvard 
University, Technical Report HP-10, 1963 is 
! See Refs. 2 and 26. UI 
£ See Refs. 3 and 38. 
b See Ref, 7. 


(1) The k=O vertical transition in germanium, 
responsible for the “direct” edge in optical absorption 
at 0.8 eV, takes place between a Tzs valence band 
maximum and a T» conduction band minimum. The 
transition in the zincblende structure 3-5 compounds 
which corresponds to I»;"— TD'»* in the diamond 
structure group 4 elements is 5? — T'3*? The pressure 
shift of the energy of this Ts? T»*(D-— Ii) 
transition is within the range (10-15) x 10-5 eV/bar for 
Ge,” GaAs," InSb,” and GaSb.' 

(2) Germanium and many of the 3-5’s exhibit 
doublet peaks in reflectivity in the 2-3 eV range.!4!5 
These are due to A3” — A;* vertical transitions along 
[111] axes in the Brillouin zone.!5 The pressure coeffi- 
cient of these reflectivity peaks in Ge, GaSb, and InAs 
is (7-8) X 10-5 eV/bar.? 

The results for GaP obtained in this study are 
consistent with this idea of correlation between pressure 
coefficient and transition assignment. 

Comparatively little is known of the band structure of 
GaP. The valence band maximum is assumed to be a 
T:s state at the zone center, in analogy with the other 
3-5 compounds.!’ The energy gap observed in optical 

NL unc ED ee Solids 17, 138 (1960). 

1 J, Feinleib, S. Groves, W. Paul, and R. Zallen, Phys. Rev. 
131, 2070 (1963); G. E. Fenner, J. Appl. Phys. 34, 2955 (1963). 

17]. Long, Phys. Rev. 99, 388 (1955); R. W. Keyes, ibid. 99, 
490 (1955). 

13 À. L. Edwards and H. G. Drickamer, Phys. Rev. 122, 1149 
(1961); T. Deutsch and B. Kosicki, Bull. Am. Phys. Soc. 9, 60 
wae Ehrenreich, H. R. Philipp, and J. C. Phillips, Phys. Rev. 
«ED SL nes cu H. Ehrenreich, Phys. Rev. 129, 1550 
DE Brust, J. C. Phillips, and F. Bassani, Phys. Rev. Letters 
9, 94 (1962). . 

17 F. Bassani and M. Yoshimine, Phys. Rev. 130, 20 (1963). 

( 18. Braunstein and E. O. Kane, Phys. Chem. Solids 23, 1423 
1962). oi ee 

19 i small terms linear in & near 

Cn Ee ignore rent in the 3-55, which displace the actual 


valence band maxima to positions very slightly removed from 


k=0 (see preceding reference). 
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absorption is 2.2 eV at room temperature, 2.33 eV at 
0^K.?-? Measurements on GaAs-GaP alloys reveal a 
nonlinear variation of energy gap with composition,” 
indicating that the conduction band minimum in GaP 
is not the same as in GaAs, i.e., is not a I’; state. This 
conclusion is supported by the large electron effective 
mass obtained from mobility and Faraday rotation 
experiments. That the fundamental absorption edge 
involves indirect transitions can be seen from the 
functional form of the dependence of absorption 
coefficient upon photon energy above threshold,” 
especially the distinct phonon-derived structure ob- 
served at low temperatures.” The effect of pressure on 
the band gap is to shift it, slowly, to lower energies,” 
an unusual behavior seen also in silicon.??* This suggests 
that the conduction band minima, as in silicon, lie along 
[100] directions in & space. Recent work by Spitzer 
and Mead on the photoresponse of surface barriers on 
GaAs-GaP alloys places the direct energy gap at 2.65 
eV." Our experiments indicate a somewhat higher 
estimate for this energy, closer to 2.8 eV, as will be seen 
in Secs. 3 and 5. Information available from ultraviolet 
reflectivity measurements and from measurements on 
an infrared absorption band in n-type material are 
discussed in later sections, along with our pressure 
experiments on these properties. 

The discussion of our experimental results in Secs. 
3-6 is presented in four parts. In Sec. 3 we describe the 
effect of pressure on the absorption edge between 2.2 and 
2.7 eV, showing how the shape of the edge and its 
pressure dependence implies the presence of indirect 
transitions of the type Tı —> X;^ near 2.2 eV, and 
direct transitions of the type Tı” — T';* near 2.8 eV. 
In Sec. 4 we discuss the 0.3-eV absorption band observed 
in n-type GaP, its pressure shift, and its interpretation 
as a direct inter-conduction band transition of the type 
X: — X3°. In Sec. 5 we describe the pressure shift of 
the strong reflectivity peak at 3.7 eV, as well as that 
of a weak peak at 2.8 eV not hitherto reported. We 
present evidence for attributing the former to Pys’—11s° 
and the latter to I'35* — T° direct interband transitions. 
In Sec. 6 we discuss the results obtained for the effect 


20. G. Folberth and F. Oswald, Z. Naturforsch. 9a, 1050 
(1954); F. Oswald, ibid. 10a, 927 (1955); H. Welker, J. Elec- 
tronics 1, 181 (1955); Doris Teichler (private communication). 

21 W. G. Spitzer, M. Gershenzon, C. J. Frosch, and D. F. Gibbs, 
Phys. Chem. Solids 11, 339 (1959). k 

2 M, Gershenzon, D. G. Thomas, and R. E. Dietz, Proceedings. 
of the International Conference on the Physics of Sémiconductors, : 
Exeter (Institute of Physics and the Physical Society, London, » — | 
1962), p. 752. motu 

2 à G. Folberth, Z. Naturforsch. 10a, 502 (1955). 3 

“T. S. Moss, A. K. Walton, and B. Ellis, Proceetinen E Le 
International Conference on Physics of Semiconductors, Exe D 
(Institute of Physics and the Physical Society, London, 1962), — 


. 295. 
PHA L. Edwards, T. E. Siykhouse, and Hi CO An 
Phys. Chem. Solids 11, 140 (1959); R. Zallen and W. Paul, — 
preliminary work quoted in Ref. 4. 

2 M. I. Nathan and W. Paul Ebys. Rev. 128, 38 
W. Paul and G. L. Pearson, ibid. 98, 1755 (1955). — 

1 W. G. Spitzer and C. A. Mead, Phys. Rev. 133, A872 
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"TABLE II. Optical components used for the various experiments. dali aT 
ed MR Fluid 
Spectral Spectral | 
o Dispersing slit-width bei surrounding 
Measurement (eV) element (eV) Source ee sample 
ene E y toco» LW. 526B photomultiplier or i 
Fundamental absorption edge 2.2-2.8 glass prism 0.004 W lamp APO ie tom Aplier isopentane 
s V NOS le CS; 
ared a i a 0.3-0. LiF prism 0.003 globar thermocoup E CS, 
RV e 23-15 NaCl Aer 0.01 D; lamp 9526B photomultiplier isopentane 
Reflectivity at 80°K 1.9-3.2 glass prism 0.004 W lamp geen photomultiplier vacuum 
Radiative recombination at 1.7-2.3 glass prism 0.02 GaP diode 9526B or ei isopentane 
300°K and 80°K or photomultiplier or 
7500-À grating 0.006 He gas 


of pressure upon recombination radiation emitted from 
forward-biased p-n junctions of GaP. The sharp edge 
emission lines at 2.1-2.3 eV, observed at 80°K, serve as 
accurate markers for monitoring the shift of the 
indirect edge. The results obtained in this fashion are 
compared to the results of Sec. 3. 


2. EXPERIMENTAL TECHNIQUES 


The spectrometer used in this work was a Perkin- 
Elmer model 12G unit used as a prism instrument or as 
a grating instrument in different experiments. Usually, 
the prism used was glass for the visible region of the 
spectrum, Lif for the infrared, and NaCl for the 
ultraviolet. Table II lists the prisms, sources, and 
detectors used for the various measurements, along 
with the spectral resolution available in each instance. 
In some of the low-temperature radiative recombination 
work, a grating blazed at 7500 A provided somewhat 
higher resolution for studying the relatively narrow 
(0.02-eV half-width) edge emission lines and their 
small pressure shift. The radiation was chopped 
mechanically at 13 cps and phase-detected. The 
resultant ac signal was then amplified, rectified, and 
recorded on a Leeds & Northrup type G recorder. 
Data were taken by continuous recording of the signal 
while scanning photon energy over the region of interest, 
rather than by a point-by-point method. Optical 
transmissions down to 10-! were measurable before 
reaching the limit set by scattered light within the 
monochromator. 

The technique of optical experiments performed at 
high hydrostatic pressure has been well documented 
elsewhere.223?9 The pressure fluid was transmitted 
between the Bridgman press and the sample chamber 
by means ôf fiexible, stainless-steel tubing.” Pressure 
was measured by means of manganin gauges," cal- 
ibrated by observing the freezing pressure of mercury 
—T 


(1958) M. Warschauer and W. Paul, Rev. Sci. Instr. 29, 675 
9 For.a bibliography of 22 references on optical high-pressure 
techniques, see Ref. 5, p. 442. 

3 W. Paul and D. M. Warschauer, Rev. Sci. Instr. 27, 418 


(1950). E 
"ap. W. Bridgman, The Physics of High Pressures (G. Bell 
raed Sons, London, 1949). 
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Fic. 1. Experimental arrangement for the measurement of 
optical absorption or reflectivity as a function of pressure: (a) 
block diagram of the optical system; (b) schematic diagram of 
the sample chamber with reflectivity samples in place. For the 


absorption measurements, the sample was the usual plane-parallel 
slab. 


at 0°C, 7566 bars.? Our experiments were conducted 
at pressures up to 9 kbars.? 

For the absorption and reflectivity pressure experi- 
ments, all performed at room temperature, the pressure 
fluids used were liquids; isopentane for the visible and 
ultraviolet measurements, carbon disulfide for the 
infrared measurements. The sample was placed in an 
optical high pressure vessel between two 4-in.-thick 
sapphire windows. A system of plane and spherical 
mirrors formed a real image of the monochromator 
exit slit at the sample position. The experimental 
layout is schematically shown in Fig. 1. For the 
measurements of absorption coefficient, the sample was 
a plane-parallel slab with polished surfaces normal to 
the direction of the light beam. For the measurements of 
reflectivity, four etched samples were arranged in 4 
periscope-like reflectivity capsule. The transmission of 

* D. K. Newhall, L. H. Abbot, and R. A. Dunn, in High 
Pressure Measurement, edited by A. A. Giardini and E. C. Lloyd 
(Butterworth Inc., Washington, 1963). : 

* Throughout this paper, we shall use the bar as our unit of 


pressure: 1 bar: 105 dyn/cm?— 1.020 kg/cm?=0.987 atm. 


(1961). Langer and D. M. Warschauer, Rev. Sci. Instr- 32, 32 
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this capsule, proportional to R*, exhibits peaks at 
photon energies at which the reflectivity peaks occur. 
Since we were interested in the position in energy of 
reflectivity peaks, rather than the absolute magnitude 
of reflectivity, the use of 45° angle of incidence rather 
than normal incidence does not present any difficulties. 
This is supported by measurements made with this 
technique on Ge, InAs, and GaSb,’ as well as on Sb 
which yielded values of peak positions in good agree- 
ment with values obtained at normal incidence.**% 
One feature omitted, for reasons of clarity, from Fig. 1, 
is a set of mirrors which sampled a small fraction of the 
beam from the monochromator for the purpose of 
monitoring the stability of the spectral distribution of 
the energy available in the source-monochromator- 
detector system. 

For the low-temperature measurement of the 2.8-eV 
reflectivity peak discussed in Sec. 5C, the reflectivity 
capsule was mounted in the vacuum chamber of a 
conventional metal cryostat, in thermal contact with 
a copper block connected to the liquid nitrogen bath. 

For the experiments on the emission from GaP 
diodes, some of which were performed at 80°K, the 
pressure fluid used was helium gas. This entailed the 
use of a second pressure system and sample chamber 
designed for use with gas. The diode was mounted on a 
combination electrical and optical plug in which a 
central sapphire window was encircled by electrical 
lead-in wires. For the low-temperature measurements, 
the pressure vessel was immersed in liquid nitrogen 
inside a stainless-steel Dewar. The radiation, emitted 
through the sapphire window, was conveyed by means 
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Fic. 2. Experimental arrangement for the measurement of 
radiative recombination as a function of pressure: (a) block 
diagram of the optical system; (b) schematic diagram of the 
sample chamber. 


E : sonal 
3 J. Tauc and A. Abraham, Proceedings of the Internationa 
Conference on Semiconductor Physics, Prague, 1! 960 (Czechslovak- 

ian Academy of Sciences, Prague, 1961), p. 315. 


3 M. Cardona, J. Appl. Phys. 32, 2151 (1961). 
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m . Bie m 
of glass fiber optics to the source position of the spec- 
trometer. This arrangement is schematically shown in 
Fig. 2. 
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Fic. 3. Effect of pressure, at room temperature, on the optical 
transmission of three GaP samples. The curves have heen normal- 
ized to unity in the transparent region. The two arrows show the 
spectral resolution used for these measurements. 


3. EFFECT OF PRESSURE ON THE ABSORPTION 
EDGE: 2.2-2.7 eV 


Transmission samples were prepared by polishing 
down bulk material. The range of absorption coefficient 
a covered in our measurements was from 5 to 5000 cm. 
Figure 3 displays representative data taken at room 
temperature on three samples of different thickness. 
Relative transmission, ¢/fo, where / is the transmission 
in the transparent region at low energy, is plotted 
against photon energy for each sample at a low and a 
high pressure. Pressure shifts the transmission curves in 
opposite directions for the two extreme samples: There 
is a small shift toward lower energy for the thick 
sample, which spans the regien of low « for Ay near 
2.2 eV; and a large shift toward higher energy for the 
thin sample, which spans the region of high æ from 2.5 
to 2.7 eV. The pressure effect reverses sign within the 
range covered by the intermediate sample, near 2.33 eV. 

Figures 4 and 5 show sets of transmission curves 
taken during two pressure runs. Figure 4 corresponds 
to absorption coefficients of the order of 10 cm, at 


p. 


photon energies just above the energy gap Eg near the - 


beginning of the absorption edge. Later we will show 
that the pressure shift of the curves in this region 
approaches the value of ôE g/3P, the pressure coefficient 
of the band gap. Figure 5 corresponds to “absorption 


coefficients of the order of 10? cme, at photon energies _ 


well into the fundamental absorption region. The change 
of shape with pressure is readily apparent; the lexge 
positive shift increases rapidly with 4v. The large 
difference between the ranges of photon energy covered 
in Figs. 4 and 5 illustrates the difference in magnitude 
between the small, negative shift with pressure near 
2.2 eV, and the large, positive shift with pressure near 
2.7 eV. "s 
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Fic. 4. A pressure 
run taken on a thick 
sample. The letters, 
arranged alphabet- 
ically, give the order 
in which the data 
were taken. 


RELATIVE TRANSMISSION 


002}- 


2235 2240 +2245 2250 2255 
hy (eV) 


Plots such as Figs. 4 and 5 were used to construct 
isoabsorption curves, /va=/va(P), in which photon 
energy for a given relative transmission (ie., for a 
given a) is plotted against pressure. These were linear, 
within our 0-9 kbar pressure range. In Fig. 6 the slopes 
of these lines, (@hv/0P)., are shown as a function of 
their zero-pressure intercepts /v(0). The two extreme 
points, at 2.78 and 2.23 eV, are obtained from other 
experiments to be discussed in Secs. 5 and 6, respec- 
tively. All that need be noted now is that they are 
consistent with the isoabsorption data spanning the 
range from 2.24 to 2.72 eV. Figure 6 essentially sum- 
marizes the effect of hydrostatic pressure on the room 
temperature absorption edge of GaP. 

To obtain 0E¢/dP from Fig. 6, it is necessary to 
know the room temperature value of Eg. One method of 
studying an indirect edge is a high-resolution analysis 
of phonon-absorption and phonon-emission components 


Fic. 5. A pressure 
run taken on a thin 
sample. The letters, 
arranged alphabet- 
ically, give the order 
in which the data 
were taken. 


AND W. 


PAUL 


of a(hv) at low a, as has been successfully performed 
for Ge and Si.37 Gershenzon, Thomas, and Dietz? have 
made such a study on GaP at low temperatures, 
observing phonon-emission components consistent with 
previous work on phonon combination bands in GaP. 
Their work is the most direct evidence thus far reported 
that the energy gap is indirect, and provides the best 
estimate available for Eg. Although unable to observe 
phonon structure at room temperature, they extrapolate 
their low-temperature data to a room-temperature 
energy gap of 2.22 eV. 
Using Eg 2.22 eV, we find from Fig. 6 that 


9Eg/0P — —1.2X10^* eV/bar. (1) 


This pressure coefficient is in fairly good agreement with 
previously reported values, less accurately determined 
than the present one, although it is somewhat smaller 
in magnitude.” The band gap of silicon, a T's? — Aj* 
indirect transition from the valence band maximum at 


a (cm^) 


22 23 24 27 28 


25 
hy (P=0) (eV) 


Fic. 6. The pressure dependence of the GaP absorption edge. 
Sample thickness (u): 16 e; 29+; 170 a; 550 m; 3000 x. Also, 
reflectivity peak [], edge emission O. 


the zone center to conduction band minima near the 
zone boundary along [100] axes, has a pressure coefh- 
cient of —1.5X10-5 eV/bar.26 In germanium, the 
corresponding T'oj? — A;* energy separation also has a 
small, negative pressure coefficient lying between 0 and 
—2X10-9 eV/bar?; the most accurate value is —1.9 
X106 eV/bar.** Thus the GaP band gap coefficient is 
similar to the coefficient of l';5* — A,* in silicon and 
germanium. In all three cases the energy separation 
slowly decreases with pressure. This may be contrasted 
to the behavior of the band gap of germanium, 8 
Tes? IL," indirect transition, which increases with 
pressure at a rate of -1-5X 10-5 eV/bar.? 


31 T. P. McLean, Progr. Semiconductors 5, 53 (1960). t 

5 W. E. Howard, Gordon McKay Laboratory of Applied 
Science, Harvard University, Technical Report HP-7, 1961 
(unpublished). 
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We conclude then that the indirect transition 
responsible for the energy gap in GaP is that which 
corresponds to the T35” — A;* transition in Ge and Si 
namely, T1” — A; Since the degeneracy at the X 
point in the diamond structure is split in the zincblende 
structure, it is likely that the conduction-band minima 
in GaP are at the X point on the zone boundary, 
rather than along the A axis in the interior of the zone. 
Thus we assign the band gap to the transition I'ig?— X4*. 

Figure 6 clearly shows that the absorption edge does 
not simply translate rigidly in energy along with Eq 
when pressure is applied, since for that situation 
(dhv/OP) would be equal to QEG/9P for all hv. Instead, 
(Ohv/OP) 4 varies linearly with hv between 2.2 and 
2.45 eV, beyond which energy it increases progressively 
more rapidly. Thus, for example, the photon energ 
corresponding to a=5000 cm™ increases with pressure 
at a rate of 8X10-* eV/bar, even though Eg itself 
decreases with pressure. 

To account for the variation of (dhv/dP)q with hy, 
we must first discuss the theoretically calculated and 
the experimentally measured shape of the absorption 
edge, a=a(hv). The simplest theory”? predicts that for 
indirect, allowed transitions, a is given by a sum of 
terms corresponding to phonon emission and absorption 
processes, and that each term varies quadratically with 
energy above threshold. Thus, 


a= 29 Gram), (2) 
where: 
0, AE#<0) AE*#=hv—Eg*, 
BE CA(AES?, AES»0) Eg*-Eg-hó. 


Here a;* is the component due to emission of a phonon 
of energy kbi, o;- is the component due to absorption 
of a phonon of energy &6;. In the more exact theory,*7:*9 
which includes the effect of the electron-hole interaction 
(excitons), it is shown that each phonon component, 
ait, is itself composed of several contributions starting 
at a series of energies given by Eo — (1/1?) Ec, where 
Ew is the exciton binding energy and n is a positive 
integer. For allowed transitions, each contribution for 
finite n (formation of bound electron-hole pairs) goes 
as (AE)!? for small AE, the energy above threshold. 
The highest threshold contribution, starting at hv 
— Eg** (formation of unbound electron-hole pairs), 
goes as (AZ)??? for small AE. à 
When experimental data on an indirect : edge is 
analyzed, it is usual to construct a plot of œa? versus 
hy. From the above it is clear that o! versus hy would 
not be expected to reveal simple linear behavior in the 
complex region near hy Eg. However, m the region in 
Which jy greatly exceeds the highest phonon-emission 
threshold (hv—Ec®>Ee for all i), the simple theory 


2R. J. Elliott, Phys. Rev. 108, 1384 (1957). 
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Fic. 7. A comparison of the absorption edges of Ge, Si, and 
GaP. The curves have been shifted horizontally so that the 
indirect energy gaps coincide. The Ge data (which are for 20°K) 
and the Si data (which are for 77°K) are from T. P. McLean, 
Progr. Semiconductors 5, 53 (1960). The GaP curve is our room- 
temperature, zero-pressure data, 


and the theory including excitons both give the same 
result: 


a C[v — Eg! )!/ hv]. (3) 


The quantity Eg' results from a weighted average of 
the phonon components and may differ from Eg by an 
amount of the order of a phonon energy. This expression 
holds provided that the assumption of parabolic energy 
bands, used in the theories, is valid to these energies. 
The factor (tv) usually does not vary by more than 
10% over the range of interest, so that we might 
expect a!/? versus hy to exhibit a region of linearity at 
photon energies exceeding Eg-+-0.05 eV. 
Our results on the room temperature, atmospheric 
pressure absorption edge of GaP are in quite good 
agreement with the data of Spitzer, Gershenzon, 
Frosch, and Gibbs? (henceforth referred to as SGFG). 
Figure 7 presents a comparison of the absorption edge 
of GaP with those of Ge and Si. The scales of photon 5 
energy have been shifted so that the indirect energy & 3 
gaps roughly coincide. The vertical scale is the square 
root of the absorption coefficient a. The germanium 
and silicon data, for 20° and 77°K, respectively, are 
from the work of Macfarlane ef al. quoted in Ref. 37. 
The silicon edge is accurately linear over the range . E 
shown, extrapolating to a/?=0 at a photon energy of . ^ — .- 
1.19 eV, 0.03 eV higher than the actual band gap at... —- 
77°K. The germanium edge is linear over a small range ——— 
of energy until the onset of direct transitions at Eo, — 
the direct energy gap. The extrapolated energy is 0.7: 
eV, in agreement with Eg (to the nearest 0.01 eV). The 
GaP edge is linear below about 2.5 eV, extrapo: 
to zero absorption at 2.20 eV, 0.02 eV lower than 
band gap as estimated by Gershenzon et al. 
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n 
2.5 eV, the absorption coefficient increases progressively 
more rapidly than the (/m—2.2 eV)? dependence 
obeyed between 2.25 and 2.5 eV. 

The GaP absorption edge and its pressure dependence 
may be represented, in the 2.2-2.5-eV energy range 
(the region of linear behavior in Figs. 6 and 7) by the 


expression 
a!2(hy,P)= A (P)[hv—Ec(P)]. (4) 


This is equivalent to (3) with the approximations 
Eg -Eg and (hv)?=constant. If we regard A» 
=hv(a,P) as the dependent variable, then partial 
differentiation of (4) yields, to lowest order in P, 


8). OEg 194 A 
(=) ei) 1g (0) I. (5) 
aP/, oP AaP 


This is just the observed linear variation of (Ohv/OP) a 
with kv shown in Fig. 6, where (9/1/9P)  — 0Ec/9P at hy 
- Eg(0), and where the slope of the linear region, 
(d/dhv)(8hv/8P)42:8, is given by — (1/4) (94/0P). In 
Sec. 7, the quantity B is estimated theoretically from 
the proposed pressure-dependent band structure of 
GaP. The result is in satisfactory agreement with the 
measured value of 1.15X 10-5 bar~ obtained from Fig. 6. 

In the region of photon energy above 2.5 eV, the 
curves of Figs. 6 and 7 deviate from their linear behavior 
at lower energy. Both a? and (dhv/dP)q increase, 
gradually at first and then more and more rapidly, 
above the line extrapolated from low energies. The 
suggestion which immediately comes to mind is that 
a new mechanism, strongly pressure-dependent, is 
beginning to contribute to the absorption above 2.5 
eV. 

Several possible explanations for the rise in absorption 
F are listed below: 


TERRE PET 


(a) the breakdown of the validity of the theory 
behind (3), due to such considerations as the k depend- 
ence of matrix elements, energy denominators, and 
especially, effective masses (nonparabolic energy bands 
away from the extrema); 

(b) the onset of a new indirect transition process 
at 2.5 eV, involving, for example, a set of higher 
conduction band minima; 

(c) the low-energy “tail” of a direct edge occurring 
at a higher energy not reached in our absorption 
measurements; - 

(d) the onset of direct transitions at 2.5 eV. 


€ 


= 


- - Both (a) and (b) are plausible possibilities. The 
——. valence and conduction band masses may increase at 
k Vectors away from the extrema, thereby providing a 

joint density of states for indirect transitions exceeding 

that predicted on the basis of parabolic bands. Also, 
regarding (b), it will be seen in Sec. 4 that there is 
evidence for a higher conduction band lying, at the 
X-point, about 0.3 eV above the lowest conduction 

^ band. However, our calculations indicate that these 
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two mechanisms are probably too weak to account for 
the observed increase in œ above 2.5 eV. Note, for 
example, that (a) is apparently unimportant in the 
case of silicon, for which o!? versus +v is linear over the 
0.5-eV range of photon energy shown in Fig. 7. If the 
additional contribution to the absorption of GaP is 
analyzed by constructing a2=a—a;, where o, is 
extrapolated from the low-energy region where a=; 
= A (hy —2.2 eV)’, then it is found that os does not at all 
resemble the (/nv—2.5 eV)? dependence expected of an 
indirect transition beginning at 2.5 eV, but increases 
much more rapidly than this. Thus, neither (2) nor (b) 
accounts for the observed shape of the edge. Moreover, 
neither mechanism readily accounts for the more rapid 
increase of (9/»/0 P), with hy above 2.5 eV. 

If the minimum *vertical separation between the 
valence and conduction bands in GaP occurs at k=0 
and the transition involved is of the type T15 — Ty’, 
as is the case in several other 3-5 compounds, then the 
pressure coefficient of the direct gap, 9o/0P, is 
expected to have a value in the range of 4-10 to 
-L15X10-* eV/bar. Mechanisms (c) and (d) would 
then predict an increase of (3hv/3P)a toward the value 
of 9Es/8P. In germanium it is found that (0hv/9 P). 
reaches the value of the direct gap coefficient at an 
energy somewhat less than Eo, and then levels off.!? 
For GaP we see that (dhv/dP)q is still below the 
expected value of QEo/8P at hy=2.7 eV, indicating that 
Ep occurs at a higher energy. Thus, only (c) is consistent 
with the variation of (@hv/0P). of Fig. 6. SGFG 
proposed (d) to account for the increase in a above 
2.5 eV, and to account for an infrared absorption band 
observed in n-type material. Our discussion in this 
section favors (c) as the explanation for the increase in 
a; the actual origin of the infrared absorption band will 
be discussed in Sec. 4. 

Ina direct absorption edge, the absorption coefficient 
rises rapidly to a high value at Eo, and then increases 
more slowly with hy.374° The direct edge of Ge is shown 
in Fig. 7, and clearly exhibits the characteristic “knee” 
at Eo. This curve is for 20°K and shows the exciton peak 
at an energy just under Eo. At room temperature the 
exciton peak is washed out but the marked knee at Eo 
persists. No indication of such a knee appears in the 
GaP absorption edge of Fig. 7. This also demonstrates 
that Eo does not fall within the range shown, and that 
it thus exceeds 2.7 eV. 

From these arguments we therefore propose that (c) 
is the dominant mechanism responsible for the deviation 
from linearity above 2.5 eV in Figs. 6 and 7. The 
presence of a Py; — T° direct transition is indicated 
at an energy Eo greater than 2.7 eV. The momentum 
matrix element for this transition, pe, which governs 
the optical absorption, is nearly equal for several 3-5 
compounds:' Assuming that the same value holds also 


39 M. D. Sturge, Phys. Rev. 127, 768 (1962). 
^! H, Ehrenreich, J. Appl. Phys. 32, 2155 (1961). 
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n-type GaP. The trough at 0.4 eV is due to the GaP absorption 
band. The cutoff at 0.7-0.8 eV is due to the absorption edge of a 
Ge sample which was included along with the GaP sample in 
order to provide a comparison of pressure shifts. 


for GaP, we can obtain a rough estimate for a(Eo), the 
absorption level at the knee of the absorption curve. 
This quantity is proportional to pa2m2/n%£o,37 where 
n is the refractive index, and where it is assumed thta 
both the density-of-states effective mass and the 
exciton reduced mass are determined primarily by me, 
the conduction band effective mass. In first approxima- 
tion, me is proportional to Eo!! so that a(E£y) becomes 
proportional to pe*/o/n*. For GaAs, a(Eo) is equal to 
0.9 10! cm“? Using the values of n and Ey for GaAs 
and GaP, where a trial value of Eo=3 eV is assumed for 
GaP, we obtain a(£o)=2X10! cm-! for GaP. The 
absorption curve of Fig. 7 extrapolates to this level at 
& photon energy of about 2.75-2.80 eV. In Sec. 5C 
we shall discuss a weak peak appearing in the reflectivity 
spectrum of GaP at about 2.78 eV, which we attribute 
to the T15” > T';* direct edge. 


4. EFFECT OF PRESSURE ON AN INFRARED 
ABSORPTION BAND IN n-TYPE GaP 


SGFG reported an absorption band in n-type GaP, 
in excess of the normal free carrier absorption, with a 
strength proportional to the electron concentration.” 
At room temperature, the band peaks at 0.41 eV and 
its low-energy threshold is at 0.31 eV; at low tempera- 
tures the threshold shifts to higher energy by about 
0.05 eV. Since the absorption is proportional to the 
concentration of electrons, the initial states of the 
transitions responsible are probably either at the 
conduction band minima, or at shallow impurity states 
derived from these minima. : 

Spitzer et al. suggested that the mechanism respon- 
sible is an indirect transition from the conduction band 
minima ‘to a higher minimum at k=0, ie, XTi’ 
As a test of this hypothesis, which is inconsistent a 
our previously discussed estimate of the energy of Ti ; 
we measured the effect of pressure on the absorption 
band in samples kindly provided us by Dr. Ser 
Since 8y/dP exceeds -- 10X 10-9 eV/bar, and ôEg/3P 
is negative, the pressure shift of Ey—Ee (the energy 


GaP A1635 
of the X> T* transition) should definitely exceed 
+10X t0-* eV/bar. Figure 8 displays the optical 
transmission, at a low and a high pressure, of an n-type 
GaP sample (n= 101 cm^, d=0.1 mm) in series witha 
Ge sample included for purposes of comparison. The 
shift. of the high-energy cutoff near 0.75 eV, due to the 
Ge, 18 -F9X1075 eV/bar. The shift of the dip in trans- 
mission caused by the absorption band is seen to be 
very much smaller than this, almost an order of 
magnitude smaller than that expected. This is strong 
evidence against this particular mechanism. 

In an earlier paper, one of us (W.P.) presented an 
explanation of this band based on vertical transitions 
of conduction electrons to the next higher conduction 
band. This simple model, unlike the indirect scattering 
process discussed above, leads naturally to a definite 
low-energy threshold, especially if the electrons are 
initially weakly bound to shallow impurities. It also 
explains reasonably well the magnitude of the absorp- 
tion band and its relative insensitivity to temperature, 
as no phonons are involved. The model is consistent 
with a small pressure coefficient. Recently, Allen and 
Hodby* have studied this absorption band in n-type 
GaAs-GaP alloys. They found that it appears only in 
the alloys which have the (100) minima lowest, strong 
evidence in favor of our model. 

The higher conduction band providing the final 
states for these direct transitions is logically X;*, 
which is degenerate with X;^ in diamond-structure 
materials like Ge and Si. Thus the X,^ — X3° energy 
separation in GaP is about 0.3 eV. Values for this 
quantity for some of the other 3-5's have been estimated 
by Greenaway from structure observed at low tem- 
peratures in the strong X-point reflectivity peak in the 
ultraviolet. He obtains values in the range of 0.4-0.5 eV, 
close to the GaP value. The interconduction band 
absorption in n-type GaP which has been discussed 
here is the analog of the intervalence band infrared 
absorption found in p-type Ge,“ GaAs,“ and InAs;# 
the shape of the absorption curve is similar in all of 
these materials. 


5. THE EFFECT OF PRESSURE ON THE 
REFLECTIVITY SPECTRUM: 2.5-4.5 eV 


A. Reflectivity Spectra 


Reflectivity spectra of semiconductors exhibit peaks 
in the 1-10-eV range of photon energies, due to corners 
(peaks or edges) in dV/dE, the joint density-of-states 
for direct interband transitions.4-!5 Corners in dN/dE 
are correlated with critical points in k space of E**(e) 


a T. W. Allen and J. W. Hodby, Proc. Phys. Soc. 82, 315 (1963). 
5 i Greenaway, Phys. Rev. Letters 9, 97 (1962) 


4 H. B. Briggs and R. C. Fletcher, Phys. Rev. 87, 1130 (1952); 
ibid. 91, 1342 (1953). f is 
PER ure Phys. Chem. Solids 8, 280 (1959). * 


48 F. Stern and R. M. Talley, Phys. Rev. 108, 158 (1957). — ^ 
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= E(k) — E (E), the vertical separation of the conduc- and temperature coefficients of this peak with those of La 
tion and valence bands. The critical points are of four the first peak in silicon and the A-point doublet peaks in 
types; minima, maxima, and two types of saddle germanium and other materials. Although there is 
points. The saddle points produce the strongest usually little relationship between temperature coeffi- 
singularities in dNV/dE, and it is their identifications cients belonging to a given type of gementes, the is 
with particular reflectivity peaks which are the most temperature coefficients of the A-point peaks are all ; 
certain. The strongest reflectivity peak in Ge, Si, and about the same.*® The pressure coefficients of Table TII 

the 3-5 compounds appears at about 4 or 5 eV andis were measured in this laboratory ; the e mpana 

due to saddle point singularities at X and Z points" — coefhcients are taken from the literature. The GaP on 
in the zone.!^!5 Doublet peaks appearing in the 2-3 eV and Si coefficients are alike, and seem to be significantly | 
range in Ge and some of the 3-5’s are due to saddle smaller than the A-point coefficients. These data | 
points at A points‘? in the zone,'^ the doublet structure indicate that the same mechanism is probably respon- 5 
being due to the spin-orbit splitting of the valence band. sible for the GaP and Si peaks, and that it is not a j 

Reflectivity peaks have been reported for GaP at A-point transition.” i ; 

3.7, 5.3, 6.9, and 10 eV.'5 The dominant 5.3-eV peak is That the Si 3.4-eV peak is other than a A-point us 
supposed due to the X, Z critical point transition./$ transition has already been conclusively established by 

The L;-— L; (or As’— As‘) transition has been work of Tauc and Abraham® on Ge-Si alloys, in which 
suggested for the 6.9-eV peak. No transition has been it was shown that the variation of the energy of the 
associated with the peak at 10 eV. The 3.7-eV peak may first reflectivity peak (followed as a function of composi- | 
be due to A, — A;° transitions, in analogy with the tion) shows a definite break at about 7997, silicon. It now | 
2-3-eV peaks in Ge and some of the 35's; or it may be remains to identify this peak, and thereby the GaP | 
due to Di5?— Ti transitions, in analogy with the 3.7-eV peak. There is a burgeoning body of evidence | 
Tos — Ps? transitions believed responsible for the first that the T25” — T45^ energy separation in silicon is 

peak in Si at 3.4 eV. 

In this section we shall discuss measurements which = 
we have made on the reflectivity between 2.5 and 4.5 eV. 
We have measured the effect of pressure on the peak 

: at 3.7 eV, as well as on a weak peak at 2.8 eV which 
has not been reported previously. 


i 


B. The Reflectivity Peak at 3.7 eV 


Figure 9 shows the effect of pressure on the 3.7-eV 


o 
: peak. The vertical scale is the transmission of a reflec- Z 
tivity capsule (four successive 45° reflections from z 
etched surfaces) of GaP immersed in isopentane, and is & 
; proportional to R*. The peak position shifts toward E 
ii higher energies at the rate of +(5.8+15%)x10° Ar E 
E eV/bar. In Table III we have compared the pressure = | 
7 
3 P in bars 
d TABLE III. Comparison of the behavior of the 3.7-eV reflectivity REC te 
; peak in GaP with that of some other reflectivity peaks. 
‘i 
E As Aa peaks 
f at 2-3eVin Peak at Peak at 
Ge, GaAs, 3.7 eV 3.4 eV 
a Coefficient . GaSb, etc. in GaP in Si 
36 . 37 38 39 
(8E/8P)r iri 10°* eV/bar Dev) 
/ id (2:1090) 31.25 +5.8 Spee) Fic. 9. Effect of pressure on the GaP reflectivity peak at 3.7 eV. 
ro 7 (OE/0T)r in 107* eV/°K The curves have been shifted vertically in order to separate them- 
pcm (+20%) —4.T7e4 —2.5* —2,6e4 The pressures are listed in the order in which the data were taken. 
ee Er e i 
s See Ref. 7. >See Ref. 8. See Ref. 36.— dSee Ref. 48. F. Lukes and E. Schmidt, Proceedings of the I nternational 
a Cet e m e Bose of Semiconductors, Exeter (Institute O 
| f z d m ysıcs and the sical Societ . 389. 
E 47 We are using the accepted notation for various positions of 9 Tt is worth M nue thatthe Re ee eee structure 


high symmetry in the Brillouin zone of a face-centered cubic in the Si and GaP peaks does not, by itself, make the A-point 
lattice: T, (000); A, along [100] axes; X, (100) at the zone explanation unlikely, since the El Shoat splitting 1? 
. bonndary; 2, along [110] axes; K, (110) at the zone boundary; these materials is small, less than 0.1 eV (Ref. 18). 

A, along [111] axes; L, (111) at the zone boundary. 2? J. Tauc and A. Abraham, Phys. Chem. Solids 20, 190 (1961). 
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3.4 eV. The band calculations for silicon seem to 
have successfully accounted for the reflectivity spec- 
gane photoernission experiments on silicon sur- 
faces,?;5* and the alloy experiments mentioned above. 
We therefore assign the 3.4-eV Si peak to Po? — T'3s*. 
We conclude then that the 3.71-eV reflectivity peak in 
GaP is due to Ti” — T, transitions (Top? in Si 
corresponds to T'3;? in GaP). 

GaP belongs to the isoelectronic sequence GeSi, 
GaP, ZnS, CuCl. All of these materials exhibit very 
similar reflectivity spectra,)>5.55.56 each peak occurring 
at an energy which is higher the more polar the com- 
pound. The data of Tauc and Abraham? indicate that 
the energies of the T25” > T'35^ and A3” — A;* transitions 
in GeSi are 3.1 and 2.9 eV, respectively. Cardona, 
from work on CuBr-CuCl alloys, puts the corresponding 
energies in CuCl at 6.8 and 6.5 eV. In both materials 
only a single reflectivity peak is observed, the weaker 
peak apparently being hidden under the stronger (which 
is less than 0.3 eV away). Cardona suggests that this is 
likely also the case in the other two members of the 
series. This would place the A3* > Ai‘ energy separation 
in GaP in the 3.5-4.0-eV range where, in reflectivity, 
it is hidden under the (in this case) stronger T15” > T'15° 
peak at 3.7 eV. A study of reflectivity in the alloy 
system GaAs-GaP would determine whether or not this 
is the correct picture. Finally, corroborating evidence 
for the T-point assignment to the 3.7-eV peak in GaP 
comes from the corresponding peak in ZnS which shows 
no splitting under the use of polarized light, and is 
therefore due to transitions at k=0.57 


C. The Reflectivity Peak at 2.8 eV 


In Sec. 3 we estimated a value of about 2.8 eV for 
Eo, the T15°—>T1° direct gap. This type of direct edge, 
at which there is a sharp rise (edge) in dV/dE due toa 
minimum in £°°(R), is much weaker than other direct 
transitions: Typical values of o for this edge are of the 
order of 10! cm71,37- while typical values of o in the 
region of (for example) the As” — A;* and the X4— X; 
transitions are of the order of 109 cm™.1°:*8 Tt is therefore 
not surprising that for Ge, no structure has yet been 
reported in the reflectivity spectrum near Eo—0.8 eV. 
For GaAs, there is some evidence of weak structure near 
Eo 1.5 eV.46 Structure near 2.8 eV has not been present 
in published reflectivity spectra of GaP. 

We have examined the transmission of a reflectivity 


51 T. C. Phillips, Phys. Rev. 125, 1931 (1962). 

D hs M. T. Cohen, and J. C. Phillips, Phys. Rev. Letters 
9, 389 (1962). = 

ow. E. ie and R. E. Simon, Phys. Rev. Letters 9, 385 

1962). 

Uc D Bassani and D. Brust, Phys. Rev. 131, 1524 (1963). 

s W. C. Walker and J. Ostantowski, J. Opt. Soc. Am. 53, 399 
(1963); M. Cardona, Phys. Rev. 129, 1068 (1963). 

56 M. Cardona, Phys. Rev. 129, 69 (1963). 

51 M. Cardona, Solid State Communications 1, 109 (1963). . 

88H. R. Philipp and E. A. Taft, Phys. Rev. 113, 1002 (1958); 


120, 37 (1960). 
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Fic. 10. Effect of temperature on the GaP reflectivity peak at 
2.8 eV. The inserts show the data obtained by subtracting out 
the smooth background. 


capsule of GaP in the vicinity of 2.8 eV. Figure 10 
shows the transmission (proportional to R*) of the 
capsule in vacuum at 300 and 80°K. [It was found that 
the arrangement of the samples accidentally placed a 
small transmission sample (thickness about 1 mm) 
effectively in parallel with the R‘ samples. This is 
responsible for the small additional contribution to the 
transmission below about 2.3 eV. The shift of the 
cutoff of this added component yields a value of 
— (5.2+15%) X 10-* eV/°K for dE¢/dT, the tempera- 
ture coefficient of the indirect gap, in agreement with a 
previously reported value? of —5.4X10-* eV/°K.] 
Superimposed upon the rapid rise of R* toward the 
3.7-eV peak, there is very weak structure near 2.8 eV. 
This takes the form of a peak having a height of the 
order of ARf/ R*— 1/30, and a width at half-maximum 
of about 0.2 eV. The temperature coefficient of the peak ` 
position is — (4.621596) X 107* eV/°K. 

Figure 11 shows the effect of pressure on the 2.8-eV 
“peak,” measured at room temperature, with the 
samples immersed in isopentane. Its pressure coefficient 
is 4- (10.72-1595) X 10-5 eV/bar. This large coefficient . ~ 
is appropriate for a T'1,* — T';* direct transition?-* It. 
has been plotted as the high-energy point in Fig. 6 
(at 2.78 eV, the zero-pressure photon energy of the 
peak), where it is seen to be consistent with the absorp- 
tion edge data obtained at lower energies. 

Structure in reflectivity in this region (where the 
extinction coefficient, K= (C/2u)a= 10-* cm Xa, is Jess 
than unity) closely follows structure in the refractive 


` ie) 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar "b 


Sus y 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


ZALLEN 


A1638 — R. 


R4 (ARBITRARY UNITS) 
D 


300bors 


8550 bars 
xs 
o = J n = 
24 26 28 30 32 
hy (eV) 


Fic. 11. Effect of pressure on the GaP reflectivity peak at 2.8 eV. 
The lower curves show the result of subtracting out the smooth 
background. 


index 22.58: From the magnitude of AR*/R* of the 2.8-eV 
peak in Figs. 10 and 11, we deduce a value of about 
0.02 for An, of the order of values calculated by Stern 
for n near Eo in GaAs. This value of An is about a 
factor of 10? smaller than the values for the first strong 
reflectivity peaks belonging to edges rising to œ= 10° 
cm-1,? implying that the edge involved has a strength 
of about 10* cm. 

From this evidence we conclude that this weak 
reflectivity peak is due to the onset of direct transitions 
of the type lı” — I';* at the center of the zone. The 
pressure and temperature dependence of the direct gap 
is then given by 


Ey 2.184- (10.715%) X 10-5P 
— (4.624-1595)x10-* (T—300), (6) 


where £ is in electron volts, P is in bars, T is in degrees 
Kelvin. Recently, Spitzer and Mead have estimated a 
value for Eo of 2.65 eV based on measurements of the 
photoresponse of metal-semiconductor surface barriers 
on GaAs-GaP alloys." We do not know the reason for 
the difference between our two results. 

The photoconductivity of GaP shows a sharp drop 
at about 2.8 eV." Similar effects have been observed in 
other materials,” and have been accounted for by the 
production of photoelectrons at k=0, where they 
contribute little to the current because of rapid direct 
recombination. This is consistent with our estimate 
of Eo. : 

Note added'in proof. D. F. Nelson, L. F. Johnson, and 
M. Gershenzon [Bull. Am. Phys. Soc. 9, 236 (1964) ] 
have recently observed structure in the photoconduc- 
tiyity spectra of GaP samples which they attribute to 


9 B. Velicky, Czech. J. Phys. B11, 787 (1961). 

6 F, Stern, Phys. Rev. 133, A1653 (1964). 

s L. F. Johnson and D. F. Nelson, unpublished data reported in 
Ref. 22; H. G. Grimmeiss, W. Kischio, and H. Koelmans, Solid- 
State Electron. 5, 155 (1962). 

?? E. Loh, Phys. Chem. Solids 24, 493 (1963). 
^ e 63E, Loh and J. C. Phillips, Phys. Chem. Solids 24, 495 (1963). 
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direct exciton formation. Their room-temperature esti- 
mate of Æo, 2.79 eV, is in excellent agreement with our 
result, as is the temperature dependence which they 
obtain for Æo. The agreement between these two inde- 
pendent determinations of the direct gap is strong 
evidence for the correctness of our interpretation of this 
reflectivity peak, as discussed above. 


6. EFFECT OF PRESSURE ON RADIATIVE RECOMBINA- 
TION IN FORWARD-BIASED p-n JUNCTIONS 


Diodes formed from GaP p-n junctions exhibit 
carrier-injection electroluminescence (EL) emission of 
low efficiency when biased in the forward direction.®:%& 
At low temperatures the emission in the green, at or near 
Eg, reveals a great deal of very complicated and 
interesting fine structure?*-' due to such effects as 
associated shallow donor-acceptor pairs? and bound 
exciton states. There is also a broad emission in the 
red involving levels well inside the forbidden zone.*! 

We have measured the effect of pressure on some of 
the EL lines observed in GaP diodes. A reason for doing 
this is that sharp lines emitted with nearly the band-gap 
energy serve as accurate and convenient markers for 
monitoring the pressure shift of the band gap, as has 
been demonstrated for GaAs." We can thus obtain in 
straightforward fashion an independent measurement 
of 0£¢/dP to compare with the result of the analysis 
given in Sec. 3 of the pressure dependence of a(/v). 

'Table IV summarizes our data on two green-emitting 


TABLE IV. Effect of pressure on GaP radiative recombination 
lines emitted at near band-gap energy. 


Relative 
Temp. Emission hv Alve inten- | —OAv/0P 
(°K) lines» (eV) (eV) sity? (10-9 eV/bar) 
80 EE-2LO 2.096 0.04 6 De 
80 EE-LO 2.145 0.04 25 1.0+0.1 
80 EE 2.194 0.04 100 1.0+0.1 
80 A-Ac-LO 2.238 0.02 60 1.20.1 
80 A-Ac 2.294 0.02 10 1.40.5 
80 A 2.308 0.02 2 1+1 
300 ee 2.23 0.1 fe 1.1+0.1 
a See Ref. 65. 


b EE is edge emission, A is an exciton line, LO is longitudinal-optical 
phonon, Ac is acoustical phonon. 

* Approximate full width at half-maximum. 

4 At a current of 10 mA. 


diodes fabricated at Bell laboratories by diffusing Zn 
into nonintentionally doped, epitaxially grown, #-type 
material. The diodes were provided to us by Dr. M 


“M. Gershenzon and R. M. Mikulyak, Solid State Electronics 
5, 313 (1962); T. Starkiewicz and J. W. Allen, Phys. Chem. Solids 
23, 881 (1962): 

55 M. Gershenzon, R. M. Mikulyak, R. A. Logan, and P. W. 
Foy, Solid-State Electron. 7, 113 (1964). 

$T. J. Hopfield, D. G. Thomas, and M. Gershenzon, Phys. 
Rev. Letters 10, 162 (1963); D. G. Thomas, M. Gershenzon, and 
F. A. Trumbore, Phys. Rev. 133, A269 (1964). 

*' D. G. Thomas, M. Gershenzon, and J. J. Hopfield, Phys- 
Rev. 131, 2397 (1963). 
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Gershenzon, who, along with co-workers, has carefully 
studied the EL spectra of such units and compared them 
with photoluminescence spectra. The labeling of the 
various lines given in Table IV follows the discussion of 
Gershenzon, Mikulyak, Logan, and foy a 
obtained by comparison of our spectra with the similar 
78? K spectra of their Fig. 8, 

The notation for the 80°K lines of Table IV is as 
follows: EE signifies edge emission and is ascribed to 
recombination. at widely separated donor-acceptor 
pairs.55.66 It is associated with the sharp-line pair- 
spectra which appear at higher energies. The A line 
has been ascribed to annihilation of an exciton bound to 
an impurity.95,67 The other four low-temperature lines 
are replications of the EE and A emissions at lower 
energies due to the simultaneous emission of phonons, 
acoustic (Ac), or longitudinal optical (LO). The edge 
emission lines saturate at higher currents while the A 
lines do not. The room temperature emission is either 
band-to-band or involves very shallow states. 

Although the results for the weakest lines listed in 
'Table IV are relatively less certain than the others, 
all yield about the same pressure coefficient. The 
consensus value for dhv/0P is — (1.12:0.1)X 10-5 eV/ 
bar, in agreement with the band gap coefficient obtained 
in Sec. 3 from the absorption edge data. The room 
temperature datum of Table IV provided the lowest 
energy point shown in Fig. 6. We found no appreciable 
effect of pressure on the intensity of any of the lines. 

The validity of equating the pressure coefficients of 
these emission lines with 9Eg/8P depends only on the 
assumption that the transitions involved are between 
shallow states derived from the band edges, which seems 
reasonable since their energies lie so close to Eg. How- 
ever, it has been suggested that the A lines are due toa 
direct exciton (derived from the valence and conduction 
band states at k=0) strongly bound to an impurity, and 
that they accidentally occur at an energy near Eg 
because of a large binding energy. If this were so, then 
their pressure coefficient should be close to QE/0P 
=+1110-® eV/bar, rather than to QEg/0P — zat 
X10-5 eV/bar. Table IV shows that the reverse is 
true. Thus, provided that we have correctly identified 
these lines as belonging to the A complex of Refs. 65 
and 67, the mechanism for the A lines more probably 
involves an ordinary indirect exciton. 

We have also measured, at room temperature, the 
effect of pressure on the broad emission band (width 
about 0.2 eV) appearing at 1.8 eV. This red emission 
has been attributed to radiative transitions from a 
associated with the presence of 
56 to the valence band. The 
0.5) X 1075 eV/bar. This 
g; it is opposite in sign 


deep donor level, 
oxygen in the crystal, 
pressure coefficient is + (1.0 
coefficient is slightly surprisin 
to dE¢/dP. [In silicon it was found that the pressure 
coefficient of the energy separation between a deep 
level and either band edge was a fraction of the (nega- 
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Fic. 12. Band structure of GaP. The states are labeled using the 
notation for the irreducible representations of the single group 
of the zincblende lattice. The left-hand superscript appended to 
each symbol gives the orbital degeneracy of the state. The arrows 
represent transitions observed in optical experiments. The five 
energy levels marked by solid circles were those studied by means 
of our pressure experiments. 


tive) band gap coefficient.**] This result implies that 
the donor binding energy decreases more rapidly with 
pressure than the band gap. 


7. THE BAND STRUCTURE AND ITS 
PRESSURE DEPENDENCE 


In Fig. 12 we have constructed an electronic energy 
band diagram for GaP based upon available experi- 
mental information. The five energy levels marked by 
the solid circles are those upon which our pressure 
experiments bear. The three marked by open circles are 
from other data discussed below, the L-point levels 


being relatively uncertain. The levels at the symmetry ^ 


points have been connected by plausible smooth curves 
which, although somewhat arbitrary, are consistent with 
various band structure calculations on similar semi- 
conductors, as well as such considerations as crossing or 
noncrossing of bands and relative magnitudes of effec- 
tive masses. We have neglected the spin-orbit splitting 
of valence band levels, which for GaP is about 0.15 V. 
Since we do not consider effects of the electron spin, we 
have labeled the states with the irreducible representa- 
tions for the single group, rather than for the double 
group, of the zincblende structure.’ 

The arrows shown in the figure represent the transi- 
tions which have been observed in optical experimerits, » 


> Li 
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i TABLE V. Effect of pressure on the band structure of GaP. 
ðE 
— in 1075 eV/bar 
Transition E in eV oP Measurement 
TiN. Eg=2.22 — (1.1+10%) absorption edge,* recombination radiation? 
Tint Eo —2.18 +(10.7+10%) absorption edge;* reflectivity peak? 
Ty: Ey =3.71 +(5.8+10%) reflectivity peaks 
XiX AEs—0.3 +(1+50%) absorption band 
XX? E:=5.3 + (2.920%): 
a See Sec. 3. 
b See Sec. 6. 
© See Sec. 3 


3 Sce Sec. 5C. 
* See Sec. 5B. 
1 See Sec. 


4. H "I 
e This is the coefficient for the corresponding transition is silicon, as measured by means of the 4.3 eV 


We have labeled them by their energies. Listed in 
ascending order of energy, they are as follows: 


(1) AE: is responsible for the infrared absorption in 
n-type material discussed in Sec. 4; 

(2) Eg is the indirect energy gap discussed in Secs. 3 
and 6; 

(3) Ep is the direct energy gap discussed in Secs. 3 
and 5C; 

(4) Ey’ is responsible for the reflectivity peak dis- 
cussed in Sec. 5B; 

(5) E; is the A-point transition, hidden under Ev, 
discussed in Sec. 5B; 

(6) Es is responsible for the 5.3-eV reflectivity peak 
mentioned in Sec. 5A. 


In this work, we have studied the effect of pressure on 
(1) through (4). 

The values of Eg, Eo, Eo’, E», and AE», are sufficient 
to fix the energies (relative to *I';", the valence band 
maximum) of !Xy°, "11°, "Iss, *X5*, and 1X35, respec- 
tively. To estimate the energies of !1L,* and ?Ls*, we 
made use of the following assumptions: 

(1) E; is equal to Ey’ (Ev is the stronger transition 
which prevents E; from being seen) ; 

(2) the separation of the two bands at L is not very 
different from E (true for Ge 16); 

(3) 2L, is about 1 eV lower than "Tis". (The known 
values of Eo, £3, and the relative energy of the T and X 
conduction band minima, put the P— £L valence band 
width at 1.1 eV in GaSb, 1.5 eV in Ge.”) 


The two L-point energies shown in Fig. 12 are then 
specified. In view of the assumptions needed for their 
computation, ‘they are clearly much less certain than 
the other levels. 


~- The right-hand side of Fig. 12 is very similar to a 
— band structure which has been calculated for AIP." 


AIP is the 3-5 compound which is isoelectronic with Si. 
This points up the fact that GaP more closely resembles 
Si than it does Ge. Calculations on Ge-Si alloys®* show 
that it is the silicon-rich alloys which most closely 
correspond to the GaP band structure of Fig. 12. 


. „68°F. Bassani and D. Brust, Phys. Rev. 131, 1524 (1963). 


E: iE < 


e 


silicon reflectivity peak (see Ref. 8). 


Our experiments determined the effect of pressure on 
Eg (Secs. 3, 6), Eo (Secs. 3, 5C), Ey (Sec. 5B), and 
AE» (Sec. 4). The pressure coefficients are listed in 
Table V. We have also listed a value for 0#/dP. This is 
the pressure coefficient, as measured by us, for the 
corresponding peak in Si (at 4.3 eV).? 

In the discussion of Sec. 3 on the pressure de- 
pendence of the indirect edge, we defined a quantity 
(d/dhv)(dhv/OP).=B. The measured value of B, ob- 
tained from the slope of the linear, low-energy region of 
Fig. 6, was 1.15X 10-5 bar~. We can now compare this 
experimental result with a theoretical estimate of B 
based, under certain assumptions, upon the information 
given in Table V. 

In Sec. 3, we showed that 8 is equal to — (1/4) 
X(904/8P), where A is the proportionality factor of 
Eq. (4). In the second-order perturbation theory used 
in dealing with indirect transitions, it is shown that the 
oscillator strength for this process [given by the co- 
efficients C;* of Eq. (2), C of Eq. (3), and A? of Eq. (4)] 
is proportional to a term of the form 


DN 
(= 2) ; 
7 E; 

b; is a matrix element and E; is an energy denominator 
corresponding to a virtual two-step process involving a 
particular intermediate state, specified by 7.99 The 
pressure dependence of this term is dominated by the 
pressure dependence of the energy denominators. If we 
assume, for lack of better information, that all of the 


pjs are equal, it follows that A=const- 25; Ej. 
Substituting this into the expression for 8, we find that 


g- 2 Ej )(9E;/0P) 


25; V/ E; n 


SINE have assumed that phonon energies are negligible relative 
to the energy separations of electron states (k6;««£; for all 2, 7)- 
Orhei wise Ej* would depend on a phonon energy (specified by 2); 
as well as on the energy of the intermediate electronic state 
(specified by 7). The expression for C;* would then just contain 


EET ot Ej, but the situation for C and 4? would be more 
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In Fig. 12 there are four intermediate states available 
for the indirect transition labeled by Ea: Pi, Tis? 
*X 3°, and °X”. The corresponding energy de inators 

: g gy denominators 
are Eo— Ea, Ey — Ea, Eg4- AE», and E:— Eg, respec- 
tively. Using (7) and the information given in Table V 
(and ignoring all other intermediate states but the ones 
lying within the energy range of Fig. 12), we obtain 
B—1.0x107 bar“. The excellent agreement with the 
experimental value of 1.15X10~ bar is most likely 
fortuitous in view of the drastic assumptions made; 
nevertheless, it does show that the measured f is 
consistent with the proposed pressure-dependent band 
structure of Fig. 12 and Table V. 


8. SUMMARY 


The effect of pressure on several optical properties of 
GaP has been used to construct a tentative electronic 
energy-band structure for this material over a region of 
energy within about 4 eV of the valence band maximum 
at k=0. The major experimental results are summarized 
in Table V; the deduced band structure is shown in 
Fig. 12. 

The data of Table V are consistent with the idea of 
correlation between pressure coefficients and types of 
electronic transitionst-*: The coefficients of £g and Ey’ 
are close to those for the corresponding transitions in 
Si??*; that of Eo is close to the corresponding coefficient 
in Ge? and GaAs." The result for Æg confirms the 
location of the lowest conduction band minima as lying 
along [100] directions in k space. 

Evidence for the direct gap Eo comes from absorption 
measurements at lower energies, as well as from the 
weak reflectivity peak at about 2.78 eV, which has not 
been previously reported. Its temperature coefficient 
is about —4.6X10-^ eV/°K, compared to about 
—5.2X 10-1 eV/°K for the indirect gap. 
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The small pressure shift of the 0.3-eV.. absorption 
band observed in n-type material indicates that vertical 
transitions between conduction bands is the mechanism 
responsible, as previously proposed by one of the 
authors. 

All of the narrow edge emission lines we observed from 
a green-emitting diode shifted with pressure at the rate 
of the indirect gap. 
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It is postulated that, at elevated temperatures, the core of a dislocation consists of a thermally disordered 


quasiliquid region, whose size increases with tempe 


rature. Calculations based on the effect of the disordering 


in relieving elastic stresses indicate that a flat strip core should be favored. A moving dislocation of this form 
will experience two distinct damping forces. One is thermodynamic, caused by the heat of disordering, and 
is analogous to the thermoelastic damping produced by the dislocation stress field; while the other is viscous, 
caused by the shear between the two sides of the strip. The calculated magnitudes of both of these damping 
forces in covalent and metallic crystals are comparable with those required to explain internal friction 
observations, although a positive demonstration of the validity of the mechanisms cannot be made on the 


basis of present data. 


INTRODUCTION 


DISLOCATION moving through a crystal ex- 
periences a damping force, the magnitude of 
which depends on the dislocation velocity. Measure- 
ments of this damping force must be made indirectly 
and interpretation of the experiments is often open to 
doubt. It is generally supposed, however, that for small 
velocities, corresponding to values of the applied shear 
stress that are small compared to the shear modulus, the 
damping force is proportional to the velocity. This is the 
region which will be of interest in this paper. 
Theoretically it is found that the nature of the damp- 
ing force depends very strongly on the type of model 
assumed for the dislocation. This dependence is par- 
ticularly sensitive to the structure of the dislocation 
core, where the Hooke’s law approximation breaks 
down. The simplest model considers the core to be of 
very small radius and to move as an approximately 
straight cylinder. If the remainder of the crystal is 
treated as a continuum exhibiting linear elastic be- 
havior, the only forces are those acting through the 
long-range stress field of the dislocation. Thermoelastic 
damping was treated by Eshelby! using this model in 
conjunction with a classical macroscopic thermodyna- 
mic method of calculation. If, in addition, nonlinear 
elastic effects occur near the core, interaction with 
thermal vibrations will occur. The periodic nature of 
_ the crystal can be taken into account by considering 
the dislocation core to be affected by a periodic Peierls 
stress field. Leibfried? and Nabarro? considered such a 
model, assuming that acoustic waves were scattered by 
- a straight dislocdtion moving through the lattice with- 
- out change"of form. 
: = A further improvement in the model is made by con- 
sidering that the dislocation core does not move in the 
*—"* Peierls field as a straight segment, but rather proceeds 


* This work supported by Aeronautical Systems Division, U. S. 
Air Force. 
1J. D. Eshelby, Proc. Roy. Soc. (London) A197, 396 (1949). 
3 G. Leibfried, Z. Physik 127, 344 (1950). 
. *B.R.N. Nabarro, Proc. Roy. Soc. (London) A209, 278 (1951). 
* 2 


by the formation and lateral motion of kinks.* The 
main sources of damping are then the energy losses 
experienced in thermal activation of kinks and the 
damping of the individual kinks. At low temperatures 
the damping is caused by the diffusion of built-in kinks 
(kinks necessitated by the geometry of the dislocation?) ; 
as the temperature is raised a little the thermal activa- 
tion of kinks becomes important.” It appears logical 
to use discrete kink models at these moderately low 
temperatures. As the temperature becomes higher, how- 
ever, the kink density becomes large and kink inter- 
actions become frequent. Eventually so much inter- 
action occurs that one can no longer identify individual 
kinks. In this situation it is better to treat the entire 
region of the dislocation occupied by kinks as a dis- 
ordered region of the crystal, and to consider this region 
as the dislocation core. 

Such a model of the dislocation is discussed in this 
paper. It is assumed that the disorder in the dislocation 
core is very similar to that found in the liquid state, 
while the crystal outside the core is treated as a solid 
isotropic elastic continuum. There are then two separate 
sources of energy loss which originate at the core of the 
moving dislocation; one thermodynamic, due to the 
heat of disordering of the material passing through the 
dislocation core, and the other viscous, due to the shear 
rate of the disordered material. In the next section we 
calculate the size of the disordered region. Following 
sections then contain calculations of the magnitudes 
of the two types of damping, and a discussion of the 
applicability of the theory to experiment. 


EQUILIBRIUM SIZE OF DISORDERED CORE 


: An immediate indication that disordering must occur 
in the core of a dislocation near the melting point is 


* W. T. Read, Jr., Dislocations in Crystals (McGraw-Hill Book 
Company, Inc., New Vork, 1953), p. 46. 

5 A. D. Brailsford, Phys. Rev. 122, 778 (1961). 

* J. Lothe and J. P. Hirth, Phys. Rev. 115, 543 (1959). 

1J. Lothe, Phys. Rev. 117, 704 (1960). 

$ A. Seeger and P. Schiller, Acta Met. 10, 348 (1962). 

us pont Z, po 149, 111 (1957). : 
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THERMAL DISORDER 
obtained by considering the lowering of the melting 
point of the material as a consequence of hydrostatic 
pressure produced by the stress field of an edge dis- 
location. In a purely clastic and isotropic material, the 
pressure within an off-axis cylinder of diameter r will 
exceed Gb/2zr(1— v), where G is the shear modulus, b 
the Burgers vector, and » Poisson's ratio. The sign 
depends on whether the cylinder lies above or below 
the glide plane. If one now applies the Clapeyron equa- 
tion for melting of bulk material, the melting point will 
be depressed in the positive pressure region for a 
material that contracts on melting, or in the tension 
region if the material expands. For example, in silicon, a 
material which has geometrically well-defined disloca- 
tions at low temperatures, a region of diameter of the 
order 10 À will have its melting point suppressed more 
than 10% (170°C). In this paper we will refer to the 
region which becomes melted or disordered as the 
"core." In reality the core will not retain a cylindrical 
shape, since the material there will not be able to sup- 
port shear stresses, and hence redistribution of the 
general stress pattern will occur, favoring a different 
core shape. 

'The exact method of calculating the core size would 
be to consider the free energy of the whole dislocation 
and then minimize this. Such a calculation would be 
very elaborate, involving a continuously decreasing 
disorder at increasing distances from the center. A more 
limited model is considered here, in which the core is 
taken to be a single phase, uniformly disordered, im- 
mediately surrounded by ideally elastic material. The 
core size may now be derived in a manner analogous to 
that employed for the standard Clapeyron equation. 

Consider the core to have fixed, but arbitrary, shape 
and variable size. Denote the core size by a parameter 
p, having dimensions of length. The independent 
thermodynamic variables are then p and the tempera- 
ture T. Initially it is supposed that the core is acted 
upon by an arbitrary stress field, whose net effect will 
be represented by a parameter f, conjugate to P, which 
may be thought as of analogous to a pressure which pro- 
duces melting point depression. Take a boundary im- 
mediately surrounding the core, of dimension po, and 
let the internal energy of the total material within the 
boundary be U(p,T). Then the heat entering through 
the boundary, in any general change, will be 


dQ—dU--dW - (9U/0T)dT-- (QU/ap)dp--pdp. (1) 


If these changes take place reversibly, the entropy 
change dQ/T will be a perfect differential. Hence, 


(8U/3p)4-b T (05/07). (2) 


involves the disordering of a volume 
ith latent heat of solidification L. 


f(p)dp of material, w t i ap 
isot his heat all passes through the 
Tn an isothermal change, this n nigh tne 


boundary. A change of temperature w 


A change dp 
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of specific heat also; so, in general, M 
dO=cdT+L f(p)dp. (3) 


ee (1) and (3) and noting that c= dU/dT, 
yields 
So) L= (0U/8p)-- p 
- T(0p/9T) from (3). 


In fact the material transforming is in a state of 
strain in the ordered phase, and in addition the total 
interface area may be changed during transformation. 
The effective latent heat is hence that for unstrained 
material, Lo, less the strain energy and surface energy 
contributions L,(p). 

At this point we restrict consideration to equilibrium 
conditions only, with the fixed dislocation stress field 
as the source of p. Now p and p are functions of T only, 
and the partial differential in (4) may be replaced by a 
total. Integrating, and inserting the sum in place of 
L, gives 


(4) 


p/(Lo— Ls) f(p) -1nT-I-const. (5) 


If now the elastic strain energy outside the boundary 
is Us(p,T), the parameter p is given by 


p= 9Us/ Op, 


since all changes of elastic energy appear as work done 
on the core. 

The constant in (5) may be evaluated by considering 
the dislocation at the melting point. Here the core re- 
mains melted in the absence of any peripheral forces, 
and hence the elastic system should be in a state of 
static equilibrium. The stress-field energy will con- 
tinuously reduce as the core size increases, while the 
surface energy of the solid-liquid interface will increase. 
The latter is also of the nature of an elastic energy, and 
so the equilibrium core size pm at Tn may be obtained by 
minimizing the total stress-field plus surface energy. 

Two possible configurations of the disordered core, 
representing extreme cases, have been considered. They 
are the flat strip, lying in the glide plane (or some 
minimum energy plane such as the most closely packed, 
for a screw dislocation), and the cylinder. It is shown 
in the Appendix that the rate of change of elastic 
energy outside a fixed contour close to the core, as the" 
core size changes, is finite for the strip, and zero for the 
cylindrical configuration. It is probable that the strip 
represents the most efficient configuration for the relief ^ 
of stress energy by the core disordering mechanism, and , 
it will be assumed from now on that the core takes this " ~ 
form. — - 

Let q be the thickness and 2p the width of the Strip, 
and suppose that only p varies with temperature. This 
constraint will affect the final answer only slightly, and 
is introduced as a simplifying approximation. The total 
elastic and surface energy (see Appendix) is 


(G88*/4z) In (ER/g)-4gs , as 


` > Ar 


> 
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where s is the energy per unit area of liquid-solid inter- 
face, and the constant 8 varies from 1/ (1—») for an 
edge dislocation to unity for a screw. Minimizing this 
gives 

ps7 GbB/16zs. 


Noting that the shape function f(p) will be 2q, and 
substituting in Eq. (5) yields 


Gis 
ERES): E23) 


for the equilibrium core size in the flat strip 
configuration. 

'The approximate stress distribution discussed in the 
Appendix gives a shear strain in the solidifying material 
at the strip edges of 58/274, and hence the strain energy 
modification to L is 


(6) 


Gb*8?/8r°p?. 


This turns out usually to be only a small fraction of Lo, 
and so it may be introduced into (6) as a second-order 
correction. The magnitude of the surface energy cor- 
rection is not certain, but it is assumed that this also 
is fairly small. 


DAMPING FORCE 


We now calculate the magnitudes of the thermo- 
dynamic and viscous damping forces mentioned in the 
Introduction, again assuming that in most respects the 
situation may be treated as macroscopic. Both sources 
of damping are additional to any which may act on the 
dislocation stress field. 

The thermodynamic damping is derived from the 
rate of entropy gain due to the temperature difference 
between the leading and trailing edges of the core. If 
the core were sufficiently wide, the temperature differ- 
ence would be proportional to the normal thermal 
resistivity as well as the rate of latent heat release. 
However, the width is usually comparable with the 
mean free path of the thermal phonons, and in this 
situation the heat-transfer mechanism becomes one of 
phonon radiation rather than diffusion. The phonons 
may be considered to travel with an effective group 
velocity vs rather less than that of lower frequency 
acoustic waves. If the rate of heat generation per unit 
length of dislocation at the solidifying edge is Qg, the 
excess heat density near the source will be Q/v.. The 
<. temperature difference between leading and trailing 

= edges is therefore 

— 


EL 0T — 20/av,c, , 


$ 


where c, is the specific heat per unit volume, and a is 
an efféctive radius, of the order of one atom distance, 
of the material which may be considered to have its 
temperature changed. 

* . The entropy gain rate, treating the system macro- 


* c 


SOUTHGATE AND K. S. 


ve CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


MENDELSON 


scopically, is 


dt 


if ôT is small. If the configuration of the dislocation 
core remains constant as it moves, the rate of change of 
internal energy of the system will be 


dU/dt=T (dS/dt)=Q8T/T , (7) 


which may be represented by a damping force coefficient 
B, Š 
dU/di= B,V?, (8) 


where V is the dislocation velocity. The heat-generation 
rate will be 


Q= (Lo— L)qV , (9) 


where Lo— Ls is the effective latent heat of solidfication. 
Thus, from (7), (8), and (9), the damping coefficient 
becomes 


B,22(Lo— L,q*/aTwse,. (10) 


Viscous damping may be calculated very simply for a 
strip-shaped core by assuming that the shear flow 
pattern is laminar. The relative displacement of the 
two faces as the core passes by is 0; since it takes time 
2p/V to pass, the shear rate is 


ô= (b/q)(V/2p). 
The total power dissipation is 2pg760?— B,V?, giving 
B,=b?/2pq. (11) 


In the case of a core which has a shape closer to the 
cylindrical, the flow pattern might become more com- 
plex. There will be a tendency for the core to "roll" 
as the dislocation progresses, lessening the frictional 
force. However, estimates of the flow pattern have been 
made for a cylinder, allowing the formation of four 
main eddies, which show that the viscous damping 
differs from Eq. (11) by a factor of the order of unity. 
It is supposed that this effect will be unimportant in 
the case of a nearly flat strip. 


APPLICATION OF MODEL 


The theory developed in the previous sections should 
apply to dislocations in crystals which have a low- 
umpurity content so that interactions between im- 
purities and dislocations are not significant, and in 
which dislocation motion is over such short distances 
that they do not actually intersect other dislocations 
in the course of their motion. Internal friction measure- 
ments, or microscopic studies of the motion of identifi- 
able dislocations, are two ways in which such a situation 
may be studied. Unfortunately there is a scarcity of 
experimental data, the tendency having been to con- 
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_Fic. 1. Widths of disordered cores computed for copper and 
silicon. Arrows denote Burgers vector of dislocation, below which 
size the width loses its significance. 


centrate on the behavior of dislocations at lower tem- 
peratures where appreciable dislocation broadening is 
not expected. We will take the case of two materials, 
for which there is some published information : silicon 
and copper. These form fairly typical covalent and 
metallic crystals. Ionic crystals are expected to behave 
in a similar manner to metallic, since the parameters 
involved have similar values. 

The most significant factor in the theory is the heat of 
disorder. As a first approximation, this could be equated 
to the heat of fusion of the material and in copper we 
will assume this is the value. In silicon, melting involves 
not only disordering but also a change from essentially 
covalent to essentially metallic bonding,’ which change 
presumably is exothermic. If we assume that in the 
core the disorder is sufficiently localized so that the 
covalent bonding is retained, and that one bond per 
atom is broken on the average, requiring about 1.5 eV, 
a heat of disorder is derived which is approximately 
twice that of melting; this will be used. 

The second factor involved is the surface energy of 
the interface, and the value of this is very uncertain. No 
direct measurements exist, and such values as are 
quoted are derived from observations of supercooling 
of solidifying melts, using nucleation theory.” It is 
not clear whether the surface energy of significance 


Taste I. Values used in computing core widths. 


Silicon Copper 
p 2X10" 1.6X 10° 
Latent heat L (ergs cm”) 1 
Shear modulus È (dyne cm ?) AR 104 TSX 104 
Burgers vector b (A) A 2n 
Strip width q (A) a EU 


Melting point (°K) 
UU ae 


uD. R. Hamilton and R. L. Seidensticker, J. Appl. Phys. 34, 


2697 (1963). ; in Metal Physics, 
13 J, H. Holloman and D. Turnbull, in Progress Tct New York, 


edited by B. Chalmers (Interscience Publishers, 
1953), Vol. 4, p. 333. 
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under these conditions is the same as that at the dis- 
location core. We shall therefore guess a value for the 
surface energy of one-tenth the disorder energy, per 
atom. The core width will be sensitive to error in this 
approximation only near the melting point. z 

Figure 1 shows the variation of core width to be 
expected for silicon and copper as a function of tem- 
perature. The dislocations considered are simple 60° 
dislocations in the (111) glide plane in both cases; 
Table I lists the values of parameters used. An isotropic 
average shear modulus is taken. It can be seen that as 
the temperature reduces to the Bordoni peak region in 
copper, below 110°K, and the region below 1200°K in 
silicon, the dislocation becomes less than one Burgers 
vector wide and it may be expected that the Peierls 
force should attain its full value. Above these tempera- 
tures, a smearing-out effect will occur. In both materials, 
dislocation behavior below these temperatures has been 
explained in terms of kink mechanisms, which require 
a large Peierls force; hence, the disordered-core theory 
is consistent with the kink hypothesis. 

The thermodynamic component of damping may be 
computed from Eq. (10), the assumption being made 
that the effective phonon velocity is of the order 10° 
cm/sec. In silicon, B, is then 2.5 dyn-sec-cm~? at 
1000°K and in copper 10-? dyn-sec-cm-?. These values 
are considerably larger than those obtained from the 
other models in which the Peierls force is not important, 
such as the thermoelastic theory of Eshelby (B~ 10-*) 
or the interaction of thermal vibrations via nonlinear 
behavior near the core (B~ 10-*), and hence will always 
dominate these other mechanisms. 

Viscous damping is more difficult to evaluate, since a 
considerable extrapolation of viscosity data to the super- 
cooled case is required. We assume that the viscosity 
is a thermally activated process, with the coefficient 
of viscosity n~10~* exp(H/kT). In copper, the activa- 
tion energy is assumed similar to that in other metals," 
about 0.1 eV. Tn silicon, a higher activation energy is 
taken, corresponding to a bond-breaking process; the — 
value of 1.4 eV gives a good match to the experimental 
observations. The viscous damping exceeds the thermo- 
dynamic damping below 0.1 Tm in copper and 0.85 Tm 
in silicon. It appears, therefore, that in copper (and., 
presumably many other metals also) the viscous 
mechanism is never important, since below 0.1 Tm dE 
the kink description of dislocation motion becomes 

valid. In silicon, on the other hand, it is anticipated — 
that it forms a good description Bic 0.7 T, and ^. — 
0.85 Tn. titer a> 

At the present time, comparison with 
cannot do more than indicate support fc 
as applied to silicon. A major difficulty is th 
of a damping coefficient from the me: 


1 C. J. Smithells, Metals Reference 
tific Publications, Du We hingto 
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internal friction, since the free length of dislocation 
is an important, and frequently unknown, factor. The 
Koehler-Granato-Lucke!^!5 stretched-string model of 
dislocation damping gives for the logarithmic decre- 
ment 6 of a crystal, at low oscillation amplitudes, 


aS ED NT? 
mC ien j 


wT 


ô 


with relaxation time 
tT L*B/a*C, 


where E is Young's modulus, JV the dislocation density, 
L the free length of the dislocation, C its line tension, 
and « an orientation factor of the order 0.25. Now if 
wr®1, both L and C vanish from the expression, and 
B can be computed readily from 6 and JV. For silicon, 
where the disordered-core theory predicts B always 
greater than unity, the inequality will hold for meas- 
urements at 10 kc/sec as long as L7» 4X 10-4 cm, which 
will be the case in pure material with dislocation den- 
sities of less than 107 cm-?. Figure 2 gives a curve for B 
derived from measurements of 6 on a silicon crystal bar 
oscillating at 8 kc/sec and containing a dislocation 
density of approximately 10° cm-?. The form of the 
curve is very similar to that obtained by summing B, 
and B, from the theory ; the factor of 2 or so difference 
in magnitude is not significant as it could easily arise 
from a nonuniform orientation distribution of a portion 
of the dislocation. 

In the case of copper, the situation is more complex 
as the damping coefficient is expected to be lower, and 
hence the dislocation length will become important. 
For most measurements reported, it is probable that 
wr<1. The internal friction will then be proportional 
to the damping coefficient, which itself varies inversely 
as the temperature. Such behavior is not observed, in- 
creases with temperature always being reported above 
about 200°K, sometimes gradual,!® sometimes rapid.!? 
Two additional effects, which would influence the 

- temperature dependence of the logarithmic decrement, 
might be mentioned here. The first is the possible re- 
lease of dislocation pinning points as the temperature 

—increases. This could cause an increase of decrement 
with temperature, either by increasing the free disloca- 
tion lengths, or by allowing some less limited motion 
at the pinning points. Secondly, dislocations in the 

^ (111) plane,of face-centered cubic metals split into two 


- half-dislocations. Changes with temperature in the 

« “spacing of these half-dislocations, particularly the 

~ possibility of their recombining at higher temperatures, 
e 


ee 


$ 


=n 3 
+ 


- 


M]. S. Koehler, in Imperfections in Nearly Perfect Crystals, 
edited by W. Shockley (John Wiley & Sons, Inc., New York, 
1952), p. 197. 

15A. Granato and K. Lucke, J. Appl. Phys. 27, 583 (1956). 

16 D. H. Niblett and J. Wilks, in Advances in Physics, edited by 
N. F. Mott (Taylor and Francis, Ltd., London, 1960), Vol. 9, p. 1. 

. „17A. S. Darling, J. Inst, Metals 85, 489 (1956). 
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DERIVED FROM Fic. 2. Dislocation 
LA INTERNAL FRICTION damping coefficient 

MEASUREMENTS in silicon derived 
from the disordered- 
core model, com- 
pared with that de- 
rived from internal 
friction measure- 
ments by assuming 
a uniform dislocation 
distribution and free 
length greater than 
107? cm. 
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ESTIMATED 


DAMPING COEFFICIENT B 
o 
— 


1 
0.8 
T/Tm 
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could affect the temperature dependence of the logarith- 
mic decrement. It is not clear what sort of effect this 
might be, although it is not expected to modify the 
behavior drastically. It is evident that further theoreti- 
cal and experimental investigation is required. 


DISCUSSION 


A thermally disordered dislocation is a complicated 
system whose behavior can only be specified accurately 
in terms of the motions of a large number of atoms. The 
situation lies midway between a macroscopic and micro- 
scopic system; too small for the averaging processes 
implicit in the treatment of macroscopic systems to be 
used with complete rigor, and yet so large that analysis 
in terms of microscopic variables such as local phonon 
modes, or the statistics of some disorder parameter such 
as the kink density, would become immensely complex. 
The model which has been presented here uses the 
macroscopic approximation, modified at the points 
where it obviously breaks down, such as in the thermal 
conductivity. However, the validity of this approxima- 
mation is purely intuitive, since the treatment is, 
strictly speaking, inconsistent; neither can it be said 
at present to be supported by any good fit to experi- 
mental data. In its defense, it may be argued that the 
dislocation is at least of macroscopic length, so that 
although thermal fluctuations, for instance, may ex- 
ceed the systematic difference due to dislocation motion 
over a very short length of line, over the total length 
they will be averaged out. : 

, One prediction of the theory is that even in an 
isotropic material, a screw dislocation will lose its 
radial symmetry at high temperatures. In a real crystal, 
the anisotropy will give preferred glide planes, and 


presumably the strip core would form in one of these 
planes. 


4» 
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The disordered-core theory modifies to some extent 
the concepts of interaction between impurities and the 
dislocation in several ways, which we will now briefly 
enumerate. Impurities which condense onto the dis- 
location will not only form a Cottrell atmosphere, but 
will also become incorporated in the core, the ratio of 
the equilibrium concentration in the core to that im- 
mediately surrounding it being of the order of the 
segregation coefficient for normal solidification. A core 
containing impurities will have a higher thermodynamic 
potential, and hence in order to minimize this by dilu- 
tion, a further increase of core width will occur. Motion 
of the dislocation away from the impurities will require 
the dissipation of a pinning energy, but this will be 
Jess than for the small-core dislocation normally con- 
sidered. In addition, an oscillation of the dislocation 
with amplitude less than the dislocation width will not 
involve the same pinning energy, and the impurities 
will act as "soft" pinning points in that case. If a 
dislocation moves sufficiently slowly for impurities to 
follow, there will be a significant frictional force, the 
relaxation time of the process being of the order of the 
time for an impurity to diffuse across the core. All 
these points require further exposition, and it is in- 
tended to do this in a later paper. 

A further effect of the disordered core will be seen 
in the “pipe diffusion" of defects. Diffusion coefficients 
for this process are high!*'!? and special structures, such 
as that of a hollow core, have been postulated to explain 
their magnitude. Queisser eż al.!8 derive an activation 
energy for dislocation diffusion of phosphorous in 
silicon which is about 1.5 eV less than that of bulk 
diffusion. This difference was interpreted in terms of 
excess concentration of vacancies and impurities near 
the dislocation due to the formation of Cottrell atmos- 
pheres. However, the disordered-strip model shows that 
in the 1050 to 1200?C region where the measurements 
were made, there is appreciable broadening, which 
significantly lowers the Cottrell binding energy. Excess 
vacancy concentrations outside the core will therefore 
not be so large, and this mechanism of enhanced 
diffusion becomes less important. The alternative ex- 
planation is that the lower activation energy is associ- 
with the disordered-core material itself, which pre- 
sumably would have an energy closer to that of the 
liquid state. Such a theory is being developed, taking 
into account possible changes of core size with im- 
purity concentration in the core and the ratio of this 
to the concentration in the surrounding solid. 


SUMMARY 


here attempts to deal with the 


J ented y 
E f dislocation configuration and 


very complex problem o 


18 H. J. Queisser, K. Hubner, and W. Shockley, Phys. Rev. 123, 


1245 (1961). à 
"d x ARAM A. Lasker, and R. Thomson; J. Appl. Phys. 31, 


445 (1963). 
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damping force by applying macroscopic tlermodyna- 
mics to a microscopic situation. Although not strictly 
correct, the model has the advantage of involving a 
relatively simple conceptual picture of the dislocation 
at high temperatures, that of the thermally disordered 
core. The disordered material is considered to be quasi- “ 
liquid, so that appeal may be made to the concepts of 
latent heat of disorder, and of viscosity, in the cal- 
culations. “Melting” of the core relieves the stresses 
in the vicinity. It seems to be a reasonable assumption 
that a flat strip configuration does this most efficiently, 
in both edge and screw dislocations, although this has 
not been proven. A cylindrical disordered-core coaxial 
with the dislocation does not relieve stresses. The core 
width increases considerably as the melting point is 
approached, and will be greater than several atomic 
spacings over a significant temperature range in many 
crystals. Near the melting temperature, the surface 
energy of the order-disorder interface is a controlling 
factor in the size, and uncertainty in the value of the 
surface energy is reflected in an uncertainty in the 
width here. 

Two separate contributions to the dislocation damp- 
ing coefficient are derived: thermodynamic and viscous. 
The thermodynamic damping originates in the heat 
required to produce disorder in the forward edge of the 
moving dislocation and the entropy changes in trans- 
ferring this heat from the solidifying trailing edge. It 
varies inversely as the temperature, but disappears when 
the dislocation narrows to only one or two atoms wide. 
Viscous damping decreases with temperature due to 
both the increase of dislocation width and decrease in 
viscosity of the disordered core. Both mechanisms 
appear to give damping coefficients of a significantly 
large order of magnitude; although good estimates of the 
viscosity of the disordered core are difficult to make. 

Application of the theory to silicon shows that the 
damping will be large and will predominate near the 
melting point, while at lower temperatures, it is reason- 
able to suppose that viscous damping becomes impor- 
tant. The magnitudes are consistent with those derived 
from experiment. In the case of copper, the thermo- 
dynamic damping is expected to dominate the viscous 
over a wide temperature range, but the form of the 
temperature dependence deviates considerably from” 
the experimentally observed variation. When the tem- 
perature becomes so low that the core is only two or 
three atoms wide, the disorder is better described as an * 
assembly of kinks, and the y" complex theories of , 


kink generation and diffusion may then be used in place^ *. | 
of the viscosity approximation. = 


A disordered core is expected to modify dislocation 
interactions with impurities, and to lead to a higher 
equilibrium concentration of impurities in the core, 
“soft” pinning effects, additional dislocation drag 
mechanisms, and enhanced impurity diffusion along 
the dislocation. j Ca 
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APPENDIX: DEPENDENCE OF STRESS-FIELD 
ENERGY ON CORE SIZE 


1. Cylindrical Core 


The dislocation core is considered to be a cylinder of 
radius p. We wish to calculate the stress-field energy per 
unit length, Uo, in the region outside a larger cylinder of 
radius po, and to obtain 


Lim (9U yp). 
PoP 


Consider a straight dislocation lying along the z 
axis and let o and e; be components of the stress 
and strain produced by this dislocation. Then 


1 
Us | oties dS , (A1) 
9 


4 


where the upper and lower indices denote contravariant 
and covariant components,” and the usual summation 
convention has been used. The integration is taken over 
the region of the xy plane between circles of radius po 
and R (large). For an isotropic medium the stress and 
strain components are related by ?! 


1 y 
ore xe , (A2) 
2G +1 


y 


where the g;; are the components of the metric tensor. 

The physical stress components? for a screw dis- 
location are given in Eq. (2.16) of Cottrell.?? They are 
independent of p; hence VU will also be independent, and 


Lim (9U/0p)—0. 
[a 


For an edge dislocation one cannot take the stress 
components directly from Eq. (2.10) of Cottrell. It is 
necessary to correct the components so that the stresses 
across the surface of the core vanish. Following the 
method outlined by Cottrell one obtains the corrected 
covariant stress components: 


Gb sin p? 
EV 
re 2r(1—»)r r 
Gbr sin p? 
E n 
. 2«x(1—») 7? 
K eae Gb cos p 
as e ard (A3) 
Te 2r(1—v) n 
zt e Cze= vlos t (72) 000) 2 


See, for example, G. E. Hay, Vector and Tensor Analysis 
(Dover Publications Inc., New York, 1953), Chap VI. 

2 This can be obtained by generalizing Eq. (12.3) of Ref. 24, 
which is given in Cartesian coordinates. The m in this equation is 
the reciprocal of Poisson's ratio (m=1/y). 

2A. H. Cottrell, Dislocations and Plastic Flow in Crystals 

> (Oxford University Press, London, 1953). 


Now, using the relation o%=gi*gi'oj. to obtain the 
contravariant stress components, and neglecting terms 
in 1/R? and 1/R*, we derive 


2 4 


Uo 


Gb? [ R p 


= n—— aF ] , (A4) 
4(1—»)L po 2(1—»)w 4(1—»)p& 

and again 

Lim (8U/8p) —0. 

Pye 


We might note that Lim,,., Uo does not agree with the 
expression given by Cottrell [the right side of Eq. 
(2.22b) of Ref. 22]. Cottrell calculates the energy by 
considering a half-plane cut made in the unstressed 
material and determining the work done in displacing 
the two sides of the cut by the Burgers vector. However, 
this procedure ignores the work done in changing the 
shape of the core during the displacement, and it is 
apparent that this work is of the same order as the 
correction terms. 


2. Flat Strip Core 


Let the dislocation core be a strip of variable width 
2p in the glide plane and fixed thickness q. Take a fixed 
boundary outside the core, width 2po, and thickness q; 
Us is the elastic energy outside this boundary. We are 
interested in the case g<p and hence, for simplicity, will 
calculate Uo in the limit g— 0. Suppose U’ is the energy 
outside the strip of width 2p. As q— 0, the volume 
between the two strips becomes vanishingly small and 
hence U’ — Up. Since the limits involved in calculating 
the partial derivatives are independent of q, they will 
also become equal as q — 0. 

A convenient way of deriving ÜU, is the use of the 
analogy between the magnetostatic field energy of a 
current and the elastic field energy of a dislocation.?* 
The analogy is effected in this case by substituting the 
elastic modulus G8 for the permeability uo, and the 
Burgers vector 5 for the current 7. We regard the strip 
as having a uniform distribution of Burgers vector 
between —p and +p, and hence take a uniform current 
distribution for the analogous case. The magnetostatic 
energy is related to the inductance L by 


(UL SEIS 


while L is readily obtained from the formulas of Oberhet- 
tinger and Magnus” for the characteristic impedance of 
a transmission line consisting of a flat strip surrounded 
by a large outer conductor. If the outer conductor is 
sufficiently large, a good approximation is 


L= (uo/2x) In(kR/p) , 


? R. de Wit, in Solid State Physics, edited by F. Seitz and D. 

Turnbull (Academic Press Inc., New York, 1960), Vol. 10, p. 249- 

** F. Oberhettinger und W. Magnus, Anwendung der Elliptischen 

ne es in Physik und Technik (Springer-Verlag, Berlin, 
> P- 63. 
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where k is of the order of 2.5 and depends on the 


seometry of the : i 
ge try the outer conductor, and mks units are 
used. Therefore 


Uo= (GBb?/4r) In(kR/p) 
and 


9Uo/dp= — (GBb?/4rp). 


IN 


DISLOCATION CORE A1649 
The same expression is obtained by an approximation 
in which the material within a circle of radius p is 
taken to be uniformly stressed, in tension above and in 
compression below the strip, while outside the circle 
1t IS stressed just as if the strip were of infinitesimal 
width. This relatively simple stress distribution appears 
to be a good approximation to the exact situation. 
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The secondary emission yield of metals by the surface effect 6 is expressed in terms of the surface photo- 
electric yield yw) for a radiation of frequency w and angle of incidence of cos"! 155-1) which is about 52°. 
It is shown that ô~ (2ra Ep) 1f 4,?? y(w)dw/w, where Ep is the primary energy in atomic units, a= 1/137, w1 
is the threshold frequency, and o» depends on the energy of the primary and may be replaced by «. For a 
square-well potential model for a metal, 6~10-4/£, with a relative error of order (Ep/Ey) In E, Er), 


where Kr is the Fermi energy. 


1. INTRODUCTION 


HE purpose of this paper is to establish a general 
and simple relation between the surface effect in 
secondary electron emission! (SSE) from metals and the 
surface photoelectric effect?? (SPE), and to use this 
relation to show clearly why the SSE is so small that it 
can be neglected in explaining the experimental facts. 
Such a relation is of interest for its own sake, and in 
addition a new examination of the problem is desirable 
since most of the published papers on the SSE, are 
incorrect.*:? 


1For a review of secondary emission see O. Hachenberg and 
W. Brauer, in Advances in Electronics and Electron Physics 
(Academic Press Inc., New York, 1959), Vol. XI, p. 413; A. J. 
Dekker, in Solid State Physics, edited by F. Seitz and D. Turnbull 
(Academic Press Inc., New York, 1958), Vol. 6, p. 251. 

2K, Mitchell, Proc. Roy. Soc. (London) A146, 442 (1934). 

3T. Adawi, Phys. Rev. 134, A788 (1964). This paper will be 
denoted by I. 

ON. Viatskin, Zh. Eksperim. i Teor. Fiz. 9, 826 (1939) treated 
a semi-infinite square-well potential model. The basic formulation 
is correct but the final integrations and conclusions are obscure 
and incorrect. : = ; 

5 W. Brauer and W. Klose, Ann. Physik 19, 116 (1956). This 
paper has been assumed correct in the two review articles cited 
in Ref. 1, but it contains unfortunately basic errors. They treat 
a finite square well of width 2a in the limit that a c. The cor- 
rect final state which should be used in the transition matrix ele- 
ment is the function v* used here. Using the notation and Eq. 
(4.5) of I we have that the incoming wave v ls given by 
p=} (69/A.*+2/Aa"). When the correct limiting procedure is 
applied as a, the results of the finite and the semi int nits 
square well become identical as has been discussed in genera i t 
With this in mind, and for a primary electron incident norma 
the metal surface, none of the four delta functions obtained by 
Brauer and Klose and on which essentially all their discussion is 


i i i ined by terms 
based should arise; and the effect is precisely determine : 
MERE oU they ignored. The yield by the surface et x 
secondary emission as in photoelectric emission 1S indepen Us of 
the dimensions of the model analyzed, and there is no n 


introduce an ad hoc depth d, for calculating the effect. 
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We shall use for convenience Hartree's atomic units 
in which A, the electron mass m, the Bohr radius ag, and 
the electron charge e are unity, and the speed of light c 
is 137. As was done in discussing the surface photo- 
electric effect in I, we treat the conduction electrons as 
independent noninteracting particles. The motion of a 
single electron in the y and z directions is free and can 
be described by the plane-wave L~! expi(kyy-+k.2) obey- 
ing cyclic boundary conditions and normalized to unity 
in a square area of side L. The « motion is bound by a 
general surface potential V(x) which is the same for all 
electrons and varies only in the direction x which is 
normal to the metal surface. This motion is described 
by the wave function L~'$(x) normalized to unity. 
The length L is the thickness of the metal plate which 
extends from +=—L to x=0. The function ¢o(x) 
satisfies the wave equation, 


Hi$o7 Ego, 


where . 
Hy(x) 7 —192/0x?- V (x). 


(1.1) 


(1.2)* 


Inside the metal, V(x) is a constant equal to — Vs, 
and we can write ¢o(x)=2'? sin(&,x-I-y) where y is a + 
phase factor depending on V(x), and lbj— Est Vo. ` 


The potential V(x) rises to zero in the surface barrier - + 


regions near x—0 and «=—L in a distance much less 


than L, and ¢o(x) behaves as exp(—px) for large x, - 


where —3p°= E». We ignore thermal effects and assume 
that all energy states below the Fermi energy Ep are 
occupied, and all energy states above the Fermi energy 
are empty. The conduction electrons in this model 
assume k values which fill “a Fermi hemisphere” gi 

by #=k and b> 0, where kp is the Fermi momen 


^ > oc. 
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i.c., Ep Y 2. The number of electrons per unit volume 
in the interval dk is given by 4d?k/(87*), where the 
factor 4 is contributed by the usual factor 2 for spin 
and an additional factor 2 to take account of the dis- 
tribution of the bound states of the x motion in the 
continuum limit? (large Vol/?Z). 
We consider a primary electron of momentum K and 
energy E, iK? incident normal to the metal surface, 
namely, K=(—K, 0, 0). The mutual Coulomb repul- 
sion between the primary electron and the conduction 
electrons bound by the surface potential offers a direct 
mechanism for a secondary electron emission from the 
bombarded metal surface. It is with this process that 
we are here concerned. We do not concern ourselves 
with those electrons which are left excited after colliding 
with the primary, and which diffuse in the metal, and 
appear later as secondaries. We hardly need to empha- 
size that the effect we are discussing exists only by 
virtue of the binding of the conduction electrons to the 
surface potential, since it is well known that no second- 
aries can be directly emitted from a metal surface 
bombarded with normally incident primaries if the 
electron gas is completely free.® 
In Sec. 2 we discuss the scattering problem of the 
system consisting of a primary electron and a conduc- 
tion electron, hereafter referred to as secondary. We 
shall consider only fast primaries (nonrelativistic) 
whose energy is much greater than the Fermi energy. 
Indeed, the situations of most practical interest! are 
those where the primary energy Æp is of order 10 (272 
eV) and E,/Er of order 100. We are then justified in 
treating the Coulomb field of the two electrons as a 
perturbation and neglecting the effect of the surface 
potential on the primary wave functions. We shall, 
however, treat exactly the effect of the surface potential 
on the motion of the secondary in the final state which 
leads to the use of the well-known “incoming wave"*? 
in the transition matrix element. The two electrons 
will be treated as distinguishable particles described by 
a product wave function, since we shall see that the 
main contributions to the SSE come from small-angle 
scattering for which exchange effects are not important. 
In Sec. 3 we use the small-angle scattering approxi- 
~ mation and derive the main result of the paper which 
expresses the secondary emission yield by the surface 
effect as an integral over the surface photoelectric yield. 
In Sec. 4 we illustrate this result by treating two ex- 
< amples. In the first example we use experimental meas- 
* urements on the SPE. In the second example we apply 
^ «our formulas to the previously treated*5 square-well 


potential model for the metal, and obtai the yield by 


<> 


— the SSE which tums out to be in agreement with 
Baroody's unpublished result.? 


6 H. Fröhlich, Ann. Physik 5, 13, 229 (1932). 
7See G. Breit and H. A. Bethe, Phys. Rev. 93, 888 (1954); 
I. Adawi, Am. J. Phys. 32, 211 (1964). 
8%. M. Baroody (unpublished report); and abstract in Phys. 
- Bev? 92, 843 (19053). — 
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2. FORMULATION 


The unperturbed Hamiltonian of the system Ho con- 
sists of three commuting parts: Hı defined in (1.2), H3 
the kinetic energy of the secondary for the y and z 
motions, and H; the kinetic energy of the primary. 
Thus, 


Ho= a+ Hots, (2.1) 
H= —1 (0?/0y*4-0?/07^) , (2.2) 
H37 —4Vg, (2.3) 


where r= (x,y,z) is the position of the secondary and R 
is the position of the primary. The perturbation po- 
tential H’ is given by the Coulomb field, 


H'=1/|R—r]. (2.4) 


We consider as in Sec. 1 a metal plate defined by 
—L<x<0, and discuss the secondary emission from 
a square region of area L? of the surface x ~0 which is 
bombarded by a uniform primary beam moving in the 
—x direction. The interaction volume is Z5 and we 
normalize the primary wave function to unity in this 
volume. Thus the initial primary state is L”? exp (;K- R) 
with K= (— K, 0, 0). The initial state of the secondary 
has been defined, and we take for the initial state of 
the system the wave function yo defined by 


Hobo Epo, 
Yo= L> expz(K- RJ- k.: x)óo(4) , (2.5) 
6— Eg4- 3 (ke +K?), 
where for any vector s, s, = (0,5,,5;). 
The solution v* of the perturbed problem for out- 
going waves satisfies the scattering integral equation,” 
Vt— yet [1/(6— Hotie) HY. (2.6) 
If we write 


Vi yet ys, (2.7) 


where y, is the scattered wave, we obtain to first order 
in H’ that 


v.=[1/(6—Hotic) HW. (2.8) 


We expand H' in terms of the eigenfunctions of the 
primary and write the usual Fourier series, 


H'—1/|R—r| =>). (4/2) L5 expfix-(r—R)], (2.9) 


in which the components of x are given by (27/L) 
times an integer which is also the case for k, and kz. 

If in (2.8) we substitute for H’ and v; the expressions 
given by (2.9) and (2.5) and observe that the relation 


(6— Hotie) expi(K’- R-- k,/- r) 
— [expi(K'- R+-k,’-1)] 
XL[6—H;—1(K?--k3)]3, (2.10) 
* B. A. Lippmann and J. Schwinger, Phys. Rev. 79, 469 (1950). 


9 M. Gell-Mann and M. L. Goldberger, Phys. Rev. 91, 398 
(1953). 
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SECONDARY AN D 
holds for a general K' and k', we obtain 


y=}, exp[ (K — x): R4- (k,-L- E r] 


X (42/2) L^6(wx), (2.11) 


(x) = | G*(E'; xx )e'"eo(z)dx', (2.12) 


G*(E)m (E' — Hy ie), (2.13) 
E'zipeke, (2.14) 
O= —k, K —$k2—k— kn, (2.15) 


which reduces the problem to a one-dimensional prob- 
lem, since the Green's function G*(£/) contains only 
the Hamiltonian of the x motion. 

To obtain the secondary current we examine the 
asymptotic behavior of y, for positive large x. For large 
x and finite x’ we have as in I that, 


GE’; 2,2’) ~ (ig) est (a?) , (2.16) 


where v(x) is the “incoming wave" defined as the 
incident wave exp(—iqx) plus the reflected and trans- 
mitted waves due to the potential V (x). From (2.16) and 
(2.12) 


$ sx) o (ig) Me, (2.17) 


M= [Gee demGespisn) (2.18) 


From the cylindrical symmetry of the problem, the 
total secondary particle current is in the x direction, 
and we need to calculate, therefore, the current element 
dI. contributed by V, and threading the area L’. Using 
(2.17) in (2.11) and the usual formula, 


à, pA 
3D Y Vs 4 [erase (2.19) 


2i Ox Ox 


dI (x) 


we obtain 

dI =F, 167c*| M |g1L7. (2.20) 
Notice that in (2.19) the y and z integrations give L’, 
the R integration gives Z°, and the orthogonality of the 
primary final states eliminates any interference terms 
between different values of x. The total current I, is 
obtained by summing (2.20) over the conduction elec- 
trons in the volume Z$, and if we replace the summations 


by integrations we obtain 


I= Í (Pk/ 209) Í (O/a, (2.21) 


over the Fermi 


integration is extended 1 
pm h tricted by the 


hemisphere and the x integration is res 
condition that q is real. 


If the normally incident primary is to lose energy tO 
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the Secondary, x, must be negative. Furthermore, w of 
(2.15) must exceed the metal work function v1 before 
any secondaries can be emitted directly by the present 
mechanism. This implies that some screening of the 


Coulomb field is automatically included in the model ~ 


and we see from (2.15) that, to a good approximation, ` 
all k<w1/K do not contribute.” 

The SSE yield 6, which is the number of secondaries 
emitted per incident primary, is obtained by dividing 
I; by the primary current K/L, namely, 


à— (2/2 K) Í (Pk/ 225) | Pulg) M2), (2.22) 


To show clearly that ô is independent of Las Lox, 
we must investigate the nature of M in this limit. It 
was shown in I that in discussing electron emission 
from the surface x—0, a finite plate model with two 
surface barriers, one at x=0 and the other at x— — L, 
reduces in the limit L—»« to a semi-infinite model 
with one barrier at x0. The limiting “incoming wave" 
v(x), which must be used in (2.18), is a solution of the 
wave equation for this limiting potential for x> — L, 
and is zero for x<— L. Thus inside the metal v(x) 
« exp(—ik,’x) where a cutoff is implied to render v(x) 
zero as x ^, and where k,”=¢?-+2Vj=k/+2w. The 
integrand in (2.18) for x« 0 is of the form 


expz(—R/ katre 


for which the phase cannot vanish, since k; >k and 
k,<0, and hence the integral cannot display a delta- 
function-type singularity which, if it were present, 
would bring a length? L in | M|?. Obviously, the inte- 
gration in (2.18) over x between x=0 and x= % is con- 
vergent since ¢o(x) is a damped wave for large x. We 
conclude that except for thin films, to which still the 
present formulation can easily be adapted, | M |°, and 
6 are independent of L, and we can set formally L=1 
in the preceding equations. 

For completeness, we give now the results of the 
time proportional transitions method in discussing the 
scattering problem. A detailed derivation can be found 
in Gell-Mann and Goldberger, or obtained by other 
means. The transition from the initial state y» of (2.5) 
to the plane-wave final state V; of energy 6’, where" 
Vy expi(K'- R--k'«r), K-K-x and k'— (ge, 57), 
is determined by the transition matrix element Tyo 


Ld 


u For a uniform electron gas screening sets in for momenta : 
«o,/ K where wy is the plasma frequency, see, e.g., H. A. 
Physica 13, 401 (1947), J. Lindhard, Kgl. Danske Viedenskab. 


Selskab, Mat. Fys. Medd. 28, 8 (1954). In the surface problem e = 


under consideration, the electron density, plasma frequencs, and 
dielectric constant are variable in the region of interest, and it is 
not deemed worthwhile to discuss screening beyond what naturally 
curs in the problem. ; 
h Notice that as Lo, 2rôlk) =f -u dx expGkx), and 
2x|5(5) |? is interpreted as 14(k). If M had a delta function be- 
havior the sum over z would have to be restricted to a length d, 
and not L, where d, is a characteristic escape depth for uq 


^ 
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which is given by 
Tyo= (p; | H'4- V |y?) 


After some algebra (2.23) gives to first order in H’ the 
result, 


(2.23) 


Tj Ws HW), (2.24) 
where V7 v* (x) expi(K'- R--k/* x), the only differ- 
ence between (2.24) and the simple version of the Born 
approximation is the replacement of exp (igx) by v*(x). 
The R integration in (2.24) gives 4x ?, and the y and z 
integrations give Kronecker deltas, since momentum is 
conserved for these directions, and we have 


T= Ank ? Mk, kyteyðkz etna (2.25) 


By summing the transition rate, 27 | Tyo ?5(6'— 6), over 
k, and k,’ holding x fixed, and integrating over q we 
obtain 16z2k-*| M |?*g7, which when summed over x and 
k gives exactly Eq. (2.21) for the total current (with 
"ab 


3. RESULT 


The yield 6 of (2.22) is controlled mainly by the Cou- 
lomb scattering which is proportional to x‘, and by 
the matrix element M which is the Fourier transform 
of vpo, where v and ġo are eigenfunctions of Hı whose 
energy difference is e. The problem is basically the 
same as the ionization of atoms by fast electrons for 
which the physical principles are well established. We 
shall here use the energy kr? as a characteristic energy 
to obtain order of magnitude estimates, in much the 
same way as the binding energy is used in atomic 
collisions. For x;—0, M —0, and for small x, we write 
exp (kx) c- 1--ikzx, and we have the well-known dipole 
approximation, 


M ~ixAv*| «| oo). (3.1) 


From the commutation relations, [v,H:]=0/dx=D, 
and [D,H,; |=0V/dx, we can rewrite? (3.1) as 
Mz —ikz9- 1M, 


3.2 
M;eG*pDléjm-elVlés. — 77 


~~ For kkr the binding effect of the potential V (a) 


` should become negligible, and the conduction electrons 
can be treated as free electrons. Since we know that in 
such a situation. no secondaries can be emitted, we 
conclude that. M must decrease very rapidly as kz 


^ 


< becomes large. This coupled with the fact that the 


< “Coulomb field strongly favors small momenta transfer, 
——— leads to the conclusion that the major contributions to 


+ Inc. Reading 


the yield come from those values of x for which x;Er 


13 H, A. Bethe, Ann. Physik 5, 325 (1930). See also N. F. Mott 
and H. S. W. Massey, The Theory of Atomic Collisions (Oxford 
University Press, New York, 1952), 2nd ed., Chap. XI; L. D. 
Landau and E. M. Lifshitz, Quantum Mechanics, translated by 
J.BeSykes and J. S. Bell (Addison Wesley Publishing Company, 

ing, Massachusetts, 1958), Chap. XV. 


fx i 
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and or. The dipole approximation (3.1) applies and 
there is no need to consider large values of «z. 

The expression (2.15) for w will now be simplied. To 
estimate various terms, we recall that for a fractional 
energy loss A by the primary and for a scattering 
angle © we have K’~ (1—4A)K, —xz=3(A+ O?) K, and 
kı% OK. The term 3x,” can certainly be neglected. 'The 
term k,-x, is zero on the average, and it is small com- 
pared to —x,K for @?KA~kr’/K? or O<kr/K. For 
O>kr/K, Kri kı. We can, therefore, neglect the 
term x,* k; without committing a serious error. Equa- 
tion (2.15) now takes the approximate form 


(3.3) 


3 
(zz —kz K — K, 


whose accuracy will be later discussed. 
By using (3.2) and (3.3) in (2.22) we can hold x 
fixed and integrate over k, and we can write 


ò= (TaK)! | Bk x ly (w) , (3.4) 
where, by definition, 
y(w) = 2e | (Bk/23)g| Mi|?, (3.5) 


and a=1/137. To identify y(w) we shall use the results 
of I on the first-order SPE. For a monochromatic 
radiation beam of angular frequency w and polarization 
(E vector) in the plane of incidence, we find by sum- 
ming Eq. (2.15a) of I over k that the photoelectric cur- 
rent density 7; is given by 


I,;=137ny(w) sin’6, (3.6) 


where n is the number of photons/unit volume in the 
incident beam, and @ is the angle of incidence. By 
dividing Jı by nc cos@ which is the normal component 
of the photon current density, we obtain the SPE yield 
Y (w,6), namely, 


Y (0,0) = y (w) (sin’6/cos6) . (3.7) 

Te y(w) is the SPE yield for 0=cos~ 3[(5)/?—1] 
=51°50/. 

The azimuthal integration in (3.4) gives 2r. The re- 


maining double integral is written using the new vari- 
ables, w= —k-K—x.? as in (3.3) and u=x,2, and we have 


ô= (2raE p) Í (dw/w)y lw) (o) , (3.8) 


where 
uo ð 
F@)= 22> | du +B JE Gee) 2B . (3.9) 
0 9E, 


Here, w2=KzoK, where x2» is a maximum value of —«z 
consistent with the approximation —«.<%r, and “o 


o 


Se —— E 


v En ET 


Mf 
v? 


o 
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=w2—w. Performing the integration i } ; : 
gration in (3.0), we obtain bracket of (3.13). Since this correction is sinaller than 
"E. o E, uA the second term retained in (3.13), we can ignore it. E 
| Ei n (b) Large x2: If in (3.10) we set mo=Ep—w which 3 
p U— 105 Td allow for the primary to lose all its energy we 
: tai «E —9 
r DL Ey Fayuka] „  Qbtain (for e«E,) that 1 
here (Es - 20) [4-2( Es4-o)u4-o]ls Ples Ey) OO E DRM 
wher 
» ; Comparing (3.14) and (3.11) we see how little the large 
11,35 — (Ey) 4e (E) -2E p0)". (3.10) momenta transfers contribute to the effect. Since the 
For of E &ug«cE,, Eq. (3.10) give dipole approximation of (3.2) is expected to overesti- 
puoi, Eq. (3.10) gives mate M for large xz, Eq. (3.14) cannot be trusted and 
HOR 1+ (w/E,) In(1--2u9E 72) . (3.11) will be discarded. 
hikes: e 1 ; (c) —4x2 term: This term can easily be included in 
i D QUE that P) is a slowly varying function of — (3.3). The resulting integral is similar to (3.9) and can 
SSE ed T DNE SPE ud in (3.8) controls the be evaluated exactly. We find that F(w) of (3.11) 
a 25 xm nex nown that?? ylw) reaches a peak remains the same (except that 1 is replaced by } inside 
S ; r rops rapidly with increasing frequency. the logarithm) and (3.13) is not altered. Equation ; 
If we take x;o7vr/4 we see that wkp? for K/kr>4 — (3.14), however, now takes the form i 
and wz may be replaced by «o in (3.8). This shows that j 
the important values of w are of order kp? and that the F (w, Ep—w) = (5/3)+ (w/ Ej) In(BEZ/3w) , (3.15) 
emitted secondaries are of low energy. If u«w?/E>, 
(3.10) shows that F(w) is small, but then y(w) is small to which the previous remarks apply. 
since z^ and these values of w are not decisive in Thus our approximations are good, and we conclude 
determining à. We set 4/9» in (3.11) and neglect the that the simple formula, 
one in the logarithmic term, and substitute the result 
in (3. i : 
in (3.8) to obtain 87» (2raE 5)! | (das/w)y(w) , (3.16) 
oo "t 
~ y jl 
87: (2raEp) i : (dee) (v) is quite satisfactory, and that (3.13) contains the major 
: corrections to the SSE yield 6. 
X[1-4-(e/Ej)lIn(QQesEg»?)]. (3.12) 
To simplify (3.12) we treat the logarithmic term as a AERE a 
constant in which we set w=fpıkrK and w=wmaz A. Experimental E 


where f, is a small fraction and wmax is the angular 
frequency for which the SPE yield y(w) is maximum. 
We can write wmax=Bokr where f» is of order unity, 
and denote 61/6: by a new parameter B. We obtain 
the result, 


o 


= QraE;) ] (ONO 


XL+ (w/Ep) InB(Ep/Er)*?]. (3.13) 


The leading term in (3.13) would have been obtained 
if we neglected the term x,’ in (3.3). The logarithmic 
term gives a correction term of order $(Er/E> 
XIn(E,/Er) which is about 25% for E,/Er~20. 

Let us now return to the approximations made: 


(a)- Ki* x1 term: This is the most bothersome term in 
the calculation and was neglected in (3.3). We can 


estimate the error by including this term in (3.9) and 


averaging the denominator over k to obtain (u+) 


-F2u(E,4-Er/5) for the new denominator. The integra- 
tion can be carried out and we obtain the correction 


term, (Ep/SE,) InB(E,/Er)'^ to be added inside the 
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To obtain an estimate of (3.16), let us use the experi- 
mental yield curves Y(w,6) for Na and K which are — — 
reproduced in Weissler's review article.* From the 
relation (3.7) we obtain y(w). By a crude numerical 
integration we find that the yield integrals, /-y(w)da/w 
are 76.7 X 10-4 and 410-5 for Na and K, respectively, 
which lead to the values 6~1.5X10~/EZ, and 0.9 
X10-/E,, respectively. This shows clearly that the 
effect is entirely negligible, since the observed values 
of à are of order unity for E57-10. ac 


B. Theoretical . 


A theoretical calculation of ô is possible tor a squa 
well potential model. Let V(x)-— —V» for z«0, 
V (x) =0, for x0. The wave functions v and ġo can 


4G. L. Weissler, Handbuck der Physik, edited by S. F 
(Springer-Verlag, Berlin, 1956), Vol. 21, p. 351. Fi 

the measurements of Maurer at @=60°, and for 
ments of Suhrmann and Theissing at 0=65°. The 
K has to be cut off, since a true SPE must as 
rapidly with frequency. For square well poten 
y(u) cw? as o — * as can be deduced from Eqs. 


"^ 
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written don easily as in I, namely, 


ence. 
Ge : P e i4 oF p) 
de MR (4.1) 
Li t. Iks 
gola) = zw g tker} eiker, ; eP? 
katip b. ip 
forx«0 and «x0, 


where the propagation constants have been previously 
defined. By using (4.1) in (2.18) we obtain after some 
lengthy algebra: 


| Mte 32e (9 (^ /R, 


(4.2) 
R= (Qo —2k,k Hre) 4e?+ (e? + 4]? — 4o) ]. 
We see that if ir Kw, Rz169! and 
|M|*e 2i 2gR2(5/—g)(kz--g))w *, — (43) 


which is precisely the dipole approximation of (3.2) 
with which we could have started. The advantage of 
giving the exact |M|? is to show how good are the 
approximations of Sec. 3. It is evident from (4.2) that 
for a finite w and large kz, | M|?« x;-* which justifies 
the neglect of large values of xz. 

Tt follows from Eqs. (4.3), (3.2), and (3.5) above, or 
from Eqs. (3.9) and (3.10) of I, that 


3) = (160/7) (e/ Er) To), (4.4) 
i el 
Tako) = — (1—e)(e+ 9-7)" 
NS ex0—3»0 
xeP[(erF9)?— (e+ Q—n)'? Pde, (4.5) 


where Q—«/ Er, e= (Vot Eo)/Er, and n=Vo/Er. 
If we substitute (4.4) and (4.5) in (3.13), define the 
parameter À by 
A= (Er/Ep) InB(E,/ Eg), (4.6) 


and introduce the integration variables e and ¢ where 
¢=e+2—n7, we have 


4 [2] 
ô= dt[ (trn) 2 — eee 
= | Ltn 2] 


Se p. 1 
X| 1—9e25(,6ede, (4.7) 
where r : 
i: S66) = mti 9-4 e. (4.8) 
X 
Li * : ^ r 
LI 
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Expanding S in a power series of e and integrating 
term by term we obtain 


16 1 o /—4 1 
i- vas Je» —— 
TÉ, 15 n-0N qn (2n4-5) 
1 3X E 
x| ar o (4.9) 
Qn+3 (n+1)(2n-+7) 
where 


Cam [ irt Eno- Per een-ema 
0 


1/2 N2n4-9 8 
: ) 4 10) 


(fe ? 
nm+2/2n+6 (2n+5)(2n-+3) 
C1— 12/8—8/3-— 0.0822, Co= (37/16) — 8/15 — 0.0557 , 

Cı= (117/128) — 8/35 —0.0414. 


-co( 


The leading term in (4.9) is 6~0.06 (12 E 59?) * which 
is of order 10-*/E, since 7 is about 2. The relative error 
committed by neglecting terms in à is of order 
[n(C_1/Co) 3-9/7 ]No- 4A, which could amount to 25% 
for E,/Er~100 (if we take 8— 1). 

This agrees with Baroody? who obtained the first two 
terms of (4.9) with A=0, by using the same method as 
did Viatskin, but making an independent evaluation of 
the final integral. It also agrees surprisingly well with 
the estimate obtained above from the measurements on 
the surface photoelectric effect of potassium. 

Note added in proof. Equation (3.16) predicts that 
the energy loss of the primary (essentially w) by SSE is 
nearly peaked at the energy value for which y(w)/w is 
maximum, and this is usually of order 5-10 eV. It 
would be interesting to investigate if such an energy 
loss can be detected in experiments on the character- 
istic energy losses of electrons in thin foils. Since the 
relative intensity of the scattered primaries by this 
effect is of order 102— 10-*/E,,, primary energies Ep of 
order 10 (ie., 272 eV) are preferable, if experiments 
with these energies are now feasible. 
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Scattering of Spin Waves by Magnetic Defects* 
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I. SCATTERING THEORY 


N a previous study! (which will be referred to as I), 
the general theory of the scattering of spin waves 
by magnetic defects was discussed, and results were 
obtained for the cross section and mean free path in 
simple cubic lattices. In the present note, the cumber- 
some procedures of that calculation are both simplified 
and generalized, and the results are extended to include 
body-centered cubic and face-centered cubic lattices. 
This is of particular interest since several of the recently 
discovered ferromagnetic insulators are face-centered 
cubic. The thermal resistivity produced by defect 
scattering is calculated by a method which includes 
some of the effects of spin-wave-spin-wave interactions. 
'The theory of the scattering of spin waves by mag- 
netic defects is an application of the general theory of 
the scattering of excitations by imperfections in solids. 
An account of this theory is being published elsewhere.” 
'Therefore, the present discussion will contain only as 
much of the theory as is required to make the applica- 
tion of those results to a spin-wave system intelligible. 
We consider a spin system described by a simple 
Heisenberg exchange Hamiltonian with nearest-neighbor 
interaction only: 


H-Y,J(R, R--A)S(R)-S(R--A). (0) 


MIA 


Neither external fields nor dipole-dipole interactions 
are included. The system is said to contain a magnetic 
defect if at some site Ro (which we choose as the origin), 
there is an atom whose spin S’ is coupled to its neighbors 
by an exchange integral J'. For the remainder of the 
atoms, these quantities are S and J, respectively. The 
quantity A is a vector connecting a lattice site with one 
of its nearest neighbors. 
The excited states of the system which contain a 
single spin deviation can be described by a set of func- 
tions ¢(R) such that le(R)!? gives the probability of 
finding the spin deviation on site R. It was shown in 


* Supported by the U. S. Air Force Office of Scientific 
Sete References to 
i vay, Phys. Rev. 132, 2003 (1963). Rete 5 
SU of aie spin-wave-defect interaction are given 


there. = 
aT Callaway, J. Math. Phys. 5, 784 (1964). 
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s M LII the scattering amplitude for the scattering of spin waves by magnetic defects 
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aR en for the mean free path, and the thermal resistivity due to defect scattering is calculated 
y a method which takes some account of spin-wave interactions. 
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Ref. 3 that 6(R) satisfies the equation 


EG(R) =2 Xs J (Rp Ri + ALS (Ri-+ A) @ (Ri) 
—(S(R)S(RH-A))"$(RH-A)]. (2) 

In this equation S(R;) is the “magnitude” of the spin 

on the site R; in that the eigenvalue of S?(Rj) in the 


ground state is S(R))LS(R;)--1]. Equation (2) can be 
written as a Schrédinger equation in the form 


Xun|E—Hs|De(R)-3i(m|VI)e(R), (3) 
in which 
(m| E— Hol) - (E—2J5S2)08,5--2JS801 a (4) 
and 
(m | V | l= ôm 255 (J'—J)61,0+2(J'S’—JS)b1,0 | 
— [J (S'S)*— JS Diosa (ôro Hôm,0). (5). 
Each atom is assumed to have z nearest neighbors. 
The subscript |— refers to Ri— Rn. 
The solutions of this equation for energies, E, within 


the continuous spectrum of Ho have been shown to 
have the following asymptotic form: 


QU2 n2 
Re — | eee — ], 6 
(R) (anya e fe (6) 


provided the energy is low enough so that the energy 
wave vector relation (energy band) for spin waves is 


spherical : 
Bye, (1) 


y72JSa, (8) 


in which a is the cubic lattice constant. The quantity f 
may be interpreted as the scattering amplitude for spin 
waves. It may be expressed as a sum of scattering ampli- 
tudes for partial waves transforming according to one 
of the irreducible representations of the crystal point , 


roup: 
f fe, (9) a 


in which 8 denotes an irreducible representation. The _, 
partial-wave scattering amplitudes are E — 


232? 
fas re E VeirePp.nn 2, Cy, O * (E, R5) - 
Y pnm » 
e XC 9 (ko Ra). (10) — 
3T. Wolfram and J. Callaway, Phys. Rev. 130, 2207 Ch S 
li 
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In this “equation, the functions C(k,R) are sym- 
metrized linear combinations of plane waves (0/879)! 
Xexp(ik- R) transforming according to the vth row of 
the Bth irreducible representation. These functions are 
characterized by wave vectors ko and k’, in which ko 
is the wave vector of the incident wave and k' is a 
vector of magnitude ko parallel to R; [see Eq. (6) ]. We 
can write 


Cg, (&, Rm) = 


Que 
(27)? 


PORE en (11) 


in which the quantities U (8,,R,,) are the matrix ele- 

ments of a unitary transformation. The prime on the 

summation in Eq. (11) indicates that the vectors Rm 

which are included are those which can be found from 

any one of them by rotation with all the operators of 

the point group. (Only distinct vectors are considered.) 
The quantities Vs,,, are given by 


Va,en=2,' U(B, R5) (|V |n tR p), — (12) 


in which Ut is the adjoint of the matrix U. Vg,pn is 
independent of the row of the representation v. The 
remaining quantities are found as follows. We consider 
the Green's function, G(R;— Ra), which satisfies 


2 (m|E—Ho|D S(Ri— Rn) Onn, (13) 


and define 
Th, n=). U(8,R)S(R;,— R;)U!(R,,8,). (14) 
ln 


Then, considering only those values of the indices 
l, m, such that Vg,ım 1s not zero, we construct the matrix 


Q8,1m=6im— > V8, V B,nm- (15) 


'The summation in (10) and (15) includes one term for 
each different symmetrized combination of plane waves. 
The quantities Pg and Dg occurring in Eq. (9) may now 
be defined by writing the matrix inverse to Q as 


[Q7 ]s,2m= Dg Pp,1m , 


Dp=det(Qs)=det[J—T'sV 6]. (17) 


The quantity Dg determines the locations of possible 
scattering resonances or localized modes. Let Æ’ (in 
- general, a complex number) be a solution of 


Dg(E)—0. (18) 


Put E'— Eg—310I/2. Then, if T0, Ep is the energy of 
a scattering resonance, and T is the width of the reso- 
nance. If T —0 (which occurs only outside the continuum 
of eigenstates of Ho), we have a localized mode. 

In the case of spin waves, the perturbation V ex- 
d tends only to nearest neighbors of the defect. The 


(16) 
-with 
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irreducible representations which occur in the sum over 
B in Eq. (7) have been determined for cubic lattices? ; 
They are Ty, T'is, and Tiz for a simple cubic lattice; 
Yi, Tis, Vas, and T for a body-centered cubic lattice, 
and T, Iis, Pas, T12 and T'ss for face-centered cubic.4 
The & dependence of the partial-wave scattering ampli- 
tude fg at low energies can be studied by expanding the 
symmetrized combinations of plane waves Cz, (b, R) 
in powers of k. One finds that 


Cp, (b, R) = (ER)'Kg, (0,0) ; (19) 


in which Kg,; is a Kubic harmonic belonging to the 
yth row of the bth irreducible representation, and is a 
linear combination of spherical harmonics of order /.5 
For I3, /—0; for Mis, J=1; for Te and Tos, l= 2, and for 
T» and Tæ, /—3. We conclude that (since | ko| =|k’|) 


feck”, (20) 


unless there are cancellations among the elements V, P 
which raise the power of k in (20). We do not expand 
Dg since this contains the possible resonances. 

For spin waves we find that the ?-wave amplitude 
(Tis) is proportional to &?, but cancellations do occur 
for the s-wave amplitude (Ti), which then turns out 
also to be proportional to &". Other (d,f) partial waves 
give terms proportional to kt and &*. Hence, the total 
amplitude is proportional to £? for small k. 

We will now determine the scattering amplitude to 
this order for simple cubic, body-centered cubic, and 
face-centered cubic lattices. The $-wave portion i 
simplest because only one term appears in the summa- 
tion over p,m, and s in Eq. (8). From this fact x 
follows, with the use of Eq. (16), that the only satrim 
element of P is unity. Hence Eq. (10) has the for it 


fo (—2n?/yD») V» D C», O* (E^, R1) 


XC, (k,R). Q1) 


The rows of the triply degenerate Tis representation 
contain functions transforming as x, y, or z. In each of 
the three lattices we have in the small £ limit 


C», =[0"?/ a3? WZika , (22) 
etc., and 
Vpu=—2IS[1—(J'S'/JS)]. (23) 
Then 
f»— (Q/2xDya*)[1— (J^S'/JS)]Ea? cosó, (24) 


in which @ is the angle between k and k’. 

: It is now necessary to express the determinant Dp 
in terms of the Green's functions. If (Ej) are the 
rectangular components of (R:—R,)/a, we have for 


* Notation for the irreducible representations of the point group 


is in accord with L. P. B i 
Wigner, Phys. Rev. 50, 58 (C936 M 


F. C. Von der Lage and H. A. Bethe, Phys, Rev. 71, 612 (1947)- 
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the three lattices: 


Simple cubic, 


D,-1—i[1— 05/75) ][9(0)— §(2,0,0)]; (25) 
body-centered cubic, 
D,=1—3[1— (5'/75)] 
face-centered cubic; 
D,—1—3[1— (V'S / IS) 50) +28 G,2,0) 


We infer from these results that for the same value 
of the lattice constant a, the scattering amplitude de- 
pends on the lattice structure partly through the cell 
volume 2 which has the value a’, a?/2, and a?/4 in the 
simple body-centered, and face-centered cubic lattices, 
respectively. In addition, more complicated dependence 
on structure is contained in the determinants, D>. 

We now turn to a consideration of the s-wave ampli- 
tude. This calculation is quite tedious and will not be 
given in detail. For each lattice, the relevant quantities 
V, P are 2X2 matrices. The symmetrized linear com- 
binations of plane waves are given by 


C,O (5,0) - QI? (22) ; (28a) 
C, (&,R)) = (Q/z) (2r)? Xa exp(ik- A). (28b) 


Equation (28b) can be expressed in terms of the spin- 
wave energy E(k), since 
E(k) -2JS[z— Da exp Gk- A)]. 
Hence, 
C,O (k, R1) = (2/2) (2r)! (a1— E/2J5). 

Equation (29) enables us to eliminate the functions 
C,0 from the s-wave amplitude. In addition, it is 
possible to express all the combinations of Green's 
functions which enter into D, and P, in terms of §(0), 
with the aid of certain identities which were derived in 
Ref. 3. Since $(0) can be expressed in terms of the 
density of states G (E), 


(29) 


Q dq (= : 
s = E—E(q) E-E 
=—/4JS, (30) 


this means that the s-wave amplitude can be deter- 
mined from the G(E) without explicit reference to 
E(k). The relevant identities are 


m ys g(a) = 0- (2/2759 ]9 (0 (1/2752) 


=—g/4JS, (31) 
; 7—A)= 1— (E/2J52)] Di S(A) 
25 g )-L usd 


a 
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In these equations, A; and A; indicate sitteaMith are 
nearest neighbors of the defect site (the origin). Only 
the particular combinations of Green’s functions appear- 
ing in (30), (31), and (32) are involved in the s-wave 
amplitude. 

In addition, we have 


e m/z 
yrs -us( ) (33) 
"V2 p 
1—gop—zng1 2) Gg) 
= ) p (34) 
z2 (gie-+Hgm) 1—goe—2n81 


in which 
e-1z(1—J'/7), (35a) 
273 [U"/J) S'/Sy—1], (35b) 
p-À(—J'S'/JS). (35c) 


After a rather tedious calculation, we obtain for the 
s-wave portion of the scattering amplitude to lowest 
order in k 


= gb qu S' 
Em 
2x D,a? 4JSNJ S 
=Q ss’ s 
= G-A (36) 
4rD.@\J S 
in which 


z J. En I'S S 
2, - eoi 7). (37) 
2 J/ ASS JS 


It will be observed that f, goes to zero as $? at low 
energies. This is the result of cancellation among some 
of the terms and of the Green's function identities (32). 
The expression (37) for D, is exact. We do not expand 
D, (or Dp) since they contain the possible low-energy 
resonances. 

The total amplitude to order 4° is the sum of (36) 
and (24): 


etm ses) 


2 J's! | 
= —(1-—— cosa |e. (38) 
DA ISP 


It will be observed that the amplitude in this order |. 
depends on the lattice structure in an obvious manner 
through the cell volume. There is, however, an equally 
important structure dependence incorporated in the 
function D, and Dp. The higher order (in £) terms in 
the amplitude also exhibit a significant dependence on 
the lattice structure since different representations 
appear. In general, it does not appear to be possible to 
express the scattering amplitude solely in terms of t 


a 


fa 
( 


ec 


and i TuS 


^ 
^ 
Li a ` o 
"a ^ 
/ Me». V. 
«geen the incoming and outgoing wave vectors 
SEE he. į 
dr order terms are inchided/ 
ho S Wi amplitattle is¢considered only to 
NW, spsthat'Tt depends only on the angle 0 
a simple calcütetion,of the-tiean free path 7p, for scat- 
tering of a spin wave by a defect is possible. This has 
been shown to be determined by the momentum transfer 
cross section c; which is given by 


SALLAWA Y 


ay Ser 


E 
7 


e fUr) cose. (39) 


"Then 
lp 1— N pes, (40) 


where Np is the concentration of defects. We obtain 
QUE b UA S^? 
E 
ár (| D,|2NJ S 
4 JES! 1 J'^S' 
E so) 
3 JS /L| Dj]? JS 
1/J’ S! 1 1 
EE eo). € 
2NJ S/\D*D, D,D, 


If we suppose that the denominators D, and D, can 
be set equal to unity, thereby neglecting the possibility 
of resonant scattering, we obtain 


Qi 7 J^? S^? 
2 dio) C-) 
Am LAS S 


ap JY P DPS) 
ea-—JQ-——) (42) 
QNT EUIS J aS 


IJ, THERMAL CONDUCTIVITY 


In this section we consider the thermal conductivity 
of a system of spin waves in interaction with each 
other and containing Np magnetic defects per unit 
volume. We hope to consider spin-wave-phonon inter- 
actions in subsequent work. Unless explicitly stated, 
the system will be assumed to be of infinite extent. 

We propose here to calculate the thermal conductivity 
“of spin waves by the same techniques which have proved 
to be reasonably successful in application to lattice 
thermal conductivity." We will consider each scattering 
“process te be described by a relaxation time, but ex- 
~— í plicit account will be taken of the conservation of 
* “crystal momentum by normal spin-wave-spin-wave 
^ . scattering processes. 

S The distinction between normal and umklapp proc- 
esses in spin-wave interactions is just as vital as it is 


6 T. M. Ziman, Electrons and Phonons (Oxford University Press, 


d, 1960). 
i S CU Phys. Rev. 113, 1046 (1959); J. Callaway and 


>% Ca Von Baeyer, ibid. 120, 1149 (1960). 
~ : . ae ad EE 
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in the case of phonon-phonon interactions. Since norma] 
processes conserve the total pseudomomentum of the 
spin-wave system, they cannot, if acting alone, produce 
a thermal resistance. This statement 1s essentially inde- 
pendent of the details of the dispersion relation for the 
magnons. It is true that the interactions between mag- 
nons do not conserve the heat flow at the microscopic 
level (as is the case for phonons if a linear dispersion 
law is assumed). A statistical argument shows, however, 
that a thermal resistance will not appear. An explicit 
demonstration is given in the Appendix. 

Nevertheless, it is not permissible to neglect normal 
processes in studying the thermal conductivity. Their 
contribution may be pictured qualitatively as that of 
converting some of the low-frequency magnons into 
high-frequency magnons which may easily be scattered 
by defects. We will show that in the limit of strong 
normal processes, the thermal conductivity becomes 
limited by the defect scattering in a fashion similar to 
that obtained by Ziman? for lattice thermal conductivity. 

'The importance of distinguishing normal and um- 
klapp processes has been stressed here because it has 
been ignored in other calculations of the spin-wave 
thermal conductivity.? We contend that such work may 
have yielded an erroneous thermal resistance. 

We begin by considering the Boltzmann equation in 


the form 
oN dN 
(=) — V: VT—=0, (43) 
c dT 


in which VT is the temperature gradient, JV is the dis- 
tribution function, and V, is the group velocity of the 
spin waves 

Vy— hV,E (k) 5 (44) 


The first assumption of the present approach is that 


the collision term (8/V/0!), may be approximated as 
follows: 


; (45) 


àNN  N(—N MN 
incer 


Ta Tu 
in which N (à) is a displaced Bose distribution 
N (3) — (expL(E—2.- k)/KT]— 1). (46) 


Nois the usual Bose function; r» is the relaxation time 
for a single mode via normal processes, and ry is the 
relaxation time for all those processes which do not 
conserve the crystal momentum.!? We assume that 2. is 
small so that N(3) may be expanded, and only first- 


* J. M. Ziman, Can. J. Phys. 34, 1256 (1956 

A QNS Phys. Kondens. Meee D 125 (1963). 
URL ettleton, Phys. Rev. 132, 2032 (1963), has partially 
justified this approximation for lattice thermal conductivity. In 
applying Nettleton’s discussion to the present calculation, one 
> ule note that in spite of appearances, an explicit expression 
or the normal process relaxation time, rw, is not required to ob- 


tain our essential results, E iti i 
are required are rp eres ED ane’ Gt SS gud gne 


Í 
} 
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b b seen | 
f order terms iA are retained. Put This condition leads to the relation ose 
| $-— —SVT/T, (sisa constant) (47a) k Tye tee 
» is [zen Ett 0. (54) 
j EN EN ee TN jp Jeet he 3 
g N— No mpg —»Vy- VT| - tcc e OA) 1 i t 
i RI) odit At 4 
á s eu SE Only the component of k parallel to the temperature 
No te ARM n, (476) gradient survives on integration over solid angles. We 
Ea substitute (51) and solve for s 
43 (These equations are definitions of s, v, and 7e.) We » Epig EIKT 
| also require the group velocity. We continue to usea , el eae di / 
PA quadratic dispersion law Ad ry (eP!ET—1} ! 
| E=yk+e, (48) 1 TaN  RichIKT 
so that Lf (1- ) — - a|. (55) 
es Vy-2(y/h)k. (49) TN vx (ePKT—1y 
| The constant term e in (48) does not play an important We combine (53) and (53), substitute for the spin- 
| . role in the mechanics of the thermal conductivity calcu- ^ wave specific heat 
lation and has been ignored in computing scattering 
amplitudes, but it is important in preventing the quan- . B se 
tity E—2.k in (46) from becoming negative for low Cu K =) ( ET py 
energies. It is therefore important to realize that a £ 
nonzero value of e is always obtained, since the material; + -oquce the change of variables 
is magnetized. One has, approximately," = 
LE e= g6[ (4/3) M - Hac] (50) x=yk/KT, 6=e/KT, (56) 
| where M is the magnetization and Hexe is a possible replace the relaxation time by mean free paths, 
| externa] field. LS 
| Equations (45)-(49) are substituted in the Boltz- iip 
mann equation, which then gives a relation between » etc., and obtain our final expression for the thermal 
and s. This is ... conductivity: 
| v= vd iths/(2y rx E) |. (51) 
| K(KTY| [77 eu 
i The thermal conductivity of the system 1s Km———— | slet oyi : dx 
p ór?yh lo (e+ —1) 
=ef V.2v(k) cos'6C sw (k)@R n d 
"mi gocce, 
o Ix (e+ — 17 
p * zm po 
4 = (272/37) i k»(E)Css (Ed, 52 | Í ES 24 (57) 
à o Ih enn di 
| = Quem) i Br (1 +Sh/ (2ytnE) JC (dk. The upper limit xn for these integrals is determined by 
an argument analogous to that establishing the Debye 


We have assumed cubic symmetry and also employed temperature for ee lattice and is ge by Lm 

» (49). Csw(k) is the contribution to the specific heat = (y/ Ed aye. we Meo i. T y 

i des E OL ` peratures low enough so that the upper limi t 
from a mode of wave vector k. Evidently another rela is m g B 


Kon wen emisor Dee t "Tie deles of he hal endet an en ME 
; rector of the system: external magnetic field can be obtained from (57). A 

change the total wave vec i magnetic field decreases the thermal conductivity by 
| an N(3)—N z reducing the probability of excitation of spin waves. 
| i | (—) kBk= | k@k=0. (53) qt is evident from (57) that in the limit of large à (low 
| dt İN TN temperatures or strong fields), the thermal conductivity 
must decrease as à^e-?; here the exponent n will depend 


s 


ev. 58, 1098 (1940); S. H. on the scattering mechanism. Further discussion of the 


` tei d Primakoff, Phys. Rev. 55, b ` ps = D 
aT o EE (ta be published). This dispersion relation is geld dependence of the conductivity will be reserved | EC 


5 .  Charap and E. Boyd (to S I c 
| only approximate since in the presence of dipole-dipole couplings, feneghecnuent OR B 
| the spin-wav® energies are anisotropic. 
n a 
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"T'hefzé« a. great similarity between Eq. (57) and the 
corresponding expressions which are obtained in the 
theory of lattice thermal conductivity. In particular, 
we observe that if the relaxation time for processes 
|! which do not conserve the wave vector becomes very 
‘Jong, the thermal conductivity of the spin-wave system 
increases without limit. Jf only norma] processes were 
present, there would be no thermal resistance. 

It follows that in the limit in which the mean free 
path for normal processes is very short compared to 
that for processes which do not conserve the wave 
vector, the thermal conductivity will depend sensitively 
on the latter processes. This is the Ziman limit. To 
obtain the thermal resistance in this limit, we drop the 
first term of (57) and neglect the contribution of lu 


to le Then 
grt? 
— dy L 
(gr —1) 


l ae Jeu 
x K(KT)J ly 


(x5) F 
| i nae | . (58) 

( grtà— 1)? 
j One interesting consequence of (58) is that the ther- 
i mal resistance in the Ziman limit is the sum of additive 


contributions from each possible type of “U” process. 
In particular we consider defect scattering, for which 
the mean free path is given by Eqs. (40) and (41). 
We will not consider here the effects of possible scatter- 
ing resonances which could increase the thermal re- 
sistance substantially. We abbreviate the mean free 
path due to defect scattering as 


: lp3—N 5AM, 
T in which 


PIT PNG S^? 4 IES 
E o» 
Am NJ S 3 JETS 


We can evaluate the thermal resistance due to defects 


(59a) 


d approximately by neglecting 6 in (58). We get 
j 6r hNpA [^ xe o y5/2pz 2 
ye in 
È Ky Jo (e7—1) o (e7—1)? 


3. 
t OWN Ah, 1(5)¢(4) NAR 


= = 38. : (60) 
Ky a L70(7/2)£(5/2)? JSKa - 
In the last step of (60), we have substituted Eq. (8). 
The thermal resistance due to the defects is inde- 
pendent of temperature. This expression is valid in the 
limif in which defect scattering is weak compared to 
ormal^magnon-magnon scattering. Our treatment has 
included the effects of magnon-magnon scattering on the 
distribution function, and thereby has taken account 
-interactions implicitly. Our formula, Eq. (60), 
depend explicitly on the details of the magnon- 
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magnon interaction. It does differ somewhat from the’ 
expression for the defect resistivity if these interactions 
are neglected. The situation in which defect scattering 
is weak compared to boundary scattering and normal 
processes are ignored was examined in J. In that case 
a thermal resistance Wp’ due to defects was obtained 
which is independent of temperature, and differs from 
(60) only in respect to the numerical constant. That 


result is? 


NpAt t(5) NpAh 
ie =70.8 


=70. SE (on 
Ky KGP * JSKe 1 


W p' = 20r? 


It will be noted that the thermal resistance due to 
defects given in (61) is larger by a factor of almost 2 
than that of Eq. (60). This situation should be con- 
trasted with that which obtains in the theory of lattice 
thermal conductivity where the contribution to the 
thermal resistance from defects is, in the Ziman limit, 
larger by a factor of 25 than when umklapp scatter- 
ing dominates. ` 

Next, we will discuss the thermal conductivity in the 
opposite limit in which the defect scattering is strong 
compared to magnon-magnon scattering. This requires 
a more detailed knowledge of the combined mean free 
path le, including normal processes. The interaction of 
two spin waves can produce bound states,” and prob- 
ably scattering resonances. A complete theory of the 
scattering of two spin waves has not yet been given, 
but for an introduction, we can make use of a calcula- 
tion reported by Dyson.“ Dyson showed that, to the 
extent that resonances and such may be neglected, the 
mean free path for a spin wave of wave vector k due 
to spin-wave-spin-wave interactions, may be written as 


1= BTR, 


This expression is valid at low temperatures. The quan- 
tity B is a rather complicated function of J and S, 
and is given by 


a- 3K 


5/2 : 
ZoS? \2rJSzu 


in which 
w= (1, 3x 2785, 2-49). 


64 

BE (1, 3X2, 213) ? 

for the simple cubic, body-centered cubic, and face- 
centered cubic lattices, respectively. 

With the use of (62) and (59), we have a combined 


mean free path including defect scattering and magnon- 


magnon scattering 
- 


l= BTE +N p ARs. (65) 


2A fact 2 dug : 
equation in Ret oo) * was omitted in the statement of tis 


13 M. Worti : 
Letters 11, 336 Gane 132, 85 (1963); J. Hanus, Phys. Rev — 


‘F. J. Dyson, Phys. Rey. 102, 1217 (1956). ^ 
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jjn the limit L«€1 which corresponds to defect scattering 
Strong compared to magnon-magnon scattering, we can 
drop all but the first term of (57). This follows since if 
we consider both /, and l, to be proportional to some 
small constant œ, the first term of (57) is proportional 
to a, whereas ihe second term is proportional to o£. 
We then substitute (65) into (57), let 6 — 0, and obtain 


Ky ie x? e 
6r hNpAJo x+y (e7—1)? 


( k= dx, (66) 


in which y= ByT3?/N pAK. 

We consider this integral at temperatures low enough 
so that the upper limit may be made infinite. Even so, 
the integral needs to be approximated if we are to avoid 
numerical computation. We follow the procedure used 
under somewhat similar circumstances in I, in which 
we replace a?e*/(e*—1)* by 1 for «<1, and neglect y 
for x2 1. This is valid only when y«&1, but if this con- 
dition is not satisfied, it is not legitimate to neglect the 
second term in (57). Hence 


Ky pde TOS 
K p «f aly 
6rANpALJy x+y Ji (e7—1)? 
Ky 1 o 1 
—— n=) 4 e] 
OTAN pA y/ n2 n 


KJ Sa NpAK 
EG )evon: (67) 
ByT3?? 


STN p Aft 


Evidently, in this limit, the temperature dependence of 
the thermal conductivity is weak. 


APPENDIX 


In this appendix, we give an explicit demonstration 
that for magnons whose energy is related to the wave 
vector by Eq. (48), normal scattering processes (N. P.) 
cannot, by themselves, produce a thermal resistance. 
We recall that a normal process is one in which the total 


: wave vector of the excitations involved is conserved. 


It is to be distinguished from an umklapp process, for 
instance, in which the wave vector changes by a re- 
ciprocal lattice vector. 

It is fairly easy to see intuitively that if only N. P. 
operate, the crystal relaxes to equilibrium subject to 
the condition that the total crystal wave vector be 
nonzero and constant X>, kz,— K (in which ns is the 
number of particles in a state of wave vector k). There 
must be a fivw of heat in the direction of K. 

Tne simplest mathematical justification of this in- 
tuition can be found by returning to the derivation of 
the Fermi-Dirac (F. D.) and Bose-Einstein (B. E.) dis- 
tribution functions for an ideal gas using the micro- 
canonical ensemble. A system of weakly interacting 


. excitations is considered. 
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To obtain the distribution function m, we maximize 
the entropy, e, subject to the constraints irffposed by 
conservation laws. We do this by multiplying those 
quantities g;(1) which must be conserved by Lagrange 
undetermined multipliers \;. Then we add their varia- 
tion with respect to 1; to the variation of « and set the 
whole equal to zero. Thus, 


óc ógj 
— +} A=. (A1) 
ôy Ông 
Let the upper/lower signs represent B. E./F. D. sta- 
tistics, respectively. We have 
óc /óny — In (-12c1). 


(A2) 
Let us choose the following conserved quantities: 
Pk". HN; Lx Euny— EK; Y km,—K. (A3) 


We vary these quantities, multiply by Lagrange multi- 
pliers, and substitute together with (2) in (1) to obtain 


Ey—p—2.-k $ 
n= (e ET 


where the multipliers of N, E, and K have been written 
in the conventional manner. (If the number of particles 
is not conserved, we should set „=0, and this must be 
done for magnons and phonons.) 

Clearly, since E; E_y, modes with k parallel to 2. 
contain more particles thanffpodes with k antiparallel, 
so we expect a net heat current Q in the direction of 3. 

To demonstrate this we consider the expression for 
the heat current 


(A4) 


Q=}; EgVyttg. 


We will evaluate this approximately on the basis of 
the dispersion relation!* 


(AS) 


E,=yk" +e. (A6) 
In this equation the relevant values of the exponent z i 
are 1 and 2. If n=1, we have a situation approximately 1 
describing low-energy phonons (also e=0 in this case). ; 
The case n=2 characterizes spin waves, but now ez£0; — 
a nonzero value of E is provided by the magnetizationg | 
(M) plus any external field (Hex), aÀàwas described in® 
the main text [see Eq. (50)]. With of (6), we 
obtain for Q =" 
ny (yk7-- ) ke? 


~ Quyh) (exp([y--e—a-K]/KT]—1) 


k. (AT) 


D 
3 $ 
p 


15 K. Huang, Statistical Mechanics (John Wiley & su Inc. 
New York, 1963), p. 192. ` . ; 

15 The importance of including « is that if it is not present, the 
argument of the exponential in (A4) will become DR 
small k when n=2. 
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& The total wave vector K is 


dk. (A8) 


; 1 k 
ae ats 
h QJ (Cexp([yE"-e—9- K]/KT) - 0) 
- "ea Neither of these integrals is zero. If n=1, (e 0) it is 


an easy to see that 
Q= (7/h)K. 


For n=2, (ez£0) the situation is somewhat more com- 
plex. The substitution q=k—2/2y enables the inte- 
grals to be transformed to a form in which the portions 
which vanish on integration over angle can be readily 
separated. The results are 


E EU. 5 
———| —— on 2. 
AS qu 


(eres ; (A102) 


x-— (reum 


af 


(A9) 


(A10b) 
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Equation (A10b) may be used to eliminate 3 fromf 
the expression for Q. In the limit in which e and À are F 
both small, we obtain a simple result 4 


Q=C(KT/h)yK, (A12)§ 
in which C is a numerical constant. 

We have shown that in the absence of processes} — 
which do not conserve the total wave vector, there is als 
constant nonzero heat current in the presence of noi 8 | 
zero total crystal wave vector. A state with a nonvanish+-= 
ing K, once established, could not be changed by Y 
collisions. Since no thermal gradient was assumed, thi Ea 
means that in the absence of umklapp processes the ^ aoe 
no thermal resistivity. There is no conflict wit. e 
result of the analogous classical problem of the thermal b 
conductivity of gas in a cylinder, since in that case a i 
state of nonzero momentum is not established. ` 
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